Salma Health Logo

Salma Health

Data Engineer

Posted 12 Days Ago
Remote
Hiring Remotely in USA
119K-185K Annually
Mid level
Remote
Hiring Remotely in USA
119K-185K Annually
Mid level
As a Data Engineer, you will build and maintain data pipelines, convert data into metrics, and operate the platform on AWS. Responsibilities include writing production code, processing data from APIs, and enhancing the orchestration layer using Dagster, while ensuring compliance in a HIPAA-regulated environment.
The summary above was generated by AI

We are looking to hire a Data Engineer to join our team as we build the data backbone for a mental and behavioral health practice. This role will build the platform that turns appointments, assessments, billing, and patient engagement data into the metrics our clinical and operations teams rely on. As a mid-level data engineer, you'll own meaningful pieces of our pipeline end-to-end: from pulling data out of third-party APIs, through medallion architecture transformations in dbt, to exposing curated metrics through our semantic layer.

This is a hands-on role on a small team. You'll write code that runs in production every day, ship improvements weekly, and have direct visibility into how the data is used. We work in a HIPAA-regulated environment, so thoughtfulness about data handling is part of the job.

Location

Hybrid role. Preference for candidates located in the San Francisco Bay Area, San Diego, or Salt Lake City. Remote is possible, with the expectation of regular in-person collaboration.


What You'll Work On
  • Maintaining and improving the orchestration layer: Dagster assets, jobs, schedules, sensors, and the dependency graph that ties extraction → loading → transformation together.

  • Adding new data sources to the pipeline; extracting from APIs (GraphQL, REST), Google Drive folders, and CSV/JSONL drops on S3, then landing them in our bronze schemas via Dagster assets.

  • Building silver and gold dbt models that transform raw source data into our unified entity model following the medallion architecture.

  • Extending our semantic layer so business metrics are available to downstream consumers (BI tool dashboards, AI agents, ad-hoc analysis) without re-deriving logic

  • Operating the platform on AWS: ECS Fargate services, RDS, S3, Secrets Manager, CloudFormation templates, and the CodePipeline-based CI/CD that deploys our data platform. All of our data platforms are deployed with IaC tools.

  • Writing tests (pytest for Python, dbt tests for models, data quality tests) and contributing to internal documentation as new patterns emerge.

What We're Looking ForRequired
  • 4-7 years of professional experience building and operating data pipelines in production

  • From conversation to shipped data product: you're comfortable owning a request end-to-end: scoping it with a non-technical stakeholder, writing requirements clear enough that you (and others) can build against them, implementing the models or metrics, and verifying with the stakeholder that what shipped solves their problem.

  • Strong Python: comfortable writing modules, structuring code for reuse and testability, and debugging issues across an async or orchestrated pipeline.

  • Solid SQL skills, including window functions, CTEs (including recursive ones), and the ability to reason about query performance.

  • Hands-on experience with dbt: building models, writing tests, and understanding materializations.

  • Working knowledge of an orchestration framework: (Dagster, Airflow, Prefect, or similar), including the mental model of assets/tasks, dependencies, and scheduling.

  • Comfort with AWS fundamentals: S3, IAM, Secrets Manager, and either ECS or Lambda for compute.

  • Git-based workflows: code review, and writing PRs that are reviewable.

Nice to Have
  • Experience with Dagster specifically.

  • Experience with semantic layer tools (Cube.js, dbt Semantic Layer/MetricFlow, LookML)

  • Healthcare data experience (HIPAA, EHR systems, ICD-10/CPT codes)

  • CloudFormation, Terraform, or another IaC tool

  • Experience with GraphQL APIs as a consumer (pagination, introspection, dealing with rate limits and retries)

  • Familiarity with identity resolution patterns or slowly-changing dimension modeling

How We Work
  • Small, focused team; your work ships and gets used quickly

  • Pragmatic engineering: we favor readable code, clear naming conventions, and well-documented patterns over clever abstractions. Our internal "how to add X" guides are first-class artifacts.

  • Tests on everything; CI runs dbt parse, dg check defs, and pytest on every PR.

Company Mission & Vision

We are the brain health company of the future that integrates care delivery, technology innovation and research breakthroughs to better understand brain biology and diagnose, treat and ultimately cure brain disorders for all stages of life.

Who We Are

Salma Health is reimagining brain healthcare. We bring together advanced diagnostics, evidence-based treatments and continuous support under one connected system—so every person can receive the right care at the right moment.

Our multidisciplinary team of psychiatrists, neurologists, neuropsychiatrists, therapists and technologists work together to deliver personalized, compassionate care for people living with brain and mental health conditions. By combining cutting-edge science with human understanding, we’re creating a new model of care that replaces fragmentation with connection and uncertainty with clarity.

Headquartered in California, Salma Health is expanding access to innovative brain health care across the U.S., beginning with clinics in San Diego, Orange County, and clinics in the Bay Area and Los Angeles opening soon.

Compensation & Benefits
  • Base: The base salary range for this role is $119,000–$185,000, depending on geographic location, experience, and qualifications. Salma Health uses a tiered compensation structure based on candidate location. Specific range details are available during the interview process.

  • Incentives: Discretionary bonus based on company and individual performance

  • Benefits: Medical, dental, vision, PTO, and additional benefits

Work Authorization

Sponsorship for employment authorization may be considered on a case-by-case basis depending on the role and candidate qualifications.

Equal Opportunity & Accessibility Statement

We are committed to providing a workplace that is inclusive, respectful, and free from discrimination. We welcome applicants of all backgrounds and make employment decisions without regard to race, color, religion, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity or expression, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, genetic information, marital status, military or veteran status, or any other characteristic protected by California or federal law.

In accordance with the California Fair Chance Act, we will consider qualified applicants with arrest and conviction records.

If you require a reasonable accommodation during the application or hiring process, please contact us directly - we’re happy to help.

HQ

Salma Health San Mateo, California, USA Office

San Mateo, California, United States, 94402

Similar Jobs

Yesterday
Remote or Hybrid
Richmond, CA, USA
77K-202K Annually
Senior level
77K-202K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Senior Data Engineer on PwC's Managed Data, Analytics & Insights team to design, build and manage advanced data ecosystems. Responsibilities include designing data solutions and scalable pipelines, solving complex problems, mentoring junior staff, maintaining high delivery standards, and building client relationships while aligning solutions to business context.
Top Skills: DatabricksKafka
3 Days Ago
Easy Apply
Remote
United States
Easy Apply
115K-145K Annually
Junior
115K-145K Annually
Junior
Fintech • Insurance • Machine Learning • Analytics • Financial Services • Automation
Build and maintain reliable data pipelines, Airflow DAGs, and Snowflake-based Data Vault/warehouse models. Implement CI/CD, automated testing, observability, and production support while partnering with stakeholders and developing insurance domain expertise.
Top Skills: Apache AirflowBigQueryCi/CdClaude CodeCursorData Observability ToolingData Vault 2.0PythonRbacRedshiftSnowflakeSnowflake CortexSQL
5 Days Ago
Remote or Hybrid
6 Locations
77K-202K Annually
Senior level
77K-202K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Design and build data infrastructure, pipelines, and integration solutions using cloud and big-data tools. Develop data lakes/warehouses, ensure data quality and security, apply data modeling and DAGs, use Databricks, Airflow, and Hadoop, and collaborate with clients to deliver actionable insights.
Top Skills: Apache AirflowApache HadoopAWSAzure Data FactoryDagsData LakeData WarehouseDatabricksDimensional ModelingAzure

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account