Komodo Health Logo

Komodo Health

Senior Data Engineer

Posted 5 Days Ago
Remote
Hiring Remotely in United States
207K-238K Annually
Senior level
Remote
Hiring Remotely in United States
207K-238K Annually
Senior level
Design, build, operate, and optimize large-scale data pipelines and foundational data products for Komodo's Healthcare Map. Improve pipeline reliability, observability, and performance; transform claims and EHR datasets; implement data quality, lineage, and monitoring; debug production workflows; collaborate with product and engineering teams; and enable downstream analytics and AI/ML use cases.
The summary above was generated by AI

We Breathe Life Into Data

At Komodo Health, our mission is to reduce the global burden of disease. And we believe that smarter use of data is essential to this mission. That’s why we built the Healthcare Map — the industry’s largest, most complete, precise view of the U.S. healthcare system — by combining de-identified, real-world patient data with innovative algorithms and decades of clinical experience. The Healthcare Map serves as our foundation for a powerful suite of software applications, helping us answer healthcare’s most complex questions for our partners. Across the healthcare ecosystem, we’re helping our clients unlock critical insights to track detailed patient behaviors and treatment patterns, identify gaps in care, address unmet patient needs, and reduce the global burden of disease. 

As we pursue these goals, it remains essential to us that we stay grounded in our values: be awesome, seek growth, deliver “wow,” and enjoy the ride. At Komodo, you will be joining a team of ambitious, supportive Dragons with diverse backgrounds but a shared passion to deliver on our mission to reduce the burden of disease — and enjoy the journey along the way.

We Breathe Life Into Data

At Komodo Health, our mission is to reduce the global burden of disease. And we believe that smarter use of data is essential to this mission. That’s why we built the Healthcare Map — the industry’s largest, most complete, precise view of the U.S. healthcare system — by combining de-identified, real-world patient data with innovative algorithms and decades of clinical experience. The Healthcare Map serves as our foundation for a powerful suite of software applications, helping us answer healthcare’s most complex questions for our partners. Across the healthcare ecosystem, we’re helping our clients unlock critical insights to track detailed patient behaviors and treatment patterns, identify gaps in care, address unmet patient needs, and reduce the global burden of disease. 

As we pursue these goals, it remains essential to us that we stay grounded in our values: be awesome, seek growth, deliver “wow,” and enjoy the ride. At Komodo, you will be joining a team of ambitious, supportive Dragons with diverse backgrounds but a shared passion to deliver on our mission to reduce the burden of disease — and enjoy the journey along the way.

The Opportunity at Komodo Health

Join Komodo Health's Data Foundations team and play a critical role in shaping the core data products that fuel our Healthcare Map. This role is essential for transforming massive, complex healthcare datasets into performant, trustworthy, and usable data assets that directly power both customer-facing applications and internal product innovation. By building and scaling our foundational data systems, you will directly enable the transparency and efficiency required to drive better health outcomes across the industry.

Mission

The Senior Data Engineer will design, build, operate, and improve large-scale data pipelines and foundational data products that power Komodo’s Healthcare Map, analytics products, and downstream AI/ML-enabled use cases. This is a hands-on engineering role focused on processing complex healthcare data at scale, improving reliability and performance, and contributing to the technical direction of core data systems.

Looking back on your first 12 months at Komodo Health, you will have…

  • Architectural Advancement: Deliver high-impact technical initiatives that improve pipeline performance, scalability, and system efficiency.
  • Platform Hardening: Improve the reliability, observability, and cost-efficiency of core Data Foundations systems.
  • Healthcare Data Innovation: Develop deep domain expertise and contribute novel approaches to challenges such as patient journey mapping and identity resolution.
  • Cross-Functional Delivery: Partner with Data Product and Engineering teams to ship scalable, production-grade data solutions.
  • Partner on architecture: Raise the bar across the team through mentorship, design reviews, and engineering best practices.

Key Responsibilities

  • Build, operate, and optimize large-scale production data pipelines using Python, SQL, Airflow, cloud infrastructure, and distributed processing frameworks.
  • Transform massive healthcare claims, EHR, and reference datasets into trusted, performant Healthcare Map data products and serving-ready data assets.
  • Strengthen pipeline reliability through data quality checks, validation, lineage, observability, monitoring, and alerting.
  • Debug complex data, system, and performance issues across computationally intensive workflows.
  • Partner with Data Product Quality, Product, Platform, and Engineering teams to translate healthcare data needs into scalable technical solutions.
  • Contribute to system design, architecture, code quality, testing, documentation, CI/CD, and rotational production support.
  • Enable downstream analytics, product, and AI/ML use cases through high-quality, well-modeled, reliable data.

What you bring to Komodo Health:

  • Healthcare data experience across claims, clinical, RWE, provider, patient, or life sciences datasets, including coding systems such as ICD-10, CPT, NDC, or NPI.
  • Strong hands-on experience building, operating, and debugging production-grade data pipelines at scale.
  • Advanced Python and SQL skills, with experience in Airflow or similar workflow orchestration tools.
  • Experience with Spark or comparable distributed data processing frameworks.
  • Proven experience designing and operating data solutions in AWS.
  • Strong instincts for data quality, reliability, root-cause analysis, and production troubleshooting.
  • Ability to communicate technical trade-offs clearly and collaborate with engineering, product, and data partners.
  • Comfort using AI-assisted engineering tools for productivity, debugging, documentation, and technical exploration.

AI-Augmented Engineering Expectations:

  • You will be expected to leverage AI-augmented engineering tools, such as ChatGPT, Gemini, or Claude, to improve productivity and technical decision-making. This may include using AI to generate and refine code, accelerate documentation, automate test case creation, debug complex issues, explore unfamiliar technical concepts, and assess architectural trade-offs and risks.

Additional skills and experience we’d prioritize (nice to have)…

  • Experience delivering external-facing data products through customers, APIs, serving layers, or production access patterns.
  • Ability to optimize high-scale data architectures for performance, cost, versioning, and large-volume productization.
  • Experience applying AI or agentic workflows to engineering, data quality, delivery, or operations.
  • Success in high-growth or ambiguous environments that require balancing architecture, speed, and quality.

Open to US remote, OR SF/NYC hybrid

#LIRemote

The pay range for each job posting reflects a minimum and maximum range of annual base pay that we reasonably expect to pay for this position within the US. We carefully consider multiple business-related factors when determining compensation, including job-related skills, work experience, geographic work location, relevant training and certifications, business needs and market demands.


The starting annual base pay for this role is listed below. This position may be eligible for performance-based bonuses as determined in the Company’s sole discretion and in accordance with a written agreement or plan. This role may also be eligible for equity awards. In addition, this role is eligible for benefits including, but not limited to, comprehensive health, dental, and vision insurance; flexible time off and holidays; 401(k) with company match; disability insurance and life insurance; and leaves of absence in accordance with applicable state and local laws and regulations and company policy. 

San Francisco Bay Area and New York City:
$207,000$238,000 USD
All Other US Locations:
$180,000$207,000 USD

Komodo's AI Standard

At Komodo, we're not just witnessing the AI revolution – we're leading it. This is a pivotal moment in time, where being first to market with AI transforms industries and sets the bar. We've already established industry leadership in leveraging AI to revolutionize healthcare, and we expect every team member to contribute. AI here isn't optional; it's foundational. We expect you to integrate AI into your daily work – from summarizing documents to automating workflows and uncovering insights. This isn't just about efficiency; it's about making every moment more meaningful, building on trust in AI, and driving our collective success.

Join us in shaping the future of healthcare intelligence.
Where You’ll Work

Komodo Health has a hybrid work model with hubs in San Francisco, New York City, and Chicago. Roles vary — some can be performed from anywhere in the country, others are scoped to a specific region, and some are based near one of our hubs. For hub-based Dragons, we're building intentional in-office rhythms alongside the flexibility that's core to how we work. Whatever your setup, expectations will always be clear before you join.

Equal Opportunity Statement

Komodo Health provides equal employment opportunities to all applicants and employees. We prohibit discrimination and harassment of any type with regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. 

By submitting your application, you acknowledge that you have read and understand Komodo Health’s Privacy Notice for Employees and Contractors.

This notice explains how we collect, use, and retain applicant data.

HQ

Komodo Health San Francisco, California, USA Office

680 Folsom St, San Francisco, CA, United States, 94107

Komodo Health Los Altos, California, USA Office

5050 El Camino Real, Suite #100, Los Altos, CA, Los Altos, United States, 94022

Similar Jobs

Yesterday
Remote or Hybrid
US
135K-155K Annually
Senior level
135K-155K Annually
Senior level
Professional Services • Software
Lead architecture and buildout of a new graph-backed enterprise data platform: design ingestion, graph and relational storage, entity resolution pipelines, temporal models, ETL/ELT pipelines, governance, APIs, and production connectors. Ship scalable graph data models, traversal queries, and platform roadmap while enabling observability, security, and containerized deployments.
Top Skills: AirflowAzureCypherDagsterDbtDockerGremlinHelmJavaKubernetesPythonSalesforceServicenowSparqlSQL
2 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and maintain enterprise ETL and data transformation pipelines to support Medicaid analytics and federal reporting. Optimize data processing with Python, Spark/Databricks, and relational platforms; ensure data validation, reconciliation, auditability, and production support. Collaborate across architects, analysts, QA, and BI teams during cloud migration and modernization efforts.
Top Skills: Azure Data FactoryAzure DevopsBashCi/CdDatabricksGitInformatica PowercenterOraclePowershellPythonRest ApiSnowflakeSparkSQLSQL ServerTeradata
4 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
186K-222K Annually
Senior level
186K-222K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Design and scale data pipelines and ML/LLM systems, build agentic automation for pipeline generation and maintenance, improve data monitoring, and collaborate with analysts, product, and ML teams to deliver reliable end-to-end data and AI infrastructure for a high-growth e-commerce platform.
Top Skills: AirflowAws Ec2Aws EksAws LambdaAws S3DbtLlmsMcp ServersMl PipelinesPythonRagSnowflake

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account