Dandelion Health Inc Logo

Dandelion Health Inc

Software Engineer

Posted Yesterday
Remote
Hiring Remotely in USA
135K-150K Annually
Mid level
Remote
Hiring Remotely in USA
135K-150K Annually
Mid level
Build and optimize high-throughput de-identification pipelines for large clinical datasets, execute QA, run and tune pipelines in health system cloud environments, reduce errors and costs, and collaborate with privacy and clinical informatics stakeholders.
The summary above was generated by AI

Our Team

Dandelion Health was founded in 2020 by experts in health tech, hospital systems, academia, and clinical AI. We are building the world’s largest AI training and clinical development platform. Today, we pride ourselves on our ability to make data access as easy as possible for AI developers, pharma, and medical devices, while raising the bar for patient safety and data quality. Tomorrow, we will be the place where any healthcare organization can go to build a responsible clinical AI product. Our culture is all about learning from data and improving, so we can help our clients improve health through AI. Meet the rest of our team here.

Our Data

We partner with health systems to safely and ethically make their de-identified patient data available to AI developers. Currently, the data is acquired from Sharp HealthCare, Sanford Health, and Texas Health Resources – with two additional U.S. health systems joining soon.

We have clinical data dating back to July 1, 2016. This data represents over 10 million patients and includes but is not limited to:

  • Structured data (e.g., 100% of the EMR, including some claims)

  • Unstructured text (e.g., clinical notes, radiology reports)

  • Images (e.g., DICOM, pathology)

  • Video

  • Waveforms

  • Continuous streaming monitoring data

Your Role

Dandelion is constantly expanding the breadth, depth, and completeness of health system datasets while improving the speed and quality of our de-identification pipeline. As an engineer working on our de-identification pipelines, you will:

  • Design and implement software systems that perform these de-identification rules at high scale and throughput (we de-identify billions of rows of data and millions of images each month) while constraining costs.

  • Generate and execute quality assurance plans to validate our de-identification processes.

  • Run de-identification pipelines in health system cloud environments, and optimize these pipelines to minimize error rates, improve processing efficiency, and reduce manual effort and cost.

  • Partner with our Director of Privacy and Clinical Informaticists to define de-identification rules.

Required technical skills

  • 3+ years of development experience in Python or an equivalent language in a professional setting, across the full software development lifecycle (design, implementation, testing, deployment, maintenance);

  • Familiarity with one or more command languages (e.g. Bash) and SQL.

Required Non-technical skills

  • Demonstrated ability to design and improve workflows, including associated operating procedures, cost management, and quality assurance;

  • Strong analytical decision-making and organizational skills;

  • Perseverance and practical problem solving;

  • Humility and strong team collaboration;

  • Enthusiasm about protecting patients’ personal data.

We are an AWS and Python shop, and our datasets are stored in AWS Redshift, Snowflake, or Parquet files which are processed in Pandas DataFrames.

Preferred skills

  • Proficiency with data structures such as Pandas DataFrames;

  • Previous software deployment in a cloud computing environment (e.g., AWS, Azure);

  • Familiarity with virtualization and containerization (e.g., Docker, VMware);

  • Prior experience working with healthcare data;

  • Experience interacting with non-technical stakeholders to deploy software solutions.

Team Benefits

  • Remote work and flexible hours. Availability needed for meetings, which we try to keep to a healthy minimum

  • Complete wellness benefits including healthcare, dental, vision, PTO, sick days and more. Ask for details

  • Professional development days to build your skills

  • Collegial work environment

  • Academic bent towards inquiry and problem solving but start-up speed and flexibility

  • Great balance of focus time to work on projects but easy to access team members to discuss issues and work collaboratively

  • Dandelion is a mission-driven company that is focused on improving patient care

Similar Jobs

2 Days Ago
Easy Apply
Remote
United States
Easy Apply
142K-210K Annually
Junior
142K-210K Annually
Junior
Big Data • Fintech • Mobile • Payments • Financial Services
Build and operate ML training and serving infrastructure by designing, developing, and launching backend systems. Collaborate across teams to decompose projects, support operations and on-call, create monitoring and metrics, perform code reviews, and contribute to developer velocity and platform reliability.
Top Skills: AWSKotlinKubernetesMySQLPython
2 Days Ago
Remote or Hybrid
Santa Clara, CA, USA
201K-352K Annually
Senior level
201K-352K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, build, and operate production-grade agentic AI systems: multi-agent orchestration, enterprise-grounded reasoning using CMDB/Knowledge Graph, retrieval/RAG pipelines, model integration with frontier SDKs, and trust/safety/governance. Lead architecture, code quality, and mentor engineers for scalable, safe autonomous agents.
Top Skills: AnthropicC++CmdbGoGoogle (Ai Sdks)Hybrid SearchInference OptimizationJavaKnowledge GraphLlm Fine-TuningMlopsModel ObservabilityOpenaiPrompt EngineeringPythonRagRe-RankingRetrieval Evaluation MetricsVector StoresWorkflow Data Fabric
3 Days Ago
Remote or Hybrid
US
126K-149K Annually
Junior
126K-149K Annually
Junior
Big Data • Fintech • Information Technology • Insurance • Software
Build and maintain React-based micro front-ends and contribute to the frontend platform. Collaborate with designers, backend engineers, and product to implement accessible, responsive interfaces, optimize GraphQL data fetching, write unit/integration tests, participate in code reviews, and manage CI/CD pipelines using NX and CircleCI.
Top Skills: Ai Development ToolsCircleCIGitGitGitlabGraphQLJestMicro FrontendsNxReactReact Testing LibraryVitest

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account