Axiom Bio Logo

Axiom Bio

Data Engineer

Posted 6 Days Ago
In-Office
San Francisco, CA
Mid level
In-Office
San Francisco, CA
Mid level
As a Data Engineer, you will build and maintain data systems for drug safety AI tools, ensuring reliable data processing and storage for research teams.
The summary above was generated by AI

Charter:

Be a founding member of a team building the first accurate AI systems for drug toxicity prediction to replace lab and animal experiments

About Axiom and the role:

We’re building AI systems for drug safety and toxicity assessment. Drug toxicity causes about half of drug program failures and by tackling it we can help drug discovery teams across the industry bring new medicines to patients far faster. We’re looking for a data engineer who’s excited to own the pipelines, systems, and tooling that turn raw chemical, biological, and clinical data into ML-ready training data and into customer-ready insights. You’ll work closely with our ML, lab, and product teams to build LLM-driven literature research and data platforms, scale inference of image and graph neural networks, automate ETL from diverse sources, and ensure the integrity of datasets that drive critical decisions internally and externally. This role is ideal for someone who wants to build clean, reliable systems that directly impact the success of hundreds of drug programs.

What are we looking for:

We want to hire people who inspire us and level up the entire team. They should be high energy, high agency, and have great taste for what matters. They should have a relentless “observe, orient, decide, act” loop, and be constantly identifying what needs to happen and getting it done. They need to be technically excellent and obsessive masters of their craft, as well as having a great curiosity which will keep them at the frontier of tech and help them interface between AI, engineering, product, biology, chemistry, and business. They could work in big tech, but it won’t satisfy them. They want to go on an adventure which will be brutally challenging, and to share in the rewards and satisfaction at its end.

What you will be doing:

  • Build and maintain the core data systems for Axiom’s research platform, including ingestion, processing, storage, and serving

  • Work with scientists to understand their data needs and create simple APIs for accessing chemical and biological datasets

  • Architect LLM systems to curate, clean, and analyze human clinical trial data, and evaluations and observability for these systems

  • Develop distributed systems to run large-scale LLM jobs that clean and curate biological and clinical data

  • Set up quality checks, testing tools, and monitoring systems to ensure data and model outputs stay accurate and reliable

Various expertise which gets us interested:

  • Leading large-scale data platform buildouts serving multiple internal teams or external users

  • Designing and maintaining high-throughput data systems capable of processing petabytes of data

  • Building AI- or LLM-powered data systems, particularly for research workflows and retrieval use cases

  • Gathering technical requirements from end users and translating them into effective data infrastructure

  • Taking ownership of data systems at an early-stage startup and significantly boosting team productivity

Key criteria:

  • Strong proficiency in Python and core data libraries such as Pandas, NumPy, and the broader Python data ecosystem

  • Hands-on experience building distributed systems from scratch using tools like Kubernetes, Slurm, Modal, Anyscale, Ray, Daft, Dask, or Spark

  • Passion for large-scale data processing and building systems for high-performance computation

  • Enjoys collaborating with researchers tackling complex scientific and technical problems

  • Comfortable working in fast-changing environments with evolving research needs

  • Solid DevOps background—experience with CI/CD systems, cloud platforms (AWS, GCP, Azure), Terraform, and compute provisioning

  • Deep, obsessive curiosity about both the science and the business driving the work

Top Skills

Anyscale
AWS
Azure
Daft
Dask
GCP
Kubernetes
Modal
Numpy
Pandas
Python
Ray
Slurm
Spark
Terraform
HQ

Axiom Bio San Francisco, California, USA Office

San Francisco, CA, United States, 94107

Similar Jobs

5 Days Ago
Remote or Hybrid
United States
60K-120K Annually
Mid level
60K-120K Annually
Mid level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The Data Engineer will build and maintain data solutions, optimize data architectures, and ensure data quality while collaborating with cross-functional teams.
Top Skills: BigQueryGoogle Cloud PlatformPythonSQL
6 Days Ago
Hybrid
4 Locations
286K-392K Annually
Expert/Leader
286K-392K Annually
Expert/Leader
Fintech • Machine Learning • Payments • Software • Financial Services
The role involves leading data engineering initiatives focusing on data architecture, developing applications in AWS, mentoring talent, and driving technology adoption.
Top Skills: AWSKafkaPythonScalaSnowflakeSQL
19 Days Ago
Hybrid
5 Locations
245K-335K Annually
Senior level
245K-335K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Distinguished Data Engineer, you will lead data architecture, manage high-availability data solutions, and collaborate with various teams to enhance customer experiences. You will leverage cutting-edge technologies while mentoring others and driving engineering best practices.
Top Skills: AWSKafkaPythonScalaSnowflakeSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account