Lila Sciences Logo

Lila Sciences

Staff Engineer, Data Platform

Posted 15 Days Ago
In-Office
San Francisco, CA, USA
192K-272K Annually
Senior level
In-Office
San Francisco, CA, USA
192K-272K Annually
Senior level
The Staff Engineer will lead technical direction on data infrastructure, focusing on ingestion frameworks, storage architecture, and orchestration to support scientific workflows. Responsibilities include mentoring, ensuring reliability, and collaborating with cross-functional teams to establish coding standards and data integrity.
The summary above was generated by AI

Your Impact at LILA

Lila Sciences is building the software platform that makes automated scientific discovery possible. At the heart of that platform is data: raw outputs from laboratory instruments, experimental model results, curated public datasets, and the scientific literature that contextualizes all of it. The data platform team is responsible for the infrastructure that moves, stores, transforms, and surfaces this data across the organization.

We are looking for a Staff Engineer to set the technical direction for our core data infrastructure: ingestion frameworks, storage architecture, orchestration patterns, and the interfaces that let scientists and ML researchers work with data reliably at scale. You will work closely with software engineers, machine learning researchers, and lab scientists to understand requirements and translate them into durable platform capabilities.

This is a role for engineers who care deeply about how data systems are designed. You will establish the architectural patterns and engineering standards the broader team builds on, mentor engineers across the data platform group, and make technical decisions that compound over time.

What You'll Be Building

  • Data Platform Architecture: Design and evolve the core data infrastructure that ingests, stores, and serves data across scientific and ML workflows. Make principled build-vs-buy decisions and establish architectural patterns adopted by the broader engineering organization.
  • Ingestion and Integration: Build reliable pipelines that bring in data from diverse sources: laboratory instruments, public scientific datasets, and external research literature. Own the interfaces between upstream producers and downstream consumers.
  • Orchestration and Reliability: Operate and extend workflow orchestration systems that run complex, multi-step scientific pipelines. Ensure observability, fault tolerance, and reproducibility across the data stack.
  • Data Modeling and Schema Strategy: Define and maintain data models, schema evolution practices, and data contracts that ensure consistency, discoverability, and long-term durability of scientific and platform data assets.
  • Cross-Functional Technical Leadership: Partner with ML researchers, lab scientists, and product engineers to translate scientific and research requirements into platform capabilities. Drive alignment on data standards and integration patterns across teams.
  • Engineering Standards and Mentorship: Establish coding, review, and design standards for the data platform team. Mentor engineers, lead design reviews, and raise the technical bar across the group.

What You’ll Need to Succeed

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, and 8+ years as a software or data engineer with a focus on building and operating data infrastructure.
  • Designed and shipped data platform components from the ground up, including ingestion frameworks, storage abstractions, and orchestration systems. Fluent in Python and SQL and writes production-quality code.
  • Production experience with relational and NoSQL databases, schema design, query optimization, and operational concerns at scale. Comfortable working across structured, semi-structured, and unstructured data.
  • Proven track record of working cross-functionally with scientists, ML researchers, and engineers. Able to translate domain requirements into platform decisions and explain technical trade-offs to diverse audiences.
  • Experience with cloud infrastructure and containerized deployment (AWS, Kubernetes).
  • Hands-on experience with modern table formats and open lakehouse patterns (Iceberg, Delta Lake, Hudi).

Bonus Points For

  • Experience with workflow orchestration systems (Flyte, Airflow, Dagster, or similar).
  • Experience building data infrastructure that serves agentic and LLM-driven workflows, including vector databases, RAG infrastructure, and retrieval-optimized data access patterns.
  • Background in scientific computing, life sciences, or research software.
  • Proficiency with AI-assisted development tools (Cursor, Claude Code, or similar) and ability to incorporate them effectively into day-to-day engineering work.

Compensation

We offer competitive base compensation with bonus potential and generous early-stage equity. Your final offer will reflect your background, expertise, and expected impact.

U.S. Benefits. Full-time U.S. employees receive a comprehensive benefits program including medical, dental, and vision coverage; employer-paid life and disability insurance; flexible time off with generous company wide holidays; paid parental leave; an educational assistance program; commuter benefits, including bike share memberships for office based employees; and a company subsidized lunch program.

International Benefits. Full-time employees outside the U.S. receive a comprehensive benefits program tailored to their region. USD salary ranges apply only to U.S.-based positions; international salaries are set to local market.

Expected Base Salary Range
$192,000$272,000 USD

About LILA

Lila Sciences is building Scientific Superintelligence™ to solve humankind's greatest challenges. We believe science is the most inspiring frontier for AI. Rather than hard-coding expert knowledge into tools, LILA builds systems that can learn for themselves.

LILA combines advanced AI models with proprietary AI Science Factory™ instruments into an operating system for science that executes the entire scientific method autonomously, accelerating discovery at unprecedented speed, scale, and impact across medicine, materials, and energy. Learn more at www.lila.ai.

Guided by our core values of truth, trust, curiosity, grit, and velocity, we move with startup speed while tackling problems of historic importance. If this sounds like an environment you'd love to work in, even if you don't meet every qualification listed above, we encourage you to apply.

We’re All In

Lila Sciences is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.

Information you provide during your application process will be handled in accordance with our Candidate Privacy Policy.

A Note to Agencies

Lila Sciences does not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Lila Sciences or its employees is strictly prohibited unless contacted directly by Lila Science’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Lila Sciences, and Lila Sciences will not owe any referral or other fees with respect thereto.

Similar Jobs

12 Days Ago
In-Office or Remote
United States
Expert/Leader
Expert/Leader
Digital Media • Social Media • Software • Sports
Design and deliver scalable, reliable data services and microservices for real-time and batch processing. Build REST APIs, model data across SQL/NoSQL stores, drive data quality, own deployment/monitoring/incident response, set technical standards, and mentor engineers.
Top Skills: Ai-Powered ToolsAWSCdkCi/CdDynamoDBDynamodb StreamsGCPKinesisMicroservicesNode.jsNoSQLObservabilityRest ApisServerlessSQLSstStream ProcessingTerraformTypescript
10 Days Ago
In-Office or Remote
San Francisco, CA, USA
195K-258K Annually
Senior level
195K-258K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, build, and operate the core data warehouse, ingestion, orchestration, and cataloging platform. Develop batch and streaming pipelines, ensure data quality, governance, observability, and provide ML data platform capabilities. Lead architecture, improve platform reliability and performance, and collaborate with product, engineering, data science, security, and compliance teams.
Top Skills: Apache FlinkBigtableCassandraGoogle Cloud Dataflow
26 Days Ago
In-Office
121K-164K Annually
Expert/Leader
121K-164K Annually
Expert/Leader
Software
Lead architecture and implementation of scalable, secure, serverless and event-driven data pipelines and services in a Node.js/TypeScript ecosystem; mentor engineers, improve reliability, observability, CI/CD, and collaborate on scientific data solutions.
Top Skills: AWSAws CodepipelineEvent-Driven ArchitectureGithub ActionsKubernetesNode.jsPostgresReactRestful ApisServerlessTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account