Easy Apply
Easy Apply
Design realistic biological benchmarks and curate high-quality datasets to evaluate and train AI systems; analyze model failures, iterate on data and evaluation criteria, collaborate with ML researchers, and manage workflows and documentation across domain experts.
About
Edison Scientific focuses on building and commercializing AI agents for science, and shares FutureHouse’s mission to build an AI Scientist- scaling autonomous research, productizing it, and applying it to critical challenges such as drug development.
RoleWe are seeking an ambitious, scientifically grounded person to join our team focused on developing rigorous benchmarks and training datasets that advance AI capabilities in biology. This role sits at the intersection of biology, data curation, and machine learning, and is ideal for someone with deep scientific training who is excited to shape how frontier AI systems learn to do science.
Responsibilities- Design benchmarks that capture the complexity of real biological research, drawing on your domain expertise to identify what makes scientific reasoning hard. This will include open-ended scientific benchmarks and building on prior work like LAB-Bench and BixBench.
- Curate and vet biological datasets to ensure scientific rigor.
- Analyze model outputs, identify failure modes, and contribute to iterative improvements in both datasets and evaluation criteria.
- Collaborate with AI/ML researchers to translate scientific intuition into training signal, helping AI systems learn not just facts but how scientists think.
- Coordinate operations and manage workflows, including working with domain experts, tracking task progress, and maintaining documentation.
- Have graduate-level training in biology, biochemistry, computational biology, or a related field, with hands-on research experience.
- Have working knowledge of machine learning concepts, particularly deep learning and large language models.
- Are comfortable with Python and can build workflows for data processing, analysis, and experimentation.
- Possess strong scientific taste and can identify what distinguishes expert-level reasoning from surface-level pattern matching.
- Are detail-oriented and willing to take on high-value but occasionally tedious work.
- Are energized by ambiguous, open-ended problems that require creativity, collaboration, and first-principles thinking to solve.
- Are organized and communicative, able to manage multiple workstreams and coordinate across teams.
- Prior experience creating evaluation datasets, annotation guidelines, or working on human-in-the-loop data pipelines.
- Experience with bioinformatics pipelines, biological databases, or sequence analysis tools.
- Hands-on experience fine-tuning or evaluating large language models, or familiarity with RLHF and preference-based training.
- Publications or research experience in areas relevant to AI for science.
- Collaboration is at the heart of discovery. We work on-site to stay close to the science, move faster as a team, and share the kind of energy that only happens when smart, curious people build together- in a space that we love to be in!
- Location: San Francisco (Dogpatch)
- At Edison Scientific, we know that titles can cover a range of experience levels. Actual base pay will depend on factors such as skills, experience, and scope of responsibility. Compensation ranges may evolve as we continue to grow. In addition to base pay, team members may be eligible for equity, benefits, and other perks.
- Compensation: $160,000+ $300,000 (pending experience) plus equity
Top Skills
Python,Deep Learning,Large Language Models,Rlhf,Bioinformatics Pipelines,Biological Databases,Sequence Analysis Tools
Edison Scientific San Francisco, California, USA Office
San Francisco, California, United States, 94107
Similar Jobs
Cloud • Information Technology • Security • Software • Cybersecurity
Lead global SEO and AEO strategy, manage a team of specialists, improve site performance, and enhance user experience on Zscaler's digital platforms.
Top Skills:
AeoGoogle AnalyticsSeoSeo ToolsUx DesignWeb Performance Software
Cloud • Information Technology • Security • Software • Cybersecurity
Lead modularization of the Zscaler Client Connector into reusable libraries, write maintainable unit-testable C/C++ code, improve scalability and memory safety, implement networking and VPN features, and collaborate across platforms in a hybrid San Jose team.
Top Skills:
C,C++,Windows,Vpn,Networking Protocols
Cloud • Information Technology • Security • Software • Cybersecurity
Develop instructor-led and interactive eLearning, labs, assessments, and PPT decks by translating SME workflows into task-based training. Build and validate hands-on product labs and create engaging, modular course content mapped to learning objectives.
Top Skills:
Articulate,Adobe Storyline,Adobe Rise,Adobe Captivate,Powerpoint,Zscaler,Ai Tools
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

