Astera Logo

Astera

Scientist - Computational Biophysics

Posted 3 Days Ago
Be an Early Applicant
In-Office
Emeryville, CA, USA
100K-180K Annually
Entry level
In-Office
Emeryville, CA, USA
100K-180K Annually
Entry level
The Scientist will develop protein representations and bioinformatic pipelines, integrating ML models and collaborating with experimental partners to enhance biological insights.
The summary above was generated by AI
About Astera:

Astera is a private foundation on a mission to steer science and technology toward an abundant future. We believe the coming years will bring an era of unprecedented scientific and technological advancement as exponential progress in AI converges with central advances in other fields to dramatically accelerate innovation. This inflection point provides an unparalleled opportunity to fundamentally rethink the institutions, systems, and tools that drive scientific progress.

Unlike traditional non-profit research organizations, projects supported by Astera operate like high-velocity startups, allowing us to focus on ambitious goals, match structure to problem, and attract strong technical talent and leadership. You can read more about our mission, vision, and programming here.

Position Summary:

We are seeking a scientist to join the diffUSE Project, which focuses on developing next-generation protein representations that bridge dynamic structural biology to downstream functional applications. The diffUSE Project is an ambitious initiative designed to advance our understanding of protein dynamics by building the experimental methods, computational models, and global infrastructure needed to capture molecular motion at scale. Our goal is to establish dynamic structural biology as a foundational pillar of modern science, as transformative and indispensable as the Protein Data Bank has been for static structures.

In this role, you will build ensemble-aware protein representations to enable the integration with downstream inputs such as protein language model (PLM), LLMs, or other functional predictions. You will design and maintain large-scale bioinformatic pipelines, manage complex datasets, develop metrics for dynamics and/or fine-tune or architect ML models to capture sequence-structure-function relationships. A key part of the role involves synthesizing diverse data sources to improve biological relevance. You will work closely with experimental collaborators to ground computational insights in real biological systems.

Key Responsibilities:
  • Build ensemble-aware protein representations that integrate PLM and LLM embeddings with experimentally derived structural heterogeneity for functional prediction

  • Design, develop, and maintain large-scale bioinformatic pipelines capable of processing and managing complex, high-dimensional datasets

  • Fine-tune or architect ML models to capture sequence-structure-function relationships, with a focus on dynamic and conformational features

  • Synthesize diverse data sources spanning evolutionary history, binding affinity, allostery, and functional annotations to improve model performance and biological relevance

  • Collaborate closely with experimental partners to ground computational representations in real biological measurements and ensure models are continuously refined against experimental ground truth

  • Contribute to the broader diffUSE infrastructure, helping establish community-wide standards and tools for dynamic structural biology

Required Skills and Qualifications:
  • PhD in bioinformatics, computational biology, machine learning, or a related field.

  • Strong understanding of protein structure and function.

  • Demonstrated experience building large bioinformatic pipelines and managing high-dimensional datasets.

  • Proficiency in fine-tuning or modifying ML models (e.g., transformer-based architectures).

  • Familiarity with protein language models (ESM, AlphaFold, etc.) is a plus.

  • Collaborative, team-oriented mindset with the ability to drive research questions from conception to execution.

Compensation:

The posted salary range is based on location in the Bay Area. The successful candidate will receive a competitive compensation package, commensurate with their experience and location.

Top Skills

Alphafold
Bioinformatics
Esm
Machine Learning
Protein Language Models

Similar Jobs

19 Days Ago
In-Office or Remote
6 Locations
235K-270K Annually
Senior level
235K-270K Annually
Senior level
Artificial Intelligence • Biotech
Lead development of scalable molecular dynamics pipelines, integrating physics-based models with machine learning frameworks to enhance molecular engineering and experimentations.
Top Skills: JaxPythonPyTorch
24 Days Ago
In-Office
South San Francisco, CA, USA
121K-224K Annually
Expert/Leader
121K-224K Annually
Expert/Leader
Healthtech • Biotech
The Scientist will develop and employ in silico methods to support biologics development, focusing on molecular dynamics simulations and computational methods for drug development.
Top Skills: AmberBashCGromacsMoeNamdPythonRosettaSchrodinger
9 Minutes Ago
Easy Apply
In-Office
Easy Apply
220K-250K Annually
Senior level
220K-250K Annually
Senior level
Artificial Intelligence • Computer Vision • Machine Learning • Payments • Real Estate • PropTech
The role involves technical direction and leadership in building customer-centric applications, managing cross-team initiatives, and fostering a culture of improvement within engineering teams.
Top Skills: AWSCopilotDatadogGitGitJavaMySQLPostgresReactScalaSnowflakeTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account