Lead and execute research to improve AI safety and reliability, design experiments, train models, and collaborate on publications.
The Center for AI Safety (CAIS) is a leading research and advocacy organization focused on mitigating societal-scale risks from AI. We address AI’s toughest challenges through technical research, field-building initiatives, and policy engagement, along with our sister organization, Center for AI Safety Action Fund.
As a Research Scientist here, you will lead and execute high-impact research that advances the safety and reliability of frontier AI systems. You'll design and run experiments on large language models, build the tooling needed to train and evaluate models at scale, and turn results into publishable research. You'll collaborate closely with CAIS researchers and external academic and commercial partners, using our compute cluster to run large-scale training and evaluation. The work spans areas like AI honesty, robustness, transparency, and trojan/backdoor behaviors, aimed at reducing real-world risks from advanced AI systems.
Key Responsibilities Include:
- Help set and lead research agenda.
- Own end-to-end research experiments.
- Train and fine-tune large transformer models across domains.
- Build and maintain datasets and benchmarks.
- Run distributed training and evaluation at scale.
- Write and ship research, collaborating with co-authors, and supporting submissions of papers to top conferences.
- Collaborate with researchers and external partners while contributing to shared research direction and responding quickly in research cycles.
- Mentor and guide others on the team.
You might be a good fit if you:
- Ph.D. in computer science, machine learning, or a related field, with 5+ years of related research experience.
- Familiar with relevant frameworks and libraries (e.g., pytorch and huggingface).
- Have experience launching and training distributed ML jobs.
- Communicate clearly and promptly with teammates.
- Have co-authored an NLP or RL paper in a top conference.
Know someone who could be a great fit for this role? Submit their details through our Referral Form. If we end up hiring your referral, you’ll receive a $1,500 bonus once they’ve been with CAIS for 90 days.
The Center for AI Safety is an Equal Opportunity Employer. We consider all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, ancestry, age, disability, medical condition, marital status, military or veteran status, or any other protected status in accordance with applicable federal, state, and local laws. In alignment with the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.
If you require a reasonable accommodation during the application or interview process, please contact [email protected].
We value diversity and encourage individuals from all backgrounds to apply.
Top Skills
Huggingface
PyTorch
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves building and optimizing large language models for the ServiceNow platform, requiring AI/ML expertise and collaboration with various stakeholders.
Top Skills:
AIAi Productivity ToolsLarge Language ModelsMachine LearningPythonTransformer Architectures
Artificial Intelligence • Machine Learning
Work on foundational machine learning research for Business AI, collaborating with research scientists and engineers to produce publishable results or prototype systems that improve AI agent optimization, model training, human-AI interaction, and evaluation.
Top Skills:
Machine LearningPythonSQL
Information Technology
The Research Scientist will lead complex data analytics projects using AI and machine learning, focusing on defense and health data opportunities.
Top Skills:
PandasPythonRR Tidyverse
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



