Develop and train scalable interpretability assistants to predict and detect subtle model behaviors. Create diverse evaluations, design novel architectures and objectives, and scale training/inference pipelines to support up to 1T‑scale models. Collaborate closely with a research team and contribute to high-impact evaluations used for industry standards and regulation.
Salary range: $250,000 - $500,000/year + benefits
Description: Transluce is a non-profit research lab building tools for scalable, end-to-end oversight of AI systems. We build world-class, AI-backed analysis tools and use these to set industry standards for evaluation. Our tools are integrated with core agent benchmarks like SWE-bench, while our evaluations are directly underpinning regulation, including our role as EU AI Office’s main evaluation developer for harmful manipulation risks.
About the role: We are looking for strong scientists and engineers to help advance our vision of scalable end-to-end oversight assistants, building on our recent advances such as predictive concept decoders and user model extractors. As part of our highly collaborative team, you will learn and grow quickly, creating technology at the frontier of AI research and with high direct impact.
Core responsibility: Help us develop and train scalable interpretability assistants that can predict and detect unexpected and subtle behaviors from models’ activations. This includes:
- Creating diverse evaluations that range in difficulty. This involves finding naturally occurring interesting and undesirable behaviors exhibited by open-source models.
- Developing novel architectures and objectives for training interpretability assistants.
- Scaling up the training and inference pipelines to support up to 1T-scale models.
Qualities of a strong candidate:
- Experience with fine-tuning language models, designing new architectures, and creating evaluations.
- Reliable results: good experimental design, epistemic self-awareness and transparency
- Generativeness: coming up with original, productive ideas for unblocking progress
- Curiosity: a desire to understand ML systems and how they work
- Strong programming ability, including navigating trade-offs between prototyping speed and maintainability
- Strong communication skills, low ego, openness to giving and receiving feedback
We are located in San Francisco and enthusiastic to work together in-person. We are open to sponsoring international visas.
Transluce San Francisco, California, USA Office
1301 Sansome St, San Francisco, California, United States, 94111 1122
Similar Jobs
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Generate new enterprise software revenue in the West by prospecting, developing pipeline, executing territory and account plans, leveraging partners and marketing, managing POCs, closing deals, and ensuring post-sale customer success through collaboration with Professional Services.
Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Own new business growth for a regional territory by building strategic account and territory plans, prospecting into enterprise IT accounts, driving complex sales cycles from discovery through close, managing proofs-of-concept, partnering with internal teams, and consistently exceeding bookings targets.
Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Lead and scale People Operations for a global, multi-brand workforce of 3,000+ employees. Build HR systems, policies, compliance, and vendor ecosystems; drive automation and data-driven HR processes; partner on headcount planning, M&A integrations, global mobility, and lifecycle programs (onboarding, offboarding, talent acquisition ops). Manage a People Ops team and cross-functional projects to ensure consistent, compliant employee experiences across brands and geographies.
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine


.png)