HUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs, Fortune 500 companies, and startups. We have grown revenue and raised funding from YC, A16Z, and other leading VCs to scale fast.
About the roleIf you would like to be at the frontier of AI agent training, apply here.
You will work with leading foundational labs to develop production agent training infrastructure. You will get a lot tokens to do this.
ResponsibilitiesDesign and build core platform systems. Post training workflows, dataset pipelines, run orchestration, and execution infrastructure.
Own the Python SDK and the overall developer experience. Developer best practices like slean APIs, sensible defaults, clear errors, and strong documentation.
Build evaluation pipelines that connect naturally to training loops; measure, create data, train, and re evaluate.
Work with Docker, Linux, and cloud infrastructure to ensure reliable and reproducible environments across local development, CI, and production.
Talk directly with customers, understand their workflows, and turn messy real world feedback into product improvements.
Strong experience owning products end-to-end. Comfortable working across the stack, including APIs, data systems, and frontend work when needed.
Real understanding of Docker and Linux environments.
Ability to design APIs and interfaces that age well. You care about ergonomics, correctness, and developer experience.
Comfort working with AI coding tools and agentic workflows, and tradeoffs between speed and technical
Experience with reinforcement learning, post training, or model training workflows.
Experience building or using LLM or agent evaluation frameworks such as Inspect, EleutherAI tooling, or custom harnesses.
Experience designing SDKs, CLIs, or developer platforms.
Kubernetes experience, including deployment, scaling, or job orchestration.
Startup experience at early stage companies with the ability to work independently.
We prioritize technical ability and learning speed over years of experience. If you have built impressive things, open source contributions, side projects, research code, or production systems, we want to see them.
Team and Company DetailsTeam Size: Approximately 15 people currently, mostly full time in person with some remote.
Our team: Includes four international Olympiad medallists across IOI, ILO, and IPhO, serial AI startup founders, and researchers with publications at ICLR, NeurIPS, and similar venues.
Company stage: We have raised tens of millions in venture funding and have strong revenue growth. We are scaling quickly and profitably to meet demand.
Employment: Full time.
Location: On site only for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We support relocation and visas for strong candidates to the United States or Singapore.
Timeline: Applications are rolling. The process includes two technical interviews and a one week work trial.
You will have unlimited* access to API credits for providers such as OpenAI, Anthropic, Gemini, Cursor, and others. *No one on our token usage leaderboard has ever hit the limit, so we do not know what the limit actually is.
Due to high volume we may not actively respond to every application, but feel free to contact us at [email protected] if we missed your application.
Top Skills
hud San Francisco, California, USA Office
San Francisco, CA, United States, 94109
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

