Code Metal Logo

Code Metal

Reinforcement Learning Engineer

Reposted 13 Days Ago
In-Office or Remote
Hiring Remotely in San Francisco, CA, USA
Junior
In-Office or Remote
Hiring Remotely in San Francisco, CA, USA
Junior
The Reinforcement Learning Engineer will build training systems using PyTorch, develop QA pipelines, and drive research innovation in RLHF for AI models.
The summary above was generated by AI

At Code Metal AI, you’ll be part of a world class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our projects directly involve leading chip manufacturers, applying advanced AI to solve meaningful, practical challenges with real world impact.

This role bridges two critical areas:

Production

  • Build and maintain robust distributed training systems using PyTorch (2+ years experience required).
  • Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets.
  • Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.

Research

  • Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF).
  • Engage with frontier research through open-source projects and potential publications, applying RLHF to Large Language Models (LLMs), ideally focusing on code generation tasks.

Requirements
    • 2+ years experience in distributed training, preferably with PyTorch.
    • Strong background in reinforcement learning, with recent RLHF experience highly preferred.
    • Proven ability to build data curation and quality assurance pipelines.
    • Experience with evaluation framework development.
    • Ideally, experience across both data pipeline and orchestration sides.
    • Eligible for TS/SCI clearance.

Nice to have:

    • Contributions to open-source AI or ML projects.
    • Published work or demonstrable research experience in related fields.
    • Hands-on experience applying RLHF to LLMs, especially for code generation.
    • Experience with large scale synthetic data generation.

Benefits
  • Health care plan with 100% premium coverage, including medical, dental, and vision.
  • 401k with 5% matching.
  • Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays).
  • Flexible hybrid work arrangement.
  • Relocation assistance for qualifying employees.

Top Skills

PyTorch
Reinforcement Learning
Rlhf

Similar Jobs

3 Days Ago
In-Office or Remote
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Software
As a Research Engineer, you'll lead and optimize large-scale synthetic data generation and AI inference pipelines, while contributing to open-source projects and publishing research.
Top Skills: Ai/MlCi/CdDistributed Inference TechniquesFrameworksMlopsSynthetic Data Generation
An Hour Ago
Remote or Hybrid
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
The Staff Software Engineer in Scale will develop and maintain a multi-tenant infrastructure, architect enterprise solutions, and lead multi-team initiatives while ensuring system design meets business needs.
Top Skills: AWSAzureGCPJavaKafkaPostgresRedisSummer Boot
An Hour Ago
Remote or Hybrid
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
As a Senior Software Engineer at Airwallex, you will architect systems for a multi-tenant infrastructure, design integrations for enterprise clients, and balance technical and business requirements while enhancing product features and solutions.
Top Skills: AWSAzureGCPJavaKafkaPostgresRedisSpring Boot

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account