Mechanize Inc. Logo

Mechanize Inc.

Software Engineer

Reposted 11 Hours Ago
In-Office
San Francisco, CA
Junior
In-Office
San Francisco, CA
Junior
As a Software Engineer, you will design evaluation scenarios, influence product development, and contribute to an early-stage startup's engineering team.
The summary above was generated by AI

About Mechanize

Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize.work.

Why the work matters

AI models have gotten good at narrow coding tasks but still fail at the complex, judgment-heavy parts of software engineering. We build the environments that expose those failures and help models improve.

What you'll do

You'll design, build, and quality-assure RL tasks. Each task is a self-contained software engineering challenge with a prompt, an environment, and an automated grader. You own the full lifecycle: ideation, grading infrastructure, running frontier models against the task, failure analysis, and iteration. At this level, we expect you to consistently produce tasks that target meaningful capability gaps in frontier models, and to develop a strong sense for what makes a task informative versus merely difficult.

You will use coding agents heavily, and a large part of the job is directing them well, evaluating their output, and knowing when they are failing in subtle ways. You may also contribute to shared infrastructure: improving our build pipeline, automating parts of QA, or building tooling for other engineers.

What makes someone good at this

Strong technical fundamentals combined with a well-calibrated intuition for AI model behavior. You need to anticipate where a model will take shortcuts, distinguish genuine capability gaps from grader issues, and understand how a model will interpret a prompt. At this level, we expect extensive familiarity with what frontier coding agents can and can't do.

Good fit if you:

  • Have 2+ years of experience as a software engineer

  • Can code in Python

  • Are confident working independently at a consistent pace

  • Have developed an intuition for what coding agents can and can't do

  • No prior ML or AI experience required

Probably not a good fit if you:

  • Want a product engineering role building features for end users

  • Prefer a highly collaborative team environment with shared ownership

  • Want extensive structured mentorship

This is independent, high-ownership work. You own your tasks from start to finish, with regular check-ins and feedback. Strong performers are recognized and promoted quickly. Benefits include 401k, health, dental, vision, and life insurance. Applying takes less than one minute.

Interview process: https://www.mechanize.work/how-our-interview-process-works

Learn more about the work: https://www.mechanize.work/what-working-here-is-like

About Mechanize. ~20 person team in San Francisco. Backed by Patrick Collison, Nat Friedman, Daniel Gross, Jeff Dean, Dwarkesh Patel, and Sholto Douglas. Featured in the New York Times, the Dwarkesh Podcast and Hard Fork.

Top Skills

Python
React
Typescript

Similar Jobs

2 Days Ago
Hybrid
San Francisco, CA, USA
136K-167K Annually
Junior
136K-167K Annually
Junior
Cloud • Healthtech • Social Impact • Software • Biotech
As a Software Engineer, you'll develop APIs and services, work on platform features, and collaborate with cross-functional teams to integrate solutions that enhance scientific applications.
Top Skills: APIsGraphQLRest
3 Days Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
133K-184K Annually
Mid level
133K-184K Annually
Mid level
Fintech • Machine Learning • Mobile • Security • Software
As a Mobile Software Engineer, you will develop frameworks and tooling for Chime's React Native app, improve app performance, automate mobile CI/CD, and collaborate with teams to enhance app quality.
Top Skills: AmplitudeAndroidApolloBitriseBugsnagCircleCIDatadogGithub ActionsGraphQLiOSReact NativeReduxSentryTypescript
3 Days Ago
Easy Apply
In-Office
Long Beach, CA, USA
Easy Apply
154K-211K Annually
Senior level
154K-211K Annually
Senior level
Aerospace • Hardware • Robotics • Software • Manufacturing
Develop embedded software for real-time control systems on Linux platforms, ensuring safety and performance in high-reliability applications for aerospace.
Top Skills: C/C++LinuxYocto

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account