Waymo Logo

Waymo

Technical Program Manager, ML Developer Experience and Infrastructure Reliability

Reposted 14 Hours Ago
Be an Early Applicant
In-Office
Mountain View, CA
230K-292K Annually
Senior level
In-Office
Mountain View, CA
230K-292K Annually
Senior level
As a Technical Program Manager, oversee ML development processes, manage infrastructure reliability, and ensure effective project execution to enhance developer experience.
The summary above was generated by AI

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

Waymo’s Technical Program Managers and Program Managers are accountable for Waymo’s roadmap execution by providing thoughtful cross-functional planning, clarity, and proactive risk management. In the face of complex technical and operational challenges with no established playbooks to follow, we act with thoughtful urgency, driving conversations, discussions, and outcomes. Our team partners closely with every function of Waymo to structure, own and drive work towards real-world deployments of the Waymo Driver across platforms and geographies.

In this hybrid role, you will report to a Technical Program Management Director. 

You will:

  • Drive the "Golden Path" for ML: Lead cross-functional execution to define and invest in a simplified "golden path" for ML development for Onboard and Foundation Model (WaymoFM) development, targeting the reduction of friction and low reliability in the "inner loop"
  • Manage Reliability Operations: Ensure smooth day-to-day operations of the reliability triage ecosystem, keeping queues healthy through interaction with rotation members and driving automation of queue management
  • Program Implementation for Infra Stability: Drive "contract-based reliability" programs across Onboard domains
  • Bridge ML and Infra: Facilitate communication and alignment between ML research, infrastructure foundations, and onboard teams to resolve blockers in core workflows like root-causing brittle pipelines
  • Strategic Roadmap Tracking: Contribute to strategic planning and track project progress, risks, and KPIs related to ML developer productivity and infrastructure reliability for leadership reporting
  • Resolve Systemic Blockers: Proactively identify and resolve roadblocks in the ML development cycle, such as data fragmentation and complex tooling that currently hinders developer velocity

You have:

  • Technical Education: A Bachelor's degree in Computer Science, Engineering, or a related technical field
  • TPM Experience: 5+ years of experience as a Technical Program Manager in a software engineering or large-scale infrastructure environment
  • ML/Reliability Track Record: Proven track record of managing complex technical projects involving machine learning infrastructure, developer experience (DevX), or site reliability engineering (SRE)
  • Program Ownership: Experience owning and driving programs end-to-end, including managing timelines, risks, and dependencies across multiple senior stakeholders
  • Analytical Problem Solving: Strong analytical and technical judgment skills, with the ability to use data to diagnose and solve systemic engineering bottlenecks
  • Communication Mastery: Excellent communication and interpersonal skills, with a demonstrated ability to convey complex technical concepts to both researchers and infrastructure engineers

We prefer:

  • Advanced ML Operations: Experience with ML observability, root-causing production pipelines, and automating large-scale offline inference or model training experiments
  • Large-Scale Data Management: Background in managing multi-petabyte scale datasets, data validation frameworks, or unified data management solutions
  • Reliability Frameworks: Familiarity with contract-based reliability models, SLO management for autonomous systems, or reliability triage ecosystems
  • Developer Platforms: Experience building or managing "golden path" developer platforms or developer tooling that simplifies complex, fragmented tech stacks
  • Advanced Degree: Master's degree or PhD in a related technical field
  • Autonomous Domain Knowledge: Experience with simulation environments for autonomous systems, model validation strategies, or onboard/offboard infrastructure dependencies

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. 

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. 

Salary Range
$230,000$292,000 USD

Top Skills

Data Management
Infrastructure Management
Machine Learning
Software Engineering

Waymo Mountain View, California, USA Office

1600 Amphitheatre Pkwy, Mountain View, CA, United States, 94043

Similar Jobs

34 Minutes Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
90K-90K Annually
Junior
90K-90K Annually
Junior
Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
The Sales Development Representative will drive outbound sales opportunities, qualify leads, and work closely with Account Executives to develop the sales pipeline.
Top Skills: AICRMSales Engagement PlatformsSalesforce
45 Minutes Ago
Hybrid
Los Angeles, CA, USA
42-44 Hourly
Senior level
42-44 Hourly
Senior level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
The Senior Electronics Quality Technician III leads investigations on product quality issues, develops inspection protocols, performs electronic diagnostics, and mentors technicians.
Top Skills: Ipc StandardsMimo WaveformMultimetersOscilloscopesSignal GeneratorsSpectrum Analyzers
46 Minutes Ago
In-Office
San Diego, CA, USA
129K-206K Annually
Junior
129K-206K Annually
Junior
Cloud • Fintech • Food • Information Technology • Software • Hospitality
As a Territory Sales Account Executive, you will prospect, build relationships, and sign up new restaurants, utilizing a consultative sales approach, managing sales cycles, and collaborating across teams.
Top Skills: Salesforce

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account