Applied Intuition Jobs

Engineering Manager - ML Platform and Infrastructure

Applied Intuition

Engineering Manager - ML Platform and Infrastructure

Reposted 11 Days Ago

Be an Early Applicant

In-Office

Sunnyvale, CA, USA

204K-343K Annually

Mid level

In-Office

Sunnyvale, CA, USA

204K-343K Annually

Mid level

Lead an ML Platform engineering team building large-scale GPU cluster architecture, training and inference orchestration, and performance optimization. Own design, scaling, scheduling, networking, and resource management while partnering with research and stack teams to accelerate production deployment and drive hiring and mentorship.

The summary above was generated by AI

About Applied Intuition

Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving machine on the planet. Applied Intuition services the automotive, defense, trucking, construction, mining and agriculture industries in three core areas: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20 global automakers, as well as the United States military and its allies, trust the company’s solutions to deliver physical intelligence. Applied Intuition is headquartered in Sunnyvale, California, with offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Learn more at applied.co.

We are an in-office company, and our expectation is that employees primarily work from their Applied Intuition office 5 days a week. However, we also recognize the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments.

About the role

As an Engineering Manager on the ML Platform team, you'll lead a world-class group of engineers focused on building the infrastructure that powers Physical AI at scale. Your team will own three critical areas: Training & Inference Orchestration, where we build frameworks to efficiently schedule and run massive jobs across thousands of GPUs; GPU Cluster Architecture, where we design and scale what will be the largest GPU cluster for Physical AI in the industry; and Performance Optimization, where we push the limits of hardware utilization, throughput, and cost efficiency for large-scale training and inference workloads. You'll work at the intersection of systems engineering and ML, partnering directly with stack development and research teams to remove bottlenecks and accelerate the path from experimentation to production.

At Applied Intuition, you will:

Grow and manage a team of world-class infrastructure and systems engineers with the goal of delivering a best-in-class ML platform for Physical AI
Own the design and evolution of frameworks for orchestrating distributed training and inference jobs across thousands of GPUs
Drive the buildout and scaling of our GPU cluster infrastructure, making critical decisions on architecture, scheduling, networking, and resource management
Lead efforts to optimize training and inference performance — including throughput, fault tolerance, GPU utilization, and cost efficiency at scale
Set team goals and roadmap in alignment with research milestones, model development timelines, and production deployment requirements
Partner closely with research, stack development, and infrastructure teams to understand their workflows and accelerate their iteration speed
Drive hiring, mentoring, and growth for a high-performing, mission-driven team

We’re looking for someone who has:

3+ years of engineering management experience, ideally leading infrastructure or platform teams
Passion for building and leading high-performing teams that operate at the frontier of scale
Deep experience with distributed systems, GPU computing, or large-scale ML infrastructure
Direct experience building or operating large GPU clusters (1,000+ GPUs)
Strong understanding of distributed training frameworks (e.g., PyTorch Distributed, Megatron-LM, DeepSpeed, FSDP) and job orchestration at scale
Familiarity with GPU cluster management, high-performance networking (InfiniBand, RDMA), and resource scheduling (Slurm, Kubernetes)
Track record of building and operating systems that run reliably at massive scale

Nice to have:

Background in training optimization techniques such as mixed-precision training, pipeline/tensor/data parallelism, or checkpointing strategies
Experience with inference optimization (batching, model serving, quantization, compiler-level optimizations)
Familiarity with Physical AI domains such as autonomous driving, robotics, or simulation
Contributions to open-source ML infrastructure projects

Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment.

Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position.

Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $204,000 - $343,000 USD annually.

Don’t meet every single requirement? If you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. Consequently, the parties agree that, as applicable, they will abide by the requirements of 41 CFR 60-1.4(a), 41 CFR 60-300.5(a) and 41 CFR 60-741.5(a) and that these laws are incorporated herein by reference. These regulations prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities, and prohibit discrimination against all individuals based on their race, color, religion, sex, sexual orientation, gender identity or national origin. These regulations require that covered prime contractors and subcontractors take affirmative action to employ and advance in employment individuals without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status or disability. The parties also agree that, as applicable, they will abide by the requirements of Executive Order 13496 (29 CFR Part 471, Appendix A to Subpart A), relating to the notice of employee rights under federal labor laws.

157 S Murphy Ave, Sunnyvale, CA, United States

Similar Jobs

Adyen

Compliance Advisory Officer

35 Minutes Ago

Easy Apply

Hybrid

San Francisco, CA, USA

Easy Apply

120K-155K Annually

Senior level

120K-155K Annually

Senior level

Fintech • Payments • Financial Services

Support assessment and resolution of escalated compliance matters, analyze AML/CFT and integrity risks, partner with commercial and first-line teams to apply risk-based solutions, help develop compliance frameworks and escalation procedures, identify automation opportunities, and collaborate globally to ensure compliant onboarding and operations.

Adyen

Compliance Advisory Officer

35 Minutes Ago

Easy Apply

Hybrid

San Francisco, CA, USA

Easy Apply

120K-155K Annually

Senior level

120K-155K Annually

Senior level

Fintech • Payments • Financial Services

Support assessment and resolution of escalated compliance matters across AML/CFT, integrity, and regulatory obligations. Partner with legal, risk, commercial, and operations to provide risk-based solutions, develop compliance frameworks, improve escalation procedures, and identify automation opportunities. Translate compliance issues into actionable steps and collaborate globally to execute compliance initiatives.

Tapestry - Coach and Kate Spade

Retail Contingent

2 Hours Ago

Hybrid

Livermore, CA, USA

15-24 Hourly

Entry level

15-24 Hourly

Entry level

eCommerce • Fashion • Retail • Sales • Wearables • Design

Maintain organized, customer-ready store by processing deliveries, stocking the sales floor, executing price changes and markdowns, auditing inventory/shrinkage, and supporting daily operational standards and cleanliness.

Top Skills: Omnichannel SellingSocial Media

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Applied Intuition

Engineering Manager - ML Platform and Infrastructure

Applied Intuition Sunnyvale, California, USA Office

Similar Jobs

Compliance Advisory Officer

Compliance Advisory Officer

Retail Contingent

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech