Lead the HPC Storage infrastructure team, focusing on system design, optimization, and cost-effective data management for high-throughput ML use cases. Responsible for team hiring, mentoring, and strategic planning.
Zoox is looking for an experienced Software Engineering Manager to lead our High Performance Computing Storage infrastructure team. Zoox HPC Storage provides abstraction layers for petabyte-scale data movement and management for critical, high-throughput use cases, such as ML foundation model training, synthetic data generation, and more. You will take on a breadth of end-to-end responsibilities, including distributed system design, optimization of storage-related GPU utilization bottlenecks, and cost-effective resource management.
The position comes with a high degree of independence and the opportunity to help define Zoox’s scaling strategy, both technically and organizationally. You will be responsible for hiring and maintaining the health of your team, as well as growing and coaching them to support the continued success of their careers.
In this role, you will:
- Work closely with AI teams and other software customers to holistically address pain points, find optimization opportunities, and ultimately charter systems-solutions for broad categories of storage use cases
- Develop a multi-year vision and roadmap for storage at Zoox, including investment into new data movement and management paradigms to meet Zoox’s ever growing computational and storage needs in a cost-effective manner
- Own the hiring process end-to-end, from thoughtful role definition to interview loop design to successfully hiring bar raisers
- Mentor, coach, and advocate for your direct reports
Qualifications:
- Experience managing teams of 5-10
- Demonstrated ability to prioritize development work and build cross-functional consensus across ML stakeholders
- Experience with high performance storage systems deployed on cloud providers, such as FSx for Lustre on Amazon Web Services (AWS)
- Strong operational background with highly available systems
- Bachelor's degree in computer science (or related field)
Bonus Qualifications:
- Experience with ML-specific data formats such as Mosaic Streaming Datasets (MDS)
- Experience with end-to-end hosted ML services such as AWS SageMaker HyperPod
- Proficiency with Python, Java, or other managed languages
Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Follow us on LinkedIn
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
Top Skills
AWS
Fsx For Lustre
Java
Ml Services
Python
Zoox Foster City, California, USA Office
4000 E 3rd Ave, Foster City, CA, United States, 94404
Zoox Foster City, California, USA Office
1149 Chess Drive, Foster City, CA, United States, 94404
Zoox Fremont, California, USA Office
47540 Kato Road, Fremont, CA, United States, 94538
Zoox San Francisco, California, USA Office
60 Broadway St, San Francisco, CA, United States, 94111
Similar Jobs
Artificial Intelligence • Hardware • Information Technology • Machine Learning
The Principal Design Engineer at Micron will drive the design and optimization of datapath circuits for NAND flash memory, overseeing technical projects, collaborating across teams, and implementing signal and power integrity requirements.
Top Skills:
CmosDdr4/5HbmLpddr5/6Nand Flash MemoryTsv (Through-Silicon Via)Verilog
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Senior Data Engineer will design and implement data architectures and pipelines, ensure data quality, automate processes, and collaborate with stakeholders to provide actionable insights using AI and big data technologies.
Top Skills:
AWSAzureHadoopJavaKafkaPythonScalaSnowflakeSparkSQL
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
The People Success Generalist at Navan focuses on employee relations, talent management, compliance, and strategic advising while enhancing the employee experience.
Top Skills:
HrisWorkday
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.jpeg)

.png)