Sonatus

Staff AI Engineer, DevOps/MLOps - Office of CTO

Reposted Yesterday

Easy Apply

Hybrid

Sunnyvale, CA

169K-232K Annually

Senior level

Easy Apply

Hybrid

Sunnyvale, CA

169K-232K Annually

Senior level

The Staff DevOps/MLOps Engineer will design and build end-to-end DevOps and MLOps platforms, managing cloud infrastructure, CI/CD pipelines, and machine learning lifecycles to ensure efficient model deployment and monitoring.

The summary above was generated by AI

Join a high-performing team at Sonatus that’s redefining what cars can do in the era of Software-Defined Vehicles (SDV).

At Sonatus, we’re driving the transformation to AI-enabled software-defined vehicles. Traditional automotive software methods can’t keep pace with consumer expectations shaped by the mobile industry—where features evolve rapidly, update seamlessly, and improve continuously. That’s why leading OEMs trust Sonatus to accelerate this shift. Our technology is already in production across more than 5 million vehicles on the road today and rapidly expanding.

Headquartered in Sunnyvale, CA, with 250+ employees worldwide, Sonatus combines the agility of a fast-growing company with the scale and impact of an established partner. Backed by strong funding and proven by global deployment, we’re solving some of the most interesting and complex challenges in the industry. Join us and help redefine what’s possible as we shape the future of mobility.

Role Summary:

We are seeking a highly experienced and strategicStaff AI Engineer, DevOps/MLOps to architect, build, and scale our end-to-end DevOps and MLOps platform. In this role, you will be responsible for the full cloud CI/CD pipeline, cloud infrastructure management, and machine learning model lifecycle, from implementing the MLOps framework that enables models to move from experimentation to production with velocity and reliability, to managing the serving infrastructure. You'll leverage deep expertise in DevOps and MLOps and Site Reliability Engineering (SRE) to make critical decisions that span model training, serving, and monitoring. This is a key leadership position for a hands-on engineer who will define our model versioning, production observability, and infrastructure-as-code best practices.

Roles and Responsibilities:

Design and build the foundational, end-to-end DevOps and MLOps platform for our Generative AI systems, making critical decisions that span large language model-based systems evaluation, monitoring, and deployment
Implement the full DevOps and MLOps framework. You will build the CI/CD/CT (Continuous Integration/Delivery/Training) automation that takes models from experiment to production with velocity and reliability.
Deploy, scale, and optimize our model serving infrastructure. You will manage GPU/NPU resources, minimize inference latency, and build robust monitoring to ensure our AI is always fast, accurate, and cost-effective.
Create a single, cohesive set of best practices for the entire AI lifecycle. Your work will define how we handle model versioning, infrastructure as code, and production observability in one seamless system.

Requirements:

A seasoned engineer with 8+ years of experience building and scaling production-grade cloud services and systems, with a strong focus on DevOps, MLOps, and/or SRE.
A "systems thinker" with a demonstrated ability to architect end-to-end solutions and a deep understanding of the full CI/CD pipeline and machine learning lifecycle.
Deep proficiency in Python and Infrastructure as Code (e.g., Terraform, Pulumi, etc.).
Experience with MLOps tools (e.g., MLflow, Kubeflow, Vertex AI) and production monitoring frameworks
Enforce reproducibility, approvals, audit trails, PII handling, model cards, and policy/compliance (e.g., privacy, evals, guardrails).
Experience with robust ML deployment systems (e.g., Kubeflow, MLflow, model servers like BentoML or TensorFlow Serving).
Hands-on experience with public cloud platforms (GCP, AWS, and/or Azure) and containerization/orchestration (Docker, Kubernetes).
Package, version, and deploy software modules and AI models (batch & online) with blue/green or canary rollouts; build feature & model registries, and automate retraining
Experience with Pytorch, vLLMs, and GPUs a plus
Experience with tracking Modes and Agentic drift is a plus
Experience with tuning serving stacks (GPU/CPU utilization, batching, quantization)
Direct experience building and operationalizing systems for LLMs, especially RAG pipelines, is a plus
Experience with vector databases (e.g., Pinecone, Weaviate) and embedding management from a deployment and scaling perspective is a plus

Benefits:

Sonatus is a tight-knit team aligned around a unified vision. You can expect a strong engineering-oriented culture that focuses on building the best products and solutions for our customers. We embrace equality and diversity in all regards because respect is ingrained in our every fiber. Other benefits Sonatus offers include:

Stock option plan
Health care plan (Medical, Dental & Vision)
Retirement plan (401k, IRA)
Life Insurance (Basic, Voluntary & AD&D)
Unlimited paid time off (Vacation, Sick & Public Holidays)
Family leave (Maternity, Paternity)
Flexible work arrangements
Free food & snacks in the office

The posted salary range is a general guideline and represents a good faith estimate of what Sonatus ("Company") could reasonably expect to pay for a base salary for this position. The pay offered to a selected candidate will be determined based on factors such as (but not limited to) the scope and responsibilities of the position, the qualifications of the selected candidate, departmental budget availability, geographic location and external market pay for comparable jobs. The Company reserves the right to modify this range in the future, as needed, as market conditions change.

Pay range for this role

$168,500—$232,000 USD

To all recruitment agencies: Sonatus, Inc. ("Sonatus") does not accept unsolicited agency resumes. Please do not forward resumes to our careers alias or other Sonatus' employees. Sonatus is not responsible for any fees associated with unsolicited activities.

Top Skills

AWS

Azure

Docker

GCP

Kubeflow

Kubernetes

Mlflow

Pulumi

Python

Terraform

Vertex Ai

330 Gibraltar Dr, Sunnyvale, CA, United States, 94089

Similar Jobs at Sonatus

Sonatus

Artificial Intelligence Engineer

Yesterday

Easy Apply

Hybrid

Sunnyvale, CA, USA

Easy Apply

176K-242K Annually

Senior level

176K-242K Annually

Senior level

Artificial Intelligence • Automotive • Cloud • Software

Design and implement large-scale cloud infrastructure and data processing pipelines for AI applications in software-defined vehicles, ensuring best practices in data governance and observability.

Top Skills: DockerElasticsearchGoKafkaKubernetesPulumiPythonRabbitMQTerraform

Sonatus

Test Automation Engineer

2 Days Ago

Easy Apply

Hybrid

Sunnyvale, CA, USA

Easy Apply

149K-204K Annually

Expert/Leader

149K-204K Annually

Expert/Leader

Artificial Intelligence • Automotive • Cloud • Software

Lead quality efforts in AI testing for embedded and cloud-connected systems, developing test plans and collaborating with teams on innovative projects.

Top Skills: AICloud ServicesJenkinsJIRALinuxPython

Sonatus

Staff Software Engineer

2 Days Ago

Easy Apply

Hybrid

Sunnyvale, CA, USA

Easy Apply

169K-232K Annually

Senior level

169K-232K Annually

Senior level

Artificial Intelligence • Automotive • Cloud • Software

Lead a team in developing in-vehicle Ethernet networking software, mentor engineers, and engage with automotive partners to enhance network management solutions.

Top Skills: Automotive EthernetC/C++CanCan-FdCoreconfLinuxNetconfNetwork Diagnostics ToolsSome/IpTsnYang Modeling

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sonatus

Staff AI Engineer, DevOps/MLOps - Office of CTO

Top Skills

Sonatus Sunnyvale, California, USA Office

Similar Jobs at Sonatus

Artificial Intelligence Engineer

Test Automation Engineer

Staff Software Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech