Sonatus Logo

Sonatus

Staff AI Engineer, DevOps/MLOps - Office of CTO

Reposted Yesterday
Easy Apply
Hybrid
Sunnyvale, CA
169K-232K Annually
Senior level
Easy Apply
Hybrid
Sunnyvale, CA
169K-232K Annually
Senior level
The Staff DevOps/MLOps Engineer will design and build end-to-end DevOps and MLOps platforms, managing cloud infrastructure, CI/CD pipelines, and machine learning lifecycles to ensure efficient model deployment and monitoring.
The summary above was generated by AI

Join a high-performing team at Sonatus that’s redefining what cars can do in the era of Software-Defined Vehicles (SDV).

At Sonatus, we’re driving the transformation to AI-enabled software-defined vehicles. Traditional automotive software methods can’t keep pace with consumer expectations shaped by the mobile industry—where features evolve rapidly, update seamlessly, and improve continuously. That’s why leading OEMs trust Sonatus to accelerate this shift. Our technology is already in production across more than 5 million vehicles on the road today and rapidly expanding.

Headquartered in Sunnyvale, CA, with 250+ employees worldwide, Sonatus combines the agility of a fast-growing company with the scale and impact of an established partner. Backed by strong funding and proven by global deployment, we’re solving some of the most interesting and complex challenges in the industry. Join us and help redefine what’s possible as we shape the future of mobility.

Role Summary:

We are seeking a highly experienced and strategicStaff AI Engineer, DevOps/MLOps to architect, build, and scale our end-to-end DevOps and MLOps platform. In this role, you will be responsible for the full cloud CI/CD pipeline, cloud infrastructure management, and machine learning model lifecycle, from implementing the MLOps framework that enables models to move from experimentation to production with velocity and reliability, to managing the serving infrastructure. You'll leverage deep expertise in DevOps and MLOps and Site Reliability Engineering (SRE) to make critical decisions that span model training, serving, and monitoring. This is a key leadership position for a hands-on engineer who will define our model versioning, production observability, and infrastructure-as-code best practices.

Roles and Responsibilities:
  • Design and build the foundational, end-to-end DevOps and MLOps platform for our Generative AI systems, making critical decisions that span large language model-based systems evaluation, monitoring, and deployment
  • Implement the full DevOps and MLOps framework. You will build the CI/CD/CT (Continuous Integration/Delivery/Training) automation that takes models from experiment to production with velocity and reliability.
  • Deploy, scale, and optimize our model serving infrastructure. You will manage GPU/NPU resources, minimize inference latency, and build robust monitoring to ensure our AI is always fast, accurate, and cost-effective.
  • Create a single, cohesive set of best practices for the entire AI lifecycle. Your work will define how we handle model versioning, infrastructure as code, and production observability in one seamless system.
Requirements:
  • A seasoned engineer with 8+ years of experience building and scaling production-grade cloud services and systems, with a strong focus on DevOps, MLOps, and/or SRE.
  • A "systems thinker" with a demonstrated ability to architect end-to-end solutions and a deep understanding of the full CI/CD pipeline and machine learning lifecycle.
  • Deep proficiency in Python and Infrastructure as Code (e.g., Terraform, Pulumi, etc.).
  • Experience with MLOps tools (e.g., MLflow, Kubeflow, Vertex AI) and production monitoring frameworks
  • Enforce reproducibility, approvals, audit trails, PII handling, model cards, and policy/compliance (e.g., privacy, evals, guardrails).
  • Experience with robust ML deployment systems (e.g., Kubeflow, MLflow, model servers like BentoML or TensorFlow Serving).
  • Hands-on experience with public cloud platforms (GCP, AWS, and/or Azure) and containerization/orchestration (Docker, Kubernetes).
  • Package, version, and deploy software modules and AI models (batch & online) with blue/green or canary rollouts; build feature & model registries, and automate retraining
  • Experience with Pytorch, vLLMs, and GPUs a plus
  • Experience with tracking Modes and Agentic drift is a plus
  • Experience with tuning serving stacks (GPU/CPU utilization, batching, quantization)
  • Direct experience building and operationalizing systems for LLMs, especially RAG pipelines, is a plus
  • Experience with vector databases (e.g., Pinecone, Weaviate) and embedding management from a deployment and scaling perspective is a plus
Benefits:

Sonatus is a tight-knit team aligned around a unified vision. You can expect a strong engineering-oriented culture that focuses on building the best products and solutions for our customers. We embrace equality and diversity in all regards because respect is ingrained in our every fiber. Other benefits Sonatus offers include:

  • Stock option plan
  • Health care plan (Medical, Dental & Vision)
  • Retirement plan (401k, IRA)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Unlimited paid time off (Vacation, Sick & Public Holidays)
  • Family leave (Maternity, Paternity)
  • Flexible work arrangements
  • Free food & snacks in the office

The posted salary range is a general guideline and represents a good faith estimate of what Sonatus ("Company") could reasonably expect to pay for a base salary for this position. The pay offered to a selected candidate will be determined based on factors such as (but not limited to) the scope and responsibilities of the position, the qualifications of the selected candidate, departmental budget availability, geographic location and external market pay for comparable jobs. The Company reserves the right to modify this range in the future, as needed, as market conditions change.

Pay range for this role
$168,500$232,000 USD
To all recruitment agencies: Sonatus, Inc. ("Sonatus") does not accept unsolicited agency resumes. Please do not forward resumes to our careers alias or other Sonatus' employees. Sonatus is not responsible for any fees associated with unsolicited activities.

Top Skills

AWS
Azure
Docker
GCP
Kubeflow
Kubernetes
Mlflow
Pulumi
Python
Terraform
Vertex Ai
HQ

Sonatus Sunnyvale, California, USA Office

330 Gibraltar Dr, Sunnyvale, CA, United States, 94089

Similar Jobs at Sonatus

Yesterday
Easy Apply
Hybrid
Sunnyvale, CA, USA
Easy Apply
176K-242K Annually
Senior level
176K-242K Annually
Senior level
Artificial Intelligence • Automotive • Cloud • Software
Design and implement large-scale cloud infrastructure and data processing pipelines for AI applications in software-defined vehicles, ensuring best practices in data governance and observability.
Top Skills: DockerElasticsearchGoKafkaKubernetesPulumiPythonRabbitMQTerraform
2 Days Ago
Easy Apply
Hybrid
Sunnyvale, CA, USA
Easy Apply
149K-204K Annually
Expert/Leader
149K-204K Annually
Expert/Leader
Artificial Intelligence • Automotive • Cloud • Software
Lead quality efforts in AI testing for embedded and cloud-connected systems, developing test plans and collaborating with teams on innovative projects.
Top Skills: AICloud ServicesJenkinsJIRALinuxPython
2 Days Ago
Easy Apply
Hybrid
Sunnyvale, CA, USA
Easy Apply
169K-232K Annually
Senior level
169K-232K Annually
Senior level
Artificial Intelligence • Automotive • Cloud • Software
Lead a team in developing in-vehicle Ethernet networking software, mentor engineers, and engage with automotive partners to enhance network management solutions.
Top Skills: Automotive EthernetC/C++CanCan-FdCoreconfLinuxNetconfNetwork Diagnostics ToolsSome/IpTsnYang Modeling

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account