Navan Logo

Navan

Senior AI Operations (AI Ops) Engineer

Reposted 7 Days Ago
Be an Early Applicant
Easy Apply
Hybrid
Palo Alto, CA, USA
116K-258K Annually
Senior level
Easy Apply
Hybrid
Palo Alto, CA, USA
116K-258K Annually
Senior level
The role involves orchestrating AI services, optimizing model inference, ensuring reliability and standardization, and collaborating with AI researchers on deployments.
The summary above was generated by AI

At Navan, we aren't building a single, generic chatbot. We are building a Composable AI Microservice Architecture, a swarm of hundreds of hyper-specialized AI services, each meticulously "programmed" to solve small, focused tasks with high precision. This fleet powers Ava, our AI support engine, and a suite of cutting-edge generative tools for travel and expense management.

As a Senior AI Operations (AI Ops) Engineer, you are the architect of the platform that makes this scale possible. You will move beyond traditional MLOps to manage a "factory" of Language Models. Your challenge is one of orchestration and standardization, ensuring that every service in the swarm meets a rigorous bar for quality, reliability, and cost-efficiency.

What You’ll Do
  • Orchestrate the AI Fleet: Build and own the runtime environment for 100+ specialized AI services. Manage model routing, context versioning, and standardized memory/history stores.
  • High-Density Inference Optimization: Design and implement SageMaker Multi-Model Endpoints (MME) and Inference Components to serve multiple tuned SLMs per GPU, maximizing hardware utilization while minimizing latency.
  • Deterministic Service Excellence: Treat reliability as a layered engineering problem. Build deterministic "shells" around probabilistic LM outputs, prioritizing data-layer validation and strict serialization.
  • Automated Evaluation & Observability: Implement "LLM-as-a-judge" patterns and automated benchmarking to detect semantic drift and hallucinations across the fleet before they impact the user.
  • Standardize the Workflow: Obsess over building reusable patterns and Terraform-based infrastructure that eliminate "snowflake" configurations, allowing us to deploy new specialized AI tasks in minutes.
  • Agency Strategy: Partner with AI Researchers to find the "Goldilocks zone" for agentic autonomy—balancing the flexibility of LLM tool-use with the precision required for production stability.
What We’re Looking For
  • Experience: 5+ years in SRE, Platform Engineering, or MLOps, with at least 2 years focused on deploying LLMs/SLMs in production environments.
  • SageMaker Mastery: Deep hands-on expertise with AWS SageMaker, specifically configuring Multi-Model Endpoints (MME), Inference Components, and GPU-backed instances (G5/P4).
  • SLM Expertise: Proven experience with Small Language Models (e.g., Mistral, Llama 3, Phi) and parameter-efficient fine-tuning (PEFT) deployment strategies like LoRA/QLoRA.
  • Technical Stack: * Languages: Strong proficiency in Python and Terraform.
    • Orchestration: Experience with Docker, Kubernetes (EKS), or AWS ECS/Fargate.
    • Data: Familiarity with Snowflake and Vector Databases.
  • The "AI Ops" Mindset: You understand that AI at scale is a statistical challenge. You are comfortable debugging issues at the data/serialization layer rather than defaulting to prompt tweaks.
  • CI/CD & Automation: Experience building robust pipelines (Jenkins, GitHub Actions) for non-deterministic software, including automated "eval" stages.
  • Education: BS or MS in Computer Science, Engineering, Mathematics, or a related technical field.

The posted pay range represents the anticipated low and high end of the compensation for this position and is subject to change based on business need. To determine a successful candidate’s starting pay, we carefully consider a variety of factors, including primary work location, an evaluation of the candidate’s skills and experience, market demands, and internal parity.
For roles with on-target-earnings (OTE), the pay range includes both base salary and target incentive compensation. Target incentive compensation for some roles may include a ramping draw period. Compensation is higher for those who exceed targets. Candidates may receive more information from the recruiter.

Pay Range
$116,100$258,000 USD

Top Skills

Aws Ecs
Aws Sagemaker
Docker
Github Actions
Jenkins
Kubernetes
Python
Snowflake
Terraform
Vector Databases
HQ

Navan Palo Alto, California, USA Office

3045 Park Blvd, Palo Alto, CA, United States, 94304

Navan San Francisco, California, USA Office

181 Fremont St. 23rd Floor , San Francisco, CA, United States, 94105

Similar Jobs at Navan

6 Hours Ago
Easy Apply
Hybrid
2 Locations
Easy Apply
115K-255K Annually
Senior level
115K-255K Annually
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
The Product Manager will own the vision and roadmap for Navan's Policy engine, collaborating with cross-functional teams to optimize expense policy management through AI and market research.
Top Skills: Ai-Tooling
6 Hours Ago
Easy Apply
Hybrid
Palo Alto, CA, USA
Easy Apply
73K-162K Annually
Senior level
73K-162K Annually
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Design, deploy, secure, and maintain client computing platforms (Windows/macOS/MDM/virtual desktops). Automate endpoint deployment, patching, and configuration management; provide advanced troubleshooting; monitor platform health; collaborate with security and infrastructure teams; and document procedures and integration plans.
Top Skills: AWSAzureAzure Virtual DesktopCitrixGCPJAMFKandjimacOSMdmMecmMicrosoft Endpoint Manager (Intune)PowershellPythonSccmShell ScriptingVmware HorizonWindows 10Windows 11Workspace One
6 Hours Ago
Easy Apply
Hybrid
2 Locations
Easy Apply
86K-145K Annually
Mid level
86K-145K Annually
Mid level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
The Sales Compensation Analyst manages commission plans, administers the ICM tool, supports month-end processes, resolves commission inquiries, and generates reports for compensation evaluation.
Top Skills: CaptivateiqGoogle SheetsIncentive Compensation Management IcmExcel

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account