Easy Apply
Easy Apply
Lead the transition to a self-hosted LLM ecosystem, optimizing model inference, developing agentic workflows, and executing fine-tuning strategies for chatbot enhancement.
We are looking for a highly skilled Senior ML Engineer to lead our transition from third-party LLM APIs to a fully self-hosted ecosystem by fine-tuning high-performance, domain-specific models.
Our core product is an advanced, agentic support chatbot capable of complex reasoning, API tool calling, database lookups, and orchestrating specialized LLMs for specific tasks.
What You’ll Do:
- Model Fine-Tuning: Design and execute fine-tuning strategies to improve model accuracy on specific domain tasks and tool-calling execution.
- Agentic Workflows: Develop and refine the chatbot's agentic capabilities, ensuring reliable tool-use, routing, and interactions between massive LLMs and specialized SLMs.
- Inference Optimization: Deploy and manage large-scale models using high-performance inference engines (like vLLM) to ensure low latency and high throughput for our agentic chatbot.
- Rigorous Evaluation: Build comprehensive offline and online evaluation frameworks to constantly measure model performance and business impact through structured A/B testing.
What We’re Looking For:
Core Engineering & AI Frameworks
- Deep experience with PyTorch and the Hugging Face ecosystem.
- Strong Data Engineering skills: data manipulation, synthetic data generation, and active learning/margin-sampling.
- High proficiency with AI-assisted development workflows (e.g., Claude Code, Cursor, Codex) to accelerate development.
LLMs & Agents
- Strong fundamental understanding of LLM architectures, attention mechanisms, and generation parameters.
- Hands-on experience building Agentic systems (ReAct, function/tool calling, RAG).
- Expertise in fine-tuning strategies (e.g., SFT, RLHF, DPO) and parameter-efficient techniques (PEFT/LoRA).
Bonus Points
- Alignment Techniques: Experience with RLHF and DPO strategies for future reasoning-model development.
- Containerization & Orchestration: Experience with Ray for orchestrating large-scale model deployments across multi-GPU clusters.
- Model Quantization: Experience with memory optimization techniques like AWQ, GPTQ, or GGUF to fit 70B models efficiently onto hardware.
- API Development: Proficiency in building robust, asynchronous microservices using FastAPI to serve model requests.
- Experience with core MLOps practices, including dataset versioning (e.g., DVC), experiment tracking (e.g., Weights & Biases, MLflow), and model registries.
Navan Palo Alto, California, USA Office
3045 Park Blvd, Palo Alto, CA, United States, 94304
Navan San Francisco, California, USA Office
181 Fremont St. 23rd Floor , San Francisco, CA, United States, 94105
Similar Jobs at Navan
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Lead AI measurement and technical strategies for growth, managing partnerships and ensuring data integrity and quality in AI initiatives.
Top Skills:
AIData ScienceGolden DatasetsLlm
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
The Senior Product Marketing Manager at Navan Edge leads growth and adoption efforts for an AI-powered travel concierge, facilitating cross-functional collaboration and community engagement to align product development with business traveler needs.
Top Skills:
Ai Tools
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Lead the design and development of core product features for a travel application, focusing on backend and frontend scalability and integrating AI technologies.
Top Skills:
AngularAWSClaude CodeCopilotHibernateLangchainReactSpring FrameworkTypescript
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine


.png)