Eloquent AI Logo

Eloquent AI

AI Engineer, .*RAG

Reposted 10 Days Ago
In-Office
San Francisco, CA
Senior level
In-Office
San Francisco, CA
Senior level
As a Senior AI Engineer, you will design and build optimized Retrieval-Augmented Generation systems, collaborating with teams to enhance AI capabilities in complex workflows.
The summary above was generated by AI
Meet Eloquent AI

At Eloquent AI, we’re building the next generation of AI Operators—multimodal, autonomous systems that execute complex workflows across fragmented tools with human-level precision. Our technology goes far beyond chat: it sees, reads, clicks, types, and makes decisions—transforming how work gets done in regulated, high-stakes environments.

We’re already powering some of the world’s leading financial institutions and insurers, fundamentally changing how millions of people manage their finances every day. From automating compliance reviews to handling customer operations, our Operators are quietly replacing repetitive, manual tasks with intelligent, end-to-end execution.

Headquartered in San Francisco with a global footprint, Eloquent AI is a fast-growing company backed by top-tier investors. Join us to work alongside world-class talent in AI, engineering, and product as we redefine the future of financial services.

Your Role

As a Senior AI Engineer, .*RAG at Eloquent AI, you will play a critical role in designing, building, and optimizing Any Retrieval-Augmented Generation (.*RAG) systems that power our enterprise AI agents. You will work on scalable, high-performance AI infrastructure, ensuring our LLM-powered agents deliver accurate, real-time responses with deep knowledge retrieval.

This role requires a strong software engineering background, expertise in LLMs, RAG architectures, and agentic frameworks, and the ability to translate cutting-edge research into production-ready AI systems. You will collaborate with researchers, engineers, and product teams to advance our AI capabilities and ensure that our agents retrieve and generate knowledge with precision and efficiency.

You will:

  • Design and implement scalable RAG pipelines that enable AI agents to retrieve and generate knowledge in real time.

  • Develop and optimize knowledge retrieval systems, fine-tuning embeddings, vector search, and ranking models.

  • Work with LLM architectures, applying prompt engineering, fine-tuning, and reinforcement learning techniques to improve response accuracy.

  • Optimize large-scale AI workloads, ensuring low latency and high efficiency for enterprise-grade AI applications.

  • Collaborate with AI researchers to translate state-of-the-art RAG advancements into deployable, high-performing solutions.

  • Leverage cloud infrastructure (AWS, GCP, or Azure) to build distributed, high-availability AI systems.

  • Continuously improve knowledge ingestion, ensuring AI agents stay up-to-date with evolving enterprise datasets.

Requirements
  • 5+ years of software engineering experience, with a focus on AI, NLP, or distributed systems.

  • Strong proficiency in Python and experience with AI frameworks like PyTorch and TensorFlow.

  • Expertise in RAG architectures, including experience with vector databases (e.g., FAISS, Weaviate, Pinecone, Milvus) and document retrieval methods.

  • Familiarity with LLM training, knowledge distillation, and agentic frameworks.

  • Experience with cloud computing and building scalable, production-ready AI applications.

  • Ability to optimize AI models for efficiency, balancing accuracy, latency, and cost.

  • Deep understanding of NLP and IR techniques, including tokenization, embeddings, ranking algorithms, and their evaluation.

Bonus Points If…

  • You have published research in AI, NLP, or RAG-related topics at top-tier conferences (NeurIPS, ICML, ICLR, ACL, SIGIR, etc.).

  • You have experience implementing hybrid RAG pipelines, combining retrieval with multi-step reasoning and tool use.

  • You’ve worked in high-performance AI teams, scaling AI-driven applications in fast-growth environments.

  • You have experience with Reinforcement Learning from Human Feedback (RLHF) and optimizing LLMs for enterprise use cases.

  • You are comfortable working in cross-functional AI product teams, collaborating with researchers, engineers, and product managers.

Top Skills

AWS
Azure
Faiss
GCP
Milvus
Pinecone
Python
PyTorch
TensorFlow
Weaviate
HQ

Eloquent AI San Francisco, California, USA Office

San Francisco, California, United States

Similar Jobs

22 Days Ago
In-Office or Remote
3 Locations
75K-99K Annually
Mid level
75K-99K Annually
Mid level
Software
Design and improve end-to-end ML/AI pipelines, process datasets, and contribute production code for high-quality solutions in machine learning projects.
Top Skills: AICi/CdGitMachine LearningPython
An Hour Ago
Hybrid
Los Angeles, CA, USA
67K-83K Annually
Mid level
67K-83K Annually
Mid level
Digital Media • eCommerce • Information Technology • Marketing Tech • Retail • Social Media • Analytics
The Buyer role involves sourcing strategy, vendor liaison, cost-savings initiatives, issuing purchase orders, and supplier engagement to secure quality materials.
An Hour Ago
Hybrid
4 Locations
205K-281K Annually
Senior level
205K-281K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
The role involves developing and deploying AI-powered products, optimizing performance, and collaborating with cross-functional teams to advance AI technology at Capital One.
Top Skills: AWSAzureGoGCPHuggingfaceJavaNemo GuardrailsPythonPyTorchScalaVectordbs

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account