Cresta Logo

Cresta

Staff Machine Learning Engineer

Reposted 5 Days Ago
Remote
Hiring Remotely in United States
230K-300K Annually
Expert/Leader
Remote
Hiring Remotely in United States
230K-300K Annually
Expert/Leader
The Staff Machine Learning Engineer will lead AI initiatives, architect LLM systems, design evaluation frameworks, and mentor engineers.
The summary above was generated by AI

Cresta unlocks the true potential of the customer experience, turning every conversation into a competitive advantage. Cresta’s unified AI platform combines conversational AI agents, real-time human agent augmentation, and comprehensive conversation intelligence to drive revenue and efficiency gains across every channel. The world’s leading companies, including United Airlines, Cox Communications, and Marriott, use Cresta to power world-class customer experiences every day. 

Born from the Stanford AI Lab, Cresta has raised more than $270 million from the world’s leading investors, including a16z, Greylock, and Sequoia. Cresta’s leadership includes some of the leading minds in AI today. Our CEO, Ping Wu, founded and led Google's Contact Center AI and Vertex AI platforms before joining Cresta to build the future of AI-driven customer experiences.

Over the next few years, AI is going to redefine how people all over the world interact with businesses every day. Come build that future at Cresta.

About the role:

Machine Learning Engineers at Cresta work across several high-impact AI initiatives. Final team placement is determined based on experience, strengths, and business needs.

Current focus areas include:

  • Agentic Assist: Lead and build next-generation agentic AI systems that augment contact center agents in real time. This track requires strong pre-LLM ML foundations, deep expertise in LLMs and modern prompting techniques, a rapid prototyping mindset, and a proven ability to translate cutting-edge research into scalable, production-grade systems.
  • Agent & System Quality: Design evaluation frameworks and improve the reliability, robustness, and performance of LLM-powered agents. This includes diagnosing and mitigating failure modes such as hallucinations, retrieval errors, tool misuse, context drift, prompt brittleness, and multi-step reasoning breakdowns, while defining measurable quality metrics (e.g., accuracy, faithfulness, task completion, latency, and cost) for complex, non-deterministic systems.
  • Insights: Architect and scale LLM and retrieval-augmented generation pipelines that ground models in enterprise data. This track focuses on building high-performance ML systems that process complex data, extract structured insights, and deliver real-time, actionable intelligence at scale.

Responsibilities:

  • Define and lead the technical vision for Cresta’s next-generation Agentic AI systems, including Agentic Assist and enterprise AI Agents.
  • Architect scalable, production-grade LLM systems that integrate reasoning, retrieval, planning, tool use, and real-time decision-making into cohesive, intelligent workflows.
  • Design and evolve multi-agent orchestration frameworks that combine RAG, structured knowledge, domain-adapted models, and automated actions.
  • Establish best practices for building robust, reliable, and cost-efficient LLM-powered systems in high-scale production environments.
  • Own evaluation strategy for complex, non-deterministic AI systems, including offline benchmarking, online experimentation, LLM-as-a-judge methodologies, and systematic failure analysis.
  • Proactively identify and mitigate agent failure modes such as hallucinations, tool misuse, retrieval errors, prompt brittleness, context drift, and multi-step reasoning breakdowns.
  • Define measurable quality standards (accuracy, faithfulness, task completion, latency, cost efficiency, robustness) and drive continuous system improvement.
  • Influence cross-team architecture decisions across ML, backend, and product engineering to ensure seamless integration of AI capabilities.
  • Mentor senior engineers, raise the technical bar, and contribute to long-term AI strategy and roadmap planning.
  • Translate cutting-edge research advances into practical, high-impact production systems.

Qualifications We Value:

  • Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. strongly preferred.
  • 7+ years of experience building and deploying machine learning systems in production, including deep hands-on experience with LLMs at scale.
  • Demonstrated leadership in architecting complex AI systems, particularly agentic or multi-step LLM workflows.
  • Deep expertise in transformer-based models, embeddings, retrieval systems, and Retrieval-Augmented Generation (RAG) pipelines.
  • Experience designing evaluation frameworks for LLM systems beyond single-turn prompts, including robustness testing and production monitoring.
  • Strong systems thinking: ability to design for scalability, latency constraints, cost efficiency, security, and long-term maintainability.
  • Extensive experience with modern ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and distributed/cloud-based infrastructure.
  • Proven ability to influence technical direction across teams as a senior individual contributor.
  • A strong bias toward action — able to prototype rapidly while maintaining production rigor.

Perks & Benefits:

We offer a comprehensive and people-first benefits package to support you at work and in life:

  • Comprehensive medical, dental, and vision coverage with plans to fit you and your family
  • Flexible PTO to take the time you need, when you need it
  • Paid parental leave for all new parents welcoming a new child
  • Retirement savings plan to help you plan for the future
  • Remote work setup budget to help you create a productive home office
  • Monthly wellness and communication stipend to keep you connected and balanced
  • In-office meal program and commuter benefits provided for onsite employees

Compensation at Cresta: 

Cresta’s approach to compensation is simple: recognize impact, reward excellence, and invest in our people. We offer competitive, location-based pay that reflects the market and what each individual brings to the table.

The posted base salary range represents what we expect to pay for this role in a given location. Final offers are shaped by factors like experience, skills, education, and geography. In addition to base pay, total compensation includes equity and a comprehensive benefits package for you and your family.

OTE Range: $230,000–$300,000 + Offers Equity

HQ

Cresta San Francisco, California, USA Office

1 Zoe St, San Francisco, CA, United States, 94107

Similar Jobs

Yesterday
Remote or Hybrid
206K-230K Annually
Senior level
206K-230K Annually
Senior level
eCommerce • Mobile • Payments
Lead design, development, and deployment of production-grade, large-scale ML systems. Influence ML strategy, integrate models with platform and data infrastructure, mentor engineers, communicate results to stakeholders, and mature ML infrastructure and abstractions across teams.
Top Skills: AWSDatabricksKafkaPythonSagemakerScikit-LearnSparkSpark MlTensorFlow
2 Days Ago
In-Office or Remote
7 Locations
200K-415K Annually
Expert/Leader
200K-415K Annually
Expert/Leader
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Lead development and production of underwriting and credit decisioning models across Cash App Borrow and Afterpay. Own full modeling lifecycle: problem formulation, feature engineering, training, calibration, experimentation, deployment, monitoring, and iteration. Build decision frameworks, agentic engineering workflows, and collaborate with cross-functional partners to align model behavior with business and regulatory goals.
Top Skills: AirflowAWSClaude CodeCopilotCursorFeature StoreGCPGitLightgbmMlflowModel Hosting PlatformNumpyPandasPrefectPythonPyTorchScikit-LearnSnowflakeSQLXgboost
3 Days Ago
Remote or Hybrid
7 Locations
200K-415K Annually
Expert/Leader
200K-415K Annually
Expert/Leader
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Senior individual contributor building and maintaining underwriting and credit decisioning ML systems for Cash App Borrow and Afterpay. Responsibilities include feature engineering, model training, calibration, experimentation, deployment, monitoring, and portfolio-level analysis. Collaborate with cross-functional teams to align models with business and regulatory goals and develop AI-native engineering workflows and governance for reliable, auditable model development.
Top Skills: AirflowAWSClaude CodeCopilotCursorGCPGitInternal Feature StoreLightgbmMlflowModel Hosting PlatformNumpyPandasPrefectPythonPyTorchScikit-LearnSnowflakeSQLXgboost

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account