Build and own production applied-AI systems focused on information retrieval and LLM pipelines. Design experiment frameworks and evaluation pipelines, optimize for metrics (precision, recall, F1), run benchmarks, perform failure analysis, and iterate to improve real-world system performance at scale.
About CaseGuild
The Role
You Might Thrive in This Role If You:
Compensation
CaseGuild builds the industry’s most advanced evidence reasoning platform for complex litigation, helping legal teams investigate massive document sets and surface critical facts with speed and precision.
We’re a fast growing, early-stage Seattle startup building systems at the intersection of information retrieval, machine learning, and large language models, operating at billions of tokens where accuracy and speed matter. You’ll join a small, senior team that ships production systems daily, measures everything, and iterates based on real-world performance.
This role is for a startup engineer who is data-first, evaluation-driven, and has built production systems.
You’ve spent years building ML or AI systems where success isn’t measured by demos, but by metrics, benchmarks, and real-world performance. You understand that modern LLM pipelines still require datasets, experiments, baselines, and failure analysis, and you enjoy owning that end-to-end.
- Have 5+ years of experience building ML or applied AI systems where accuracy and evaluation mattered
- Have designed and owned experiment frameworks and evaluation pipelines in production
- Are fluent in metrics (precision, recall, F1) and know when each matters
- Have strong foundations in classic ML, NLP, and information retrieval, now applied to LLM-based systems
- Have experience working with multiple LLM providers and models, and don’t treat them as black boxes
- Enjoy end-to-end ownership and pragmatic tradeoffs in a startup environment
- Have a high sense of ownership and agency, with a bias toward getting 1% better every day
- Care deeply about correctness, rigor, and repeatability
Compensation
The base pay range for this role is $80,000 – $140,000 per year.
Similar Jobs
Cloud • Information Technology • Machine Learning
Design and build production-grade full-stack, AI-enabled applications. Develop React/Next.js frontends, backend services on Kubernetes, integrate LLM/AI features, connect data platforms, implement CI/CD, automated testing, observability, and ensure secure, high-performance APIs and services.
Top Skills:
Ai/MlAutomated TestingC#Ci/CdDockerGoGrpcHelmJavaJavaScriptKafkaKubernetesLlmNext.JsObservabilityPythonReactRestSparkTypescript
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead development of AI and ML solutions for healthcare systems, overseeing teams, driving use case development, and ensuring operational excellence while managing global delivery operations.
Top Skills:
AWSAzureDatabricksGCPKerasNoSQLPandasPythonScikit-LearnSQLTransformers
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead the creation of innovative AI solutions for healthcare systems, serve as a trusted advisor, foster collaboration, and drive operational excellence while managing complex challenges in healthcare AI.
Top Skills:
AIAws BedrockAzure Openai ServiceDatabricksGoogle Vertex AiMlMlopsSnowflake
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

