Sentry Logo

Sentry

Senior Software Engineer, AI Evals

Reposted 7 Days Ago
Be an Early Applicant
Hybrid
San Francisco, CA, USA
240K-280K Annually
Senior level
Hybrid
San Francisco, CA, USA
240K-280K Annually
Senior level
The Senior Software Engineer will build evaluation infrastructure for AI systems, ensuring reliability and accuracy. Responsibilities include designing datasets, benchmarks, and test harnesses for AI behavior assessment.
The summary above was generated by AI
About Sentry

Software runs the world and the pace is faster than ever. Sentry helps developers fix errors and performance issues before users notice, so teams can spend less time firefighting and more time building.

Trusted by 200,000+ organizations, Sentry is today’s application monitoring standard and our team is building its AI-native future.

About the role

As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real-world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI-powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence.

In this role you will
  • Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems

  • Create and curate high-quality datasets, golden test cases, and benchmarks grounded in real production data

  • Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows

  • Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria

  • Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring

You’ll love this job if you
  • Care deeply about correctness, rigor, and measurement in AI systems

  • Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics

  • Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team

  • Thrive in cross-functional environments and enjoy influencing model design through better evaluation

Qualifications
  • Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field

  • Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred)

  • Comfort writing production-quality code (we use Python and TypeScript)

  • Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines

  • Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts)

  • Bonus: experience evaluating LLMs, agentic systems, or AI-assisted developer tools

The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000 USD. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job-related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs.

Equal Opportunity at Sentry

Sentry is committed to providing equal employment opportunities to its employees and candidates for employment regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other legally-protected characteristic. This commitment includes the provision of reasonable accommodations to employees and candidates for employment with physical or mental disabilities who require such accommodations in order to (a) perform the essential functions of their jobs, or (b) seek employment with Sentry. We strive to build a diverse team, with an inclusive culture where every teammate can thrive. Sentry is an open-source company because we believe that everyone, everywhere, should have the ability and tools to make great software. Software should be accessible. That starts with making our industry accessible.

If you need assistance or an accommodation due to a disability, you may contact us at [email protected].

Want to learn more about how Sentry handles applicant data? Get the details in our Applicant Privacy Policy.

HQ

Sentry San Francisco, California, USA Office

45 Fremont Street, 8th floor, San Francisco, CA, United States, 94105

Similar Jobs

5 Days Ago
In-Office
2 Locations
170K-240K Annually
Junior
170K-240K Annually
Junior
Information Technology • Software
The Backend Engineer will contribute to building Ambient AI solutions, scale infrastructure, design AI pipelines, and collaborate closely with engineering and product teams to enhance model deployment and performance.
Top Skills: DjangoDockerFastapiFlaskKubernetesPython
15 Days Ago
In-Office
San Francisco, CA, USA
155K-195K Annually
Mid level
155K-195K Annually
Mid level
Information Technology • Software • Database
As a Backend Engineer, you will design and build backend systems and APIs for LangChain's observability and evals platform, optimizing performance and reliability while collaborating with cross-functional teams.
Top Skills: AWSAzureClickhouseGCPGoPostgresPythonRedis
2 Hours Ago
In-Office
San Jose, CA, USA
159K-358K Annually
Expert/Leader
159K-358K Annually
Expert/Leader
Artificial Intelligence • Hardware • Information Technology • Machine Learning
The Director of Business Operations will drive strategic initiatives, AI-driven transformations, financial oversight, workforce planning, and continuous improvement in Micron's STPG.
Top Skills: AIAutomationData-Driven Operations

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account