Braintrust Logo

Braintrust

Cloud Infrastructure Engineer

Reposted 3 Hours Ago
In-Office
San Francisco, CA, USA
Senior level
In-Office
San Francisco, CA, USA
Senior level
The Cloud Infrastructure Engineer will develop and maintain infrastructure using Terraform and Kubernetes, support multi-cloud deployments, and improve CI/CD processes.
The summary above was generated by AI
About the company

Braintrust is the AI observability platform. By connecting evals and observability in one workflow, Braintrust gives builders the visibility to understand how AI behaves in production and the tools to improve it.

Teams at Notion, Stripe, Zapier, Vercel, and Ramp use Braintrust to compare models, test prompts, and catch regressions — turning production data into better AI with every release.

About the role

We’re looking for a Cloud Infrastructure Engineer to help us build reliable, scalable infrastructure and give developers a world-class platform to ship code with speed and confidence. You’ll lead efforts across Terraform, Kubernetes, CI/CD, observability, and support, and play a key role in how we scale Braintrust both internally and for customers self-hosting our platform.

This is a high-impact role where you’ll contribute across our internal AWS environment and help customers deploy our stack in AWS, Azure, and GCP.

What you’ll do
  • Build and maintain Terraform modules for both internal infrastructure and customer deployments

  • Work directly with customers in Slack to support self-hosting and troubleshoot infrastructure issues. Build tools to make it easier for them to support themselves.

  • Own and improve our CI/CD pipeline: reduce build times, improve failure visibility, and enable safer, faster releases

  • Centralize and scale observability - including logs, metrics, dashboards, and alerts

  • Partner with engineering teams to build and evolve a secure, developer-friendly infrastructure platform

  • Support multi-cloud deployment patterns (AWS primarily, with Azure and GCP support for enterprise customers)

  • Implement tools and automation to improve deployment, rollback, and infrastructure reliability

Ideal candidate credentials
  • 5+ years of experience in DevOps, SRE, or Infrastructure Engineering roles

  • Deep experience with Terraform and at least one major cloud provider (AWS strongly preferred)

  • Strong Kubernetes skills: deploying, debugging, and scaling real workloads

  • Proficient in scripting or programming (Python, Typescript, or Go)

  • Experience supporting production systems and responding to incidents

  • Comfortable working directly with customers in a support or deployment context

  • Bonus: experience with multi-cloud environments or self-hosted enterprise software

Benefits include
  • Medical, dental, and vision insurance

  • Daily lunch, snacks, and beverages

  • Flexible time off

  • Competitive salary and equity

  • AI Stipend

Equal opportunity

Braintrust is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Similar Jobs

Yesterday
Easy Apply
In-Office
Easy Apply
Senior level
Senior level
Artificial Intelligence • Computer Vision • Machine Learning • Payments • Real Estate • PropTech
Senior Cloud Infrastructure Engineer responsible for designing, building, and operating central cloud infrastructure on AWS; managing observability (Datadog), version control and CI/CD (Git/GitHub); and collaborating closely with product and engineering teams. Role requires regular on-site presence (four days/week) and contributes to platform reliability and scalability.
Top Skills: AWSDatadogGitGitGithub CopilotJavaMySQLPostgresReactScalaSnowflakeTypescript
4 Days Ago
Remote or Hybrid
12 Locations
148K-190K Annually
Senior level
148K-190K Annually
Senior level
Healthtech • Biotech
Lead design, implementation, and security of AWS cloud infrastructure and CI/CD automation. Drive cost optimization, observability, and IaC standards (Terraform/CDK). Mentor engineers, participate in on-call rotations, and collaborate with product and science teams to build scalable, reliable cloud-native systems.
Top Skills: AWSBashCdkGoInfrastructure-As-CodeLinuxPythonTerraform
8 Days Ago
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead the optimization and performance analysis of distributed training and inference workloads on NVIDIA GPU platforms, with responsibilities including debugging, benchmarking, and ensuring reliability of large-scale AI systems.
Top Skills: C/C++Containerized EnvironmentsCudaInfinibandMegatronNcclNemoNsight SystemsNvlinkNvswitchPciePythonPyTorchRdmaRoceTensorrt-Llm

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account