Sierra Logo

Sierra

Software Engineer, Site Reliability (SRE)

Posted Yesterday
Be an Early Applicant
In-Office
San Francisco, CA, USA
230K-390K Annually
Senior level
In-Office
San Francisco, CA, USA
230K-390K Annually
Senior level
As a Software Engineer on the Site Reliability team, you'll ensure system reliability, scalability, and observability while partnering with engineering teams and improving incident management processes.
The summary above was generated by AI
About us
  • At Sierra, we’re creating a platform to help businesses build better, more human customer experiences with AI. We are primarily an in-person company based in San Francisco, with growing offices in Atlanta, New York, London, Paris, Singapore, and Japan.

  • We are guided by a set of values that are at the core of our actions and define our culture: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These values are the foundation of our work, and we are committed to upholding them in everything we do.

  • Our co-founders are Bret Taylor and Clay Bavor. Bret currently serves as Board Chair of OpenAI. Previously, he was co-CEO of Salesforce (which had acquired the company he founded, Quip) and CTO of Facebook. Bret was also one of Google's earliest product managers and co-creator of Google Maps. Before founding Sierra, Clay spent 18 years at Google, where he most recently led Google Labs. Earlier, he started and led Google’s AR/VR effort, Project Starline, and Google Lens. Before that, Clay led the product and design teams for Google Workspace. 

What you'll do

As a Software Engineer on our Site Reliability team at Sierra, you will be responsible for defining and building the foundation of reliability, observability, and scalability across Sierra’s AI-driven infrastructure. You’ll partner closely with our core engineering and product teams to ensure our systems are highly available, efficient, and built for growth.

  • Own Sierra’s observability stack—monitoring, alerting, logging, and tracing—to give engineers clear visibility into system health and performance.

  • Partner with product and platform engineers to design systems that are reliable and scalable from day one—not as an afterthought.

  • Design and implement scalable, reliable, and secure cloud infrastructure (AWS) using Terraform and modern DevOps tooling.

  • Improve the reliability and scalability of our LLM deployments, ensuring robust, performant, and cost-effective operation.

  • Lead improvements to deployment pipelines, CI/CD tooling, and incident management processes to reduce downtime and response time.

  • Define the foundation of SRE practices at Sierra, influencing culture, tooling, and best practices across the engineering org.

What you'll bring
  • 5+ years of hands-on experience in Site Reliability or Infrastructure engineering roles for complex SaaS or cloud-based systems.

  • Experience designing for availability, scalability, and reliability at both infrastructure and application layers.

  • Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture).

  • Strong background in observability systems (e.g., Prometheus, Grafana, Datadog, or similar).

  • Experience working with enterprise customers and familiarity with their compliance and networking needs along with integration patterns.

  • Comfortable working in fast-moving environments and collaborating across product, ML, and core engineering teams.

  • Degree in Computer Science or a related field, or equivalent professional experience.

Even better...
  • Experience with LLM infrastructure — optimizing inference performance, managing fine-tuned models, or large-scale model deployment.

  • Past experience in an early-stage startup environment, especially defining SRE culture and tooling from scratch.

  • Familiarity with incident management automation or self-healing infrastructure patterns.

Our values
  • Trust: We build trust with our customers with our accountability, empathy, quality, and responsiveness. We build trust in AI by making it more accessible, safe, and useful. We build trust with each other by showing up for each other professionally and personally, creating an environment that enables all of us to do our best work.

  • Customer Obsession: We deeply understand our customers’ business goals and relentlessly focus on driving outcomes, not just technical milestones. Everyone at the company knows and spends time with our customers. When our customer is having an issue, we drop everything and fix it.

  • Craftsmanship: We get the details right, from the words on the page to the system architecture. We have good taste. When we notice something isn’t right, we take the time to fix it. We are proud of the products we produce. We continuously self-reflect to continuously self-improve.

  • Intensity: We know we don’t have the luxury of patience. We play to win. We care about our product being the best, and when it isn’t, we fix it. When we fail, we talk about it openly and without blame so we succeed the next time.

  • Family: We know that balance and intensity are compatible, and we model it in our actions and processes. We are the best technology company for parents. We support and respect each other and celebrate each other’s personal and professional achievements.

What we offer

We want our benefits to reflect our values and offer the following to full-time employees:

  • Flexible (Unlimited) Paid Time Off

  • Medical, Dental, and Vision benefits for you and your family

  • Life Insurance and Disability Benefits

  • Retirement Plan (e.g., 401K, pension) with Sierra match

  • Parental Leave

  • Fertility and family building benefits through Carrot

  • Lunch, as well as delicious snacks and coffee to keep you energized 

  • Discretionary Benefit Stipend giving people the ability to spend where it matters most

  • Free alphorn lessons

These benefits are further detailed in Sierra's policies, may vary by region, and are subject to change at any time, consistent with the terms of any applicable compensation or benefits plans. Eligible full-time employees can participate in Sierra's equity plans subject to the terms of the applicable plans and policies.

Be you, with us

We're working to bring the transformative power of AI to every organization in the world. To do so, it is important to us that the diversity of our employees represents the diversity of our customers. We believe that our work and culture are better when we encourage, support, and respect different skills and experiences represented within our team. We encourage you to apply even if your experience doesn't precisely match the job description. We strive to evaluate all applicants consistently without regard to race, color, religion, gender, national origin, age, disability, veteran status, pregnancy, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.

Top Skills

AWS
Ci/Cd Tooling
Container Orchestration
Datadog
Grafana
Prometheus
Terraform

Sierra San Francisco, California, USA Office

San Francisco, California , United States

Similar Jobs

13 Days Ago
Hybrid
San Francisco, CA, USA
120K-140K Annually
Junior
120K-140K Annually
Junior
Artificial Intelligence • Security • Software
You will develop and improve cloud infrastructure, support distributed systems, and write infrastructure-as-code while collaborating across teams.
Top Skills: AWSCloudFormationDockerGoJavaKubernetesPythonTerraform
14 Hours Ago
In-Office
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead technical projects in cloud infrastructure, mentoring teams, improving DevOps practices, and ensuring system resilience.
Top Skills: AWSCi/CdGCPGithub ActionsGrafanaIamIstioKubernetesPrometheusTerraformVpc Peering
14 Hours Ago
In-Office
San Francisco, CA, USA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead complex projects, mentor engineers, and ensure cloud infrastructure is resilient and automated. Responsibilities include developing software, managing production environments, and enforcing coding standards.
Top Skills: ArgocdAWSGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account