Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Senior Site Reliability Engineer Jobs in San Francisco, CA

Cisco ThousandEyes

Senior Site Reliability Engineer (FedRAMP) - Cisco ThousandEyes

Reposted 6 Days AgoSaved

Hybrid

San Francisco Bay Area, CA

147K-278K Annually

Senior level

147K-278K Annually

Senior level

Cloud • Software

Responsible for maintaining FedRAMP-compliant infrastructure, collaborating with software engineers, and ensuring system availability and security. Duties include infrastructure design, automation, monitoring, and incident response.

Top Skills: AWSGoKubernetesPuppetPythonTerraform

Phantom (phantom.com)

Staff Software Engineer (SRE)

Reposted 3 Days AgoSaved

Remote

San Francisco Bay Area, CA

200K-250K Annually

Senior level

200K-250K Annually

Senior level

Software • Cryptocurrency

Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.

Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform

HHAeXchange

SRE Technical Project Manager

Reposted 3 Days AgoSaved

Remote

San Francisco Bay Area, CA

100K-110K Annually

Mid level

100K-110K Annually

Mid level

Healthtech • Software

The SRE Technical Project Manager will lead project delivery, incident management, automation processes, and uptime communication, partnering with SRE and development teams to ensure system stability and scalability.

Top Skills: Ai BotsDatadogJIRAJira Service ManagementMs TeamsOpsgeniePagerduty

The Walt Disney Company

Sr Principal Site Reliability Engineer

6 Days AgoSaved

In-Office

San Francisco Bay Area, CA

251K-336K Annually

Senior level

251K-336K Annually

Senior level

Digital Media • Gaming • News + Entertainment • Sports

As a Sr Principal Site Reliability Engineer, you will ensure maximum platform availability, lead incident response processes, drive automation, and collaborate across teams to optimize system performance and operational efficiency.

Top Skills: Automation ToolsCloud TechnologiesContent Delivery NetworksMedia Streaming TechnologiesMonitoring Tools

Speakeasy

Platform Engineer (SRE) - AI Control Plane

6 Days AgoSaved

In-Office

San Francisco Bay Area, CA

Mid level

Software

Join a fast-growing team as a Platform Engineer to enhance AI control systems, ensuring reliability and performance while collaborating on product decisions.

Top Skills: AIDeveloper ToolsInfrastructure

E2B

SRE/Infrastructure Engineer

6 Days AgoSaved

In-Office

San Francisco Bay Area, CA

200K-350K Annually

Senior level

200K-350K Annually

Senior level

Artificial Intelligence

The SRE/Infrastructure Engineer will manage Terraform and Kubernetes across cloud platforms, ensuring scalable infrastructure. Responsibilities include multi-cloud deployments, observability, and creating reusable components.

Top Skills: AWSAzureCloudflareGCPKubernetesTerraform

Speakeasy

Platform Engineer (SRE) - AI Control Plane

6 Days AgoSaved

In-Office

San Francisco Bay Area, CA

Mid level

Enterprise Web • Information Technology • Software

Join a passionate team as a Platform Engineer (SRE), focusing on improving reliability, performance, and availability of AI control plane products. Collaborate closely on operational processes and foster reliability culture across engineering.

Top Skills: AIDeveloper ToolsInfrastructureMcpMonitoring SystemsObservabilitySecure AccessSecurity

Airwallex

Senior Site Reliability Engineer, Spend

7 Days AgoSaved

Remote or Hybrid

San Francisco Bay Area, CA

Senior level

Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI

The Senior Site Reliability Engineer will architect and implement scalable cloud infrastructure, lead incident response, and ensure system reliability for product initiatives.

Top Skills: AWSCloud InfrastructureGCPKubernetes

Fieldguide

Senior Site Reliability Engineer

Reposted YesterdaySaved

In-Office or Remote

San Francisco Bay Area, CA

190K-206K Annually

Senior level

190K-206K Annually

Senior level

Software

As a Senior Site Reliability Engineer, you will ensure the reliability and scalability of production systems, improve system performance, and enhance observability through design and automation.

Top Skills: AWSCloudwatchDatadogGrafanaPrometheusTerraform

MongoDB

Site Reliability Engineer (Senior or Staff), Infrastructure Security

Reposted 12 Days AgoSaved

Easy Apply

Remote or Hybrid

San Francisco Bay Area, CA

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.

Top Skills: AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform

C3 AI

Senior/Lead Site Reliability Engineer – Federal

Reposted 6 Days AgoSaved

In-Office

San Francisco Bay Area, CA

159K-230K Annually

Senior level

159K-230K Annually

Senior level

Artificial Intelligence • Big Data • Machine Learning • Software

The role involves designing and implementing custom installations of the C3 AI Platform for Federal customers, ensuring uptime, and automating system processes while collaborating with cross-functional teams.

Top Skills: AnsibleAWSAzureBashKubernetesLinuxPuppetPythonRubyTerraform

Kentik

Staff Site Reliability Engineer, Cloud

Reposted 4 Days AgoSaved

Remote

San Francisco Bay Area, CA

165K-200K Annually

Senior level

165K-200K Annually

Senior level

Cloud • Information Technology

As a Staff Site Reliability Engineer, you will enhance cloud product lines, ensuring real-time scalability, collaborating with teams, and automating builds.

Top Skills: AnsibleAWSAzureBashDnsDockerEnvoyGCPGitGoGrafanaHaproxyHTTPJenkinsKafkaKubernetesLinuxMySQLOciOpentelemetryPostgresPrometheusPuppetPythonRedisTcp/IpTelegrafTerraformTls

New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free

Coalfire

Junior Site Reliability Engineer

Reposted 4 Days AgoSaved

Remote

San Francisco Bay Area, CA

95K-110K Annually

Junior

95K-110K Annually

Junior

Cloud • Security • Cybersecurity

As a Junior Site Reliability Engineer, you will support cloud operations, implement automation for cloud infrastructure, and ensure system reliability and security.

Top Skills: AnsibleAWSAzureBashElastic StackGCPJIRAPowershellPythonServicenowSplunkTerraform

Precisely

Site Reliability Engineer

Reposted 4 Days AgoSaved

Remote

San Francisco Bay Area, CA

Senior level

Software

As a Site Reliability Engineer, you will enhance system reliability, manage cloud services, respond to incidents, and support network systems.

Top Skills: AutomationCisco RoutingCloud ServicesF5 Load BalancingFortinet FirewallsInfrastructure AutomationMonitoringNetworking

Andromeda (andromeda.ai)

Staff SRE, AI Infrastructure

7 Days AgoSaved

In-Office or Remote

San Francisco Bay Area, CA

Senior level

Artificial Intelligence • Cloud • Information Technology • Software

As a Staff SRE, you will ensure the reliability and performance of Andromeda's GPU infrastructure, lead incident responses, build observability systems, and mentor engineers, while collaborating closely with engineering and customers.

Top Skills: AnsibleCudaGoHelmKubernetesLinuxNcclNvidiaPythonRustSlurmTerraform

Superhuman

Site Reliability Engineer

13 Days AgoSaved

Hybrid

San Francisco Bay Area, CA

214K-260K Annually

Senior level

214K-260K Annually

Senior level

Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI

The SRE will ensure the reliability of backend systems, scale Kubernetes-based control planes, and improve automation mechanisms while managing incident processes.

Top Skills: AWSAzureDockerGCPJavaKubernetesLinuxTerraform

Altruist

Senior Site Reliability Engineer

Reposted 2 Days AgoSaved

In-Office

San Francisco Bay Area, CA

200K-250K Annually

Senior level

200K-250K Annually

Senior level

Fintech • Professional Services • Software

As a Senior Site Reliability Engineer, you'll design scalable systems on AWS, mentor engineers, manage incident responses, and enhance the reliability of fintech infrastructure.

Top Skills: SparkAWSDevOpsJavaKubernetesTerraform

Circle (circle.so)

Senior Site Reliability Engineer

Reposted 6 Days AgoSaved

Easy Apply

Remote

San Francisco Bay Area, CA

Easy Apply

130K-140K Annually

Senior level

130K-140K Annually

Senior level

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software

The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.

Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis

Latent

Site Reliability Engineer

Reposted 7 Days AgoSaved

In-Office

San Francisco Bay Area, CA

200K-275K Annually

Senior level

200K-275K Annually

Senior level

Artificial Intelligence • Healthtech • Information Technology • Software

As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.

Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript

Cohere AI

Site Reliability Engineer, Inference Infrastructure

Reposted 7 Days AgoSaved

In-Office or Remote

San Francisco Bay Area, CA

Senior level

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI

The Site Reliability Engineer will develop, deploy, and operate AI infrastructure, focusing on high-performance and scalable machine learning systems using Kubernetes and cloud platforms.

Top Skills: AWSAzureC++GCPGoKubernetesOci

athenahealth

Lead Site Reliability Engineer

Reposted 5 Days AgoSaved

Remote

San Francisco Bay Area, CA

119K-203K Annually

Senior level

119K-203K Annually

Senior level

Healthtech • Information Technology • Telehealth

Lead Site Reliability Engineer responsible for ensuring cloud services reliability, automation, and performance while mentoring a team and collaborating cross-functionally. Drive initiatives to enhance incident management and enforce security compliance.

Top Skills: AnsibleAWSAws CloudformationAzureBashDatadogDockerElk StackGoGCPGrafanaKubernetesPrometheusPuppetPythonTerraform

Guild Mortgage

Senior Site Reliability Engineer

YesterdaySaved

Remote

San Francisco Bay Area, CA

95K-136K Annually

Senior level

95K-136K Annually

Senior level

Fintech • Real Estate

The Senior Site Reliability Engineer executes reliability strategies, designs and maintains infrastructure, improves monitoring and deployment processes, collaborates with teams for system reliability and performance optimization.

Top Skills: Automated Configuration ManagementAutomated ProvisioningAWSAzureAzure StorageCloud-Based SolutionsContainerization SolutionsGCPGitJIRALinuxMariadbMySQLRdsSQL ServerUnixWindows

You.com

Senior Site Reliability Engineer

Reposted 3 Days AgoSaved

In-Office

San Francisco Bay Area, CA

195K-240K Annually

Senior level

195K-240K Annually

Senior level

Software

The Site Reliability Engineer will enhance reliability, observability, and incident response of You.com's production services, while collaborating with teams to implement best practices and improve operational efficiency through tooling and automation.

Top Skills: AWSBashCi/CdEksGhaGitGitGrafanaOpentelemetryPrometheusPythonTerraform

Juul Labs

Senior Site Reliability Engineer

Reposted YesterdaySaved

Remote

San Francisco Bay Area, CA

185K-227K Annually

Senior level

185K-227K Annually

Senior level

Other

The Senior Site Reliability Engineer at Juul Labs ensures operational stability and performance of hybrid cloud infrastructure, leads automation, and handles critical incidents.

Top Skills: AWSBashCloudFormationGCPNutanixPowershellPythonTerraform

Astera

Site Reliability Engineer

Reposted 8 Days AgoSaved

Hybrid

San Francisco Bay Area, CA

Entry level

Artificial Intelligence • Machine Learning • Biotech • Generative AI

The Site Reliability Engineer will manage digital infrastructure, ensuring access to compute resources, automating processes, and maintaining resource visibility for researchers.

Top Skills: AnsibleDockerGrafanaKubernetesPrometheusPythonTailscaleTalos Linux