Get the job you really want.

Top Senior Site Reliability Engineer Jobs in San Francisco, CA

10 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
Responsible for maintaining FedRAMP-compliant infrastructure, collaborating with software engineers, and ensuring system availability and security. Duties include infrastructure design, automation, monitoring, and incident response.
Top Skills: AWSGoKubernetesPuppetPythonTerraform
Reposted 5 Days AgoSaved
In-Office
San Francisco Bay Area, CA
Senior level
Senior level
Software
Design, implement, and maintain scalable backend systems and APIs; build cloud infrastructure (preferably GCP) using Terraform; operate containerized workloads with Kubernetes; ensure reliability, security, and performance; participate in on-call rotations, architecture discussions, and cross-functional delivery.
Top Skills: Ci/CdCloud AutomationContainer OrchestrationGoGoogle Cloud PlatformIamInfrastructure As CodeKubernetesMicroservicesPythonService-Oriented ArchitectureTerraform
2 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
130K-140K Annually
Senior level
130K-140K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis
Reposted 6 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
The Site Reliability Engineer will ensure high availability and performance of CodeRabbit's AI-powered code review platform, enhancing system reliability through infrastructure ownership, performance engineering, and automation.
Top Skills: AWSDatadogDockerElk StackGoogle Cloud PlatformGrafanaKubernetesLinuxNode.jsPrometheusTerraformTypescript
7 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
30K-120K Annually
Senior level
30K-120K Annually
Senior level
Information Technology • Automation
The SRE/Infrastructure Engineer will architect and manage secure, scalable systems for automated penetration testing, optimizing reliability, and enhancing infrastructure based on customer demand. Responsibilities include maintaining production environments, leading technical discussions, and promoting high coding standards.
Top Skills: AWSAzureCloudFormationElkGCPNew RelicOpentelemetryPostgresPrometheusTerraform
Reposted 3 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills: AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Reposted 7 Days AgoSaved
Easy Apply
In-Office or Remote
San Francisco Bay Area, CA
Easy Apply
200K-260K Annually
Senior level
200K-260K Annually
Senior level
Artificial Intelligence • Software • Generative AI
The Lead Site Reliability Engineer will drive technical strategy, ensure high service availability, manage cloud infrastructure, and lead a team to optimize systems and automate processes.
Top Skills: AWSAzureDockerGoogle Cloud PlatformKubernetesTerraform
Reposted 17 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
165K-330K Annually
Mid level
165K-330K Annually
Mid level
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills: Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
Reposted 8 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
175K-320K Annually
Mid level
175K-320K Annually
Mid level
Artificial Intelligence • Software
The SRE at Fluidstack is responsible for ensuring infrastructure reliability and performance, handling complex production issues, and improving platform stability.
Top Skills: AnsibleBashGoKubernetesPythonSlurmTerraform
15 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
Develop and maintain Kubernetes runtime environments, support developers, resolve critical issues, and participate in on-call rotations for production systems.
Top Skills: AWSAzureCert-ManagerCorednsCrdsCriCsiGatekeeperGCPGoHelmKubernetesKustomizeOperatorsPythonTerraform
Reposted 9 Days AgoSaved
In-Office
San Francisco Bay Area, CA
116K-200K Annually
Mid level
116K-200K Annually
Mid level
Information Technology • Mobile • Software
As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.
Top Skills: AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform
Reposted 12 Hours AgoSaved
Remote
San Francisco Bay Area, CA
Mid level
Mid level
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills: AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 12 Hours AgoSaved
Remote
San Francisco Bay Area, CA
200K-250K Annually
Senior level
200K-250K Annually
Senior level
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Reposted 12 Hours AgoSaved
Remote
San Francisco Bay Area, CA
165K-200K Annually
Senior level
165K-200K Annually
Senior level
Cloud • Information Technology
As a Staff Site Reliability Engineer, you will enhance cloud product lines, ensuring real-time scalability, collaborating with teams, and automating builds.
Top Skills: AnsibleAWSAzureBashDnsDockerEnvoyGCPGitGoGrafanaHaproxyHTTPJenkinsKafkaKubernetesLinuxMySQLOciOpentelemetryPostgresPrometheusPuppetPythonRedisTcp/IpTelegrafTerraformTls
YesterdaySaved
Remote
San Francisco Bay Area, CA
119K-203K Annually
Senior level
119K-203K Annually
Senior level
Healthtech • Information Technology • Telehealth
Lead Site Reliability Engineer responsible for ensuring cloud services reliability, automation, and performance while mentoring a team and collaborating cross-functionally. Drive initiatives to enhance incident management and enforce security compliance.
Top Skills: AnsibleAWSAws CloudformationAzureBashDatadogDockerElk StackGoGCPGrafanaKubernetesPrometheusPuppetPythonTerraform
YesterdaySaved
Remote
San Francisco Bay Area, CA
86K-109K Annually
Senior level
86K-109K Annually
Senior level
Information Technology • Consulting
The Site Reliability Engineer will drive the observability roadmap, standardize monitoring practices, optimize alerting tools, and collaborate with teams to enhance operational efficiency and system reliability.
Top Skills: .NetAsp.Net CoreAWSAzureC#DatadogDockerGCPGrafanaKubernetesNew RelicPowershellPrometheusReactSplunkWeb Apis
10 Days AgoSaved
In-Office
San Francisco Bay Area, CA
194K-267K Annually
Senior level
194K-267K Annually
Senior level
Cloud
The Site Reliability Engineer will manage Kubernetes platforms, optimize AWS cloud infrastructure, ensure high availability, and automate deployment while handling troubleshooting and security compliance.
Top Skills: AWSBashCi/CdCloudwatchElk StackGoGrafanaHelmIstioKubernetesPrometheusPythonTerraform
10 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
250K-295K Annually
Senior level
250K-295K Annually
Senior level
Artificial Intelligence • Software
As a Senior Staff SRE Tech Lead, you'll oversee reliability and scalability, mentor engineers, optimize systems, and enhance data infrastructure.
Top Skills: ClickhouseGoPostgresPythonTypescript
Reposted YesterdaySaved
Remote
San Francisco Bay Area, CA
165K-200K Annually
Expert/Leader
165K-200K Annually
Expert/Leader
Cloud • Information Technology
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Top Skills: EnvoyExpressGoJenkinsKafkaMySQLNode.jsPostgresPuppetPythonReactRedis
Reposted YesterdaySaved
In-Office or Remote
San Francisco Bay Area, CA
Mid level
Mid level
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills: AWSDockerGrafanaKubernetesPrometheusPython
2 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
96K-120K Annually
Mid level
96K-120K Annually
Mid level
Cloud • Security • Software
As a Site Reliability Engineer, you'll design and optimize cloud infrastructure, automate compliance, manage Kubernetes, and maintain reliability in regulated environments.
Top Skills: Ci/Cd PipelinesDockerGoogle Cloud Platform (Gcp)KubernetesNist 800-53
Reposted 2 Days AgoSaved
Remote
San Francisco Bay Area, CA
Mid level
Mid level
Software • Analytics
The role involves automating and managing AWS infrastructure, ensuring reliability and scalability of stateful systems, and optimizing deployment processes. You'll also handle incident responses and improve operational tooling.
Top Skills: AWSKubernetesTerraformTerragrunt
2 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
76K-136K Annually
Junior
76K-136K Annually
Junior
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer, you will troubleshoot production issues, automate systems, define database requirements, and collaborate with Dev and QA teams for stability.
Top Skills: AnsibleCassandraChefNoSQLPythonRedis
Reposted 11 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
250K-295K Annually
Senior level
250K-295K Annually
Senior level
Artificial Intelligence • Software
As Staff SRE Tech Lead, you'll oversee platform reliability and scalability, lead the SRE team, architect data infrastructures, and optimize systems while implementing automation and observability practices.
Top Skills: ClickhouseGoPostgresPythonTypescript
Reposted 11 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
180K-210K Annually
Senior level
180K-210K Annually
Senior level
AdTech • Marketing Tech • Analytics
As a Staff Software Engineer - SRE, you'll manage cloud infrastructure, improve application reliability, collaborate across teams, and support back-office systems.
Top Skills: AWSDatadogDockerKafkaKibanaKubernetesLinuxPostgresPythonRdsRedshiftShell/BashSparkTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account