Get the job you really want.

Top Senior Site Reliability Engineer Jobs in San Francisco, CA

Reposted 4 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
120K-150K Annually
Mid level
120K-150K Annually
Mid level
Cloud • Mobile • Software
Improve and protect production reliability and performance of AWS-based systems. Implement SRE practices (SLIs/SLOs, error budgets), build observability, automate infrastructure with Terraform, contribute code and tooling, participate in incident response, and document runbooks and best practices.
Top Skills: AWSDatadogDockerEcsEksGrafanaHoneycombIncident.IoKubernetesLlmsNew RelicNode.jsOpsgeniePagerdutyPrometheusPythonTerraformTypescript
Reposted 5 Days AgoSaved
In-Office
San Francisco Bay Area, CA
130K-280K Annually
Junior
130K-280K Annually
Junior
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills: ArgocdAWSKubernetesPythonTerraform
Reposted 5 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
214K-260K Annually
Senior level
214K-260K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
As a Site Reliability Engineer, you'll build software to ensure system reliability, scale infrastructure, and deploy ML systems while collaborating with cross-functional teams.
Top Skills: AWSAzureDockerGCPJavaKubernetesLinuxTerraform
Reposted 7 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
Mid level
Mid level
AdTech
As a Site Reliability Engineer, you'll maintain the infrastructure for systems, ensure efficiency, automate processes, monitor databases, and participate in architecture discussions.
Top Skills: Amazon KinesisAws LambdaAws SnsBigQueryDockerGcp (Google Cloud Platform)GitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerSQLTerraform
Reposted 9 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
Senior level
Senior level
AdTech
The Site Reliability Engineer will build and maintain infrastructure, manage databases, automate operations, and ensure system efficiency and scalability at Attain.
Top Skills: Amazon KinesisAws LambdaAws SnsBigQueryDockerGCPGitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerTerraform
4 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
165K-235K Annually
Mid level
165K-235K Annually
Mid level
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will automate tasks, enhance platform infrastructure, improve observability, and lead incident response efforts for optimal performance.
Top Skills: AWSGrafanaHoneycombLinuxPythonTerraform
Reposted 6 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
152K-179K Annually
Senior level
152K-179K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.
Top Skills: AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
8 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills: Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform
Reposted 8 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted 18 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
147K-289K Annually
Senior level
147K-289K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills: AnsibleAWSAzureCloudFormationGCPGoTerraform
12 Days AgoSaved
Remote
San Francisco Bay Area, CA
150K-220K Annually
Senior level
150K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills: AWSBashGoKubernetesPythonSlurmTerraform
Reposted 4 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
165K-330K Annually
Mid level
165K-330K Annually
Mid level
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills: Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
24 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
151K-297K Annually
Expert/Leader
151K-297K Annually
Expert/Leader
Big Data • Cloud • Software • Database
Lead a 6–8 person team managing the Kubernetes fleet and core runtime components (CoreDNS, cert-manager, Gatekeeper). Define technical vision and roadmap, guide migration from Terraform to Operator-driven lifecycle management, perform hands-on architectural reviews and PR reviews, resolve operational incidents, and collaborate with engineering leaders and stakeholders.
Top Skills: AlertingAWSAzureCert-ManagerContainerizationCorednsCrossplaneGatekeeperGCPKubernetesLoad BalancingObservabilityOperatorsService MeshTerraform
2 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
Develop and maintain Kubernetes runtime environments, support developers, resolve critical issues, and participate in on-call rotations for production systems.
Top Skills: AWSAzureCert-ManagerCorednsCrdsCriCsiGatekeeperGCPGoHelmKubernetesKustomizeOperatorsPythonTerraform
Reposted 16 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
Internship
Internship
Cloud • Information Technology • Security • Software • Cybersecurity
This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.
Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform
3 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
147K-278K Annually
Senior level
147K-278K Annually
Senior level
Cloud • Software
Responsible for maintaining FedRAMP compliant services, designing infrastructure, monitoring systems, and ensuring security for federal regions, while driving automation and collaboration with development teams.
Top Skills: AWSFedrampGoKubernetesPuppetPythonTerraformUnix/Linux
Reposted 3 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
155K-196K Annually
Senior level
155K-196K Annually
Senior level
Cloud • Mobile • Software
Drive SRE practices and reliability strategy: implement SLIs/SLOs and error budgets, build observability (metrics, logs, traces, dashboards, alerts), evolve AWS/Terraform infrastructure, automate toil, participate in incident response, develop runbooks and safeguards, and collaborate with engineering and product teams to design and operate reliable services.
Top Skills: Ai-Assisted ToolingAWSDatadogDockerEcsEksGrafanaHoneycombIncident.IoInfrastructure As CodeKubernetesLlmsNew RelicNode.jsOpsgeniePagerdutyPrometheusPythonTerraformTypescript
Reposted 3 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
130K-232K Annually
Senior level
130K-232K Annually
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and enhance the Currents data export system, focusing on observability, scalability, and reliability, while mentoring junior engineers and solving performance issues.
Top Skills: BuildkiteDatadogDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPagerdutyPostgresRubySentrySidekiqSnsSqs
Reposted 14 Hours AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
180K-250K Annually
Senior level
180K-250K Annually
Senior level
Cloud • Digital Media • Information Technology
Operate and improve Kubernetes-based production systems, manage cluster lifecycle and networking, build CI/CD and GitOps pipelines, define SLOs and incident response, automate resolution with AI, implement monitoring/alerting, and drive reliability through automation and chaos engineering.
Top Skills: AnsibleArgocdBashBgpCalicoCephCiliumCni PluginsCorootDatadogDnsEbpfFalcoFluxcdGoGrafanaKubernetesLokiLonghornMetallbPrometheusPythonSIEMTerraformThanosVictoriametricsVxlanXdp
Reposted 14 Hours AgoSaved
In-Office
San Francisco Bay Area, CA
181K-237K Annually
Senior level
181K-237K Annually
Senior level
Healthtech • Insurance
The Senior Software Engineer will lead complex projects, mentor engineers, and ensure cloud infrastructure is resilient and automated. Responsibilities include developing software, managing production environments, and enforcing coding standards.
Top Skills: ArgocdAWSGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
YesterdaySaved
In-Office
San Francisco Bay Area, CA
230K-390K Annually
Senior level
230K-390K Annually
Senior level
Artificial Intelligence • Software
As a Software Engineer on the Site Reliability team, you'll ensure system reliability, scalability, and observability while partnering with engineering teams and improving incident management processes.
Top Skills: AWSCi/Cd ToolingContainer OrchestrationDatadogGrafanaPrometheusTerraform
Reposted 2 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
130K-185K Annually
Junior
130K-185K Annually
Junior
Healthtech • Software
As a DevOps Engineer, you'll build and maintain scalable infrastructures, manage monitoring systems, provide operational support, and collaborate across teams to enhance the company's cloud environment.
Top Skills: AnsibleAWSAzureBashChefDockerGCPGithub ActionsJenkinsPostgresPuppetPythonTerraform
Reposted 2 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
180K-275K Annually
Senior level
180K-275K Annually
Senior level
Financial Services
Design, develop, and deploy robust platform solutions while ensuring reliability, scalability, and security of the system. Collaborate with teams to enhance tooling and automation.
Top Skills: GCPKubernetesTerraform
Reposted 3 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
140K-205K Annually
Senior level
140K-205K Annually
Senior level
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills: AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
Reposted 22 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
219K-245K Annually
Expert/Leader
219K-245K Annually
Expert/Leader
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills: AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account