Get the job you really want.

Top Senior Site Reliability Engineer Jobs in San Francisco, CA

Reposted YesterdaySaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
160K-300K Annually
Senior level
160K-300K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills: AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Reposted YesterdaySaved
In-Office or Remote
San Francisco Bay Area, CA
185K-327K Annually
Senior level
185K-327K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Embedded Site Reliability Engineer will develop and maintain software applications for Bitcoin mining, focusing on embedded systems and cloud observability. Responsibilities include software testing, bug triage, and collaboration with engineering teams to optimize performance and reliability.
Top Skills: CC++DatadogElasticGoGrafanaJavaScriptLinuxPythonRustSplunkSQLTypescript
Reposted 2 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
147K-289K Annually
Senior level
147K-289K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills: AnsibleAWSAzureCloudFormationGCPGoTerraform
2 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
195K-235K Annually
Senior level
195K-235K Annually
Senior level
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
Join Celonis' Reliability Engineering team to ensure the health and performance of their platform, applying SRE principles and mentoring engineers while leading reliability efforts for microservices on Kubernetes.
Top Skills: ArgocdAWSAzureDatadogGCPGithub ActionsJavaKubernetesKustomizePythonSpring FrameworkTerraform
Reposted 3 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
Expert/Leader
Expert/Leader
Financial Services
As a Principal Site Reliability Engineer, you'll architect reliability solutions, lead observability initiatives, and mentor teams for enhanced operational efficiency.
Top Skills: Cloud-Native InstrumentationOpen TelemetryStreaming Data Platforms
Reposted 10 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills: AWSGCPAzureMongoDB
2 Days AgoSaved
Remote
San Francisco Bay Area, CA
84K-144K Annually
Senior level
84K-144K Annually
Senior level
Artificial Intelligence • Cloud • Consumer Web • eCommerce • Information Technology • Software
The Site Reliability Engineer will ensure application performance, architect monitoring tools, analyze systems, provide reliability recommendations, and support production.
Top Skills: AnsibleCentosDatadogDockerLinuxMySQLNew RelicRhelSQL
Reposted 12 Days AgoSaved
In-Office
San Francisco Bay Area, CA
130K-280K Annually
Junior
130K-280K Annually
Junior
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills: ArgocdAWSKubernetesPythonTerraform
Reposted 13 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
162K-198K Annually
Mid level
162K-198K Annually
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills: AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
6 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
165K-235K Annually
Mid level
165K-235K Annually
Mid level
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will support engineering teams, enhance system resilience, and drive scalable infrastructure practices.
Top Skills: Aws ServicesGrafanaHoneycombLinuxPythonTerraform
Reposted 15 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
148K-205K Annually
Senior level
148K-205K Annually
Senior level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design, scale, and manage AWS services for IoT devices. Collaborate on infrastructure, optimize performance, and ensure high availability of services.
Top Skills: AWSBashGoHelmKubernetesPythonRubyTerraform
Reposted 7 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills: AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 8 Days AgoSaved
Remote
San Francisco Bay Area, CA
140K-210K Annually
Senior level
140K-210K Annually
Senior level
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills: AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Reposted 18 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
180K-230K Annually
Senior level
180K-230K Annually
Senior level
Artificial Intelligence • Healthtech • Software
As a Site Reliability Engineer, you will manage cloud infrastructure, implement observability, and ensure system reliability by collaborating with engineering teams and maintaining databases.
Top Skills: AzureBashGitGitKubernetesPostgresPythonRedisSQLTypescriptVscode
Reposted 9 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills: AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Reposted 22 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
170K-220K Annually
Senior level
170K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Software
As a Staff Site Reliability Engineer, you will enhance the reliability, scalability, and performance of production services by applying SRE principles, implementing observability practices, automating processes, and collaborating with engineering teams.
Top Skills: AWSAzureCloudFormationDatadogDockerElk StackGCPGoGrafanaJaegerKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Reposted 13 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
175K-175K Annually
Senior level
175K-175K Annually
Senior level
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills: AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Reposted 25 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
176K-241K Annually
Senior level
176K-241K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team of developers to create cloud-based solutions while driving transformations using DevOps practices. Collaborate across teams to solve business challenges and mentor engineers.
Top Skills: AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
Reposted 18 Hours AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
150K-200K Annually
Junior
150K-200K Annually
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 19 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted YesterdaySaved
In-Office
San Francisco Bay Area, CA
165K-250K Annually
Senior level
165K-250K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
Reposted YesterdaySaved
In-Office or Remote
San Francisco Bay Area, CA
205K-235K Annually
Senior level
205K-235K Annually
Senior level
Financial Services
The Senior Cluster Site Reliability Engineer will enhance the research compute cluster's uptime, reliability, and performance through engineering and operational improvements, ensuring high availability for researchers working on machine learning problems.
Top Skills: AnsibleAWSAWSCephDockerElkGCPGCPGrafanaHorovodHpcInfinibandKubeflowKueueLokiLustreMlflowOpentelemetryPodmanPrometheusPythonRdmaRubyS3SingularitySlurmTerraform
Reposted YesterdaySaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
180K-440K Annually
Mid level
180K-440K Annually
Mid level
Information Technology
As a Site Reliability Engineer, you'll design and operate scalable storage systems and optimize performance for AI research data management.
Top Skills: GoKubernetesPulumiRust
2 Days AgoSaved
In-Office
San Francisco Bay Area, CA
196K-248K Annually
Senior level
196K-248K Annually
Senior level
Automotive
As a Senior Technical Program Manager for SRE & On-call Excellence, you will manage projects that improve incident response, on-call protocols, and system reliability, collaborating with various engineering teams to drive successful execution.
Top Skills: Cloud InfrastructureDevops PracticesDistributed SystemsSite Reliability Engineering
2 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
169K-276K Annually
Senior level
169K-276K Annually
Senior level
Energy
The Site Reliability Engineer will design and implement scalable systems, automate IT infrastructure management, and support deployed systems, ensuring high availability and performance.
Top Skills: Active DirectoryAnsibleAWSAzureChefJSONLinuxPuppetPythonRestVMwareWindows ServerYaml
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account