Get the job you really want.

Top Reliability Engineer Jobs in San Francisco, CA

8 Days AgoSaved
Easy Apply
In-Office or Remote
San Francisco Bay Area, CA
Easy Apply
90K-140K Annually
Senior level
90K-140K Annually
Senior level
eCommerce • Retail • Software
As a Senior Database Reliability Engineer, you will manage database systems, enhance observability through automation, and lead database upgrade initiatives while ensuring security and reliability.
Top Skills: AWSCi/CdDynamoDBElasticsearchMongoDBMySQLPostgresPowershellPythonRedisSQL Server
Reposted 13 Hours AgoSaved
Remote
San Francisco Bay Area, CA
Senior level
Senior level
Database
Manage and optimize Postgres databases at scale on AWS RDS, own reliability/monitoring, execute low-downtime upgrades and migrations, troubleshoot production issues, participate in on-call rotation, and collaborate with platform and product teams.
Top Skills: Aws RdsBarmanGoPgbackrestPostgresTypescriptWal-G
Reposted 13 Hours AgoSaved
Remote
San Francisco Bay Area, CA
145K-180K Annually
Senior level
145K-180K Annually
Senior level
Legal Tech • Software
Lead automation and optimization of Filevine's data platform: performance tune MSSQL/Postgres, optimize Snowflake, provision infrastructure with Terraform/AWS, run stateful containers on Kubernetes, integrate AI/LLM and MCP for operational automation, manage CI/CD, capacity planning, documentation, and serve in 24/7 on-call rotation.
Top Skills: AWSC#DapperDockerDynamoDBEntity FrameworkGitlabKubernetesLlmsMcp (Model Context Protocol)Microsoft Sql Server (Mssql)Octopus DeployOpensearchPostgresPowershellPythonRedisSnowflakeTerraform
Reposted 6 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
152K-179K Annually
Senior level
152K-179K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.
Top Skills: AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
8 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills: Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform
Reposted 2 Days AgoSaved
Remote
San Francisco Bay Area, CA
175K-275K Annually
Senior level
175K-275K Annually
Senior level
Software
Own reliability, performance, and scalability of PostgreSQL infrastructure. Implement HA, replication, observability, capacity planning, automation, and DR. Support engineering teams with migrations, query optimization, on-call incident response, runbooks, and tooling to enable safe DB operations.
Top Skills: AnsibleAuroraAws RdsChefDatadogDynamoDBElasticacheGoGrafanaIndexingMvccPatroniPgbouncerPostgresPrometheusPythonQuery PlannerReplicationRubySQLTerraformVacuum TuningWal
Reposted 8 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
4 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
120K-160K Annually
Mid level
120K-160K Annually
Mid level
Fintech • Financial Services
The Systems Reliability Engineer supports MEMX exchange platforms by responding to incidents, debugging issues, improving processes, and working with cross-functional teams to ensure platform availability.
Top Skills: AnsibleBashChefLinuxLinux ShellMonitoring ToolsPuppetPython
Reposted 18 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
147K-289K Annually
Senior level
147K-289K Annually
Senior level
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills: AnsibleAWSAzureCloudFormationGCPGoTerraform
Reposted 11 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
195K-270K Annually
Expert/Leader
195K-270K Annually
Expert/Leader
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Software Engineer on the SRE team, lead best practices adoption, mentor engineers, and improve system reliability and user experience through automation and collaboration.
Top Skills: CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
12 Days AgoSaved
Remote
San Francisco Bay Area, CA
150K-220K Annually
Senior level
150K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills: AWSBashGoKubernetesPythonSlurmTerraform
Reposted 16 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
123K-193K Annually
Senior level
123K-193K Annually
Senior level
Energy
As a Reliability Engineer, you'll define system reliability requirements, execute various reliability tests, and analyze data to improve product performance in hardware engineering for energy storage.
Top Skills: JmpMinitabPython
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 12 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
187K-224K Annually
Senior level
187K-224K Annually
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Seeking a Senior Software Engineer, Site Reliability to ensure system stability, scalability, and reliability, while optimizing AWS infrastructure using modern DevOps practices and tools like Terraform, Docker, and Kubernetes.
Top Skills: AWSCircleCICronitorDatadogDockerGithub ActionsJenkinsKubernetesMySQLPagerdutyReactRedisRuby On RailsSentrySidekiqTerraform
17 Days AgoSaved
In-Office
San Francisco Bay Area, CA
182K-249K Annually
Senior level
182K-249K Annually
Senior level
Cloud
As a Database Reliability Engineer, oversee MySQL database services, ensure performance and availability, coordinate infrastructure tuning, and enhance operational processes.
Top Skills: ChefCloudsqlDockerGrafanaKubernetesLinuxMySQLPostgresRds AuroraTerraform
23 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
160K-300K Annually
Senior level
160K-300K Annually
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
Own and improve critical production services end-to-end by writing production-quality code: instrumenting services, eliminating performance bottlenecks, building deployment and observability platforms, defining SLOs, running incident response and post-mortems, capacity planning and cost optimization, maintaining CI/CD, and embedding with product teams to design reliable systems.
Top Skills: AWSC++Ci/CdContainer OrchestrationGoObservability StacksPythonRust
17 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
170K-226K Annually
Senior level
170K-226K Annually
Senior level
Artificial Intelligence • Hardware • Robotics • Software
The role involves developing and executing test strategies for autonomous systems, collaborating with engineering teams for reliability, and analyzing data for risk assessments.
Top Skills: PythonSQL
8 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
90K-140K Annually
Senior level
90K-140K Annually
Senior level
eCommerce • Retail • Software
The Senior Database Reliability Engineer ensures database availability, reliability, and efficiency, driving initiatives for upgrades, automation, and security while mentoring team members.
Top Skills: AWSDynamoDBElasticsearchMongoDBMySQLPostgresPowershellPythonRedisSQL Server
9 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
85K-90K Annually
Mid level
85K-90K Annually
Mid level
Food
The Reliability Engineer will manage maintenance of fixed assets, focusing on equipment reliability, predictive maintenance, and collaboration to reduce downtime and improve performance metrics of packaging operations.
Top Skills: Automation EquipmentThermoforming Packaging MachinesTpm
24 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
151K-297K Annually
Expert/Leader
151K-297K Annually
Expert/Leader
Big Data • Cloud • Software • Database
Lead a 6–8 person team managing the Kubernetes fleet and core runtime components (CoreDNS, cert-manager, Gatekeeper). Define technical vision and roadmap, guide migration from Terraform to Operator-driven lifecycle management, perform hands-on architectural reviews and PR reviews, resolve operational incidents, and collaborate with engineering leaders and stakeholders.
Top Skills: AlertingAWSAzureCert-ManagerContainerizationCorednsCrossplaneGatekeeperGCPKubernetesLoad BalancingObservabilityOperatorsService MeshTerraform
Reposted 18 Days AgoSaved
In-Office
San Francisco Bay Area, CA
150K-300K Annually
Mid level
150K-300K Annually
Mid level
Artificial Intelligence • Information Technology • Robotics • Software
Sieve is seeking a Founding Reliability Engineer to build and maintain infrastructure for petabyte-scale video workloads, focusing on reliability and security. Responsibilities include incident response, cloud security, and observability systems management.
Top Skills: AWSC++CloudflareGCPGoOpentelemetryOraclePrometheusPythonRustTerraformVictoriametrics
Reposted 18 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
112K-152K Annually
Senior level
112K-152K Annually
Senior level
Energy
As a Reliability Engineer, you will define system reliability requirements, conduct tests, analyze failures, and collaborate on design improvements for hardware products.
Top Skills: JmpMinitabPython
Reposted 14 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
Reposted 19 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
Mid level
Mid level
Cloud • Internet of Things • Agriculture
The Hardware Test & Reliability Engineer will ensure product validation, oversee reliability testing, lead root cause analysis, and automate testing processes using Python to enhance data-driven quality improvement.
Top Skills: DmmsLogic AnalyzersOscilloscopesPython
16 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves supporting network infrastructure, automating cloud infrastructure, managing CI/CD workflows, and ensuring operational excellence in IT support, including incident response and security practices.
Top Skills: AnsibleAWSBashDockerGitKubernetesPythonRubyTerraform
Reposted 16 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
Internship
Internship
Cloud • Information Technology • Security • Software • Cybersecurity
This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.
Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account