Get the job you really want.

Top Reliability Engineer Jobs in San Francisco, CA

Reposted 12 Days AgoSaved
In-Office
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Software
As a Senior/Staff Network Reliability Engineer, you'll optimize and maintain Fluidstack's network platform, ensuring performance and reliability for AI and HPC workloads. Responsibilities include tuning networking protocols, deploying and validating switches, automating telemetry, conducting root-cause analyses, and collaborating with vendors.
Top Skills: BgpDpdkEbpfEvpnGeneveGoPythonRdmaRustTcp/IpVxlanXdp
Reposted 12 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
112K-152K Annually
Senior level
112K-152K Annually
Senior level
Energy
As a Reliability Engineer, you will define reliability requirements, analyze design failures, run tests, and ensure high standards for hardware products.
Top Skills: JmpMinitabPython
Reposted 8 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills: AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
9 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
164K-226K Annually
Senior level
164K-226K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Software Engineer focused on Site Reliability Tooling, you'll enhance system reliability, implement SRE practices, and build automation tools to support site reliability across Upstart's infrastructure.
Top Skills: CdkCloudFormationDatadogGoJavaScriptKubernetesPrometheusPythonTerraformTypescript
13 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
160K-185K Annually
Senior level
160K-185K Annually
Senior level
Industrial • Manufacturing
Lead reliability strategy for HVAC components, develop predictive life models, design tests, and improve product durability. Analyze data and mentor junior engineers.
Top Skills: Accelerated Test MethodsHvac SystemsIot DevicesPredictive Life ModelsWeibull Modeling
13 Days AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
160K-185K Annually
Senior level
160K-185K Annually
Senior level
Appliances
Lead reliability strategies for HVAC components, develop predictive models, conduct tests, analyze data, and mentor junior engineers.
Top Skills: Accelerated Life TestingCorrosion TestingEnvironmental ChambersHvac SystemsReliability EngineeringVibration TablesWeibull Analysis
Reposted 9 Days AgoSaved
Remote
San Francisco Bay Area, CA
140K-210K Annually
Senior level
140K-210K Annually
Senior level
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills: AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Reposted 4 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
130K-150K Annually
Mid level
130K-150K Annually
Mid level
Marketing Tech
The Cloud Reliability Engineer develops, configures, and deploys cloud tools, enhances applications, ensures observability, and participates in on-call rotations.
Top Skills: AWSCi/CdDockerGithub ActionsGoGoogle BigqueryGCPKubernetesLinuxPythonSQLTerraform
Reposted 19 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
180K-230K Annually
Senior level
180K-230K Annually
Senior level
Artificial Intelligence • Healthtech • Software
As a Site Reliability Engineer, you will manage cloud infrastructure, implement observability, and ensure system reliability by collaborating with engineering teams and maintaining databases.
Top Skills: AzureBashGitGitKubernetesPostgresPythonRedisSQLTypescriptVscode
Reposted 10 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills: AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
15 Days AgoSaved
In-Office
San Francisco Bay Area, CA
135K-155K Annually
Expert/Leader
135K-155K Annually
Expert/Leader
Information Technology • Security • Cybersecurity
Responsible for managing Oracle RAC databases, optimizing performance, ensuring security and integrity, and providing 24x7 support for production applications.
Top Skills: CassandraCephElasticsearchKafkaOracleRedis
16 Days AgoSaved
In-Office
San Francisco Bay Area, CA
152K-228K Annually
Senior level
152K-228K Annually
Senior level
Cloud
The role involves designing and optimizing PostgreSQL clusters, automating database tasks, and ensuring high availability and performance while collaborating with other engineering teams.
Top Skills: AnsibleDatadogGoGrafanaKubernetesMySQLPostgresPrometheusPythonTerraform
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 8 Days AgoSaved
Remote
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Cybersecurity
The Database Reliability Engineer will ensure database availability, performance, scalability, and security across AWS, collaborating with application and security teams.
Top Skills: AWSCrossplaneDatadogGitlab Ci/CdKubernetesNoSQLOpensearchPostgresTerraform
Reposted 23 Days AgoSaved
Easy Apply
Hybrid
San Francisco Bay Area, CA
Easy Apply
170K-220K Annually
Senior level
170K-220K Annually
Senior level
Artificial Intelligence • Machine Learning • Software
As a Staff Site Reliability Engineer, you will enhance the reliability, scalability, and performance of production services by applying SRE principles, implementing observability practices, automating processes, and collaborating with engineering teams.
Top Skills: AWSAzureCloudFormationDatadogDockerElk StackGCPGoGrafanaJaegerKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
Reposted 14 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
175K-175K Annually
Senior level
175K-175K Annually
Senior level
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills: AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Reposted 20 Days AgoSaved
In-Office
San Francisco Bay Area, CA
150K-215K Annually
Senior level
150K-215K Annually
Senior level
Aerospace • Hardware • Logistics • Robotics • Software • Transportation
Design for Reliability Engineer responsible for ensuring the safety and reliability of drone-delivery systems through testing, statistical analysis, and innovative design solutions.
Top Skills: JmpMatlabMinitabPythonReliasoft
17 Days AgoSaved
Remote
San Francisco Bay Area, CA
148K-195K Annually
Mid level
148K-195K Annually
Mid level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Site Reliability Engineer will build and maintain infrastructure, improve software systems, develop scalable microservices, and ensure quality software delivery.
Top Skills: AWSGoGoogle Cloud PlatformJavaKubernetesAzureSQL
Reposted 5 Hours AgoSaved
In-Office
San Francisco Bay Area, CA
180K-200K Annually
Senior level
180K-200K Annually
Senior level
Productivity
The Senior Site Reliability Engineer will enhance site reliability through monitoring, optimizing infrastructure, collaborating on engineering projects, and ensuring systems’ stability.
Top Skills: AWSDockerKubernetesTemporal
Reposted 5 Hours AgoSaved
In-Office
San Francisco Bay Area, CA
255K-490K Annually
Mid level
255K-490K Annually
Mid level
Artificial Intelligence • Machine Learning • Generative AI
The Software Engineer in Reliability will ensure system scalability, reliability, and performance, collaborating with teams to improve infrastructure and handle incidents.
Top Skills: Cloud InfrastructureCloudFormationContainer Orchestration PlatformsContainerization TechnologiesDatadogGrafanaIac ToolsKubernetesMicroservices ArchitectureObservability ToolsProgramming LanguagesPrometheusService Mesh TechnologiesSplunkTerraform
Reposted 5 Hours AgoSaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
Senior level
Senior level
Software • Generative AI
As a Site Reliability Engineer at Fireworks AI, you'll ensure system reliability, manage incidents, develop monitoring solutions, and reduce operational toil, while collaborating with software engineers to embed reliability in the development lifecycle.
Top Skills: AWSAzureDockerElk StackGCPGoGrafanaKubernetesPrometheusPython
Reposted 5 Hours AgoSaved
Hybrid
San Francisco Bay Area, CA
175K-225K Annually
Senior level
175K-225K Annually
Senior level
Artificial Intelligence • Machine Learning • Database
The role involves ensuring the reliability and performance of distributed database systems, developing monitoring strategies, and automating operations in a cloud-native environment.
Top Skills: AnsibleArgoAWSAzureDockerGCPGitlab CiGoJavaJenkinsKubernetesPythonTerraform
Reposted YesterdaySaved
Easy Apply
In-Office
San Francisco Bay Area, CA
Easy Apply
150K-200K Annually
Junior
150K-200K Annually
Junior
Artificial Intelligence • Information Technology
As a Site Reliability Engineer, maintain user-facing services, implement best practices for reliability, and manage production incidents.
Top Skills: AnsibleCloud ServicesKubernetesProgramming LanguagesTerraform
Reposted 20 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted 2 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
120K-160K Annually
Mid level
120K-160K Annually
Mid level
Consumer Web • Mobile
As a Site Reliability Engineer at Patreon, you'll improve AWS infrastructure, implement SRE practices, enhance Kubernetes capabilities, and develop automation tools.
Top Skills: AnsibleAWSChefKubernetesPuppetPythonTerraform
Reposted 2 Days AgoSaved
In-Office
San Francisco Bay Area, CA
165K-250K Annually
Senior level
165K-250K Annually
Senior level
Artificial Intelligence • Healthtech • Information Technology • Software
As a Site Reliability Engineer, you will manage the production environment, focusing on infrastructure design, automation, and optimizing deployment pipelines to ensure high availability.
Top Skills: HelmKafkaKubernetesPostgresPythonRedisTerraformTypescript
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account