Top Reliability Engineer Jobs in San Francisco, CA

Reposted 17 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills: AWSDockerGCPKubernetes
Reposted 17 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
190K-235K Annually
Senior level
190K-235K Annually
Senior level
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
19 Days AgoSaved
Easy Apply
Remote or Hybrid
San Francisco Bay Area, CA
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
As a Senior Site Reliability Engineer, you'll design and build complex systems, support Atlas platform operations, automate processes, and ensure high availability of services.
Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted 13 Hours AgoSaved
In-Office
San Francisco Bay Area, CA
225K-445K Annually
Expert/Leader
225K-445K Annually
Expert/Leader
Artificial Intelligence • Machine Learning • Generative AI
The Reliability/DFX Engineer will oversee DFX architecture and improve system reliability, working closely with teams on AI hardware design and implementation.
Top Skills: Data AnalysisDftMl Chip ArchitectureRtl DesignSilicon Ate
Reposted 13 Hours AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Software
The Site Reliability Engineer ensures the reliability and performance of products Devin and Windsurf, managing incident response, CI/CD pipelines, infrastructure as code, and fostering a reliability culture within the engineering team.
Top Skills: AWSAzureCi/CdGCPKubernetesTerraform
Reposted YesterdaySaved
In-Office
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Software
As a Site Reliability Engineer at Mercor, you will ensure production reliability, develop SRE function, and collaborate with engineering teams to maintain system performance.
Top Skills: AWSKubernetesSpaceliftTerraform
2 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
140K-230K Annually
Senior level
140K-230K Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
The Site Reliability Engineer at Zoox will manage the availability and resilience of services for autonomous vehicles, design systems, and lead incident resolution.
Top Skills: AnsibleAWSAzureC++CloudFormationGCPGoJavaKubernetesPythonSaltTerraform
Reposted 2 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
The role involves defining and evolving technical foundations for AI evaluation, optimizing performance, designing resilient systems, and collaborating with various teams for infrastructure improvements.
Top Skills: Node.jsPostgresServerless EnvironmentsTypescript
Reposted 2 Days AgoSaved
In-Office
San Francisco Bay Area, CA
170K-197K Annually
Mid level
170K-197K Annually
Mid level
Aerospace • Artificial Intelligence
The Site Reliability Engineer will architect and manage ground infrastructure for satellite systems, ensuring high availability, automating deployments, and optimizing data management systems.
Top Skills: AnsibleAWSAzureC++CloudFormationEksElkGCPGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted 3 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
156K-261K Annually
Senior level
156K-261K Annually
Senior level
Consumer Web • eCommerce • Fashion • Retail
The Senior Site Reliability Engineer ensures the health of systems, automates processes, and collaborates on architecture to maintain uptime and reliability in production environments.
Top Skills: AnsibleAWSAzureDatadogDockerElasticsearchGCPGraphiteHaproxyJavaScriptJenkinsKubernetesMongoDBNagiosNew RelicNginxNode.jsRabbitMQRedisRubyTerraformTomcat
Reposted 3 Days AgoSaved
Hybrid
San Francisco Bay Area, CA
Entry level
Entry level
Consumer Web • eCommerce • Fashion • Retail
The Software Engineer, SRE will develop, deploy, and support new product features while ensuring operational excellence and quality support in a fast-paced environment.
Top Skills: AWSDockerElasticsearchHaproxyJavaScriptKubernetesMongoDBNginxNode.jsRabbitMQRedisRubyTomcat
Reposted 3 Days AgoSaved
In-Office
San Francisco Bay Area, CA
194K-267K Annually
Senior level
194K-267K Annually
Senior level
Cloud
The role involves building and managing observability infrastructure in GCP, automating deployments, and optimizing data processes for high reliability.
Top Skills: GkeGoGCPGrafanaKubernetesOpentelemetryPythonRubySplunkTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 3 Days AgoSaved
In-Office
San Francisco Bay Area, CA
255K-490K Annually
Mid level
255K-490K Annually
Mid level
Artificial Intelligence • Machine Learning • Generative AI
As a Site Reliability Engineer, you will manage Kubernetes clusters, automate infrastructure, improve operational metrics, and enhance reliability across data centers.
Top Skills: CloudFormationGoGpuKubernetesLinuxPythonTerraform
Reposted 3 Days AgoSaved
In-Office
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Information Technology • Robotics • Software
As a Senior Reliability and Test Engineer, you will identify failure risks, develop new reliability tests, ensure regulatory compliance, and conduct testing on home robotics.
Top Skills: Certification StandardsReliability TestingStatistical Modeling
Reposted 22 Days AgoSaved
Easy Apply
Remote
San Francisco Bay Area, CA
Easy Apply
167K-231K Annually
Senior level
167K-231K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Lead technical direction for software architecture and cross-team initiatives focusing on scaling consumer-facing systems and maximizing loan originations while maintaining compliance and system integrity.
Top Skills: AWSCi/CdDockerGithub ActionsInfrastructure As CodeReactRuby On Rails
Reposted 22 Days AgoSaved
Remote or Hybrid
San Francisco Bay Area, CA
Senior level
Senior level
Fintech • Software
The Senior Site Reliability Engineer ensures fast, stable SaaS products through automation, collaboration, monitoring, and implementing AI tools to enhance performance and reliability.
Top Skills: Ai ToolsAnsibleAppdynamicsAWSAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicPowershellPythonSaaSSQLTerraform
19 Days AgoSaved
Remote
San Francisco Bay Area, CA
120K-160K Annually
Mid level
120K-160K Annually
Mid level
Fintech • Financial Services
The Systems Reliability Engineer will support MEMX exchange platforms, handling incidents, improving processes, documenting actions, and debugging issues while collaborating with diverse teams to maintain operational efficiency.
Top Skills: AnsibleBashChefLinuxPuppetPython
5 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
210K-247K Annually
Expert/Leader
210K-247K Annually
Expert/Leader
Software
As a Staff Site Reliability Engineer, you'll lead reliability strategies, design scalable systems, improve observability, and mentor engineers to enhance system performance and resilience.
Top Skills: AWSDatadogGrafanaPrometheusTerraform
5 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
190K-206K Annually
Senior level
190K-206K Annually
Senior level
Software
As a Senior Site Reliability Engineer, you will ensure the reliability and scalability of production systems, improve system performance, and enhance observability through design and automation.
Top Skills: AWSCloudwatchDatadogGrafanaPrometheusTerraform
Reposted 5 Days AgoSaved
In-Office
San Francisco Bay Area, CA
Senior level
Senior level
Artificial Intelligence • Healthtech
The Site Reliability Engineer will enhance system reliability, define observability standards, respond to incidents, and collaborate with engineering teams on performance and compliance improvements.
Top Skills: AWSContainerized ServicesDistributed WorkflowsObservability ToolingPostgresServerless Compute
Reposted 5 Days AgoSaved
In-Office
San Francisco Bay Area, CA
200K-250K Annually
Senior level
200K-250K Annually
Senior level
Fintech • Professional Services • Software
As a Senior Site Reliability Engineer, you'll design scalable systems on AWS, mentor engineers, manage incident responses, and enhance the reliability of fintech infrastructure.
Top Skills: SparkAWSDevOpsJavaKubernetesTerraform
Reposted 6 Days AgoSaved
In-Office
San Francisco Bay Area, CA
195K-240K Annually
Senior level
195K-240K Annually
Senior level
Software
The Site Reliability Engineer will enhance reliability, observability, and incident response of You.com's production services, while collaborating with teams to implement best practices and improve operational efficiency through tooling and automation.
Top Skills: AWSBashCi/CdEksGhaGitGitGrafanaOpentelemetryPrometheusPythonTerraform
7 Days AgoSaved
In-Office
San Francisco Bay Area, CA
175K-250K Annually
Mid level
175K-250K Annually
Mid level
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
The Site Reliability Engineer will ensure the reliability and performance of AI infrastructure, build core systems, handle incident response, and develop automation tools.
Top Skills: AWSDatadogElkGCPGithub ActionsGitlab CiGoGrafanaJenkinsKubernetesLinuxPrometheusPulumiPythonRustTerraform
7 Days AgoSaved
In-Office
San Francisco Bay Area, CA
230K-385K Annually
Mid level
230K-385K Annually
Mid level
Artificial Intelligence • Machine Learning • Generative AI
The Site Reliability Engineer will manage production infrastructure, focusing on data-heavy systems and improving reliability across services, particularly using ClickHouse and Kafka.
Top Skills: ClickhouseCloud InfrastructureKafkaKubernetesSnowflakeTerraform
Reposted 7 Days AgoSaved
In-Office or Remote
San Francisco Bay Area, CA
Senior level
Senior level
Software
The Senior Site Reliability Engineer will lead service onboarding, maintain SLAs/SLOs, design secure infrastructure, automate operational tasks, and respond to incidents while ensuring system reliability and performance.
Top Skills: AWSCloudFormationElk StackGoGrafanaHadoopKubernetesPythonTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account