Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in San Francisco, CA
Financial Services
As a Principal Site Reliability Engineer, you'll architect reliability solutions, lead observability initiatives, and mentor teams for enhanced operational efficiency.
Top Skills:
Cloud-Native InstrumentationOpen TelemetryStreaming Data Platforms
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills:
ArgocdAWSKubernetesPythonTerraform
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills:
AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will support engineering teams, enhance system resilience, and drive scalable infrastructure practices.
Top Skills:
Aws ServicesGrafanaHoneycombLinuxPythonTerraform
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design, scale, and manage AWS services for IoT devices. Collaborate on infrastructure, optimize performance, and ensure high availability of services.
Top Skills:
AWSBashGoHelmKubernetesPythonRubyTerraform
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Embedded Site Reliability Engineer will develop and maintain software applications for Bitcoin mining, focusing on embedded systems and cloud observability. Responsibilities include software testing, bug triage, and collaboration with engineering teams to optimize performance and reliability.
Top Skills:
CC++DatadogElasticGoGrafanaJavaScriptLinuxPythonRustSplunkSQLTypescript
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills:
AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills:
AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Artificial Intelligence • Healthtech • Software
As a Site Reliability Engineer, you will manage cloud infrastructure, implement observability, and ensure system reliability by collaborating with engineering teams and maintaining databases.
Top Skills:
AzureBashGitGitKubernetesPostgresPythonRedisSQLTypescriptVscode
Artificial Intelligence • Legal Tech • Machine Learning • Natural Language Processing • Software • Financial Services • Generative AI
As a Site Reliability Engineer, you will ensure system uptime, manage CI/CD pipelines, and enhance security and observability while troubleshooting issues in a collaborative environment.
Top Skills:
AWSAzureCloudFormationDatadogDockerGCPGrafanaKubernetesPrometheusTerraform
Artificial Intelligence • Machine Learning • Software
As a Staff Site Reliability Engineer, you will enhance the reliability, scalability, and performance of production services by applying SRE principles, implementing observability practices, automating processes, and collaborating with engineering teams.
Top Skills:
AWSAzureCloudFormationDatadogDockerElk StackGCPGoGrafanaJaegerKubernetesOpentelemetryOpentofuPrometheusPythonTerraform
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a team of developers to create cloud-based solutions while driving transformations using DevOps practices. Collaborate across teams to solve business challenges and mentor engineers.
Top Skills:
AnsibleAWSDockerGoJavaKubernetesPythonRubySQLTerraform
Software
As a Site Reliability Engineer, you'll ensure platform reliability through scalable systems, incident response, observability, and collaboration with engineering teams.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Reposted 20 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills:
Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance reliability and observability, automate processes, support engineering teams, and promote a culture of reliability at Coinbase.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Fintech • Software
The SRE is responsible for building cloud-native platforms, improving application reliability, and fostering collaboration within teams.
Top Skills:
Ci/CdKubernetesOpenshiftOpenstackPrometheusSplunkVMware
Artificial Intelligence • Software
The SRE at Fluidstack is responsible for ensuring infrastructure reliability and performance, handling complex production issues, and improving platform stability.
Top Skills:
AnsibleBashGoKubernetesPythonSlurmTerraform
Artificial Intelligence • Software • Conversational AI • Generative AI
As a Staff Software Engineer in Site Reliability, you'll maintain production services, develop automation tools, and collaborate with teams to ensure system reliability and performance.
Top Skills:
Ci/CdGCPGoKubernetesLinuxPythonSQLTerraform
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills:
Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
Information Technology • Mobile • Software
As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.
Top Skills:
AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform
Artificial Intelligence • Software
As a Site Reliability Engineer at Anyscale, you will ensure smooth operations of user-facing services, develop monitoring and alerting systems, implement incident management processes, and improve cloud service deployment methodologies.
Top Skills:
Alerting SystemsAutomationCloud ComponentsIncident ManagementMonitoring Systems
Edtech
As a Staff Site Reliability Engineer, you will lead reliability engineering by designing automation, scaling systems, driving architectural improvements, and mentoring engineers.
Top Skills:
ArgocdCi/CdCircleCIDatadogGCPGithub ActionsGoIstioKubernetesPythonTerraform
Edtech
Lead the technical vision for reliability at Quizlet by architecting self-healing systems, mentoring engineers, and improving infrastructure resilience.
Top Skills:
Ci/CdDatadogGoIstioJeliKubernetes (Gke)PythonTerraform
Top San Francisco Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results
















.png)

















