Top Senior Site Reliability Engineer Jobs in San Francisco
Seeking a Staff Site Reliability Engineer to improve security and reliability of critical cloud-based infrastructure for early cancer detection technology. Responsibilities include ensuring high availability, incident management, automation, performance optimization, security compliance, monitoring/alerting, and software development consultation.
Lead Site Reliability engineering effort to improve anomaly detection, platform stability and resilience using modern best practice.
We are looking for a Principal Site Reliability Engineer with expertise in scaling Cloud services. The candidate should have deep understanding of modern Cloud infrastructure, programming expertise, and operational experience. They will be responsible for improving services and processes to enhance reliability, performance, scalability, and cost efficiency. The engineer will work with teams across the organization to advocate for reliability methodologies and will report to the Senior Engineering Manager.
As a Principal Site Reliability Engineer, you will focus on innovating and providing strong technical vision for our platform's mission-critical datastores. You will build reliable, scalable, and highly available datastores on a multi-region scale platform. You will collaborate with leaders across the company as a subject matter expert and be a role model for the engineering team.
Join Okta as a Principal Site Reliability Engineer for Workflows team, responsible for building and leading global production infrastructure. Core responsibilities include coding with Go, Terraform, Helm, supporting Kubernetes and AWS environment, contributing to multi-cloud initiative, engaging with engineering teams, mentoring junior SREs, and automating processes.
Seeking a Senior Engineer for artifact management system design and maintenance, enforcing best practices, architecting cloud agnostic solutions, and continuous improvement of developer experience in a global hybrid environment. Must have expertise in CI/CD, artifact management, IaC provisioning, and source code management services, as well as experience with Kubernetes at scale.
Looking for a reliability expert to join our growing SRE teams. Must have deep understanding of modern Cloud Infrastructure and operational best practices. Responsible for driving change across services and processes to improve reliability, performance, scalability, and cost efficiency. Proficiency in Java, Go, or Python is required. Remote-friendly opportunity.
Seeking a Site Reliability Engineer to design and implement SRE practices and ensure availability and scalability of production systems
Featured Jobs
As a Federal SRE at ServiceNow, you will support the Government Cloud infrastructure during 3rd Shift (Nights) with a 4-day work week. Responsibilities include driving technical resolutions, enhancing platform operability, and optimizing services for customers through software development, networking, and systems engineering expertise.
RingCentral is seeking a Senior Site Reliability Engineer to work on infrastructure solutions, Docker infrastructure, automation, and deployment activities. Responsibilities include production support, research, development, IaaC with Terraform, CI/CD processes, and collaboration with teams.
Build fast, highly available infrastructure at scale. Contribute to architecture and design of new and current systems. Solid understanding of infrastructure design. Write high quality code and use modern infrastructure tools. Experience with logging, monitoring, and security.
Seeking a Senior Site Reliability Engineer with expertise in designing and operating large-scale distributed systems in the cloud, with a focus on FedRAMP-compliant infrastructure. Responsibilities include collaborating with software engineers, designing and managing infrastructure, ensuring compliance with FedRAMP controls, driving automation, and maintaining cloud-native services on AWS.
The Apple Service Engineering - Edge & Messaging SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems that are foundational for many of Apple's services such as iCloud, iMessage, and FaceTime. The ideal candidate will have strong Software Development skills and expertise in Linux, Systems, and Cloud technology.
Seeking a Senior Cloud Site Reliability Engineer with 8+ years of experience in SRE/MLOps. Responsibilities include automation, maintaining large-scale distributed systems, data pipelines management, and building support applications on AWS and Kubernetes.
Join the HWTE Core Infrastructure team at Apple, responsible for developing software delivery systems and infrastructure for global manufacturing lines. Looking for a DevOps/SRE engineer with experience in Unix/Linux system admin, Shell scripting, CI/CD pipeline management, infrastructure technologies like Docker, Kubernetes, and AWS, and networking concepts.
The Senior Site Reliability Engineer on the Datastores team at ThousandEyes/Cisco will focus on designing, optimizing, and maintaining mission critical datastores like ElasticSearch, Kafka, MongoDB, and MySQL. Responsibilities include collaborating with software engineers, building automation for scalability, and contributing to incident response and on-call rotation.
As a Senior Site Reliability Engineer at Atlassian, you will be responsible for improving the performance and reliability of services, addressing root causes of incidents, and automating repetitive tasks. You will collaborate with the team to develop innovative solutions and ensure high code quality, operating at scale in Amazon Web Services. Strong skills in Bash, Python, Linux, AWS, Ansible, Docker, Kubernetes, and ITIL are required.
Join Apple's Cloud Monitoring SRE team to improve reliability and performance of software systems providing visibility into Apple's services & infrastructure. Design and build next-gen cloud and systems monitoring infrastructure focusing on automation and efficiency. Deep dive into operational issues and integrate infrastructures at global scale.
Apple Services Engineering is seeking a Software Development Engineer with expertise in ML infrastructure. Responsibilities include supporting ML services, deploying new models, providing insights, and collaborating with teams. Requires strong technical skills in Java, Python, Swift, Rust, GoLang, and SRE best practices.
Data Platform SRE position at Apple Services Engineering organization, responsible for managing infrastructure and applications to deliver data processing, governance, and storage for global products. Requires proficiency in Apache Spark, Trino, and security-related infrastructure.
Join Apple's Applied Machine Learning Team as a Senior Cloud DevOps/Site Reliability Engineer to build and support innovative software applications. Responsibilities include managing applications on AWS & Kubernetes, building CI/CD pipelines, enabling auto-scaling, and ensuring high performance in cloud environments.
As a Senior Site Reliability Engineer at Reddit, you will improve the reliability and performance of engineering platforms and services, build internal services for peer engineering teams, and contribute to the evolution of Reddit at scale.
As a Senior Site Reliability Engineer at Reddit, you will improve the reliability and performance of Reddit's platforms and services using your knowledge of distributed systems. You will collaborate with engineering teams to develop resilient systems at scale and automate tasks to support Reddit's evolution. Join Reddit and be part of shaping the future of one of the largest communities on the internet.
As a Senior Site Reliability Engineer at Reddit, you will improve the reliability and performance of Reddit's engineering platforms and services by advising engineering teams, automating tasks, diagnosing issues, optimizing performance, and building internal services. You will work on high-traffic backend systems and utilize your expertise in software engineering, site reliability engineering, DevOps, Go, Python, Kubernetes, and Cloud systems.
Passionate and talented Site Reliability Engineers needed to ensure high-quality Apple Services experience. Responsibilities include managing and scaling large distributed systems, deploying and supporting new services, troubleshooting, and participating in on-call service support.
Top San Francisco Companies Hiring Senior Site Reliability Engineers
See All
Work your passion. Live your purpose.
Explore all your job opportunities on Built In.
Most Popular Searches
More Job Categories
Jobs by Expertise
Data + Analytics
Senior Analytics Jobs
Senior Analysis & Reporting Jobs
Senior Business Intelligence Jobs
Senior Data Engineering Jobs
Senior Data Science Jobs
Senior Machine Learning Jobs
Senior Management Jobs
Senior Other Jobs
Developer + Engineer
Senior Android (Java) Jobs
Senior C++ Jobs
Senior C# Jobs
Senior DevOps Jobs
Senior Front-End Jobs
Senior Golang Jobs
Senior Java Jobs
Senior Javascript Jobs
Senior Hardware Jobs
Senior iOS (Objective-C) Jobs
Senior Linux Jobs
Senior Management Jobs
Senior .NET Jobs
Senior Perl Jobs
Senior PHP Jobs
Senior Python Jobs
Senior QA Jobs
Senior Ruby Jobs
Senior Salesforce Jobs
Senior Sales Engineer Jobs
Senior Scala Jobs
Senior Other Jobs
All Filters
No Results
No Results