Top Senior Site Reliability Engineer Jobs in San Francisco, CA
As a Principal Site Reliability Engineer, you will focus on innovating and providing strong technical vision for our platform's mission-critical datastores. You will build reliable, scalable, and highly available datastores on a multi-region scale platform. You will collaborate with leaders across the company as a subject matter expert and be a role model for the engineering team.
As a Site Reliability Engineer at Orb, you will play a critical role in maintaining and scaling our robust infrastructure, ensuring stability, scalability, and performance. You will be at the heart of tackling some of the most significant engineering challenges, from scaling our data ingestion pipelines to refining our observability and reliability practices.
Guide the technical design, implementation, and optimization of global infrastructure services primarily focused on Hybrid Cloud. Research and recommend new technology solutions, ensure reliability and redundancy, manage projects, automate operational activities, integrate systems technologies with ServiceNow platform, and ensure legal compliance.
The Staff SRE - Technical Duty Officer at ServiceNow supports and protects all of ServiceNow's public services, providing technical leadership for a team of on-site engineers responsible for the availability and performance of ServiceNow's cloud platform. This role involves coordinating recovery efforts and crisis management during major outages.
Design and implement production-grade systems, establish standards for automation, plan complex migrations, improve on-call experience, and lead technical roadmaps for system reliability and scalability.
Seeking a Senior Site Reliability Engineer with expertise in designing and operating large-scale distributed systems in the cloud, with a focus on FedRAMP-compliant infrastructure. Responsibilities include collaborating with software engineers, designing and managing infrastructure, ensuring compliance with FedRAMP controls, driving automation, and maintaining cloud-native services on AWS.
As a Senior Site Reliability Engineer at Atlassian, you will be responsible for improving the performance and reliability of services, addressing root causes of incidents, and automating repetitive tasks. You will collaborate with the team to develop innovative solutions and ensure high code quality, operating at scale in Amazon Web Services. Strong skills in Bash, Python, Linux, AWS, Ansible, Docker, Kubernetes, and ITIL are required.
Build fast, highly available infrastructure at scale. Contribute to architecture and design of new and current systems. Solid understanding of infrastructure design. Write high quality code and use modern infrastructure tools. Experience with logging, monitoring, and security.
Featured Jobs
RingCentral is seeking a Senior Site Reliability Engineer to work on infrastructure solutions, Docker infrastructure, automation, and deployment activities. Responsibilities include production support, research, development, IaaC with Terraform, CI/CD processes, and collaboration with teams.
Apple Services Engineering is seeking a Senior Site Reliability Engineer experienced in software and systems to join the Storage SRE team. Responsibilities include architectural and technical leadership for operating large scale distributed storage systems, driving best practices in resiliency, and designing and developing code in Go, Rust, Java, and Python.
Principal engineers at Invisible Technologies have multiple paths, including technical leadership and overseeing technical initiatives. They have a strong understanding of cloud architecture, networking, security, authentication, authorization, and Kubernetes. Ideal candidates would also have experience with infrastructure as code tools.
Seeking a highly skilled Staff SRE to ensure reliability, scalability, and performance of critical infrastructure and applications. Responsibilities include proactive issue mitigation, monitoring and alerting setup, incident management, SLA compliance, automation, cross-team collaboration, and mentorship.
Looking for a reliability expert to join our growing SRE teams. Must have deep understanding of modern Cloud Infrastructure and operational best practices. Responsible for driving change across services and processes to improve reliability, performance, scalability, and cost efficiency. Proficiency in Java, Go, or Python is required. Remote-friendly opportunity.
Apple Services Engineering is seeking a Software Development Engineer with expertise in ML infrastructure. Responsibilities include supporting ML services, deploying new models, providing insights, and collaborating with teams. Requires strong technical skills in Java, Python, Swift, Rust, GoLang, and SRE best practices.
Seeking a Senior Cloud Site Reliability Engineer with 8+ years of experience in SRE/MLOps. Responsibilities include automation, maintaining large-scale distributed systems, data pipelines management, and building support applications on AWS and Kubernetes.
The Software Engineer in Reliability Engineering role at Grammarly involves building world-class, secure, and reliable cloud-native infrastructure solutions for Grammarly engineers. Responsibilities include improving incident management, introducing auto-scaling and resilience mechanisms, conducting chaos testing, and establishing best practices for reliability.
The Senior Infrastructure Engineer at CrowdStrike will be responsible for building and maintaining infrastructure to support the intelligence team's activities, including working on classical datacenter and cloud infrastructure. The role involves adapting to rapidly changing requirements and environments, debugging infrastructure issues, and collaborating with a remote team of experienced engineers.
Seeking a Senior Staff Engineer for Cloud Infrastructure team responsible for leading design, implementation, and maintenance of cloud infrastructure and platform services. Requires deep expertise in AWS, Docker, Kubernetes, Istio, Kafka, Kinesis, and strong programming skills for infrastructure automation.
Lead the implementation and design of infrastructure automation and IaaS capabilities, focusing on Kubernetes and cloud environments. Responsible for maintaining and troubleshooting large-scale production environments, CI/CD tools, configuration management, and scripting languages.
Crowdstrike is seeking a Sr. Cloud Engineer to work on the Detections Platform team, focusing on building the next generation of cloud-side detections processing. Responsibilities include leveraging and building cloud-based systems to detect targeted attacks, collaborating with multiple teams, mentoring engineers, and delivering feedback. The role involves building solutions in Go, reading multiple programming languages, and continuously learning and improving engineering practices.
Seeking a visionary Principal Cloud Software Engineer/DevOps Engineer to shape hybrid cloud infrastructure strategy, drive innovation, and provide technical leadership. Responsibilities include designing scalable cloud solutions, developing CI/CD pipelines, optimizing infrastructure performance, and driving DevOps processes. Requires 12+ years of experience and expertise in AWS, Azure, GCP, Terraform, Python, Bash, PowerShell, containerization, and networking.
Data Platform SRE position at Apple Services Engineering organization, responsible for managing infrastructure and applications to deliver data processing, governance, and storage for global products. Requires proficiency in Apache Spark, Trino, and security-related infrastructure.
Seeking a Senior DevOps Engineer to help build and maintain the infrastructure powering an ecommerce platform. Responsibilities include enabling product teams, leading key projects, managing infrastructure through automation, performing RCAs on incidents, and ensuring availability and scalability of production systems.
The Senior Infrastructure Engineer will be responsible for deploying and operating infrastructure technologies and hardware systems. They will work on Linux environments, automate infrastructure management tasks, and contribute to the architecture and design of infrastructure systems. This role requires experience with modern infrastructure tools, TCP/IP configurations, and programming languages such as Go, Python, or Bash. The ideal candidate will have at least 5 years of infrastructure management experience.
Infrastructure Engineer role at Academia.edu involving scaling and building out infrastructure, managing databases with large amounts of data, and implementing security practices for organizational growth.
Top San Francisco Companies Hiring Senior Site Reliability Engineers
See All
Work your passion. Live your purpose.
Explore all your job opportunities on Built In.
Most Popular Searches
More Job Categories
Jobs by Expertise
Data + Analytics
Senior Analytics Jobs
Senior Analysis & Reporting Jobs
Senior Business Intelligence Jobs
Senior Data Engineering Jobs
Senior Data Science Jobs
Senior Machine Learning Jobs
Senior Management Jobs
Senior Other Jobs
Developer + Engineer
Senior Android (Java) Jobs
Senior C++ Jobs
Senior C# Jobs
Senior DevOps Jobs
Senior Front-End Jobs
Senior Golang Jobs
Senior Java Jobs
Senior Javascript Jobs
Senior Hardware Jobs
Senior iOS (Objective-C) Jobs
Senior Linux Jobs
Senior Management Jobs
Senior .NET Jobs
Senior Perl Jobs
Senior PHP Jobs
Senior Python Jobs
Senior QA Jobs
Senior Ruby Jobs
Senior Salesforce Jobs
Senior Sales Engineer Jobs
Senior Scala Jobs
Senior Other Jobs
All Filters
No Results
No Results