Get the job you really want.
Maximum of 25 job preferences reached.
Top Senior Site Reliability Engineer Jobs in San Francisco, CA
Artificial Intelligence • Software
The SRE at Fluidstack is responsible for ensuring infrastructure reliability and performance, handling complex production issues, and improving platform stability.
Top Skills:
AnsibleBashGoKubernetesPythonSlurmTerraform
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Payments • Software • Financial Services
The Lead Site Reliability Engineer will drive reliability strategies, architect and maintain infrastructure, lead incident responses, and influence engineering practices for operational excellence while mentoring team members.
Top Skills:
AWSDockerFastapiKubernetesPostgresPythonTypescriptVue
Software
As a Site Reliability Engineer, you'll build and maintain infrastructure for ML models, automate processes, and collaborate cross-functionally.
Top Skills:
Circle CiCloudFormationElk StackGithub ActionsGitlab CiGrafanaJenkinsKubernetesOpentelemetryPrometheusPulumiTerraform
Artificial Intelligence • Software • Conversational AI • Generative AI
As a Staff Software Engineer in Site Reliability, you'll maintain production services, develop automation tools, and collaborate with teams to ensure system reliability and performance.
Top Skills:
Ci/CdGCPGoKubernetesLinuxPythonSQLTerraform
Information Technology • Mobile • Software
As a Site Reliability Engineer, you'll ensure system reliability and scalability, automate processes, optimize performance, and collaborate on system design.
Top Skills:
AWSAzureBashCloudFormationDatadogDockerElkGoGoogle Cloud PlatformGrafanaHelmKubernetesNew RelicPrometheusPulumiPythonTerraform
Artificial Intelligence • Software
As a Site Reliability Engineer at Anyscale, you will ensure smooth operations of user-facing services, develop monitoring and alerting systems, implement incident management processes, and improve cloud service deployment methodologies.
Top Skills:
Alerting SystemsAutomationCloud ComponentsIncident ManagementMonitoring Systems
Edtech
As a Staff Site Reliability Engineer, you will lead reliability engineering by designing automation, scaling systems, driving architectural improvements, and mentoring engineers.
Top Skills:
ArgocdCi/CdCircleCIDatadogGCPGithub ActionsGoIstioKubernetesPythonTerraform
Edtech
Lead the technical vision for reliability at Quizlet by architecting self-healing systems, mentoring engineers, and improving infrastructure resilience.
Top Skills:
Ci/CdDatadogGoIstioJeliKubernetes (Gke)PythonTerraform
Artificial Intelligence • Software
As Staff SRE Tech Lead, you'll oversee platform reliability and scalability, lead the SRE team, architect data infrastructures, and optimize systems while implementing automation and observability practices.
Top Skills:
ClickhouseGoPostgresPythonTypescript
Fintech • Software
The SRE is responsible for building cloud-native platforms, improving application reliability, and fostering collaboration within teams.
Top Skills:
Ci/CdKubernetesOpenshiftOpenstackPrometheusSplunkVMware
Fintech
The Principal Site Reliability Engineer designs and implements software to enhance application performance and resilience while ensuring security standards. Responsibilities include automating application management, providing observability, and leading cross-functional teams. Mentorship and on-call rotation participation are expected.
Top Skills:
AuroraAWSChefDockerDynamo DbGitGoJavaJenkinsJmsKafkaKubernetesMavenMemcachedOraclePythonRedisSqsSwarm
Artificial Intelligence • Healthtech • Other • Productivity • Telehealth • Conversational AI • Generative AI
As a Founding Site Reliability Engineer, you'll enhance system reliability, automate tasks, manage incidents, and mentor others. You'll shape infrastructure and ensure optimal performance and stability for healthcare services.
Top Skills:
AWSAzureDatadogGCPGrafanaHoneycombKubernetesOpentelemetryPagerdutyPrometheusSentryTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Information Technology • Software • Big Data Analytics
The Site Reliability Engineer will design, analyze, and troubleshoot large-scale distributed systems, focusing on operating systems and performance tuning.
Top Skills:
ApacheJava
Reposted 5 Days AgoSaved
Information Technology
Lead Observability Engineer responsible for defining and implementing observability strategies, tools, and patterns to ensure reliable performance across various systems at Vivun.
Top Skills:
CeleryDatadogGrafanaHoneycombLangchainNode.jsObserveOpenai ApisOpentelemetryPrometheusPython
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Senior Site Reliability Engineer manages production infrastructure, ensuring performance and reliability using AI tools, Kubernetes, and CI/CD pipelines while mentoring teams.
Top Skills:
Apache AirflowAWSAws LambdaAzureChatgptCi/CdCrossplaneGCPGeminiGithub CopilotGoKubernetesOpensearchPostgresPythonRedisSnowflakeTerraform
Healthtech • Insurance
The Senior Software Engineer will lead complex projects, mentor engineers, and ensure cloud infrastructure is resilient and automated. Responsibilities include developing software, managing production environments, and enforcing coding standards.
Top Skills:
ArgocdAWSGCPGithub ActionsGrafanaIstioKubernetesPrometheusTerraform
Financial Services
As a Site Reliability Engineer, you will enhance and monitor production systems, automate workflows, and respond to incidents to maintain system reliability.
Top Skills:
AirflowBazelGitGoGrafanaGrpcJenkinsKubernetesLinuxPandasPostgresPrometheusPythonRRelational DatabasesSQL
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills:
Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Reposted 3 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Consumer Web • eCommerce • Fashion • Retail
Poshmark seeks a Software Engineer, SRE to create a positive impact and thrive in a diverse environment. Qualifications include strong background in software and SRE practices.
Top Skills:
Site Reliability EngineeringSoftware Engineering
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills:
AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
Financial Services
Design, develop, and deploy robust platform solutions while ensuring reliability, scalability, and security of the system. Collaborate with teams to enhance tooling and automation.
Top Skills:
GCPKubernetesTerraform
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills:
Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Reposted 12 Hours AgoSaved
Easy Apply
Easy Apply
Gaming • Mobile • Software
As an SRE Manager, you will lead a team to enhance infrastructure services, manage incidents, and contribute to technical decisions while ensuring high availability and scalability of systems.
Top Skills:
Amazon AwsAnsibleArtifactoryCrossplaneDatadogElasticsearchGitlabGoGCPJaegerJenkinsKubernetesAzureMongoDBPackerPostgresPythonRedisTerraformVault
Top San Francisco Companies Hiring Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results















.png)













