Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in San Francisco, CA
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Senior Hardware Reliability Engineer ensures product reliability through planning, testing, and collaboration across engineering and operations. Responsibilities include leading investigations, analyzing failure data, and designing reliability strategies throughout the product lifecycle.
Top Skills:
Environmental TestingFailure AnalysisFirmware EngineeringHardware ReliabilityReliability ModelingStress Testing
Information Technology • Cybersecurity
Huntress seeks an experienced Staff Database Reliability Engineer to optimize, scale, and manage PostgreSQL databases, ensuring stability and performance in a production environment.
Top Skills:
AzureDatadogNewrelicPostgres
Reposted YesterdaySaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Reposted YesterdaySaved
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
As a Staff Android Engineer, lead performance and reliability optimizations for Snapchat's Android app, mentor engineers, and collaborate with cross-functional teams.
Top Skills:
AndroidC/C++JavaKotlin
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll maintain and enhance the Currents data export system, focusing on observability, scalability, and reliability, while mentoring junior engineers and solving performance issues.
Top Skills:
BuildkiteDatadogDocker SwarmGitGitlabJavaJenkinsKafkaKotlinKubernetesMongoDBPagerdutyPostgresRubySentrySidekiqSnsSqs
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills:
ArgocdAWSKubernetesPythonTerraform
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills:
AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
Fintech • Machine Learning • Payments • Software • Financial Services
Lead a diverse technology team to deliver backend solutions using cloud technologies and manage complex projects while mentoring developers.
Top Skills:
AWSDockerGoJavaKubernetesNode.jsPythonScalaSQL
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
As a Site Reliability Engineer, you'll build software to ensure system reliability, scale infrastructure, and deploy ML systems while collaborating with cross-functional teams.
Top Skills:
AWSAzureDockerGCPJavaKubernetesLinuxTerraform
Reposted 5 Days AgoSaved
Easy Apply
Easy Apply
AdTech
As a Site Reliability Engineer, you'll maintain the infrastructure for systems, ensure efficiency, automate processes, monitor databases, and participate in architecture discussions.
Top Skills:
Amazon KinesisAws LambdaAws SnsBigQueryDockerGcp (Google Cloud Platform)GitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerSQLTerraform
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Embedded Site Reliability Engineer will develop and maintain software applications for Bitcoin mining, focusing on embedded systems and cloud observability. Responsibilities include software testing, bug triage, and collaboration with engineering teams to optimize performance and reliability.
Top Skills:
CC++DatadogElasticGoGrafanaJavaScriptLinuxPythonRustSplunkSQLTypescript
Software
In this role, you will provide expert support for customers using Kubernetes and Replicated products, enabling their success with application deployment and management, while collaborating with engineering teams for product improvement.
Top Skills:
Cncf ToolsGoHelmKubernetesLinux
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Energy
The Electrical Reliability Engineer focuses on improving equipment reliability, maintenance support, troubleshooting, budgeting for equipment replacements, and implementing reliability programs in the petroleum industry.
Top Skills:
EtapGe ApmPowerdbSAP
Healthtech • Telehealth
The Senior/Staff Site Reliability Engineer will build AI-driven reliability systems, enhance incident management, and lead architecture improvements while collaborating across teams to improve patient care and system performance.
Top Skills:
AWSKubernetesNode.jsPostgresPythonRedisSQLTypescript
AdTech
The Site Reliability Engineer will build and maintain infrastructure, manage databases, automate operations, and ensure system efficiency and scalability at Attain.
Top Skills:
Amazon KinesisAws LambdaAws SnsBigQueryDockerGCPGitlabGoogle Cloud FunctionsGoogle Cloud RunGoogle Pub/SubGrafanaIstioKafkaKubernetesMySQLPrometheusSpannerTerraform
Robotics • Pharmaceutical
The Hardware Reliability Engineer ensures the robustness of robotic systems through testing, analysis, and collaboration across teams to improve designs and reduce risks.
Top Skills:
Onshape CadPython
Artificial Intelligence • Information Technology • Machine Learning • Marketing Tech • Software • Biotech • Design
The Hardware Reliability Engineer plans and executes reliability testing, develops testing methods, performs failure analysis, and collaborates with cross-functional teams to ensure product quality.
Top Skills:
Data AnalysisElectrical EngineeringEnvironmental ReliabilityMechanical EngineeringReliability Testing
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills:
AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Fintech • Payments • Productivity • Financial Services
As a Senior Database Reliability Engineer, you will enhance database reliability and performance, develop automation tools, and support GCP persistence tools.
Top Skills:
BashChefGCPMySQLPerlPuppetPythonRubySaltTerraform
Reposted 3 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
Reposted 7 Days AgoSaved
Easy Apply
Easy Apply
Energy
As a Reliability Engineer, you will define reliability requirements, analyze design failures, run tests, and ensure high standards for hardware products.
Top Skills:
JmpMinitabPython
Aerospace • Hardware • Logistics • Robotics • Software • Transportation
The Senior Hardware Reliability Engineer measures and improves the reliability of Zipline's drone systems through data analysis and reliability modeling, ensuring operational safety and efficiency.
Top Skills:
JmpMatlabMinitabPythonReliasoft
Food • Marketing Tech • Manufacturing
The Senior Reliability Engineer enhances equipment reliability, reduces downtime, and improves maintenance strategies across production. This role involves collaboration with engineering and operations, leading reliability programs, and mentoring junior staff.
Top Skills:
Advanced AnalyticsCmmsDigital ToolsReliability Modeling Tools
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
The Design Reliability Engineer will establish reliability targets, lead DFMEA processes, develop test plans, and implement monitoring systems for sensors and automotive electronics.
Top Skills:
NumpyPandasPysparkPythonScipy
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Top San Francisco Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results




.jpg)











.png)

.png)















