Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in San Francisco, CA
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills:
AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal
Security • Software • Cybersecurity • Automation
As a Staff Database Reliability Engineer, lead strategic initiatives for database architecture, optimize performance, mentor teams, and promote best practices in data usage.
Top Skills:
Aws AuroraAws DynamodbAws ElasticacheMySQLObject-Relational MappingRedisSQLTypeormTypescript
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
As a Staff Reliability Engineer, you'll enhance platform health by applying SRE principles, leading reliability for microservices, mentoring teams, and automating operational processes.
Top Skills:
AWSAzureDatadogGCPJavaKubernetesPythonSpringTerraform
Financial Services
As a Principal Site Reliability Engineer, you'll architect reliability solutions, lead observability initiatives, and mentor teams for enhanced operational efficiency.
Top Skills:
Cloud-Native InstrumentationOpen TelemetryStreaming Data Platforms
Reposted 4 Days AgoSaved
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
As a Staff Android Engineer, lead performance and reliability optimizations for Snapchat's Android app, mentor engineers, and collaborate with cross-functional teams.
Top Skills:
AndroidC/C++JavaKotlin
Reposted 5 Hours AgoSaved
Easy Apply
Easy Apply
Energy
As a Reliability Engineer, you will define system reliability requirements, conduct tests, analyze failures, and collaborate on design improvements for hardware products.
Top Skills:
JmpMinitabPython
Software
In this role, you will provide expert support for customers using Kubernetes and Replicated products, enabling their success with application deployment and management, while collaborating with engineering teams for product improvement.
Top Skills:
Cncf ToolsGoHelmKubernetesLinux
Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
As a Senior Software Engineer in Device Reliability, you will design automation for diverse devices, build automated test systems, and ensure quality across installations, collaborating with various teams.
Top Skills:
HTML5JavaScriptReactTypescript
Energy
The Reliability Engineer will maintain instrumentation equipment, improve reliability, support maintenance teams, ensure compliance, and lead incident investigations.
Top Skills:
Ge ApmExcelMicrosoft OutlookMicrosoft WordSAP
Energy
The Electrical Reliability Engineer will maintain reliability programs, troubleshoot electrical systems, and optimize maintenance protocols for long-term equipment performance and safety.
Top Skills:
Cmms SoftwareElectrical Protection SystemsEtapGe ApmHigh Voltage DistributionMotor Control CentersPowerdbReliability ToolsSAP
Cloud • Hardware • Security • Software
Manage and enhance infrastructure, automate processes, define roadmaps, and support engineering teams while ensuring uptime and efficiency.
Top Skills:
ArgocdAWSKubernetesPythonTerraform
Big Data • Cloud • Productivity • Software • Database • Analytics • Automation
The Site Reliability Engineer will support engineering teams, enhance system resilience, and drive scalable infrastructure practices.
Top Skills:
Aws ServicesGrafanaHoneycombLinuxPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Food • Marketing Tech • Manufacturing
The Senior Reliability Engineer enhances equipment reliability, reduces downtime, and improves maintenance strategies across production. This role involves collaboration with engineering and operations, leading reliability programs, and mentoring junior staff.
Top Skills:
Advanced AnalyticsCmmsDigital ToolsReliability Modeling Tools
Reposted 4 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer at GitLab, you will automate and manage the lifecycle of GitLab environments, ensuring reliability and scalability while leading incident responses and architectural decisions.
Top Skills:
AnsibleAWSElkGCPGoGrafanaKubernetesPrometheusRubyTerraform
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design, scale, and manage AWS services for IoT devices. Collaborate on infrastructure, optimize performance, and ensure high availability of services.
Top Skills:
AWSBashGoHelmKubernetesPythonRubyTerraform
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Embedded Site Reliability Engineer will develop and maintain software applications for Bitcoin mining, focusing on embedded systems and cloud observability. Responsibilities include software testing, bug triage, and collaboration with engineering teams to optimize performance and reliability.
Top Skills:
CC++DatadogElasticGoGrafanaJavaScriptLinuxPythonRustSplunkSQLTypescript
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills:
AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
Artificial Intelligence • Software
As a Senior/Staff Network Reliability Engineer, you'll optimize and maintain Fluidstack's network platform, ensuring performance and reliability for AI and HPC workloads. Responsibilities include tuning networking protocols, deploying and validating switches, automating telemetry, conducting root-cause analyses, and collaborating with vendors.
Top Skills:
BgpDpdkEbpfEvpnGeneveGoPythonRdmaRustTcp/IpVxlanXdp
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Energy
As a Reliability Engineer, you will define reliability requirements, analyze design failures, run tests, and ensure high standards for hardware products.
Top Skills:
JmpMinitabPython
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Software Engineer focused on Site Reliability Tooling, you'll enhance system reliability, implement SRE practices, and build automation tools to support site reliability across Upstart's infrastructure.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptKubernetesPrometheusPythonTerraformTypescript
Industrial • Manufacturing
Lead reliability strategy for HVAC components, develop predictive life models, design tests, and improve product durability. Analyze data and mentor junior engineers.
Top Skills:
Accelerated Test MethodsHvac SystemsIot DevicesPredictive Life ModelsWeibull Modeling
Appliances
Lead reliability strategies for HVAC components, develop predictive models, conduct tests, analyze data, and mentor junior engineers.
Top Skills:
Accelerated Life TestingCorrosion TestingEnvironmental ChambersHvac SystemsReliability EngineeringVibration TablesWeibull Analysis
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills:
AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Marketing Tech
The Cloud Reliability Engineer develops, configures, and deploys cloud tools, enhances applications, ensures observability, and participates in on-call rotations.
Top Skills:
AWSCi/CdDockerGithub ActionsGoGoogle BigqueryGCPKubernetesLinuxPythonSQLTerraform
Artificial Intelligence • Healthtech • Software
As a Site Reliability Engineer, you will manage cloud infrastructure, implement observability, and ensure system reliability by collaborating with engineering teams and maintaining databases.
Top Skills:
AzureBashGitGitKubernetesPostgresPythonRedisSQLTypescriptVscode
Top San Francisco Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results

































