Senior Site Reliability Engineer

DO NOT USE - Agero

| Hybrid

Sorry, this job was removed at 2:17 p.m. (PST) on Wednesday, April 6, 2022

View 1903 Jobs

Find out who's hiring in San Francisco.

See all Developer + Engineer jobs in San Francisco

View 1903 Jobs

Easy Apply

By clicking Apply Now you agree to share your profile information with the hiring company.

Save job

Agero + Swoop provides simplicity and peace of mind in the chaotic world of roadside and accident assistance. Nimble and energetic, our Swoop SaaS technology team is transforming the entire industry, taking manual processes and redefining them as digital experiences, with a focus on speed to market, immediate impact and rapid growth.

Our automotive clients represent more than 2 in 3 passenger vehicles sold in the U.S., and two-thirds of insurance companies are served by Agero. We handle over 12 million vehicle disablement events annually, serving 115 million drivers across the U.S.

It’s not just a job, it’s a mission to make driving safer, smarter and more enjoyable for everyone.

Role Description and Mission:

As a member of the Site Reliability Team, the Senior Site Reliability Engineer will conceive and execute a blueprint aimed at increasing service availability, forecasting monitoring needs and requirements, and automating resolution of future issues. This is achieved by focusing on proactive and holistic approaches to continuously improving the customer experience.

Key Outcomes:

Improve and influence key processes across the company, such as incident response, durable ownership, and on-call rotations
Steward reliability as a feature across the organization through concepts such as SLOs and service maturity
An interest in distributed systems, especially those with high availability requirements
An ability to troubleshoot and debug issues, especially those that relate to emerging problems on unfamiliar architectures
Great communication and problem-solving skills, coupled with a willingness to collaborate and work with empathy
Set standards for deployments at scale, infrastructure reliability, and scalability. Iterate, revisit, and optimize service availability, scalability, and performance
Experience with being on-call and willingness to represent our team in incidents

Skills, Education and Experience:

5+ years of experience in a Reliability Engineering, DevOps or infrastructure focused role
Container orchestration with Kubernetes
Passion for designing and building reliable systems
Advanced experience with programming languages (Ruby on Rails, Python, Bash Scripting)
Deep systems and infrastructure knowledge leveraging infrastructure as code (Terraform)
Automation advocate - you truly believe in removing operation load with software
Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks
Experience with scale testing, disaster recovery, and capacity planning
Excellent troubleshooting and problem solving skills (P1 production issues)
Demonstrated ability to deliver results on time and of high quality

Read Full Job Description

Senior Site Reliability Engineer

Location

Similar Jobs