Senior Site Reliability Engineer

Sorry, this job was removed at 2:17 p.m. (PST) on Wednesday, April 6, 2022
Find out who's hiring in San Francisco.
See all Developer + Engineer jobs in San Francisco
Easy Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Agero + Swoop provides simplicity and peace of mind in the chaotic world of roadside and accident assistance. Nimble and energetic, our Swoop SaaS technology team is transforming the entire industry, taking manual processes and redefining them as digital experiences, with a focus on speed to market, immediate impact and rapid growth. 

Our automotive clients represent more than 2 in 3 passenger vehicles sold in the U.S., and two-thirds of insurance companies are served by Agero. We handle over 12 million vehicle disablement events annually, serving 115 million drivers across the U.S. 

It’s not just a job, it’s a mission to make driving safer, smarter and more enjoyable for everyone. 

Role Description and Mission:

As a member of the Site Reliability Team, the Senior Site Reliability Engineer will conceive and execute a blueprint aimed at increasing service availability, forecasting monitoring needs and requirements, and automating resolution of future issues. This is achieved by focusing on proactive and holistic approaches to continuously improving the customer experience.

Key Outcomes

  • Improve and influence key processes across the company, such as incident response, durable ownership, and on-call rotations
  • Steward reliability as a feature across the organization through concepts such as SLOs and service maturity
  • An interest in distributed systems, especially those with high availability requirements
  • An ability to troubleshoot and debug issues, especially those that relate to emerging problems on unfamiliar architectures
  • Great communication and problem-solving skills, coupled with a willingness to collaborate and work with empathy
  • Set standards for deployments at scale, infrastructure reliability, and scalability. Iterate, revisit, and optimize service availability, scalability, and performance
  • Experience with being on-call and willingness to represent our team in incidents

Skills, Education and Experience:

  • 5+ years of experience in a Reliability Engineering, DevOps or infrastructure focused role
  • Container orchestration with Kubernetes 
  • Passion for designing and building reliable systems 
  • Advanced experience with programming languages (Ruby on Rails, Python, Bash Scripting) 
  • Deep systems and infrastructure knowledge leveraging infrastructure as code (Terraform) 
  • Automation advocate - you truly believe in removing operation load with software 
  • Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks
  • Experience with scale testing, disaster recovery, and capacity planning 
  • Excellent troubleshooting and problem solving skills (P1 production issues) 
  • Demonstrated ability to deliver results on time and of high quality





Read Full Job Description
Easy Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

San Francisco, CA

Similar Jobs

Easy Apply
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about DO NOT USE - AgeroFind similar jobs