Ciroos

Senior Site Reliability Engineer (Senior SRE)

Reposted 12 Days Ago

Be an Early Applicant

In-Office

Pleasanton, CA

Senior level

In-Office

Pleasanton, CA

Senior level

The Senior Site Reliability Engineer will manage operational performance and availability, optimize AI tools, and collaborate with product teams to improve customer satisfaction.

The summary above was generated by AI

About Ciroos

Ciroos (pronounced “Sai rose”) is a seed-stage startup founded in February 2025 by a team of experienced executives and distinguished engineers with deep expertise in observability, AI, distributed systems, cloud, cybersecurity, and networking. Our mission is to provide an AI SRE Teammate that empowers SREs to be superheroes. Our AI SRE Teammate is based on a multi-agentic AI platform that uses expert human-like reasoning to decrease toil, investigate incidents faster, and drive autonomous operations for SREs. To date, we have raised $21M in seed funding led by Energy Impact Partners and several prominent angel investors. We are headquartered in Pleasanton, CA with operations in India.

Job Summary

We're looking for an experienced and curious Senior Site Reliability Engineer (SRE) to join our team. You'll work closely with our product and engineering teams, and our customers, to ensure our AI SRE Teammate delights SREs like you! You'll have a strong incentive to use the Ciroos AI SRE Teammate as your primary on-call delegate, freeing you up for more proactive tasks (or helping you get your sleep back!). You'll be responsible for the availability, operational performance, emergency response, and planning of our service. This is a unique opportunity to shape the product experience in cutting-edge technology—after all, you are customer zero for what we're building!

Responsibilities

Provide input to product and engineering teams to shape the direction of our product offering.
Implement and optimize our AI SRE Teammate to handle primary on-call rotations, providing backup when needed.
Participate in on-call rotations to support our production systems and respond to incidents when needed.
Recommend best practices to customers for implementing the Ciroos AI SRE Teammate in their environment.
Plan, design, build, and maintain highly scalable, reliable, and efficient infrastructure for our AI SRE Teammate.
Conduct post-incident reviews to identify root causes and implement preventative measures.
Ensure security best practices are integrated into our infrastructure and operations.
Contribute to a culture of continuous learning and improvement.

Qualifications

Deep empathy for customers in operational roles such as yours (SREs, ITOps, and DevOps) with a passion to reduce their toil and innovate on their behalf.
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
At least 7 years of experience in Site Reliability Engineering, DevOps, or a similar role.
Strong proficiency in at least one programming language (e.g., Python, Go, Java).
Extensive experience with cloud platforms (AWS, GCP, or Azure).
Solid understanding of Kubernetes, infrastructure as code tools (e.g., Terraform, CloudFormation, Ansible), and CI/CD pipelines.
Strong knowledge of open-source and commercial observability tools, ticketing systems, and incident management/response systems.
Excellent problem-solving and analytical skills to diagnose complex systems systematically.
Strong written and verbal communication skills with a collaborative mindset.

Bonus Points

Experience with machine learning infrastructure or MLOps.
Familiarity with AI/ML concepts and technologies.
Contributions to open-source projects.

Why Work at Ciroos?

At Ciroos, our mission is to provide an AI SRE Teammate that empowers SREs to be superheroes. We dream big and execute fast. Build. Test. Iterate. All at lightning speed in service of our customers. Tickle your curiosity and shape the future of agentic AI for SREs like you!

We work hard and play hard. Our “BEST” perks are designed to help you operate at your peak potential in a fun environment:

Benefits: Comprehensive medical, vision, and dental benefits. 401k plans and commuter benefits. Free lunches, snacks, and top-of-the-line espressos!
Equity: Equity that could change your life, not just look nice on paper. QSBS eligibility for US employees
Stewardship: A career-defining, high-impact role with plenty of mentorship opportunities from founders and other coworkers
Teamwork: Collaborative coworkers with high IQ and high EQ. No politics. No bureaucracy. No permission-seeking

Ciroos is an equal opportunity employer.

Top Skills

Ansible

AWS

Azure

CloudFormation

GCP

Java

Kubernetes

Python

Terraform

7901 Stoneridge Dr, Suite #210, Pleasanton, California, United States, 94588

Similar Jobs

Anduril

Senior Site Reliability Engineer

6 Days Ago

In-Office

Costa Mesa, CA, USA

166K-220K Annually

Senior level

166K-220K Annually

Senior level

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense

The Senior Site Reliability Engineer will build, deploy, and maintain critical infrastructure for Business Systems, enhancing CI/CD processes and promoting system reliability.

Top Skills: AnsibleAWSAzureBashCloudFormationDockerGoGoogle Cloud PlatformHelmKubernetesPuppetPythonRustTerraform

General Motors

Site Reliability Engineer

4 Days Ago

Hybrid

202K-302K Annually

Senior level

202K-302K Annually

Senior level

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing

Lead a team responsible for the reliability of hybrid cloud systems, defining SLOs/SLIs, managing on-prem utilities, and ensuring environment integrity for autonomous vehicle operations.

Top Skills: AnsibleChefConfiguration Management ToolsDhcpHybrid CloudLinuxNtpPxeSite Reliability EngineeringSlo Frameworks

Mochi Health

Site Reliability Engineer

6 Days Ago

Easy Apply

In-Office

San Francisco, CA, USA

Easy Apply

250K-300K Annually

Senior level

250K-300K Annually

Senior level

Healthtech • Telehealth

The Senior/Staff Site Reliability Engineer will build AI-driven reliability systems, enhance incident management, and lead architecture improvements while collaborating across teams to improve patient care and system performance.

Top Skills: AWSKubernetesNode.jsPostgresPythonRedisSQLTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Ciroos

Senior Site Reliability Engineer (Senior SRE)

Top Skills

Ciroos Pleasanton, California, USA Office

Similar Jobs

Senior Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech