Ciroos Logo

Ciroos

Senior Site Reliability Engineer (Senior SRE)

Reposted 12 Days Ago
Be an Early Applicant
In-Office
Pleasanton, CA
Senior level
In-Office
Pleasanton, CA
Senior level
The Senior Site Reliability Engineer will manage operational performance and availability, optimize AI tools, and collaborate with product teams to improve customer satisfaction.
The summary above was generated by AI

About Ciroos

Ciroos (pronounced “Sai rose”) is a seed-stage startup founded in February 2025 by a team of experienced executives and distinguished engineers with deep expertise in observability, AI, distributed systems, cloud, cybersecurity, and networking. Our mission is to provide an AI SRE Teammate that empowers SREs to be superheroes. Our AI SRE Teammate is based on a multi-agentic AI platform that uses expert human-like reasoning to decrease toil, investigate incidents faster, and drive autonomous operations for SREs. To date, we have raised $21M in seed funding led by Energy Impact Partners and several prominent angel investors. We are headquartered in Pleasanton, CA with operations in India.

Job Summary

We're looking for an experienced and curious Senior Site Reliability Engineer (SRE) to join our team. You'll work closely with our product and engineering teams, and our customers, to ensure our AI SRE Teammate delights SREs like you! You'll have a strong incentive to use the Ciroos AI SRE Teammate as your primary on-call delegate, freeing you up for more proactive tasks (or helping you get your sleep back!). You'll be responsible for the availability, operational performance, emergency response, and planning of our service. This is a unique opportunity to shape the product experience in cutting-edge technology—after all, you are customer zero for what we're building!

Responsibilities

  • Provide input to product and engineering teams to shape the direction of our product offering.

  • Implement and optimize our AI SRE Teammate to handle primary on-call rotations, providing backup when needed.

  • Participate in on-call rotations to support our production systems and respond to incidents when needed.

  • Recommend best practices to customers for implementing the Ciroos AI SRE Teammate in their environment.

  • Plan, design, build, and maintain highly scalable, reliable, and efficient infrastructure for our AI SRE Teammate.

  • Conduct post-incident reviews to identify root causes and implement preventative measures.

  • Ensure security best practices are integrated into our infrastructure and operations.

  • Contribute to a culture of continuous learning and improvement.

Qualifications

  • Deep empathy for customers in operational roles such as yours (SREs, ITOps, and DevOps) with a passion to reduce their toil and innovate on their behalf.

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • At least 7 years of experience in Site Reliability Engineering, DevOps, or a similar role.

  • Strong proficiency in at least one programming language (e.g., Python, Go, Java).

  • Extensive experience with cloud platforms (AWS, GCP, or Azure).

  • Solid understanding of Kubernetes, infrastructure as code tools (e.g., Terraform, CloudFormation, Ansible), and CI/CD pipelines.

  • Strong knowledge of open-source and commercial observability tools, ticketing systems, and incident management/response systems.

  • Excellent problem-solving and analytical skills to diagnose complex systems systematically.

  • Strong written and verbal communication skills with a collaborative mindset.

Bonus Points

  • Experience with machine learning infrastructure or MLOps.

  • Familiarity with AI/ML concepts and technologies.

  • Contributions to open-source projects.

Why Work at Ciroos?

At Ciroos, our mission is to provide an AI SRE Teammate that empowers SREs to be superheroes. We dream big and execute fast. Build. Test. Iterate. All at lightning speed in service of our customers. Tickle your curiosity and shape the future of agentic AI for SREs like you!

We work hard and play hard. Our “BEST” perks are designed to help you operate at your peak potential in a fun environment:

  • Benefits: Comprehensive medical, vision, and dental benefits. 401k plans and commuter benefits. Free lunches, snacks, and top-of-the-line espressos!

  • Equity: Equity that could change your life, not just look nice on paper. QSBS eligibility for US employees

  • Stewardship: A career-defining, high-impact role with plenty of mentorship opportunities from founders and other coworkers

  • Teamwork: Collaborative coworkers with high IQ and high EQ. No politics. No bureaucracy. No permission-seeking

Ciroos is an equal opportunity employer.

Top Skills

Ansible
AWS
Azure
CloudFormation
GCP
Go
Java
Kubernetes
Python
Terraform
HQ

Ciroos Pleasanton, California, USA Office

7901 Stoneridge Dr, Suite #210, Pleasanton, California, United States, 94588

Similar Jobs

6 Days Ago
In-Office
Costa Mesa, CA, USA
166K-220K Annually
Senior level
166K-220K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
The Senior Site Reliability Engineer will build, deploy, and maintain critical infrastructure for Business Systems, enhancing CI/CD processes and promoting system reliability.
Top Skills: AnsibleAWSAzureBashCloudFormationDockerGoGoogle Cloud PlatformHelmKubernetesPuppetPythonRustTerraform
4 Days Ago
Hybrid
2 Locations
202K-302K Annually
Senior level
202K-302K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead a team responsible for the reliability of hybrid cloud systems, defining SLOs/SLIs, managing on-prem utilities, and ensuring environment integrity for autonomous vehicle operations.
Top Skills: AnsibleChefConfiguration Management ToolsDhcpHybrid CloudLinuxNtpPxeSite Reliability EngineeringSlo Frameworks
6 Days Ago
Easy Apply
In-Office
San Francisco, CA, USA
Easy Apply
250K-300K Annually
Senior level
250K-300K Annually
Senior level
Healthtech • Telehealth
The Senior/Staff Site Reliability Engineer will build AI-driven reliability systems, enhance incident management, and lead architecture improvements while collaborating across teams to improve patient care and system performance.
Top Skills: AWSKubernetesNode.jsPostgresPythonRedisSQLTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account