Obsidian Security Logo

Obsidian Security

Sr. Staff Site Reliability Engineer

Reposted 5 Hours Ago
Be an Early Applicant
In-Office
Palo Alto, CA, USA
232K-263K Annually
Senior level
In-Office
Palo Alto, CA, USA
232K-263K Annually
Senior level
As a Sr. Staff Site Reliability Engineer, you will define the reliability vision for a multi-tenant SaaS platform, lead the architecture of detection systems, and partner across teams to improve incident management and system resilience, ensuring issues are resolved before affecting customers.
The summary above was generated by AI

Obsidian Security is the leading SaaS security platform, trusted by global enterprises like Snowflake, T-Mobile, and Algolia. We protect 200+ organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand, including many of the world’s largest Fortune 1000 and Global 2000 companies.

Founded in 2017 and backed by top investors like Greylock, Obsidian was built to close a critical gap: securing SaaS apps where business happens—Microsoft 365, Salesforce, and hundreds more. The company does this by offering a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Obsidian was built by leaders who redefined endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, they’re transforming how SaaS is secured.

With AI driving rapid SaaS growth and complexity, agentic AI tools gain privileged access to sensitive data through integrations, creating new risks most security tools miss. Obsidian uniquely detects anomalous OAuth token activity and manages integration risks. Major announcements are on the horizon. Recognizing that SaaS security needs to evolve, Obsidian enables growing organizations to start with a lightweight, prevention-focused browser extension and expand coverage over time.

With global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise ahead, Obsidian is scaling rapidly toward long-term growth and IPO readiness.

Sr. Staff Site Reliability Engineer

As a Sr. Staff SRE at Obsidian, you will define and drive the company-wide reliability vision for a complex, multi-tenant SaaS platform serving enterprise and financial customers. You will operate as a strategic partner to DevOps and Platform Engineering leadership, shaping a unified reliability strategy that scales across the organization.

Your core mandate: ensure Obsidian detects, diagnoses, and communicates system issues before customers are impacted—consistently and predictably.

This is a hands-on technical role that involves architecting and leading the implementation of systems that handle real-world complexity, including upstream SaaS dependencies, sparse and noisy signals, and mission-critical enterprise workloads.

Key Responsibilities

  • Reliability Strategy & Architecture - Define and lead long-term reliability strategy across services. Establish end-to-end system visibility frameworks and guide architecture for observability, detection, and resilience.
  • Cross-Org Leadership - Partner across teams to embed reliability, standardize SLI/SLOs, and serve as a technical escalation expert.
  • Detection & Observability - Build intelligent detection systems (anomaly detection, connector health models) and enable self-service observability.
  • Incident Management - Define and evolve a tiered incident communication strategy, improve response practices, and lead postmortems to strengthen reliability and customer trust.
  • Execution - Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines.

Required Qualifications

  • 5+ years in SRE, Production Engineering, or related roles
  • 3+ years operating at a senior or technical leadership level (Staff or equivalent scope)
  • Deep expertise in:
    • AWS and/or GCP
    • Kubernetes and Helm
    • Observability stacks (Prometheus, Grafana, or equivalent)
    • CI/CD systems (GitLab CI/CD, ArgoCD, etc.)
  • Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms
  • Strong debugging and systems thinking across distributed microservices and legacy systems
  • Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience
  • Hands-on engineering approach with a track record of building—not just configuring—reliability systems

Preferred Qualifications

  • Experience in B2B SaaS serving enterprise or financial customers
  • Familiarity with third-party SaaS connector architectures and ingestion patterns
  • Experience building anomaly detection or intelligent alerting systems
  • Experience designing customer-facing status pages and incident communication frameworks

Why This Role

  • Drive org-wide reliability strategy
  • Own and build new detection & observability systems
  • Tackle complex distributed systems challenges
  • Safeguard critical infrastructure for financial customers

What Success Looks Like

  • Issues caught and resolved before customer impact
  • Reliability is measurable and continuously improving
  • Teams self-serve observability with scalable tools
  • Clear, proactive incident communication builds trust
  • Reliability becomes a competitive advantage

Employee Benefits

Our competitive benefits packages are designed to support our employees' well-being, both at work and at home.  Our US based employees enjoy:

  • Competitive compensation with equity and 401k
  • Comprehensive healthcare with dental and vision coverage
  • Flexible paid time off and paid holiday time off 
  • 12 weeks of new parent or family leave
  • Personal and professional development resources

For more details on our US benefits, or for information on our international benefits, please see here.

Pay Transparancy

Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.

At Obsidian, we are proud to be an equal-opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization.  If you have a need that requires accommodation, please contact [email protected]

Information collected and processed as part of any job applications you choose to submit is subject to Obsidian’s Applicant Privacy Policy.

Base Salary Range
$232,000$263,000 USD

Obsidian Security Palo Alto, California, USA Office

577 College Ave , Palo Alto, United States, 94306

Similar Jobs

4 Days Ago
Remote or Hybrid
Santa Clara, CA, USA
166K-290K Annually
Senior level
166K-290K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sr Staff Site Reliability Engineer will lead infrastructure projects, design scalable solutions, and collaborate across teams while providing technical support and mentorship.
Top Skills: AWSBashDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
6 Days Ago
In-Office
San Jose, CA, USA
207K-259K Annually
Senior level
207K-259K Annually
Senior level
Aerospace
Responsible for the reliability, scalability, performance, and security of core systems, implementing infrastructure, maintaining cloud-native services, and developing automation solutions.
Top Skills: AirflowAmazon EksArgocdAWSBashDockerElk StackGitlab CiGrafanaJenkinsKafkaPowershellPrometheusPythonSpark
7 Days Ago
Hybrid
San Francisco, CA, USA
245K-270K Annually
Senior level
245K-270K Annually
Senior level
Information Technology • Consulting
As a Senior Staff Site Reliability Engineer, you will lead the SRE team, advocate best practices, ensure resilience in cloud architecture, and mentor team members.
Top Skills: ArgocdCircleCIGoogle Cloud PlatformKubernetesPulumiTerraformTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account