Illumio Logo

Illumio

Sr. Manager, Site Reliability Engineering

Posted 3 Days Ago
Be an Early Applicant
In-Office
Sunnyvale, CA, USA
233K-280K Annually
Senior level
In-Office
Sunnyvale, CA, USA
233K-280K Annually
Senior level
Lead a team managing SaaS security platform reliability, focusing on CI/CD automation and infrastructure scaling while ensuring product health and meeting SLAs.
The summary above was generated by AI
Onwards Together!

Illumio is the leader in ransomware and breach containment, redefining how organizations contain cyberattacks and enable operational resilience. Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi-cloud environments – stopping the spread of attacks before they become disasters.
Recognized as a Leader in the Forrester Wave™ for Microsegmentation, Illumio enables Zero Trust, strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running.

Location: 5 on-site days a week in Sunnyvale, CA Headquarters.

Our Team's Vision:

Our Engineering team is shaping the future of cybersecurity. We thrive on visionary leadership, autonomy, and ownership, fostering a culture of innovation that propels us forward in the ever-evolving cybersecurity landscape.

As a leader in Zero Trust Segmentation, we are redefining security for a world facing unprecedented cyber threats. You’ll work with a highly scalable SaaS service built using cloud-native technologies while simultaneously shipping the solution on-premises.

Our guiding philosophy in Engineering is to get things right through practicing disciplined engineering, focusing, not cutting corners, and of course having fun while we are at it. We believe in enabling ownership at all levels of the organization and empowering teams. If you thrive in this culture, come join us!

 
Your Impact:

In this role, you will lead a team of talented engineers to help build a world-class SaaS security platform so we can continue to provide quality security solutions for our customers.

Every day you will lead a small team to ensure our SaaS security platform is available and performing, finding problems before our customers do, building tools to improve speed, confidence, and visibility, while embedding security into every step of the software and infrastructure life cycle.

To thrive in this role you must have at least 5 years of people leadership experience; be fluent in AWS/Azure cloud platforms and have programming language experience while hands-on building infrastructure tooling and automation at least 50% of the time.

  • Manage Illumio’s SRE team to deliver SaaS security products to companies including the Fortune 100.

  • Work closely with Development, QA, Customer success, and Technical Support to ensure the health of our products and that all SLAs are being met for our customers.

  • Manage infrastructure to scale globally, utilizing automation tools to maximize operational efficiency on public clouds.

  • Lead the team responsible for supporting the infrastructure that powers the Illumio SaaS products.

  • Own and improve the SaaS delivery efficiency with end-to-end responsibility for application lifecycle, availability, performance, and SLAs.

  • Work with the team to improve CI/CD automation for deploying applications using Infrastructure as Code to minimize downtime and ensuring adherence to any contractual commitments.

  • Work with senior management in developing a long-term product reliability and technology road map, using strategies to align with business objectives and large scale.

  • Develop and maintain automation which can be consumed by multiple teams to deploy SaaS clusters.

  • Create a high-performing Cloud Operations and SRE team through career development, mentorship, and training

  • Ensure adherence of infrastructure and processes to FedRAMP, SOC2 and other requirements, and work with PM to adjust the infrastructure to meet any federal changes.

  • Continue to evolve product architecture and DevOps processes to ensure reliable CI/CD pipelines and continuous delivery.

Your Toolkit:
  • Bachelor's or Master’s degree in Computer Engineering, Computer Science, or related field, or equivalent relevant experience

  • 5+ years of Unix or Linux system administration experience with Chef/Ansible, Ruby and/or Python

  • 7+ years of hands-on technical experience managing/developing CI/CD solution using Concourse, Gitlab, or equivalent

  • Proven track record of improving uptime (at least 99.9%) and SLAs.

  • Experience establishing support SLOs.

  • Working experience with Cloud Technology such as application gateway, HAProxy, Nginx, microservices, databases(Postgresql), Redis

  • Experience building observability for 24/7 monitoring with Grafana, Prometheus, Splunk, Datadog, or equivalent and ability to improve uptime and meet SLAs.

  • Experience building/running revenue generating enterprise applications in at least one of the three big public cloud providers: AWS(Prefer), Azure, or GCP

  • Experience managing and running multi-tenant cloud infrastructure.

  • Extensive experience developing and using Terraform in production

  • Strong troubleshooting experience and skillset to resolve incidents working across functional teams.

  • Ability to nurture and support a strong operations culture: customer focus, excellent technology, high quality implementations, self-motivated innovation, and problem-solving.

  • A positive can-do attitude and passion to succeed.

  • Experience with managing certifications and audits such as SOC2, FedRAMP is a big plus.

  • Experience with Vault, CloudFormation, and EKS a big plus.

This position involves access to software/technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls

#LI-KD1 #LI-ONSITE

Our Commitment

Illumio believes that an environment of unique backgrounds, experiences, viewpoints, and individual contributions creates a culture of belonging, drives our future, and makes us stronger together in support of our customers and their success.

All official job offers from our company are extended directly by our recruitment team and will be sent through an official E-Signature document for your review and signature. Please be aware that we do not ask for any personal information in the process of extending offers of employment, such as financial details or social security numbers. Upon acceptance of any offer, we will request such information as part of the onboarding process prior to or on your first day of employment, and only after completing a background check through an authorized third-party vendor. If you receive any communication asking for personal details outside of these processes, please contact us immediately to verify the authenticity of the request. Your security is important to us, and we are committed to a safe and transparent hiring experience.

For roles in San Francisco and Los Angeles: Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Illumio will consider for employment qualified applicants with arrest and conviction records.

Illumio Sunnyvale, California, USA Office

920 De Guigne Dr, Sunnyvale, CA, United States, 94085

Similar Jobs

8 Days Ago
In-Office
Santa Clara, CA, USA
200K-322K Annually
Senior level
200K-322K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead and reshape IT operations by managing Incident, Problem, and Change Management using AI and automation to enhance reliability, speed, and employee experience.
Top Skills: Advanced AnalyticsAIAutomationObservabilitySre Principles
3 Days Ago
In-Office
San Francisco, CA, USA
227K-325K Annually
Senior level
227K-325K Annually
Senior level
News + Entertainment
Lead and grow the Site Reliability Engineering team, ensuring platform reliability and performance through strategic planning and incident management. Champion AI integration into SRE practices to improve automation and operational efficiency while fostering a blameless learning culture.
Top Skills: AnsibleArgo RolloutsAWSDatadogFirehydrantGrafanaKubernetesLaunchdarklyOpentelemetryPagerdutyPrometheusSplunkTerraform
8 Days Ago
In-Office
San Jose, CA, USA
153K-262K Annually
Senior level
153K-262K Annually
Senior level
Fintech • Payments
Lead and grow a North America SRE/Production Operations team to ensure 24/7 reliability, scalability, and performance of payment infrastructure. Drive incident management, AIOps and automation-first initiatives, capacity planning, disaster recovery, KPIs (MTTD/MTTR/SLOs), cross-functional collaboration, and continuous improvement while mentoring managers and engineers.
Top Skills: Ai/MlAiopsAuto-RemediationAutomation PlatformsAWSAzureCi/CdContainerizationDevOpsGCPInfrastructure-As-CodeMonitoring ToolsOrchestrationRunbook Automation

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account