Cloudbeds Logo

Cloudbeds

Senior Site Reliability Engineer

Posted 7 Days Ago
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
The Senior Site Reliability Engineer ensures platform reliability and performance, architecting AWS cloud solutions and fostering automation and resilience across engineering teams.
The summary above was generated by AI

What Makes Us Unique 

At Cloudbeds, we're not just building software, we’re transforming hospitality. Our intelligently designed platform powers properties across 150 countries, processing billions in bookings annually. From independent properties to hotel groups, we help hoteliers transform operations and uplevel their commercial strategy through a unified platform that integrates with hundreds of partners. And we do it with a completely remote team. Imagine working alongside global innovators to build AI-powered solutions that solve hoteliers' biggest challenges. Since our founding in 2012, we've become the World's Best Hotel PMS Solutions Provider and landed on Deloitte's Technology Fast 500 again in 2024 – but we're just getting started. 



As a Sr. Site Reliability Engineer, you'll be the guardian of our platform's reliability and performance, ensuring millions of hospitality transactions flow seamlessly across the globe. You'll architect and implement scalable AWS cloud solutions that keep the most ambitious hotels running 24/7, while fostering a culture of automation, resilience, and continuous improvement across our engineering teams.

Our SRE Team:

We're a bottom-up, collaborative team that thrives on healthy debate and shared ownership of our infrastructure. You'll have endless opportunities to influence architecture decisions while working with cutting-edge cloud technologies at scale. We believe the best solutions come from engineers who are empowered to innovate, experiment, and challenge the status quo.

What You Bring to the Team:

  • Design and implement reliable and scalable AWS architecture to meet the needs of the organization.
  • Maintain and support highly loaded Kubernetes (EKS) clusters and infrastructure-related components.
  • Support the CICD process with ArgoCD and GitOps.
  • Automate the platform deployments with Terraform infrastructure-as-code.
  • Develop and continuously improve product Observability and Monitoring systems based on the Grafana, Prometheus, DataDog, and Cloudwatch.
  • Respond and participate with Incident Management and Root Cause Analysis, ensuring minimal impact on services.
  • Optimize system performance and troubleshoot issues as they arise.
  • Collaborate with development teams to establish monitoring best practices and ensure systems meet reliability targets. 
  • Collaborate with security teams to implement and maintain security best practices.
  • Infrastructure support rotation providing guidance to other engineering teams.

What Sets You Up for Success:

  • 5+ years of experience as a DevOps or SRE working within the AWS ecosystem.
  • 5+ years of experience with Kubernetes (EKS) and Helm charts.
  • Experience with designing, building, and supporting CI/CD pipelines with ArgoCD and GitHub actions.
  • Experience with infrastructure-as-code methodologies with Terraform.
  • Experience with Observability and Monitoring with Grafana, Prometheus, DataDog, and Cloudwatch.
  • Experience with Incident Management, full stack troubleshooting, performance analysis and root cause analysis (RCA).
  • Experience with Web application systems such as Nginx, Ingress controllers, load balancing and Content Delivery Networks. 
  • Experience with Databases (MySQL, PostgreSQL, Aurora) and Middleware technologies (Redis, Memcached and SQS)
  • Good networking skills with VPC, Security Groups and Network ACLs.
  • Ability to work remotely and manage your own time in a global team.
  • Good written and verbal communication in English.
  • Bachelor’s degree in Computer Science or equivalent experience.

Bonus Skills to Stand Out:

  • Advanced experience with Database Administration (Aurora, MySQL, PostgreSQL).
  • Experience working in a PCI-compliant environment.
  • Experience working with Kong API Gateway.

#LI-IK1

What to Expect - Your Journey with Us 

Behind Cloudbeds' revolutionary technology is a team of redefining what's possible in hospitality. We're 650+ employees across 40+ countries, bringing together elite engineers, AI architects, world-class designers, and hospitality veterans to solve challenges others haven't dared to tackle. Our diverse team speaks 30+ languages, but we all share one language: a passion for innovation and travel. From pioneering breakthroughs in machine learning to revolutionizing how hotels operate, we're not just watching the future of hospitality unfold – we're coding it, designing it, writing it and shipping it. If you're ready to work alongside some of the brightest minds in tech who are obsessed with using AI to transform a trillion-dollar industry, this is your chance to be part of something extraordinary.

Learn more online at cloudbeds.com

Company Awards to Check Out! 
  • Best All-In-One Hotel Management System | HotelTechAwards (2025)
  • Overall 10 Best Places to Work | HotelTechAwards (2025)
  • Most Loved Workplace® Certified (2024) 
  • Top 10 People’s Choice(2024)
  • Deloitte Technology Fast 500 (2024)
 Discover our Benefits:
  • Remote First, Remote Always 
  • PTO in accordance with local labor requirements
  • Monthly Wellness Fridays - enjoy an extra long weekend every month
  • Full Paid Parental Leave
  • Home office stipend based on country of residency
  • Professional development courses in Cloudbeds University
  • Access to professional development, including manager training, upskilling and knowledge transfer.
Everyone is Welcome - A Culture of Inclusion  

Cloudbeds is proud to be an Equal Opportunity Employer that celebrates the diversity in our global team! We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Cloudbeds is committed to the full inclusion of all qualified individuals. As part of this commitment, Cloudbeds will ensure that persons with disabilities are provided reasonable accommodations in the hiring process. We encourage deaf, hard of hearing, deaf-blind, and deaf-disabled individuals to apply. If reasonable accommodation is needed to participate in the job application or interview process or to perform essential job functions, please contact our HR team by phone at (858) 201-7832 or via email at [email protected]. Cloudbeds will provide an American Sign Language (ASL) interpreter where needed as a reasonable accommodation for the hiring processes.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Cloudbeds. Staffing, recruiting agencies, and individuals being represented by an agency are not authorized to use this site or to submit applications, and any such submissions will be considered unsolicited. Cloudbeds does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Cloudbeds employees, or any other company location. Cloudbeds is not responsible for any fees related to unsolicited resumes/applications.

Top Skills

Argocd
Aurora
AWS
Cloudwatch
Datadog
Gitops
Grafana
Kubernetes
Memcached
MySQL
Nginx
Postgres
Prometheus
Redis
Sqs
Terraform

Similar Jobs

Yesterday
In-Office or Remote
6 Locations
Senior level
Senior level
Software
The Senior Site Reliability Engineer will lead service onboarding, maintain SLAs/SLOs, design secure infrastructure, automate operational tasks, and respond to incidents while ensuring system reliability and performance.
Top Skills: AWSCloudFormationElk StackGoGrafanaHadoopKubernetesPythonTerraform
7 Days Ago
Easy Apply
Remote
Easy Apply
Senior level
Senior level
Information Technology • Software • Travel • Hospitality
As a Senior Site Reliability Engineer at Cloudbeds, you'll ensure platform reliability and performance, manage AWS cloud solutions, maintain Kubernetes clusters, support CI/CD processes, and collaborate across teams to foster a culture of automation and resilience.
Top Skills: ArgocdAuroraAWSCloudwatchDatadogGitopsGrafanaKubernetesMemcachedMySQLNginxPostgresPrometheusRedisSqsTerraform
11 Days Ago
Easy Apply
Remote
Easy Apply
Senior level
Senior level
Database • Analytics
The Senior Site Reliability Engineer will ensure reliability and scalability of cloud infrastructure, enhance incident management, and optimize operational efficiencies through collaboration with various teams.
Top Skills: AnsibleAWSAzureDocker SwarmGoGoogle Cloud PlatformKubernetesPuppetPythonTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account