The Senior Site Reliability Engineer will manage cloud infrastructure, improve reliability, enhance security, and collaborate with teams to design reliable systems.
Your Career, our Future—Together.
What You'll Do
What You'll Bring
Workplace & Compensation
Let's Start the Conversation
Ready to join something big? At SoundHound AI, we bring voice, generative, and conversational AI together to transform how people interact with products and services. From voice-enabled vehicles to food ordering and customer support, our multilingual, omnichannel technology already impacts hundreds of millions worldwide.
The OpportunityThis is a high-ownership role with direct influence over infrastructure decisions. The team has a clear roadmap focused on improving reliability, security posture, and operational maturity. The Senior Site Reliability Engineer helps build first-class infrastructure to deliver our best-in-class technology to the world. The infrastructure is large and complex, running in the cloud and on Kubernetes, so there's no shortage of interesting problems.
- Build software and systems for cloud infrastructure management and automation (Terraform, Ansible, Oracle Cloud, GCP)
- Participate in developing frameworks for application deployment, customization, and upgrades (Kubernetes, ArgoCD, Vault, Jenkins)
- Ensure application and infrastructure security complies with ISO 27001 / SOX / PCI
- Improve observability, implement and measure key metrics, and define and enforce SLOs/SLAs (Prometheus, Grafana, ELK)
- Collaborate with engineering, quality engineering, and product management to architect and build highly available, reliable, and secure systems
- 8 years of experience working with cloud services at scale in a high-volume customer-facing environment with a Bachelor's degree in Computer Science or equivalent
- Willing to participate in on-call rotation
- Vast experience working in Linux environments, security, and networking with Python, Go, or Bash
- Very experienced with monitoring and alerting tools such as Prometheus, Grafana, ELK stack, and PagerDuty
- Experience with deployments in cloud technologies and architectures, CI/CD tools, and configuration management such as Ansible, Terraform, and Kubernetes
- Proficient with a wide range of relevant server-side technologies such as Consul, Vault, Kafka, MongoDB, PostgreSQL, MySQL
- Pragmatic, problem-solving approach when designing and implementing solutions
This role is available throughout Canada. Employees within a 100-kilometer radius of our Toronto office are expected to work from the office on three pre-scheduled “core days” each month to encourage cross-team connection and in-person collaboration.
Compensation includes salary, equity, comprehensive healthcare, paid time off, and other benefits. Our recruiting team will provide a specific salary range based on location and years of experience.
#LI-MQ1 #LI-REMOTE
Join SoundHound AI and collaborate with colleagues worldwide who are shaping the future of voice AI. Guided by our values—supportive, open, undaunted, nimble, and determined to win—we strive to build breakthrough AI experiences together.
We provide reasonable accommodations for individuals with disabilities throughout the hiring process and employment. To view our job applicant privacy policy, please visit https://static.soundhound.com/corpus/ta/applicantprivacynotice.html.
Discover more about our philosophy, benefits, and culture at https://www.soundhound.com/careers.
***Please beware of agency recruiters falsely stating that they represent SoundHound AI on job posts. Our job post above will note if we are utilizing a specific agency to assist with the search. Our recruiters use @soundhound.com email addresses exclusively.
SoundHound Santa Clara, California, USA Office
5400 Betsy Ross Drive, Santa Clara, CA, United States, 95054
SoundHound San Francisco, California, USA Office
544 Market St, San Francisco, CA, United States, 94104
Similar Jobs
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Lead SRE work to keep Circle highly available and performant: respond to incidents, own monitoring/alerting/log management, manage and optimize MySQL/Postgres/ClickHouse/Redis databases, maintain server infrastructure and deployment pipelines, collaborate with engineering teams, and build internal SRE tooling and automation.
Top Skills:
AWSClickhouseKubernetesLlm-Based Tools (Copilots)MySQLPostgresRedis
Cloud • Security • Software • Generative AI
Lead engineering initiatives to automate and scale Elastic's multi-cloud platform. Build and maintain software, tooling, and automations for reliability; manage Kubernetes at scale; respond to major incidents and drive problem management; collaborate across distributed teams and participate in a follow-the-sun on-call rotation to prevent customer impact.
Top Skills:
CrossplaneDockerElastic CloudElastic StackGoInfluxdbInfrastructure-As-CodeKubernetesLinuxPrometheusServerlessTerraform
Software
Operate and maintain production AWS/EKS Kubernetes clusters; design and ship infrastructure-as-code with Terraform; manage Helm charts and ArgoCD GitOps for multi-region SaaS; maintain observability (Grafana, alerting, logs); improve CI/CD pipelines; remediate container and infrastructure CVEs; support compliance (FedRAMP/SOC2/NIST); create runbooks and lead incident response and post-incident reviews.
Top Skills:
Amazon EksArgocdAWSCi/CdClaudeDockerGitopsGrafanaHelmKubernetesTerraform
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



