As a Site Reliability Engineer, you'll enhance the performance and reliability of infrastructure and products by collaborating with engineering teams, automating configurations, and implementing monitoring systems.
At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. We work in tight, skilled teams, prioritize clarity over complexity, and continuously evolve through curiosity, data, and craftsmanship. We’re seeking technologists and problem solvers who thrive in fast-paced environments, love collaborating with great talent, and approach every day like it’s Day 1.
We're a globally diverse team with hubs in New York City, Mountain View, Latin America, and India—embracing both hybrid and remote work to bring the best minds together, wherever they are. If you're driven by continuous learning, rapid pivots, and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey.
Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. The team owns the entire infrastructure stacks. SREs design and implement the tools that automate building reliable and performant systems. We emphasize building tools over manual processes. We implement, not administer. We’re obsessed with automation, not repetition. Our job is to focus on building reliable infrastructure and tools for our product teams so that they can solve customer problems and deliver new features, not reinvent platforms.
What you'll do
- Work with product engineering teams on service architecture and implementation
- Deliver Infrastructure configuration as code and automate everything
- Direct and implement monitoring and alerting systems to support rapid problem diagnosis
- Perform Root Cause Analysis and design and deliver resolutions
- Work on our Kubernetes / AWS infrastructure to support our product engineers
- Design secure and performant networking solutions in our production systems
What you'll need
- +4 years of relevant experience bringing software to production at high scale
- Participation in on-call rotation, triaging and addressing production issues
- Obsession with automation and instrumentation
- Understanding of complex systems and failure scenarios
- Excellent communication skills
- Knowledge of AWS services, containers and container management frameworks
- Familiarity with Message Bus based systems and distributed architectures
- Proficiency in Terraform , Python and/or Go
What we'd like to see
- BS or MS degree in the Computer Science field, or equivalent hands-on experience.
- Experience in product oriented environments
- Scalable distributed applications experience
Benefits
- Competitive compensation with stock options
- Comprehensive medical, vision, and dental insurance
- 401k matching
- Fitness and wellness stipend
- Mobile phone reimbursement
- Mental well-being benefits
- Professional learning and development stipend
- Parental leave, including adoptive and foster parents
- 3 weeks paid time off (increases with tenure) and unlimited sick leave
ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at [email protected] to obtain assistance. #LI-SL1 #LI-Hybrid
ASAPP Mountain View, California, USA Office
717 North Shoreline Boulevard, Mountain View, California, United States, 94043
Similar Jobs
eCommerce • Fintech • Payments • Software
The role involves ensuring software reliability and performance, managing incidents, developing infrastructure automation, and mentoring junior engineers within a platform team.
Top Skills:
AWSCloudFormationDatadogKubernetesOpentelemetryRubyRuby On RailsTerraform
Healthtech • Pharmaceutical • Telehealth
As a Senior Site Reliability Engineer, you will ensure the reliability and scalability of production systems, drive incident response, and collaborate with cross-functional teams on best practices for resilience and observability.
Top Skills:
AWSDatadogEksElasticacheGoPulumiPythonRdsRoute53S3Terraform
Financial Services
The Senior Site Reliability Engineer will enhance production insights, manage scalable infrastructure, optimize Kubernetes, and develop automation tools while ensuring high availability and performance in cloud-based systems.
Top Skills:
AnsibleAWSGCPGitopsGoGrafanaHelmIacKubernetesPythonSplunkTerraformTerragrunt
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)
.png)
