Xpert Development LLC Logo

Xpert Development LLC

Senior DevOps & Site Reliability Engineer

Posted Yesterday
Remote
Hiring Remotely in United States
165K-190K Annually
Senior level
Remote
Hiring Remotely in United States
165K-190K Annually
Senior level
Own US PST coverage for releases and incidents as the first SRE; bridge infrastructure and code by working with Kubernetes, Terraform, and AWS and patching Elixir when needed; lead incident response and post-mortems; define SLOs and observability; author runbooks and support HIPAA-aligned compliance for a regulated medical-device platform.
The summary above was generated by AI

About the role

As our first SRE & first engineer in the US, you will own the platform’s stability and releases, especially during PST hours.

You are the perfect "bridge" profile: part system administrator, part software engineer. You don't just manage infrastructure; you understand the code running on it. You will operate with high autonomy, making critical decisions during incidents and ensuring that our production environment is state-of-the-art, secure, and resilient.

You’ll report to our Lead DevOps Engineer, Pierre, and your main mission will be:

  • Own US coverage for releases and incidents as the first responder during PST hours.

  • Bridge infra and code by working hand-in-hand with our DevOps team on Kubernetes, Terraform, and AWS, while being able to read and patch Elixir code to unblock yourself without waiting for a backend engineer.

  • Drive incident response end-to-end, managing triage, mitigation, and blameless post-mortems with real follow-through.

  • Improve the platform’s operability by defining SLOs, tuning alerts to reduce toil, and pushing observability (metrics, logs, tracing) where it’s lacking.

  • Transfer operational knowledge from France to the US by authoring runbooks and documenting procedures so local teams are empowered to act when something breaks.

  • Support compliance and security in our regulated medical-device environment, maintaining HIPAA-aligned controls and an audit-ready infrastructure.

About the profile

Sonio is a mission-driven company, so interest in our mission is critical. Other requirements are:

  • 4+ years of experience in SRE, DevOps, or Production Engineering, including significant on-call experience on a 24/7 product

  • You possess a hybrid "code-literate" mindset, acting as an infrastructure expert who can also navigate a backend codebase to triage and patch issues independently.

  • You bring strong technical foundations in Kubernetes, Terraform, and AWS, along with the ability to architect and tune your own observability signals.

  • You are highly autonomous and comfortable making technical decisions with limited supervision, which is essential given the timezone difference with France.

  • You maintain operational rigor and stay calm under pressure, with the written English skills necessary to produce high-quality runbooks and handle async handoffs.

Location: where you can cover for PST timezone (not necessarily only in the US)

Salary: $165,000 -190,000 + 10% bonus

Benefits:

⚕️Health Insurance (Medical plan, vision, dental) - up to 30,000$ per year + FSA & HSA

👵 401(k) - up 4% of your salary matched

⛑️ Life Insurance - covering 2 times your salary, up to $200k

🐣 An attractive Parental Policy for primary and secondary caregivers

🏝️ 20 PTO + 1 week offered between Christmas and New Year

🖥️ Offices in Boston (HQ) & New York (incl. free breakfast, drinks & gym)

⏰ Flexible hours & remote policies

🚎 Commuter Benefits

✈️ One offsite per year in France & regular team building with US team

🚀 Ongoing trainings and continuous opportunities for professional growth and development, specifically unlimited access to coaching

We move fast and aspire to be transparent over the process - our objective is that the process from the first chat to an offer is no longer than a month.

Similar Jobs

11 Days Ago
Remote
United States
165K-215K Annually
Senior level
165K-215K Annually
Senior level
Software • Cybersecurity
This role involves managing Kubernetes clusters, cloud infrastructure, and CI/CD pipelines. The engineer will enhance system reliability and efficiency while troubleshooting production issues.
Top Skills: AlertmanagerAWSAzureBashCi/CdDockerElastic StackElasticsearchGCPGoGrafanaHelmKafkaKubernetesLokiMongoDBOciPrometheusPythonRedisSparkTerraform
2 Days Ago
Remote or Hybrid
United States
154K-199K Annually
Senior level
154K-199K Annually
Senior level
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills: AWSInfrastructure As CodeJavaScriptNode.jsPython
22 Days Ago
Remote
United States
Senior level
Senior level
Big Data
You will manage AWS infrastructure, automate deployments, debug application issues, and improve the operational health of Metabase Cloud.
Top Skills: AWSDatadogGoGrafanaKubernetesPrometheusPythonTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account