Unusual Ventures Logo

Unusual Ventures

Software Engineer, ML Serving - Rime Ai

Posted Yesterday
Be an Early Applicant
In-Office
San Francisco, CA, USA
Senior level
In-Office
San Francisco, CA, USA
Senior level
Design, implement, and operate Rime's real-time TTS model serving infrastructure across GPU-backed inference engines and APIs. Optimize models from single-node to disaggregated fleets, ensure compatibility with NVIDIA hardware, own CI/CD for serving, manage SRE responsibilities including on-call, monitoring, cost and resource provisioning for GPU fleets.
The summary above was generated by AI

Rime is a foundation modeling company that builds voice AI for enterprises running customer experiences at scale. Our models are purpose-built for high-volume conversational deployments, engineered for the accuracy, performance, and deployment flexibility that production environments actually demand.

We started from a different premise than the rest of the field: build voice AI for human connection, not slop. Before we trained a single model, we built our own corpus: full-duplex, studio-quality conversational speech of normal people, recorded and annotated by linguists. It's why our models are unparalleled in naturalism, and it's why enterprises pick Rime when pilots need to make it to production.

 Role Overview

We're hiring a Software Engineer to own the serving infrastructure that connects Rime's inference engines to the world. This role sits at the intersection of ML systems and cloud infrastructure — you'll work directly on model inference and cloud infrastructure to build, harden, and scale the systems that stream voice at real-time latency. As Rime moves toward its next-generation architecture, you'll be a core architect of how our models get served.


What You'll Own

  • Architecture and implementation of Rime's TTS serving infrastructure, from GPU-backed inference engines to the API surface.

  • Model optimization from a single-node to disaggregated fleet serving.

  • Compatibility with different NVIDIA hardwares from Hopper to Blackwell and beyond for on-prem and cloud deployments.

  • Continuous integration and deployment workflows for the model serving pipeline.

  • Site reliability: on-call rotation, monitoring, alerting, and observability across the serving stack.

  • Resource provision, cost management across our GPU fleet.

What We're Looking For

  • Hands-on experience with real-time multinode ML serving infrastructure — ML serving framework experience: NVIDIA Dynamo/Triton, vLLM, SGLang, or equivalent.

  • Experience with distributed or disaggregated model serving (Tensor Parallel, Pipeline Parallel, or equivalent).

  • Strong cloud infrastructure fundamentals: Linux internals, networking, containerization (Docker, Kubernetes).

  • IaC experience — Terraform, Packer, or comparable. You should have opinions about how to do this right.

  • On-call is part of the job. You treat production reliability as a shared responsibility.

Nice to Have

  • Experience with multinode training (DDP, FSDP, etc.).

  • Experience with gRPC or other bidirectional binary streaming protocols.

  • Experience with audio streaming and related technologies (WebRTC, WebSockets, etc.).

  • Experience with a multilingual monorepo where you pick the best language out of merit more than personal experience.

  • Experience with multi-cloud infrastructures (AWS, GCP, OCI, etc.).

  • Comfort with configuration management tooling (Ansible, Chef, Puppet, or similar).

  • SRE, DevOps, or platform engineering background at a startup.

  • Experience at an early-stage company.

Why Join Rime

  • Build the serving infrastructure behind a category-defining voice AI company from the ground up.

  • You will bring in experience that no one else currently has at the company: you can help us set the vision.

  • Direct collaboration with the inference, platform, and ML teams — no handoff culture.

  • The systems you build determine what experiences our customers can deploy at scale.

  • Meaningful equity upside at an early stage.

  • High ownership, high standards, low bureaucracy.

  • SF / Bay Area.

At Rime, we...

  • Are outliers

  • Cut through the hype to focus on the craft

  • Move fast with agency and freedom

  • Maintain a growth mindset, finding joy in the struggle

  • Do the right things, knowing that it'll lead to making money

  • If that sounds like you too, you'll be a great fit for Rime!

Unusual Ventures Menlo Park, California, USA Office

200 Middlefield Rd, Menlo Park, CA, United States, 94025

Similar Jobs

7 Minutes Ago
Remote or Hybrid
United States
165K-215K Annually
Expert/Leader
165K-215K Annually
Expert/Leader
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead stabilization and modernization of Leave Solutions operations, drive AI (agentic) enablement and SKAN.AI adoption, set governance and readiness, deliver measurable KPI improvements, oversee change management, workforce enablement, and cross-functional alignment for safe, auditable AI decisioning.
Top Skills: Advanced Decisioning SystemsAgentic AiIntelligent AutomationSkan.Ai
7 Minutes Ago
Hybrid
San Francisco, CA, USA
62K-83K Annually
Entry level
62K-83K Annually
Entry level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Provide paralegal support for commercial real estate and corporate transactions, managing entity governance, drafting and reviewing transaction documents, maintaining trackers and digital minute books, supporting closing and post-closing deliverables, driving process improvements, and assisting attorneys and cross-functional stakeholders with workflow automation and legal-technology initiatives.
Top Skills: CtadvantageGemsGenerative Ai ToolsImanageIntralinksLexisnexisLiteraMicrosoft 365ProdealWestlawWorkflow Automation
7 Minutes Ago
Remote or Hybrid
United States
42K-54K Annually
Junior
42K-54K Annually
Junior
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Review and evaluate Group Life claim documentation for completeness and accuracy, process payments, respond to policyholder and beneficiary inquiries, ensure regulatory compliance, meet production and quality targets, mentor reviewers, and support claimants compassionately throughout the claims process.
Top Skills: Bios (Windows-Based Claim System)ExcelMicrosoft TeamsMicrosoft WordWindows

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account