Speak Logo

Speak

Machine Learning Engineer, Voice

Reposted 18 Days Ago
Hybrid
San Francisco, CA, USA
Mid level
Hybrid
San Francisco, CA, USA
Mid level
The role involves training and deploying ASR models, improving pronunciation feedback, measuring performance, and expanding ASR systems for language learning.
The summary above was generated by AI
About us

Our mission is to reinvent the way people learn, starting with language.

Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around the world are actively trying to learn a language, but the best way to learn (one-on-one tutoring) is hard to access at scale and hasn’t been meaningfully improved in decades. Speak is building a human-level, AI-powered tutor in your pocket: a conversation-first experience that lets learners actually speak, get instant feedback, and progress through carefully designed lessons. The result is a complete path from beginner to confident speaker across multiple languages.

Speak first launched in South Korea in 2019, where Speak has now become the number one language learning app, and we now serve learners across many markets and 15+ languages. Speak is one of the world’s leading AI companies, with over $150m raised in venture investment from OpenAI, Accel, Founders Fund, Khosla Ventures, and more, with a distributed team across San Francisco, Seoul, Tokyo, Taipei, and Ljubljana.

About this role

We are looking for an experienced Machine Learning Engineer to join our team and help develop cutting-edge speech recognition models that help teach language fluency. In this role you will take ownership of the end-to-end modeling pipeline, from training and experimentation to deployment and monitoring. You will also work closely with Product teams to design innovative learning experiences and measure the efficacy of production models as they affect our end users. We are a small, dynamic team where you will contribute as a developer and thought partner on team projects like ASR, assessment, pronunciation, content personalization, and much more. This is an incredibly exciting time to join an ML team designing a personalized learning experience that will revolutionize language learning for millions of learners worldwide — come join us!

What you'll be doing
  • Training and deploying ASR models end-to-end, including monitoring, performance tracking, and retraining

  • Improving the pronunciation model that provides precise feedback, and make it more central to our learning app

  • Creating metrics to measure ASR performance across tasks and languages

  • Expanding our ASR systems to new languages and markets

  • Building and maintaining data infrastructure such as training/evaluation datasets and labeling pipelines

What we're looking for
  • Extensive experience training large models on GPUs and deploying custom deep learning models

  • Proficiency in Python and common Deep Learning frameworks like PyTorch

  • Demonstrated experience owning ML pipelines end to end, from POC to production

  • Strong communication skills and the ability to explain complex ML concepts to non-technical stakeholders

  • Sharp product sense and an ability to think broadly and cross-functionally about model quality in the context of user experience

  • Bonus

    • Experience with speech or audio

Office
  • San Francisco, CA

Why work at Speak
  1. Join a fantastic, tight-knit team at the right time: we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company.

  2. Do your life's work with people you’ll love working with: we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply.

  3. Global in nature: We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you’ll have the opportunity to talk to users in each of these regions on a regular basis as well as travel.

  4. Impact people's lives in a major way: Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We’re helping millions of people achieve their goals and improve their lives.

Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Speak San Francisco, California, USA Office

San Francisco, CA , United States, 94107

Similar Jobs

25 Days Ago
In-Office
San Francisco, CA, USA
220K-280K Annually
Senior level
220K-280K Annually
Senior level
Artificial Intelligence • Information Technology
The Staff Machine Learning Engineer will optimize the inference layer for voice applications, focusing on model serving and real-time audio processing. Responsibilities include driving performance architecture, collaborating with model partners, and developing evaluation frameworks for STT and TTS models.
Top Skills: CartesiaCudaDeepgramLlm Serving EnginesPythonPyTorchRimeSglangTensorrt-Llm
24 Days Ago
In-Office
San Francisco, CA, USA
160K-230K Annually
Senior level
160K-230K Annually
Senior level
Artificial Intelligence • Information Technology
This role involves optimizing model serving layers for voice AI applications, working with inference engines, and improving performance for STT and TTS systems.
Top Skills: CudaPythonPyTorchSglangTensorrt-LlmVllm
21 Days Ago
In-Office
Palo Alto, CA, USA
130K-260K Annually
Expert/Leader
130K-260K Annually
Expert/Leader
Insurance
As a Senior Staff Engineer, you will lead engineering teams, provide technical solutions, mentor junior members, and drive quality in enterprise applications with focus on voice technologies.
Top Skills: .NetAWSAzure DevopsC#JavaNode.jsPowershellPython

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account