Speak Jobs

Machine Learning Engineer, Assessments

Speak

Machine Learning Engineer, Assessments

Reposted 19 Days Ago

Hybrid

San Francisco, CA, USA

Mid level

Hybrid

San Francisco, CA, USA

Mid level

You will build and maintain assessment ML systems, define evaluation frameworks, and collaborate with design teams to improve language proficiency assessments.

The summary above was generated by AI

About us

Our mission is to reinvent the way people learn, starting with language.

Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around the world are actively trying to learn a language, but the best way to learn (one-on-one tutoring) is hard to access at scale and hasn’t been meaningfully improved in decades. Speak is building a human-level, AI-powered tutor in your pocket: a conversation-first experience that lets learners actually speak, get instant feedback, and progress through carefully designed lessons. The result is a complete path from beginner to confident speaker across multiple languages.

Speak first launched in South Korea in 2019, where Speak has now become the number one language learning app, and we now serve learners across many markets and 15+ languages. Speak is one of the world’s leading AI companies, with over $150m raised in venture investment from OpenAI, Accel, Founders Fund, Khosla Ventures, and more, with a distributed team across San Francisco, Seoul, Tokyo, Taipei, and Ljubljana.

About this role

We’re hiring an ML Engineer, Assessments to help build best-in-class assessment systems across multiple products (Speak for Business, B2C, and new surfaces). You will work in a tight loop with our Assessment Design Lead (Content/Learning Design), Machine Learning, Product, and Engineering to turn assessment constructs and rubrics into reliable, scalable scoring + feedback systems.

This role owns the implementation, deployment, and ongoing quality of our assessment algorithms and ML systems. While there is immediate need to improve and expand production assessments, this work is also building a platform capability that can be reused across the app.

What you’ll be doing

Ship and own assessment ML systems end-to-end
- Build, deploy, and maintain scoring models/pipelines (feature extraction → model training → inference → feedback generation)
- Own monitoring, regression tests, and ongoing iteration to maintain accuracy targets
Define and operationalize evaluation
- Implement validation/evaluation frameworks for assessments, including metrics, test sets, and offline/online analysis
- Translate assessment requirements into measurable acceptance criteria and guardrails
Partner deeply with the Assessment Design Lead
- Co-develop the strategy, together with the Content team, to grow assessments into a core platform at Speak
- Work in a tight weekly loop to deliver incremental improvement
Drive near-term delivery across products
- Stand up or improve summative assessments (spoken language ability) and bring them reliably to production
- Prototype and validate formative assessment approaches to measure improvement over weeks/months
Support data and labeling strategy
- Help define data needs for training/evaluation (including psychometric measurement needs)
- Build or improve pipelines that support label collection and analysis (especially for efficacy studies)

What we’re looking for

Domain expertise in spoken language proficiency assessment (linguistics, applied linguistics, pedagogy, or equivalent experience)
Strong experience designing and running evaluation + validation for assessment/scoring systems, and tailoring approaches to a specific product use case
4+ years building automatic proficiency assessment systems (or equivalent depth in closely related scoring/evaluation domains)
- PhD is helpful but not required
Proven ability to ship ML models to production (not only research), including reliability, monitoring, and iteration
Strong generalist ML/analysis skills (statistics, Python, PyTorch/model training)
Ability to operate cross-functionally and communicate clearly with non-technical partners (Content/LD, PM, leadership)

Nice to have

Experience with speech/audio ML
Experience with psychometrics concepts (reliability/validity, calibration)

How we work (collaboration expectations)

This role is designed to be highly collaborative with the Assessment Design Lead. Success depends on a tight loop where constructs/rubrics and model outputs co-evolve — not a sequential handoff.

Why work at Speak

Join a fantastic, tight-knit team at the right time: we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company.
Do your life's work with people you’ll love working with: we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply.
Global in nature: We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you’ll have the opportunity to talk to users in each of these regions on a regular basis as well as travel.
Impact people's lives in a major way: Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We’re helping millions of people achieve their goals and improve their lives.

Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

San Francisco, CA , United States, 94107

Similar Jobs

Chime

Media Manager, Strategy and Planning

24 Minutes Ago

Easy Apply

Hybrid

San Francisco, CA, USA

Easy Apply

83K-145K Annually

Mid level

83K-145K Annually

Mid level

Fintech • Machine Learning • Mobile • Security • Software

Owner/partner for offline media planning and execution: build and maintain campaign plans, trackers, and dashboards; coordinate with agencies and vendors; support MMM and attribution workflows by gathering and QA'ing inputs; run and document tests; translate measurement outputs into actionable insights for Growth, Brand, and MarTech teams.

Top Skills: AttributionExcelGoogle SheetsMarketing Mix Modeling (Mmm)

Eve

Senior Deal Desk Analyst (Bay Area, Mountain or Central Time Zone)

26 Minutes Ago

Easy Apply

Remote or Hybrid

Easy Apply

110K-160K Annually

Senior level

110K-160K Annually

Senior level

Legal Tech • Software • Generative AI

Manage deal workflows for new business, renewals, and expansions; review pricing, discounts, contracts, and billing for policy and revenue-recognition alignment; run approval workflows and escalate complex deals; partner with Sales, CS, Finance, Legal, and RevOps; optimize quote-to-cash processes, track deal metrics, and implement AI-powered automation to improve efficiency and scalability.

Top Skills: Ai-Powered ToolsBi/Reporting ToolsCpq ToolsCrm PlatformsDealhubExcelGoogle SheetsHubspotSalesforce

Eve

Software Engineer

26 Minutes Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

250K-300K Annually

Senior level

250K-300K Annually

Senior level

Legal Tech • Software • Generative AI

Build and own Eve's marketing site and GTM engineering stack: integrate and optimize external tools, design AI agents for sales and marketing, implement webhooks and middleware to sync product/CRM data, and create programmatic campaigns and internal tools in partnership with Marketing, Sales, RevOps, and Product to drive growth and automation.

Top Skills: Ai AgentsCRMCSSHTMLJavaScriptLlmsMarketing AutomationMiddlewarePythonSQLWebhooks

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Speak

Machine Learning Engineer, Assessments

Speak San Francisco, California, USA Office

Similar Jobs

Media Manager, Strategy and Planning

Senior Deal Desk Analyst (Bay Area, Mountain or Central Time Zone)

Software Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech