Knowtex Logo

Knowtex

Applied ML Engineer

Posted 20 Days Ago
Hybrid
San Francisco, CA, USA
Mid level
Hybrid
San Francisco, CA, USA
Mid level
The Applied ML Engineer will productionize and scale machine learning systems for a voice AI platform, optimizing deployment and performance in healthcare environments.
The summary above was generated by AI

About Knowtex

Knowtex is building the future of voice AI operating systems for clinicians, transforming how healthcare documentation happens at the point of care. Founded by Stanford AI scientists with deep clinical experience, we're experiencing explosive growth across both commercial health systems and federal healthcare, with our ambient documentation platform scaling rapidly to thousands of clinicians across hundreds of specialties. We're at an inflection point where cutting-edge AI meets real clinical impact, giving clinicians hours back each day to focus on what matters most - their patients.

Position Overview

We are seeking an Applied ML Engineer to productionize and scale machine learning systems powering our voice AI platform. This role bridges research and engineering — transforming models into reliable, low-latency, production-grade systems deployed across enterprise healthcare environments.

You will work closely with ML Scientists, Backend Engineers, and Platform teams to optimize inference performance, build evaluation pipelines, and ensure robust model deployment in regulated environments.

Key Responsibilities

  • Productionize ML models for real-time clinical applications

  • Optimize inference pipelines for low latency and high throughput

  • Deploy and scale models using AWS-based infrastructure

  • Build automated evaluation and regression testing frameworks for LLM outputs

  • Implement monitoring systems for model performance and drift detection

  • Collaborate with Backend teams to integrate ML services into APIs and workflows

  • Improve model efficiency through quantization, batching, caching, and optimization techniques
    Support specialty-level model evaluation and performance analysis

  • Contribute to CI/CD workflows for ML deployment

Required Qualifications

  • 3–7+ years of experience in machine learning engineering or applied ML roles

  • Strong proficiency in Python and PyTorch (or TensorFlow)

  • Experience deploying ML models in production environments

  • Familiarity with transformer architectures and large language models

  • Experience with model optimization techniques (quantization, distillation, pruning)

  • Experience working with cloud infrastructure (AWS preferred)

  • Strong software engineering fundamentals and debugging skills

Preferred Qualifications

  • Experience with speech recognition systems or NLP pipelines

  • Experience with Triton Inference Server or similar deployment frameworks

  • Familiarity with healthcare data or clinical documentation workflows

  • Experience working in regulated environments (HIPAA, GovCloud, etc.)

  • Knowledge of medical coding systems (ICD-10, CPT)

Technical Environment

  • Python, PyTorch / TensorFlow

  • Transformer-based LLM architectures

  • AWS (SageMaker, ECS, Lambda, S3)

  • Triton Inference Server

  • CI/CD pipelines for ML deployment

  • Observability tools for performance and drift monitoring

Compensation & Benefits

  • Meaningful equity compensation

  • Unlimited PTO

  • Premium health, dental, and vision coverage

  • 401(k) plan

Top Skills

AWS
Python
PyTorch
TensorFlow
Triton Inference Server
HQ

Knowtex San Francisco, California, USA Office

San Francisco, California, United States, 94108

Similar Jobs

2 Hours Ago
Hybrid
San Francisco, CA, USA
Junior
Junior
Artificial Intelligence • HR Tech • Productivity • Software
As an Applied Machine Learning Engineer, you will design and implement scalable machine learning models, collaborate with engineering teams, and experiment with AI technologies to solve real-world problems in a fast-paced environment.
Top Skills: Computer VisionGraph TheoryMachine LearningSymbolic Ai
3 Days Ago
In-Office
Santa Clara, CA, USA
152K-265K Annually
Senior level
152K-265K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Responsible for designing and optimizing VLSI circuits, translating requirements into data science problems, and building machine learning solutions. Collaborate on hardware design and integrate models into existing tools.
Top Skills: C++PythonPyTorchSparkTensorFlow
6 Days Ago
In-Office
San Francisco, CA, USA
170K-200K Annually
Internship
170K-200K Annually
Internship
Artificial Intelligence • Information Technology • Sales • Software
The Applied Machine Learning Engineer will build end-to-end ML pipelines, extract insights from data, and optimize systems, primarily using Python and SQL.
Top Skills: NlpPythonSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account