Hippocratic AI Jobs

Performance Engineer

Hippocratic AI

Performance Engineer

Reposted 20 Days Ago

Be an Early Applicant

In-Office

Menlo Park, CA, USA

Senior level

In-Office

Menlo Park, CA, USA

Senior level

As a Performance Engineer, you will build automated performance testing frameworks, characterize VoIP performance, and collaborate with cross-functional teams to improve system efficiency and reliability.

The summary above was generated by AI

About Us

Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations with patients. We have trained our own LLMs as part of our Polaris constellation, resulting in a system with over 99.9% accuracy.

Why Join Our Team

Reinvent healthcare with AI that puts safety first. We’re building the world’s first healthcare‑only, safety‑focused LLM — a breakthrough platform designed to transform patient outcomes at a global scale. This is category creation.

Work with the people shaping the future. Hippocratic AI was co‑founded by CEO Munjal Shah and a team of physicians, hospital leaders, AI pioneers, and researchers from institutions like El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA.

Backed by the world’s leading healthcare and AI investors. We recently raised a $126M Series C at a $3.5B valuation, led by Avenir Growth, bringing total funding to $404M with participation from CapitalG, General Catalyst, a16z, Kleiner Perkins, Premji Invest, UHS, Cincinnati Children’s, WellSpan Health, John Doerr, Rick Klausner, and others.

Build alongside the best in healthcare and AI. Join experts who’ve spent their careers improving care, advancing science, and building world‑changing technologies — ensuring our platform is powerful, trusted, and truly transformative.

About the Role

We're hiring a Performance Engineer to own performance across our entire stack. You'll build the automated harnesses that keep us honest, continuously measuring every model, microservice, and infrastructure component, and be our expert voice on VoIP quality characterization. This is a high-impact, highly technical role that cuts across ML infrastructure, backend services, and real-time communications.

What You'll Do

Build Automated Performance Harnesses

Design and maintain automated performance testing frameworks covering the full system: LLM inference, REST/gRPC microservices, and infrastructure components including PostgreSQL, Redis, message queues, and object storage
Integrate performance suites into CI/CD so every deploy is gated against latency and throughput regressions
Define SLIs/SLOs and build dashboards (e.g., Grafana) that give engineering teams real-time visibility into system health

Characterize VoIP Performance

Measure and track VoIP quality metrics: MOS scores, jitter, packet loss, latency, echo, and codec fidelity
Build synthetic call load testing infrastructure to stress telephony paths at scale
Correlate audio degradation signals with underlying infrastructure metrics to root-cause issues

Drive a Performance Culture

Partner with ML, Speech, Backend, and Infra teams to turn performance findings into prioritized engineering work
Contribute to incident reviews where latency or audio quality was a factor
Write runbooks, share learnings, and help other engineers instrument their own services

What You Bring

Must-Have:

BS in Computer Science or equivalent
10+ years in performance engineering with a solid software development and SRE background
Proven track record building automated performance test harnesses (Locust, k6, Gatling, JMeter, or custom tooling)
Deep hands-on experience with PostgreSQL performance tuning (execution plans, indexing strategies, connection pooling, autovacuum) and Redis (eviction policies, clustering, pipelining)
Solid grasp of distributed systems fundamentals: queueing theory, tail latency, backpressure, cascading failures
Fluency with observability tools: Prometheus, Grafana, Cloudwatch, etc.

VoIP / Telephony

Working knowledge of SIP, RTP/RTCP, and RTCP-XR
Hands-on experience with VoIP testing tools (SIPp or equivalent) and interpreting call quality reports
Ability to diagnose audio degradation across network, codec, and application layers
Familiarity with MOS scoring methodologies (PESQ, POLQA, or E-model)

Nice-to-Have:

Experience benchmarking ML inference servers: vLLM, TensorRT-LLM, Triton, or similar
Kubernetes workload profiling and resource right-sizing
Chaos engineering experience: Toxiproxy, Gremlin, or Chaos Monkey
Background in healthcare tech, real-time communications, or other high-reliability, latency-sensitive systems

Join us and help build the future of safe, life-changing AI in healthcare. There’s never been a more exciting moment to make an impact.

Please be aware of recruitment scams impersonating Hippocratic AI. All recruiting communication will come from @hippocraticai.com email addresses. We will never request payment or sensitive personal information during the hiring process.

167 Hamilton Ave, 3rd Floor, Palo Alto, California, United States, 94301

Similar Jobs

Arm

Principal AI Performance Engineer

3 Days Ago

Hybrid

San Jose, CA, USA

263K-355K Annually

Expert/Leader

263K-355K Annually

Expert/Leader

Artificial Intelligence • Internet of Things • Semiconductor

Optimize production AI inference on Arm-based edge devices by developing kernel-to-system level implementations, profiling and resolving performance bottlenecks, producing production-quality reference code and documentation, and collaborating with customers and internal teams to influence IP and tooling roadmaps.

Top Skills: Arm Ai Optimization ToolsArm TechnologyC++CudaPythonTriton

CoreWeave

GPU Performance Engineer

10 Days Ago

In-Office

Sunnyvale, CA, USA

120K-160K Annually

Mid level

120K-160K Annually

Mid level

Cloud • Information Technology • Machine Learning

Design, implement, and maintain infrastructure and tools to validate GPU performance at scale. Develop performance tests, automation workflows, and Kubernetes controllers/operators, extend open-source tooling for metrics and observability, troubleshoot production systems, and participate in on-call rotation.

Top Skills: Ai/Ml InfrastructureGoGpu Performance TestingHpcKubernetesKubernetes Custom ControllersKubernetes OperatorsPython

Relativity Space

High Performance Compute Responsible Engineer

15 Days Ago

Easy Apply

In-Office

Easy Apply

175K-241K Annually

Senior level

175K-241K Annually

Senior level

Aerospace • Hardware • Robotics • Software • Manufacturing

Design and bring up high-performance compute boards (≥10 TFLOPS) for onboard AI/ML and science data processing. Own schematic capture, PCB design, power delivery (500W+ multi-rail), high-speed interfaces (HBM, DDR5, PCIe Gen4/5, 10GbE), thermal interface specs, and hands-on hardware bringup and validation while collaborating with software and mechanical teams.

Top Skills: 10Gbe EthernetBgaCDdr5Ecc Memory ScrubbingGpuHbmHpcPcb LayoutPcie Gen4Pcie Gen5Power Management IcsSignal IntegrityVoltage SequencingWatchdog/Reset Architectures

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Hippocratic AI

Performance Engineer

Hippocratic AI Palo Alto, California, USA Office

Similar Jobs

Principal AI Performance Engineer

GPU Performance Engineer

High Performance Compute Responsible Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech