The Generative AI Engineer will develop AI-driven features for medical records, collaborating with teams to create systems extracting and generating clinical insights from EHR data, while ensuring accuracy and clinical relevance.
As a Generative AI Engineer at Regard, you’ll work across the full lifecycle of developing and deploying AI-driven features, from ideation and design to prototyping, implementation, evaluation, and iteration. You’ll collaborate closely with product and clinical teams to build systems that transform medical records into structured insights and clinician-ready documentation.
Your work will center on applying modern LLMs to extract, summarize, normalize, and generate clinical information from diverse electronic health record (EHR) data sources. This includes developing robust pipelines, running model and prompt-engineering experiments, integrating models into production services, and ensuring outputs remain factual, safe, and clinically aligned.
You’ll directly contribute to high-priority product initiatives, shape new AI capabilities, and advance our LLM platform. Your work will have a tangible impact on how clinicians understand patient data and how healthcare organizations improve care quality.
About Regard
Our mission is to bring world-class healthcare to everyone. Regard is an AI-powered Proactive Documentation platform that advances how care is delivered by reviewing all patient data in the EHR to recommend diagnoses and surface clinical evidence. Regard drafts a note even before the physician sees the patient, enabling an approach that gets documentation right at the point of care - we call it Proactive Documentation. This improves quality of care, reduces physician burden, and improves hospital finances. We are excited by challenges, mission-oriented work, and meaningful relationships. We work closely with some of the top health systems in the country and are leading the change that healthcare - one of the largest and most inefficient industries in the world - needs. We want you to join us.
Our Tech Stack:
- Frontend: TypeScript, React
- Backend: Python, PostgreSQL, Redis, AWS
- AI: OpenAI, Anthropic, Langfuse
Responsibilities:
- Build and refine LLM-powered systems to extract structured medical concepts, diagnoses, medications, labs, and timelines from unstructured records
- Develop generation pipelines that produce clinically accurate drafts of notes (H&P, progress notes, discharge summaries, etc.) from factual inputs
- Design, prototype, and evaluate prompts, agent workflows, and retrieval-augmented generation (RAG) components
- Benchmark LLM systems to evaluate new models and audit accuracy
- Optimize inference cost, latency, and throughput through batching, caching, and model-selection strategies
Qualifications:
- BS in Computer Science or equivalent experience
- 3+ years of professional experience with software development in one or more programming languages (Python preferred)
- 1+ years of professional experience building generative AI products, such as RAGs, agents and chatbots
- Able to participate in on-call operational support for their areas of responsibility
- Able to travel up to 4 weeks a year for company co-working and/or retreat weeks
- Strong verbal and written communication skills
- Practical experience leveraging AI coding tools in day-to-day software development (e.g., Cursor, Claude Code, Codex)
Preferred Qualifications:
- Familiarity with vector databases and embeddings generation
- Experience working on a mature enterprise SaaS technology product
- Exposure to startup and/or high growth environments
Hybrid Work | Location | Work Authorization
- For this role, Regard is currently only considering candidates who are authorized to work in the US without visa sponsorship, and are within the New York City metro area, San Francisco Bay Area, or Los Angeles
- We expect our Engineers to be in the office on Tuesdays and Thursdays. We may request more frequent in-office work during the onboarding period
- We will provide relocation assistance to anyone who does not already reside in the NYC metro area
- We prefer hiring people within commuting distance of our offices because we value getting together in person regularly
- For those who enjoy working from the office on a more regular basis, we offer catered lunches and other fun perks
- Additionally, hybrid employees have the flexibility to work from locations outside of their home office from up to 6 weeks per year
Comp | Perks | Benefits
- Eligible for equity
- 99% employer paid health benefits (Medical, Dental, and Vision) + One Medical subscription
- 18 PTO days/yr + 1 week holiday break
- Monthly health & wellness budget
- Company-sponsored team retreat + social events
- A sabbatical program
Our goal at Regard is to provide and maintain a work environment that fosters mutual respect, professionalism and cooperation. Regard is proud to be an equal opportunity employer that does not discriminate on the basis of actual or perceived race, creed, color, religion, national origin, ancestry, alienage or citizenship status, age, disability or handicap, sex, gender identity, marital status, familial status, veteran status, sexual orientation or any other characteristic protected by applicable federal, state or local laws. We celebrate diversity and are proud of our supportive, inclusive workplace.
All candidates must successfully complete a background check as part of the hiring process.
Top Skills
Anthropic
AWS
Langfuse
Openai
Postgres
Python
React
Redis
Typescript
Similar Jobs
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
Lead the development of generative AI tools and scalable data pipelines for the AEC industry, mentoring engineers and collaborating on ML systems.
Top Skills:
AWSPythonPyTorch
Fintech • Financial Services
The role involves designing and building reliable software, partnering with teams, ensuring compliance, and innovating at scale in the Generative AI platform.
Top Skills:
AICi/CdClojureElixirKubernetesLlmsMlMlflowMlopsPythonRagRestfulRustScalaW&B
Gaming • Mobile • Software
The role focuses on machine learning, especially in video and image embedding, and improving user features using ML recommendations. Requires expertise in GCP and video production metrics.
Top Skills:
AirflowBigQueryC#C++CircleCIDockerElectronGCPGithub ActionsJavaKotlinKubeflow PipelinesKubernetesOpencvPyTorchRabbitMQReactRedisReduxSaltSwiftTerraform
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)

