CTGT Logo

CTGT

Machine Learning Engineer: LLM Interpretability & Systems

Posted 18 Days Ago
In-Office
San Francisco, CA, USA
175K-250K Annually
Senior level
In-Office
San Francisco, CA, USA
175K-250K Annually
Senior level
Build production systems that probe and modify LLM internals to improve reliability and enforce deterministic policy. Implement mechanistic-interpretability techniques (activation patching, control vectors), work with weights/activations, build evaluation and deployment loops, and design feature-level intervention systems for enterprise inference.
The summary above was generated by AI
About CTGT & The Mission

Despite massive investment in commercial AI, organizations often find that demonstrated value is elusive, primarily due to the non-deterministic risk inherent to generative models. CTGT is the deterministic governance layer that enables the most important global institutions to deploy AI workflows with confidence.

Born out of Stanford University research, we provide the control plane that makes it possible. A lightweight, model-agnostic system that enforces policy, prevents drift, and produces auditable decisions in real time.

While we sit on the edge of AI research, CTGT brings frontier intelligence into real-world environments. We apply cutting-edge theory directly in production to make large language models more reliable, controllable, and performant in practice.

Our mission is to bring models to the level of performance and accountability required by the Fortune 500. By bridging the gap between LLM capabilities and domain-specific requirements, we unlock the true potential of generative AI to solve the most pressing problems in our world today.

The Role

A new open-source model is released and you are compelled to reach inside and understand how it actually works. You instinctively try to push it beyond what most people say is already impressive. You observe model behavior and don’t think, “What’s a better prompt?”, but “How do I improve its fundamentals?”

CTGT’s Senior Machine Learning Engineer will operate deep within the model stack, working directly with weights, activations, and architectures to build the systems that make AI governance deterministic. Your work powers the Policy Engine, the core technology that gives enterprises real-time, auditable control over model behavior in production. Your mandate is ostensibly simple but difficult in execution: determine how a model can be improved for a specific purpose and build the systems that operationalize that within our platform.

As opposed to simply using models, you will probe the mechanics of their cognition.

What You Will Do
  • Take ideas from mechanistic interpretability and related work and turn them into code that runs in production, making research into reality.

  • Work directly with model internals to improve behavior and performance across commercial and open-source models.

  • Leverage techniques like activation patching, control vectors, and feature extraction to achieve targeted, repeatable improvements in model output.

  • Build the evaluation and deployment loops needed to ship changes reliably into enterprise environments.

  • Design and optimize the feature-level intervention systems that enable deterministic policy enforcement at inference time.

Who You Are
  • Strong understanding of Transformer architectures, PyTorch internals, and the mathematical foundations of deep learning.

  • Have trained, fine-tuned, or optimized models beyond superficial augmentation.

  • Can read a paper, decide what matters, and implement it.

  • Notice when something is not working and take ownership of fixing it.

  • Motivated by the challenge of making large language models reliable and controllable enough for the highest-stakes enterprise applications.

Our Stack
  • Languages: Python, Rust, and Node/TypeScript, with React on the frontend

  • Data: Postgresql, vector, and graph databases

  • Infra: Docker, Kubernetes, Terraform, across several cloud providers and customer VPCs

  • ML: Self hosted models on multiple GPU providers and frontier APIs

What We Offer

Compensation & Equity: Competitive base compensation, plus significant equity in a venture-backed company with institutional investors including Google’s Gradient Ventures, General Catalyst, and Y Combinator. We want people who think and act like owners.

Real Impact: You will work directly on the core systems that determine how models perform in the wild. Your work ships into real, high-stakes environments where governance, auditability, and performance are non-negotiable.

Autonomy & Trust: We operate with a high degree of trust. You are expected to form strong technical opinions and execute on them.

Similar Jobs

An Hour Ago
In-Office
San Jose, CA, USA
177K-301K Annually
Senior level
177K-301K Annually
Senior level
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Lead development and maintenance of advanced CAD tools and methodologies for semiconductor memory. Establish CAD build standards, integrate solutions with cross-functional teams, benchmark against industry, mentor engineers, and coordinate continuous improvement of CAD tools to support DRAM/NAND device development.
Top Skills: C++CadDramNandPythonSemiconductor Processes
An Hour Ago
In-Office
San Jose, CA, USA
141K-319K Annually
Senior level
141K-319K Annually
Senior level
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Lead account strategy and business development for cloud/hyperscale customers; influence product roadmaps across DDR/LPDDR/HBM; analyze market and segment data; define TAM/SOM; document product requirements; and partner with sales, engineering, operations, and supply chain to drive adoption and market share.
Top Skills: Data Center DesignDdrHbmLpddrServer Architecture
An Hour Ago
In-Office
San Jose, CA, USA
46K-106K Hourly
Senior level
46K-106K Hourly
Senior level
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Develop and prepare multi-dimensional semiconductor layouts from schematics, lead major block developments, verify data integrity, debug physical verification issues, improve layout efficiency via automation, and collaborate with global design, verification, and CAD teams to meet scheduled deadlines.
Top Skills: Agentic AiCadence VirtuosoCalibreSynopsys

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account