Judgment Labs Logo

Judgment Labs

Senior Data Infrastructure Engineer

Reposted 20 Days Ago
In-Office
San Francisco, CA, USA
Senior level
In-Office
San Francisco, CA, USA
Senior level
Build and scale real-time data pipelines processing 100k+ traces/sec, run LLM-based scoring and clustering near-real time, optimize LLM serving and ClickHouse OLAP performance, and own infrastructure roadmap from ingestion through analytics.
The summary above was generated by AI

Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and latency, our ABM surfaces behavioral anomalies such as instruction drifts and context retrieval loss in scaled production environments.

Hundreds of teams building autonomous agents rely on Judgment to understand how their systems behave post-deployment. Instead of reactive incident triage, they cluster patterns across conversations and workflows, correlate regressions to specific interaction types, and pinpoint where reliability breaks down. We've raised $30M+ across two rounds in the past five months from investors including Lightspeed, SV Angel, and Valor Equity Partners.

We’ve raised $30M+ across two rounds in the past five months. Our investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, Chris Manning, Michael Ovitz, Michael Abbott, Cory Levy, Kevin Hartz, and others.

The Role:

We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data pipelines that power agent behavior analysis at production scale. This role is crucial for processing hundreds of thousands of traces per second, running LLM-based scoring and clustering in near-real time, and delivering the low-latency query performance that enables teams to understand agent behavior as it happens. We need someone who has built petabyte-scale data systems, knows how to squeeze performance out of OLAP databases, and can own the data infrastructure from ingestion through analytics.

What You'll Do:
  • Design and automate large-scale, high-performance streaming and batch data processing systems to power Judgment's behavioral analysis products.

  • Partner closely with infrastructure and backend partners to improve scalability, data governance, and efficiency.

  • Evangelize high-quality software engineering practices for data infrastructure at scale.

  • Advocate for a high bar on data and engineering quality: reliable, efficient, well-documented, testable, and maintainable.

  • Design data models for optimal storage and access, with thoughtful data flows to power critical product requirements.

  • Optimize OLAP database performance through schema design, partitioning strategies, storage tiering, and access pattern analysis.

What We're Looking For:
  • 6+ years of relevant industry experience building and operating high-throughput, petabyte-scale data pipelines in production.

  • Experience collaborating with infrastructure, backend, and product partners to align on data flow and system design.

  • Experience designing and deploying high-performance systems with reliable monitoring and observability practices

  • Deep expertise with streaming and batching systems (Kafka, Spark, Flink, or Ray) operating at petabyte scale.

  • Hands-on OLAP database engineering experience, including with columnar databases (ClickHouse or similar) and distributed query engines (Presto or similar)

  • Excellent communication skills, both written and verbal

Nice to have:

  • Experience building pipelines that call LLM APIs at scale: request batching, rate limit management, cost optimization.

  • Familiarity with ML workflow orchestration (Airflow, Dagster, Prefect).

  • Experience with embedding generation pipelines or vector search infrastructure.

  • Background in observability, log processing, or event stream platforms (Datadog, Honeycomb, Sentry).

  • Data quality monitoring and anomaly detection within pipelines

Why Judgment?
  • Agents can’t work without this. Today’s agents hallucinate, drift, and break in production. We’re building the infrastructure that fixes this: the monitoring layer that makes agents self-improving.

  • We’re wired to win. We're a team of less than 20 but we ship like 50+ on the daily. You'll be working with olympiad medalists, debate champions, and competitive athletes who bring that same intensity to company building.

  • Fast track to fonding. Our engineers interface directly with customers, ship code into their environments, and use their feedback to dictate what’s next on the roadmap. Everyone on the team is either an ex-founder or a founder-to-be.

  • We make sure our people do their best work. If you deserve a spot on the team, money will never get in the way of it. Full benefits, Equinox, and a private chef to take care of you. We sprint hard but we play hard, ask us about our Smash/Mario Kart tournaments.

    We work in person in San Francisco.

HQ

Judgment Labs San Francisco, California, USA Office

425 Bush St, San Francisco, California, United States, 94108 3708

Similar Jobs

2 Days Ago
In-Office
140K-160K Annually
Senior level
140K-160K Annually
Senior level
AdTech • Agency • Digital Media
Build AI-powered platforms and tools, focusing on backend services in Python, integrating LLM capabilities, and ensuring performance and reliability.
Top Skills: Abacus.AiAnthropicBigQueryCloud RunCompute EngineDockerFastapiFirestoreGCPGoogle GeminiGraphQLKubernetesMongoDBOpenaiPythonRestSQLStarlette
5 Days Ago
In-Office
San Francisco, CA, USA
200K-400K Annually
Senior level
200K-400K Annually
Senior level
Artificial Intelligence • Software
The Senior Data Infrastructure Engineer will design, build, and operate data systems for Decagon's AI products, managing data pipelines and optimizing performance while ensuring high availability and low latency.
Top Skills: AirflowBigQueryClickhouseDaskDatabricksDatadogFlinkGrafanaKafkaKubernetesOpentelemetryPrometheusRedshiftSnowflakeSparkTerraform
14 Hours Ago
In-Office
San Jose, CA, USA
182K-228K Annually
Senior level
182K-228K Annually
Senior level
Aerospace
The Sr Staff Engineer will architect and manage the data infrastructure, ensuring reliability and performance for data engineering and machine learning teams, while automating processes and monitoring systems.
Top Skills: ArgocdJupyterhubKubernetesMlflowRayTerraformTrinoVectorVictoriametrics

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account