Judgment Labs Logo

Judgment Labs

Senior Data Infrastructure Engineer

Posted 3 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA
Senior level
In-Office
San Francisco, CA
Senior level
Build and scale real-time data pipelines processing 100k+ traces/sec, run LLM-based scoring and clustering near-real time, optimize LLM serving and ClickHouse OLAP performance, and own infrastructure roadmap from ingestion through analytics.
The summary above was generated by AI

Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and latency, our ABM surfaces behavioral anomalies such as instruction drifts and context retrieval loss in scaled production environments.

Hundreds of teams building autonomous agents rely on Judgment to understand how their systems are behaving post-deployment. Instead of reactive incident triage, they cluster patterns across conversations and workflows, correlate regressions to specific interaction types, and pinpoint where reliability breaks down in their usage context.

We’ve raised $30M+ across two rounds in the past five months. Our investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, Chris Manning, Michael Ovitz, Michael Abbott, Cory Levy, Kevin Hartz, and others.

The Role:

We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data pipelines that power agent behavior analysis at production scale. This role is crucial for processing hundreds of thousands of traces per second, running LLM-based scoring and clustering in near-real time, and delivering the low-latency query performance that enables teams to understand agent behavior as it happens. We need someone who has built petabyte-scale data systems, knows how to squeeze performance out of OLAP databases, and can own the data infrastructure from ingestion through analytics.

What You'll Do:
  • Design the streaming pipeline that scores and clusters 100k+ traces/s workload using LLM APIs in near-real time (Kafka + Spark/Ray).

  • Identify LLM API Serving bottleneck via looking at flamegraphs and raise RPS via smart batching/streaming, adaptive concurrency, and connection pooling.

  • Speedup Clickhouse Database query, reduce p95/p99 for queries with better schemas/partitions, projections/materialized views, and tiered storage.

What We're Looking For:
  • Experience building and tuning high-throughput Petabyte-scale data pipelines

  • Deep knowledge of data infrastructure (Apache Spark, Ray, dbt, Airflow/Dagster)

  • Experience with OLAP database engineering

  • Comfortable with cloud infrastructure and batch + streaming pipelines

  • Senior-level ownership: you will own infrastructure roadmap, architecture design, set practices, identify bottlenecks, ship fixes.

Nice to have:

  • Experience working with LLM Inference and Serving optimization techniques such as:

    • Speculative Decoding

    • Continuous batching and dynamic batching strategies

    • KV cache optimization and management

    • Quantization techniques (INT8, INT4) for reduced memory footprint

    • Multi-GPU serving and tensor parallelism

Target Profile:

  • Senior+ Infrastructure Engineer from observability company (Datadog/Sentry/Honeycomb), trading company, RecSys/ML big tech (Netflix/Google/Meta), or AI labs.

Why Judgment?
  • Agents can’t work without this. Today’s agents hallucinate, drift, and break in production. We’re building the infrastructure that fixes this: the monitoring layer that makes agents self-improving.

  • We’re wired to win. We're a team of less than 20 but we ship like 50+ on the daily. You'll be working with olympiad medalists, debate champions, and competitive athletes who bring that same intensity to company building.

  • Fast track to founding. Our engineers interface directly with customers, ship code into their environments, and use their feedback to dictate what’s next on the roadmap. Everyone on the team is either an ex-founder or a founder-to-be.

  • We make sure our people do their best work. If you deserve a spot on the team, money will never get in the way of it. Full benefits, Equinox, and a private chef to take care of you. We sprint hard but we play hard, ask us about our Smash/Mario Kart tournaments.

    We work in person in San Francisco.

Top Skills

Kafka,Apache Spark,Ray,Clickhouse,Dbt,Airflow,Dagster,Llm Apis,Olap Databases,Flamegraphs,Connection Pooling,Speculative Decoding,Continuous Batching,Dynamic Batching,Kv Cache,Int8 Quantization,Int4 Quantization,Multi-Gpu Serving,Tensor Parallelism
HQ

Judgment Labs San Francisco, California, USA Office

425 Bush St, San Francisco, California, United States, 94108 3708

Similar Jobs

2 Days Ago
In-Office
4 Locations
165K-242K Annually
Senior level
165K-242K Annually
Senior level
Cloud • Information Technology • Machine Learning
As a Senior Software Engineer, you'll design and build data models, APIs, and backend services for a complex datacenter infrastructure platform, while ensuring high performance and scalability.
Top Skills: Ci/CdCockroachdbGoGraphQLGrpcKubernetesPostgresRest
16 Days Ago
Easy Apply
In-Office
2 Locations
Easy Apply
320K-320K Annually
Senior level
320K-320K Annually
Senior level
Artificial Intelligence • Natural Language Processing • Generative AI
The role involves designing and implementing data infrastructure, handling data governance, financial data systems, and ensuring cloud storage reliability, while collaborating with data scientists and business stakeholders.
Top Skills: AirflowAWSBigQueryBigtableDbtGCPGcsGoKubernetesPythonS3SparkSQLTerraform
2 Days Ago
Hybrid
Foster City, CA, USA
208K-300K Annually
Senior level
208K-300K Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
The Senior Software Engineer will develop data infrastructure, enhance network design, and implement software for high-bandwidth data transmission critical for robot operations.
Top Skills: Aws S3CC++Google Cloud StorageLinuxPython

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account