Hilbert's AI Logo

Hilbert's AI

Data Engineer (Core Platform & Optimization)

Posted 8 Days Ago
Hybrid
San Francisco, CA
Senior level
Hybrid
San Francisco, CA
Senior level
Build and optimize core data platform: improve ClickHouse performance, create configurable ingestion schemas, develop shared Python libraries, design integration pipelines, and architect monitoring/observability for reliable, production-grade data delivery.
The summary above was generated by AI
The Mission: Build the High-Performance Refinery

At Hilbert’s AI, we shorten "months" to "minutes." To achieve this, our internal data systems must be more than just pipelines—they must be a high-velocity refinery. We are looking for a Core Data Engineer to build the foundation that allows our integration team to move 10x faster.

While others focus on connecting the pipes for specific customers, you focus on the integrity, flow, and scalability of the entire system. You will own our "shared brain": the common code, the optimization of our ClickHouse clusters, and the AI-driven engines that turn manual integration into automated ingestion.

The Hard Problem: Abstracting the Chaos

Our challenge is building a "Generic Growth Engine" that stays performant even as we scale to hundreds of diverse B2C retailers. We don't want to write custom code for every new data source; we want to build a meta-platform where new integrations are defined by configuration and generated by AI.

Our Current Hurdles:
  • The Programmable Refinery: You aren't just building pipelines; you are building the infrastructure that our Discovery Agent uses to generate pipelines. You’ll design the configuration schemas that allow AI to scaffold Dagster orchestrations automatically.

  • OLAP at Scale: We rely on Clickhouse for sub-second analytical queries. You’ll be responsible for shard tuning and ensuring our canonical models are structured for maximum performance when queried by AI agents.

  • The "Shared Brain": You will architect the core Python libraries that manage sync logic and automated monitoring, providing the "Source of Truth" that our internal agents learn from.

Who You Are (The Profile)
  • The Optimizer: You are the person who finds the bottleneck in a query or a memory leak in a container. You think in terms of complexity classes and system reliability.

  • The Front-Line Veteran: You have empathy for the integration struggle. You understand that a platform is only as good as its usability in the field. Note: Our Core engineers start in a Forward Deployed capacity to earn the "battle scars" necessary to build truly useful abstractions.

  • Software Rigorist: You treat data engineering like software engineering. Modular code, CI/CD, and unit testing for data logic aren't "nice to haves", they are your baseline.

  • The Meta-Engineer: You enjoy building systems that build other systems. You see AI-generated pipelines as an engineering challenge in reproducibility and safety.

  • Dagster Ninja: You don’t just use orchestrators; you push them to their limits. You understand how to build dynamic, asset-based workflows that scale.

What You’ll Own
  • Core Engine & Agent Frameworks: Build the shared libraries and configuration schemas that power Hilbert’s data integrations and the Discovery Agent’s output logic.

  • ClickHouse Mastery: Take full ownership of our ClickHouse performance, optimizing storage, query speed, ingestion patterns, and how agents query tables on-demand.

  • Observability Architecture: Design the monitoring and alerting stack that detects data drift or pipeline failures before they impact the customer.

  • R&D Ingestion: Investigate new data sources and sync mechanisms (CDC, streaming, API patterns) to expand our integration capabilities.

Path from Integration-to-Core

We believe that the best platform tools are built by those who have felt the pain of the user. At Hilbert, we offer an Integration-to-Core pipeline.

  • Phase 1 (The Front Line): You can start as a Forward Deployed Engineer, mastering our stack while solving high-stakes enterprise data problems.

  • Phase 2 (The Architect): Once you've mastered the nuances of the data, you can transition into the Core team to turn those learnings into the foundational code that powers the entire company.

Bonus Points
  • Experience building a data orchestration or integration engine from scratch.

  • Deep understanding of E-commerce/Retail data structures (SKUs, Attribution, LTV).

  • Contribution to open-source data tools (Dagster, Airbyte, ClickHouse, etc.).

  • Experience working alongside ML Engineers to optimize data for model training/inference.

  • Having built an agentic workflow before.

Location

San Francisco, or Istanbul

At least 5 hours overlap with PST timezone (7am-5pm)

Compensation

Competitive salary + equity package, commensurate with experience.

Performance-based bonuses tied to project milestones and customer impact.

The Hiring Journey

Short form → Intro Call → Technical working session → Team conversations → Offer

Fast, human, no bureaucracy.

Top Skills

Ci/Cd
Clickhouse
Dagster
Druid
Monitoring
Observability
Pinot
Python
Unit Testing
HQ

Hilbert's AI San Francisco, California, USA Office

San Francisco, CA, United States

Similar Jobs

12 Minutes Ago
Hybrid
2 Locations
100K-150K Annually
Senior level
100K-150K Annually
Senior level
Artificial Intelligence • Semiconductor
Design, develop, and deploy UEFI-based firmware for hyperscale AI server platforms, ensure platform integration, and implement security features.
Top Skills: C/C++Ci/CdEdk IiTianocoreUefi
25 Minutes Ago
Remote or Hybrid
United States
21-28 Hourly
Junior
21-28 Hourly
Junior
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Assist customers in utilizing iSolved HCM software through phone, video calls, and emails, providing support and resolving issues effectively.
Top Skills: Isolved HcmMS Office
25 Minutes Ago
Remote or Hybrid
United States
88K-118K Annually
Senior level
88K-118K Annually
Senior level
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Procore Implementation Consultant leads Procore implementations, designs client-specific solutions, and mentors junior consultants while managing projects for clients in construction and real estate.
Top Skills: Power BIProcore

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account