Mercor Logo

Mercor

Data Engineer

Reposted 7 Days Ago
In-Office
San Francisco, CA
Mid level
In-Office
San Francisco, CA
Mid level
The Data Engineer will build and maintain data pipelines, ensuring data quality, reliability, and usability for the Data Science and Engineering teams. Responsibilities include processing data from various sources and improving pipeline performance.
The summary above was generated by AI
About Mercor

Mercor is at the intersection of labor markets and AI research. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.

Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.

Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society.

Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our new San Francisco headquarters.

About the Role

We’re looking for someone who wants to bring a full-stack perspective to data. As a Software Engineer supporting our Data function, you will be responsible for creating and maintaining pipelines that enable our Data Science, Engineering, and Product teams, and the wider Mercor organization.

Your focus will be on data reliability, availability, and timeliness, with a focus on collaboration (and significant operational crossover with) our Data Science team and the many partner functions.

What You’ll Work On

  • Building robust pipelines to ingest, transform, and consolidate data from diverse sources (e.g., MongoDB, Airtable, PostHog, production databases).

  • Designing dbt models and transformations to standardize and unify many disparate tables into clean, production-ready schemas.

  • Implementing scalable, fault-tolerant data workflows with Fivetran, dbt, SQL, and Python.

  • Partnering with engineers, data scientists, and business stakeholders to ensure data availability, accuracy, and usability.

  • Owning data quality and reliability across the stack, from ingestion through to consumption.

  • Continuously improving pipeline performance, monitoring, and scalability.

What We’re Looking For

  • Proven experience in data engineering, with strong knowledge of SQL, Python, and modern data stack tools (Fivetran, dbt, Snowflake or similar).

  • Experience building and maintaining large-scale ETL/ELT pipelines across heterogeneous sources (databases, analytics platforms, SaaS tools).

  • Strong understanding of data modeling, schema design, and transformation best practices.

  • Familiarity with data governance, monitoring, and quality assurance.

  • Comfort working cross-functionally with engineering, product, and operations teams.

  • Bonus: prior experience supporting machine learning workflows or analytics platforms.

Why Mercor

  • Impact: Your work powers how the world’s leading AI labs train and test their models.

  • Learning: Get early insights into frontier model capabilities months before the market.

  • Growth: Work on both infrastructure and research-adjacent projects with fast paths to ownership.

Benefits

  • Generous equity grant vested over 4 years

  • A $10K housing bonus (if you live within 0.5 miles of our office)

  • A $1K monthly stipend for meals

  • Free Equinox membership

  • Health insurance

Top Skills

Airtable
Dbt
Fivetran
MongoDB
Posthog
Python
Snowflake
SQL
HQ

Mercor San Francisco, California, USA Office

San Francisco, California , United States, 94105

Similar Jobs

Yesterday
Hybrid
Los Angeles, CA, USA
Internship
Internship
AdTech • Consumer Web • Digital Media • eCommerce • Insurance • Marketing Tech • SEO
As a Data Engineer Intern, you'll develop and maintain data pipelines, support ETL processes, analyze data, and complete a capstone project, while learning from experienced mentors.
Top Skills: ETLPythonSQL
Yesterday
Easy Apply
Remote or Hybrid
USA
Easy Apply
175K-225K Annually
Mid level
175K-225K Annually
Mid level
Fintech • Information Technology • Software • Financial Services
The Data Engineer will build real-time data pipelines for pricing algorithms, collaborate with teams, and contribute to batch data workflows.
Top Skills: Cloud-Based Distributed Data InfrastructureFlinkKafkaPythonSQL
Yesterday
Easy Apply
Remote or Hybrid
United States
Easy Apply
102K-154K Annually
Mid level
102K-154K Annually
Mid level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Data Engineer II, you will build and optimize data pipelines in Databricks, ensure high-quality data delivery, and support generative AI applications. You will collaborate with data scientists and AI engineers to provide reliable data infrastructures and meet business needs.
Top Skills: DatabricksDbtGainsightGongOutreachPythonRdsRedshiftSalesforceSnowflakeSparkSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account