Mercor Logo

Mercor

Data Engineer

Posted 17 Days Ago
In-Office
San Francisco, CA
Mid level
In-Office
San Francisco, CA
Mid level
The Data Engineer will build and maintain data pipelines, ensuring data quality, reliability, and usability for the Data Science and Engineering teams. Responsibilities include processing data from various sources and improving pipeline performance.
The summary above was generated by AI
About Mercor

Mercor is at the intersection of labor markets and AI research. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.

Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $1.5 million a day.

Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society.

Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our new San Francisco headquarters.

About the Role

We’re looking for someone who wants to bring a full-stack perspective to data. As a Software Engineer supporting our Data function, you will be responsible for creating and maintaining pipelines that enable our Data Science, Engineering, and Product teams, and the wider Mercor organization.

Your focus will be on data reliability, availability, and timeliness, with a focus on collaboration (and significant operational crossover with) our Data Science team and the many partner functions.

What You’ll Work On

  • Building robust pipelines to ingest, transform, and consolidate data from diverse sources (e.g., MongoDB, Airtable, PostHog, production databases).

  • Designing dbt models and transformations to standardize and unify many disparate tables into clean, production-ready schemas.

  • Implementing scalable, fault-tolerant data workflows with Fivetran, dbt, SQL, and Python.

  • Partnering with engineers, data scientists, and business stakeholders to ensure data availability, accuracy, and usability.

  • Owning data quality and reliability across the stack, from ingestion through to consumption.

  • Continuously improving pipeline performance, monitoring, and scalability.

What We’re Looking For

  • Proven experience in data engineering, with strong knowledge of SQL, Python, and modern data stack tools (Fivetran, dbt, Snowflake or similar).

  • Experience building and maintaining large-scale ETL/ELT pipelines across heterogeneous sources (databases, analytics platforms, SaaS tools).

  • Strong understanding of data modeling, schema design, and transformation best practices.

  • Familiarity with data governance, monitoring, and quality assurance.

  • Comfort working cross-functionally with engineering, product, and operations teams.

  • Bonus: prior experience supporting machine learning workflows or analytics platforms.

Why Mercor

  • Impact: Your work powers how the world’s leading AI labs train and test their models.

  • Learning: Get early insights into frontier model capabilities months before the market.

  • Growth: Work on both infrastructure and research-adjacent projects with fast paths to ownership.

Benefits

  • Generous equity grant vested over 4 years

  • A $20K relocation bonus (if moving to the Bay Area)

  • A $10K housing bonus (if you live within 0.5 miles of our office)

  • A $1K monthly stipend for meals

  • Free Equinox membership

  • Health insurance

Top Skills

Airtable
Dbt
Fivetran
MongoDB
Posthog
Python
Snowflake
SQL
HQ

Mercor San Francisco, California, USA Office

San Francisco, California , United States, 94105

Similar Jobs

6 Days Ago
Hybrid
San Francisco, CA, USA
120K-120K Annually
Internship
120K-120K Annually
Internship
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Data Engineer I role focuses on building scalable data platforms to create personalized experiences, requiring strong Python and PySpark skills.
Top Skills: MS OfficePysparkPython
13 Days Ago
Hybrid
San Francisco, CA, USA
150K-177K Annually
Mid level
150K-177K Annually
Mid level
Artificial Intelligence • Productivity • Software
The Data Engineer will build datasets and pipelines for marketing and sales reporting, collaborating with various teams to support data needs and ensure scalable solutions.
Top Skills: AWSAzureGCPHiveJavaNoSQLPythonRedshiftScalaSnowflakeSQL
18 Days Ago
Hybrid
San Francisco, CA, USA
170K-284K Annually
Senior level
170K-284K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
The Principal Data Engineer will architect and lead the development of a Multi-Agent ETL Platform, modernize data platforms, implement data pipelines, and drive innovation with AI integrations.
Top Skills: Apache AirflowSparkAutogenAws GlueAzure Data FactoryBigQueryClouderaCrewaiDagsterDbtFlinkHadoopKafkaLangchainLlmsNifiOraclePythonSnowflakeSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account