Baselayer Logo

Baselayer

Data Engineer

Posted 10 Days Ago
Easy Apply
In-Office
San Francisco, CA, USA
122K-167K Annually
Junior
Easy Apply
In-Office
San Francisco, CA, USA
122K-167K Annually
Junior
The Data Engineer will design and maintain data pipelines while ensuring reliability and data quality for analytics and machine learning. The role requires close collaboration with product and engineering teams and involves optimizing data workflows.
The summary above was generated by AI

About Baselayer
Trusted by 2,200+ financial institutions, Baselayer is the intelligent business identity platform that helps verify any business, automate KYB, and monitor real-time risk. Baselayer’s B2B risk solutions and identity graph network leverage state and federal government filings and proprietary data sources to prevent fraud, accelerate onboarding, and lower credit losses.


About the Role
We are looking for a Data Engineer to build and scale Baselayer’s data infrastructure. You will own the pipelines and data systems that power analytics, reporting, and machine learning across the company, with a focus on reliability, performance, and data quality.

This role is hands-on and highly cross-functional. You will work closely with Product and Engineering to ensure data is accessible, trusted, and delivered in a way that supports product capabilities in a regulated environment.


What You’ll Do

  • Design, build, and maintain scalable data pipelines that ingest, clean, validate, and transform data from internal systems and external sources

  • Own data reliability and quality through monitoring, alerting, lineage, and validation frameworks

  • Build and maintain data models and curated datasets that support analytics, dashboards, customer reporting, and downstream ML use cases

  • Partner with Engineering to define best practices for data architecture, storage, access controls, and performance

  • Implement orchestration and scheduling for batch and near-real-time workflows as needed

  • Optimize pipeline performance, cost, and scalability as data volumes grow

  • Develop and maintain documentation and runbooks for pipelines, datasets, and operational procedures

  • Identify data gaps and instrumentation needs, and work with engineering teams to improve event capture and logging


About You
You want to learn fast, take ownership, and do work that matters. You are not just doing this for the win. You are doing it because you have something to prove and want to be great.

You thrive in the details, care about correctness, and take pride in building robust systems that other teams can rely on. You operate with urgency, handle ambiguity well, and consistently raise the bar on data quality and reliability.


Required Experience and Skills

  • 1 to 3 years of experience in data engineering, analytics engineering, or backend engineering with significant data pipeline ownership

  • Strong Python skills and experience building production-grade data workflows

  • Strong SQL skills with experience designing data models and transforming large datasets

  • Experience building and maintaining ETL or ELT pipelines and working with data warehouses or analytics databases

  • Familiarity with orchestration tools and workflow scheduling (for example Airflow, Dagster, Prefect, or similar)

  • Strong understanding of data quality, testing, observability, and operational best practices

  • Comfort working with large-scale datasets and troubleshooting performance issues

  • Ability to communicate clearly with technical and non-technical stakeholders


What Sets You Apart

  • Experience working with identity, fraud, risk, compliance, or other regulated datasets

  • Experience integrating with external data sources, APIs, and government or registry data

  • Familiarity with streaming or near-real-time data patterns

  • Highly feedback-oriented with a desire for continuous improvement


Work Location

  • Hybrid in SF, in office 3 days per week

Compensation and Benefits

  • Salary range of $122,000 to $167,000

  • Equity package

  • Unlimited vacation

  • Comprehensive health coverage

  • 401(k) with company match

Top Skills

Airflow
Dagster
Elt
ETL
Prefect
Python
SQL

Similar Jobs

Yesterday
In-Office
146K-194K Annually
Junior
146K-194K Annually
Junior
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Data Engineer, you will develop operational apps, build data systems, and collaborate with engineering teams to enhance military technology and production systems.
Top Skills: DbtLookerPalantir FoundryPythonReactRedshiftSQLTableauTypescript
9 Days Ago
Hybrid
Internship
Internship
AdTech • Consumer Web • Digital Media • eCommerce • Insurance • Marketing Tech • SEO
As a Data Engineer Intern, you'll develop and maintain data pipelines, support ETL processes, analyze data, and complete a capstone project, while learning from experienced mentors.
Top Skills: ETLPythonSQL
9 Days Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
175K-225K Annually
Mid level
175K-225K Annually
Mid level
Fintech • Information Technology • Software • Financial Services
The Data Engineer will build real-time data pipelines for pricing algorithms, collaborate with teams, and contribute to batch data workflows.
Top Skills: Cloud-Based Distributed Data InfrastructureFlinkKafkaPythonSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account