Build and own core SQL and Python pipelines, design golden datasets at scale, ensure data quality and performance, and collaborate cross-functionally to enable data-driven decisions.
We believe that the way people interact with their finances will drastically improve in the next few years. We’re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products. Plaid powers the tools millions of people rely on to live a healthier financial life. We work with thousands of companies like Venmo, SoFi, several of the Fortune 500, and many of the largest banks to make it easy for people to connect their financial accounts to the apps and services they want to use. Plaid’s network covers 12,000 financial institutions across the US, Canada, UK and Europe. Founded in 2013, the company is headquartered in San Francisco with offices in New York, Washington D.C., London and Amsterdam.
The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.
You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.
Responsibilities
- Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
- Have data quality and performance top of mind while designing datasets
- Advocating for adopting industry tools and practices at the right time
- Owning core SQL and Python data pipelines that power our data lake and data warehouse
- Well-documented data with defined dataset quality, uptime, and usefulness.
Qualifications
- 2+ years of dedicated data engineering experience, solving complex data pipeline issues at scale.
- You have experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
- You value SQL as a flexible and extensible tool and are comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
- [Nice to have] You have experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks
- [Nice to have] You have experience building and maintaining batch and real-time pipelines using technologies like Spark, Kafka.
Our mission at Plaid is to unlock financial freedom for everyone. To support that mission, we seek to build a diverse team of driven individuals who care deeply about making the financial ecosystem more equitable. We recognize that strong qualifications can come from both prior work experiences and lived experiences. We encourage you to apply to a role even if your experience doesn't fully match the job description. We are always looking for team members that will bring something unique to Plaid!
Plaid is proud to be an equal opportunity employer and values diversity at our company. We do not discriminate based on race, color, national origin, ethnicity, religion or religious belief, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, military or veteran status, disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local laws. Plaid is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance with your application or interviews due to a disability, please let us know at [email protected].
Please review our Candidate Privacy Notice here.
Top Skills
Airflow
Atlanta
Databricks
Dbt
Elasticsearch
Kafka
Mode
Python
Redshift
Retool
Snowflake
Spark
SQL
Plaid San Francisco, California, USA Office
San Francisco, CA, United States, 94105
Similar Jobs
Food • Software
The Senior Data Engineer handles ChowNow's data platform, collaborating with teams to enhance data availability and insights, supporting internal and customer-facing products.
Top Skills:
AWSDbtPythonSnowflakeSQL
Fintech • Information Technology • Financial Services
Design, build, and support data-centric integration services for portfolio construction and reporting, ensuring reliability and scalability across APIs and data pipelines.
Top Skills:
Aws SqsGraphQLGrpcKubernetesPythonRestful ServicesSQL
Fintech • Software
Design, build, and own robust batch and real-time data pipelines and golden datasets using SQL and Python. Ensure data quality, performance, schema design, and SLAs while partnering across engineering, product, and business teams. Advocate tooling and best practices, document datasets, and deliver reliable analytics infrastructure at petabyte scale.
Top Skills:
AirflowAtlantaDatabricksDbtElasticsearchKafkaModePythonRedshiftRetoolSnowflakeSparkSQL
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

.png)
