Design and build scalable data lakes, warehouses, and lakehouses; implement Python ETL/ELT pipelines and Airflow orchestration; ingest from third-party APIs; optimize columnar file formats and SQL performance; support ML initiatives; consult with stakeholders to translate business goals into data architecture roadmaps; work with Spark, Snowflake, and cloud infrastructure (AWS/GCP).
AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.
WHY JOIN US
If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you!
ABOUT THE ROLE
We are looking for a Senior Data Engineer to design and build scalable data lakes, warehouses, and lakehouse architectures supporting a thematic research platform that processes large volumes of financial data daily. You will implement Python-based ETL/ELT pipelines, orchestrate workflows with Airflow, develop ingestion workflows from third-party APIs, and work with Snowflake, Spark, and AWS to deliver high-performance data infrastructure. The role combines hands-on engineering with technical consulting responsibilities, translating business goals into data architecture roadmaps.
WHAT YOU WILL DO
- Design and implement Python Data Engineering solutions;
- Design and build scalable Data Lakes, Data Warehouses, and Data Lakehouses;
- Design and implement robust ETL/ELT processes at scale using Python, incorporating modern pipeline orchestration tools like Airflow;
- Develop sophisticated ingestion workflows from diverse 3rd party APIs and data sources;
- Manage and optimize various file formats (Parquet, Avro, ORC) and columnar storage to ensure high-performance data retrieval;
- Work with AI development tools to support and accelerate ongoing development, machine learning initiatives and advanced analytics;
- Act as a technical consultant for stakeholders and leadership to gather requirements, understand business goals, and translate them into technical roadmaps;
- Work with Terraform and other tools to build AWS and on-prem infrastructure.
MUST HAVES
- You must be authorized to work for ANY employer in the US (e.g., Green card holders, TN visa holders, GC EAD, H4 EAD, U4U with EAD), as we are unable to sponsor or take over employment visa sponsorship at this time;
- Bachelor’s degree in computer science/engineering or other technical field, or equivalent experience;
- 5+ years of experience with Python;
- 5+ years of experience with data processing, manipulation, and analytics libraries like Pandas, Polars, PySpark or DuckDB;
- 2+ years of experience with Big Data technologies (Spark, Snowflake);
- Expert-level knowledge of pipeline orchestration using Airflow or similar industry-standard tools;
- Deep understanding of Medallion Architecture, columnar file formats, and diverse database technologies (SQL, NoSQL, and Lakehouse architectures);
- Proven ability to work with 3rd party APIs for complex data ingestion tasks;
- Proficiency with modern Cloud platforms (AWS, GCP, Snowflake) and advanced SQL optimization;
- Exceptional soft skills with a proven ability to gather requirements from leadership and collaborate effectively across cross-functional teams;
- Excellence in optimizing complex data pipelines and troubleshooting data latency or consistency issues in massive datasets;
- A self-starter mindset, regularly investigating more efficient data architectures and AI development tools to improve pipeline performance;
- Taking pride in data integrity and the accuracy of the end-to-end pipelines and architectures you build;
- Strong communication skills for seamless global collaboration with stakeholders and distributed teams;
- Upper-intermediate English level.
NICE TO HAVES
- Familiarity with the fintech industry, understanding of financial data, regulatory requirements, and business processes specific to the domain;
- Documentation skills to document data pipelines, architecture designs, and best practices for knowledge sharing and future reference;
- OpenSearch, Elasticsearch;
- AWS Sagemaker Studio, Jupyter for analyze data;
- Terraform;
- Scala.
PERKS AND BENEFITS
- Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
- Competitive compensation: USD-based pay with education, fitness, and team activity budgets.
- Exciting projects: Modern solutions with Fortune 500 and top product companies.
- Flextime: Flexible schedule with remote and office options.
Similar Jobs
Software
Design and build scalable data lakes, warehouses, and lakehouses; implement Python ETL/ELT pipelines and Airflow orchestration; ingest data from third-party APIs; optimize columnar storage and SQL performance; support ML/AI initiatives; consult with stakeholders to translate business goals into data architecture and infrastructure using cloud and IaC tools.
Top Skills:
AirflowAvroAWSAws Sagemaker StudioDuckdbElasticsearchGCPJupyterLakehouseNoSQLOpensearchOrcPandasParquetPolarsPysparkPythonScalaSnowflakeSparkSQLTerraform
Fintech • Financial Services
Lead new client acquisition and manage complex commercial banking relationships for early-stage technology companies. Coordinate with credit, treasury, FX and partner teams, analyze financials, drive multi-line revenue, host events and network, maintain pipeline and relationship plans, and advise on industry and market trends to expand business in Northern California.
Fintech • Financial Services
Grow and manage relationships with affluent customers by delivering multi-product financial solutions across deposits, lending, investments, and business banking. Proactively acquire clients, conduct discovery-driven planning, coordinate with Wealth, Home Lending and Business Banking partners, drive digital adoption, handle everyday banking service needs, maintain accurate documentation, and comply with regulatory and risk requirements. Role requires licensing and may be temporary until licensing is completed.
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

