Design and implement AWS-based data architectures and lakehouse solutions, lead data engineering teams, build ETL and real-time pipelines (Kafka/Kinesis), use DBT and Airbyte for ingestion and transformation, orchestrate workflows with Airflow/MWAA, enforce security/compliance, and guide infra automation with Terraform/CloudFormation and CI/CD.
Title – AWS Data Architect
Job Summary
We are seeking a skilled AWS Data Architect for one of our clients; a leading international sports league. The Architect will play a crucial role in designing, implementing and maintaining data and technology solutions that align with client’s business goals and objectives. This role requires a deep understanding of AWS data services, good understanding of AWS Infra & Ops services, and the ability to translate business requirements into scalable and efficient solutions.
ResponsibilitiesKey Responsibilities
Data Architecture and Cloud Strategy:
- Develop and maintain a comprehensive data architecture and cloud strategy that aligns with the organization's goals and needs.
- Design, implement, and manage cloud-based data infrastructure on AWS, ensuring scalability, reliability, and cost-efficiency.
- Utilize AWS services (S3, Glue, EMR, Redshift, Lambda, Kinesis, MWAA, etc.) to build and optimize data pipelines and storage solutions.
- Champion the use of data lakehouse architecture and optimize its performance for analytical and operational workloads.
- Identify the gaps and opportunities in the current system and suggest/implement to optimise the processes and costs.
Data Engineering:
- Lead and guide data engineering teams to develop, maintain, and optimize ETL processes for data ingestion, transformation, and loading.
- Implement real-time data processing solutions using technologies such as Apache Kafka and AWS Kinesis.
- Collaborate with data scientists, business stakeholders and analysts to ensure data availability and quality, enabling effective analytics and reporting.
- Leverage DBT for data modelling and transformation to support self-service analytics and data governance.
Data Ingestion & Ingestion:
- Architect and implement data integration solutions for API ingestion, enabling data from diverse sources to be captured, transformed, and ingested into our data lakehouse.
- Utilize Airbyte and custom APIs to ensure efficient, reliable, and secure data transfers.
- Manage data integration pipelines to support real-time and batch data processing.
Workflow Orchestration:
- Design, configure, and maintain workflow orchestration using Apache Airflow to automate ETL processes and data pipeline executions.
- Monitor and optimize job scheduling, error handling, and performance of data workflows.
Security and Compliance:
- Implement data security protocols, access controls, and encryption to safeguard sensitive data, especially PIIs.
- Ensure compliance with data privacy regulations and industry standards.
Collaboration and Documentation:
- Collaborate with cross-functional teams to understand data requirements and provide data solutions to meet their needs.
- Maintain comprehensive documentation for data engineering and data architecture processes and solutions.
Infra & Operations:
- Guide the team in setting up cloud Infra and automate using tools like terraform, cloud formation, Jenkins etc
- Guide the operations team in setting up automated monitoring & alerts mechanism
Relevant Qualifications
- Bachelor's or higher degree in a relevant field.
- 6+ years of proven experience in data engineering, cloud architecture, and AWS services.
- Extensive knowledge of data lakehouse technologies, Hudi, DBT, Airbyte, Redshift, Glue, Kinesis and Apache Airflow.
- Strong expertise in programming languages like SQL, Python and processing frameworks like PySpark
- Strong expertise in real-time data processing.
- Excellent problem-solving and analytical skills.
- Strong communication and teamwork abilities.
- Passion for Sports/Gaming/Entertainment is preferred
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
