Nubank Logo

Nubank

Lead ML Data Engineer, AI Core

Posted 12 Days Ago
Easy Apply
In-Office
4 Locations
Senior level
Easy Apply
In-Office
4 Locations
Senior level
Lead design and build scalable data ingestion and feature pipelines for foundation models; implement data quality monitoring; model and integrate new data sources; run experiments measuring data impact; optimize ML training and hyperparameters; collaborate with ML, platform, and infra teams; lead technical initiatives and mentor team members.
The summary above was generated by AI
About Us

Nu is one of the largest digital financial platforms in the world, with more than 127 million customers across Brazil, Mexico, and Colombia. Guided by our mission to fight complexity and empower people, we are redefining financial services in Latin America and this is still just the beginning of the purple future we're building.

Listed on the New York Stock Exchange (NYSE: NU), we combine proprietary technology, data intelligence, and an efficient operating model to deliver financial products that are simple, accessible, and human.

Our impact has been recognized by global rankings such as Time 100 Companies, Fast Company's Most Innovative Companies, and Forbes World's Best Bank. Visit our institutional page [Careers at Nu - Join our team!](https://international.nubank.com.br/careers/)

About the Role

At Nu, data is the foundation that powers our AI and machine learning models, enabling millions of customers to access fair financial products. As a Machine Learning Engineer in AI Core, Data Intelligence, you’ll work across a broad spectrum — from building scalable data infrastructure and feature pipelines that feed our state-of-the-art foundation models to designing, training, and shipping transaction classification models that power critical customer experiences across the company.

You'll work at the intersection of data and applied machine learning, contributing across multiple stages of the ML lifecycle: ingesting and labeling data, training and evaluating models, and helping with deployment and production monitoring through robust quality controls. You’ll partner closely with product, compliance, and ML teams to ensure models are auditable, privacy-aware, and deliver measurable business value.

You'll join a team that manages the data engineering backbone of AI Core, ensuring data is accessible, healthy, and properly tracked across our entire ML ecosystem. Here, you'll combine your expertise in building scalable data systems with your passion for machine learning, creating solutions that enable our models to learn from better, richer data.

You can read more about the work in the AI Core team on our blog: https://building.nubank.com/understanding-our-customers-finances-through-foundation-models/

Key Responsibilities

As a Lead Machine Learning Engineer in AI Core Data Intelligence, you will:

  • Design and build scalable data ingestion pipelines that bring new datasets into our AI Core platform, ensuring reliable, efficient data flow from source to model training.
  • Implement data quality monitoring and validation systems that catch issues before they impact model performance, maintaining the health of datasets across our ML ecosystem.
  • Model new types of data into our foundation models.
  • Analyze the impact of new data sources on existing models, conducting experiments to measure performance improvements and guide data acquisition decisions.
  • Develop and maintain data preparation workflows that transform raw data into features ready for model training, working with distributed computing frameworks like Ray.
  • Tune and optimize machine learning models when new datasets are integrated, applying hyperparameter optimization and evaluating model performance improvements.
  • Collaborate with AI Core ML, Platform, and Infra teams to ensure seamless data flow across our ML infrastructure, from ingestion to model deployment.
  • Lead technical initiatives that improve our data engineering practices, setting standards for data quality, pipeline reliability, and model-data integration.
  • Mentor team members and contribute to hiring activities, helping build a strong and diverse team that drives innovation in AI infrastructure.
Basic Qualifications
  • Typically 6+ years of experience in machine learning engineering, data engineering, or related fields with a strong track record of building production data and ML systems.
  • Proven experience designing and building data ingestion pipelines at scale, with expertise in distributed computing frameworks (Ray, Spark, or similar).
  • Strong background in applied machine learning, including model training, hyperparameter tuning, and performance evaluation.
  • Experience analyzing how data changes impact model performance, with the ability to design and run experiments to measure improvements.
  • Proficiency in Python for data engineering and ML workflows, with experience working with large-scale data processing systems.
  • Solid understanding of data quality principles and experience implementing monitoring, validation, and alerting systems.
  • Strong problem-solving skills with the ability to address complex, ambiguous problems requiring coordination across multiple teams.
  • Excellent communication skills, capable of explaining technical concepts to both technical and non-technical stakeholders.
  • Demonstrated leadership experience, including mentoring team members and contributing to technical decision-making.
Preferred Qualifications
  • Experience with MLflow or similar model tracking and versioning systems.
  • Knowledge of foundation models, fine-tuning workflows, and transformer architectures.
  • Experience with data pipeline orchestration tools (Dagster, Airflow, or similar).
  • Background in financial services or fintech, understanding the unique data challenges in this domain.
  • Experience working in a fast-paced, high-growth environment with distributed teams.
  • Track record of reducing complexity in data systems and improving developer experience for ML teams.
Our Benefits
  • Opportunity of earning equity at Nu
  • Medical Insurance
  • Dental and Vision Insurance
  • Life Insurance and AD&D
  • Extended maternity and paternity leaves 
  • Nucleo - Our learning platform of courses
  • NuLanguage - Our language learning program
  • NuCare - Our mental health and wellness assistance program
  • Extended maternity and paternity leaves 
  • 401K
  • Saving Plans - Health Saving Account and Flexible Spending Account
  • Work-from-home Allowance
  • Relocation Assistance Package, if applicable.
Work Model for this Role

Hybrid 2-3 times/week: Our hybrid work model brings us to the office at least twice a week, on strategic days designed to maximize team connection and collaboration. For more details, visit https://building.nubank.com/nu-hybrid-work-model/

Locations: This role is available in any of our North American offices (Palo Alto, USA; Miami, USA; Durham, USA; Toronto, CAN)

Top Skills

Python,Ray,Spark,Mlflow,Dagster,Airflow,Transformer Architectures,Foundation Models

Similar Jobs

9 Hours Ago
Hybrid
14 Locations
124K-280K Annually
Expert/Leader
124K-280K Annually
Expert/Leader
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The Senior Manager oversees AI and GenAI data science teams, driving complex analytics projects, coaching teams, and ensuring high-quality solutions. Responsibilities include managing GenAI development, collaborating with clients, presenting findings, and leading business development activities.
Top Skills: AWSAzureCi/CdGitGCPLangchainNltkNoSQLPandasPythonSemantic KernelSQL
9 Hours Ago
Hybrid
42 Locations
212K-244K Annually
Senior level
212K-244K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As a Senior Developer, you'll lead the development of software solutions, manage a team, and ensure quality deliverables that meet client needs.
Top Skills: .Net/C#Ai PromptingAngularAuthenticationCloud TechnologiesDevOpsMicro Front EndsMicro ServicesRest ApisSQL Server
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Safety Modeling Engineer will develop and analyze models to assess collision outcomes and severity for automated driving systems, using statistical and machine learning methods.
Top Skills: Ci/CdDockerGitJenkinsJIRAKubernetesPoetryPythonSQLTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account