Caterpillar Logo

Caterpillar

Senior Data Scientist - Artificial Intelligence R&D

Posted 5 Hours Ago
Be an Early Applicant
Hybrid
Chicago, IL
113K-183K Annually
Senior level
Hybrid
Chicago, IL
113K-183K Annually
Senior level
Lead applied AI research and prototyping across LLMs, multimodal models, RAG, and agentic workflows. Design experiments, prepare and label data, fine-tune and benchmark models, instrument experiment tracking, translate research to production-ready prototypes, mentor junior staff, and communicate results to stakeholders.
The summary above was generated by AI
Career Area:
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Cat Digital is the digital and technology arm of Caterpillar Inc., leveraging the latest technologies to build industry leading digital solutions for our customers and dealers. With over 1.5 million connected assets worldwide, our teams use data, technology, advanced analytics, telematics and AI capabilities to help our customers build a better, more sustainable world.
Job Summary:
Join the AI Research & Development team of Cat Digital and play a central role in advancing the frontier of applied AI for one of the world's largest industrial enterprises. As a Senior Data Scientist, you will design, build, and evaluate cutting-edge AI systems, spanning generative AI, large language models (LLMs), multimodal intelligence, retrieval-augmented generation (RAG), and autonomous agents, delivering high-impact Proofs of Concept (POCs) with clear production intent while exploring longer-horizon research opportunities.
What You Will Do:
  • Design and execute AI experiments across the full model lifecycle: hypothesis formulation, data preparation, model development, evaluation, and iteration, maintaining research rigor in an ambiguous, fast-moving environment.
  • Develop, fine-tune, and benchmark LLMs and multimodal AI models (text, vision, speech), including systematic evaluation of quality, latency, cost, and safety tradeoffs across model variants and providers.
  • Explore and optimize knowledge retrieval systems (RAG pipelines, vector databases, hybrid search) and agentic workflows, ensuring relevance, accuracy, and scalability for enterprise use cases.
  • Lead data preparation workstreams for model training, fine-tuning, and validation, including dataset curation, labeling strategy, synthetic data generation, and quality assurance.
  • Instrument AI systems for observability and reproducibility using experiment tracking frameworks (e.g., Langfuse, MLflow), maintaining clear documentation of model versions, evaluation datasets, and performance baselines.
  • Translate research findings into production-ready prototypes, collaborating with Engineering and Product teams to define technical requirements, integration paths, and deployment readiness criteria.
  • Evaluate emerging AI capabilities and tools (open-source and commercial), providing structured assessments and recommendations to inform the team's technology strategy.
  • Mentor and coach junior Data Scientists, establishing best practices for experimentation, model evaluation, and responsible AI development across the team.
  • Communicate insights and results to technical and non-technical stakeholders, including product managers, engineers, and senior leadership, with clarity and business impact framing.

What You Will Have:
  • Applied Statistics & Quantitative Methods: Experience applying statistical thinking to experimentation, evaluation, and decision-making in ambiguous, research-driven environments.
  • Analytical Rigor & Attention to Detail: Proven ability to design precise experiments, validate assumptions, and ensure accuracy and reproducibility of results.
  • Advanced Machine Learning & AI: Knowledge of modern ML techniques, including deep learning, generative AI, NLP, computer vision, and multimodal systems, with hands-on implementation experience.
  • Model Evaluation & Optimization: Strong experience evaluating model quality and system-level tradeoffs across accuracy, latency, cost, and scalability dimensions.
  • Programming Expertise: Proficiency in Python for AI and ML development, including use of modern AI frameworks and tooling.
  • Data Engineering & Access: Strong understanding of data storage, retrieval, and processing systems required to support large-scale training and experimentation workflows.
  • Requirements & Systems Thinking: Ability to define technical and non-functional requirements that bridge research, engineering, and production concerns.

Considerations for Top Candidates:
  • Bachelor's, Master's, or PhD degree in Data Science, Computer Science, Machine Learning, Statistics, Applied Mathematics, Engineering, or a closely related technical field.
  • Proven experience building and deploying advanced ML models beyond traditional analytics use cases.
  • Extensive proficiency in Python (NumPy, Pandas, PyTorch, LangChain, etc.); ability to write clean, maintainable, production-oriented code and contribute to shared AI infrastructure.
  • Strong hands-on experience with generative AI, large language models, deep neural networks, and modern ML frameworks.
  • Demonstrated experience designing evaluation frameworks and benchmarks for AI systems.
  • Familiarity with AI infrastructure, cloud platforms (AWS, Azure), and scalable experimentation environments.
  • Advanced experience with version control, experiment tracking, and collaborative development (e.g., Git-based workflows).
  • Experience working in Agile, cross-functional product development environments.
  • Prior exposure to industrial, manufacturing, heavy equipment, or complex physical systems is a strong plus, but not required.

Summary Pay Range:
$112,710.00 - $183,140.00
Compensation and benefits offered may vary depending on multiple individualized factors, job level, market location, job-related knowledge, skills, individual performance and experience. Please note that salary is only one component of total compensation at Caterpillar.
Benefits:
Subject to plan eligibility, terms, and guidelines. This is a summary list of benefits.
  • Medical, dental, and vision benefits*
  • Paid time off plan (Vacation, Holidays, Volunteer, etc.)*
  • 401(k) savings plans*
  • Health Savings Account (HSA)*
  • Flexible Spending Accounts (FSAs)*
  • Health Lifestyle Programs*
  • Employee Assistance Program*
  • Voluntary Benefits and Employee Discounts*
  • Career Development*
  • Incentive bonus*
  • Disability benefits
  • Life Insurance
  • Parental leave
  • Adoption benefits
  • Tuition Reimbursement

* These benefits also apply to part-time employees
Visa Sponsorship is not available for this position.
Posting Dates:
June 29, 2026 - July 5, 2026
Any offer of employment is conditioned upon the successful completion of a drug screen.
Caterpillar is an Equal Opportunity Employer, Including Veterans and Individuals with Disabilities. Qualified applicants of any age are encouraged to apply.
Not ready to apply? Join our Talent Community.

Similar Jobs at Caterpillar

5 Hours Ago
Hybrid
100K-134K Annually
Junior
100K-134K Annually
Junior
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Provide worldwide Tier 2/3 support for Caterpillar digital products: troubleshoot software and API issues, perform root cause analysis, monitor product performance, document and communicate resolutions, maintain knowledge base and training materials, and coordinate fixes with development and dealer teams.
Top Skills: Azure DevopsPostmanProduct LinkPythonRelational DatabasesSalesforceServicenowSos Web ServicesSQLVisionlink
5 Hours Ago
Hybrid
128K-193K Annually
Senior level
128K-193K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Lead delivery of software projects and programs for equipment management platforms. Manage planning, schedules, budgets, risk, communications, and cross-functional teams through SDLC stages to deliver scalable, high-quality solutions that improve fleet visibility and user experience.
Top Skills: Azure DevopsConfluencePower BI
5 Hours Ago
Hybrid
128K-209K Annually
Senior level
128K-209K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Lead Data Scientist responsible for designing and delivering advanced analytics and ML solutions on large Service Events datasets. Lead data acquisition, modeling, NLP/text analytics, predictive modeling, and model productionization. Define problem statements with stakeholders, optimize algorithms and data models, communicate insights via visualization, and mentor teams to embed data-driven decision making across service operations.
Top Skills: Cat InteractData ModelingDatabase SystemsMachine LearningNatural Language ProcessingPythonQuery OptimizationRSQLText Analytics

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account