Coupa employees grouped together on the left and sitting on the right.
Coupa Logo

Coupa

Lead Data Platform Engineer - 11623

Posted An Hour Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in New York, NY
125K-174K Annually
Senior level
In-Office or Remote
Hiring Remotely in New York, NY
125K-174K Annually
Senior level
Design, deploy, and operate scalable data pipelines and cloud infrastructure for ML/GenAI, manage day-2 LLM serving platforms, automate IaC and platform maintenance, ensure observability and reliability, and support production ETL and incident response.
The summary above was generated by AI
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.

Why join Coupa?

🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. 

Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa. 

The Impact of a Lead Data Platform Engineer at Coupa:

If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the “Cloud team” is looking for an engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing big data/ Machine Learning/GenAI platforms.

What You'll Do:

  • Manage end-to-end Data pipeline (ETL jobs) within agreed SLAs.
  • Manage AWS core and big data services (S3, IAM, EMR, Redshift, etc..)
  • Running applications in containers (ECS, Docker)
  • Lead Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and continuous cost optimization for high-compute AI workloads.
  • Collaborate closely with our product development and engineering teams to create AI-driven features
  • Drive cloud operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize drift across multi-cloud environments.
  • Manage AWS infrastructure using code (Terraform, Chef, etc..)
  • Administering applications running in Linux operating system.
  • Enable application and system monitoring for better observability.
  • Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies.
  • Collaborate with platform and Dev teams to plan and deploy product releases and patch Linux/ECS clusters.
  • Ability to participate in design reviews, code reviews, and troubleshooting incidents.
  • Ability to operate in a high-pressure environment and troubleshoot complex issues quickly while successfully handling multiple priorities.
  • Ability to record, write, and review RCAs.

What You Will Bring to Coupa:

  • Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines.
  • Sound knowledge and experience in Linux administration and troubleshooting.  
  • 5+ years of experience in managing cloud infrastructure and platforms, such as AWS and Azure
  • Familiar with the current engineering landscape in the generative AI space and have a strong interest in AI and related technologies.
  • Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error rate monitoring) specifically for AI workloads.
  • Have Bash or Python scripting experience
  • Experience with containerization, Amazon ECS, EKS/ Azure AKS
  • Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent
  • Experience with source control systems such as Git and operating in complex branching strategies
  • Experience with Infrastructure as Code products like Terraform, helm charts
  • Good understanding of DNS and Load balancers setup and troubleshooting
  • Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..)
  • Knowledge in ApacheSpark architecture and troubleshooting Java applications.
  • Basic understanding of MySQL Server and general database knowledge
  • Excellent written and verbal communication with a passion for solving the problem
  • Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally
  • Deep experience in Day 2 cloud operations, including automated incident remediation, capacity planning, and managing large-scale production cloud environments with a focus on performance and reliability.

#LI-Remote
#LI-TC1

Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. 

Please be advised that inquiries or resumes from recruiters will not be accepted.

By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.

HQ

Coupa Foster City, California, USA Office

Coupa Foster City Office

950 Tower Ln, Foster City, California, United States, 94404 2121

Similar Jobs at Coupa

14 Days Ago
Remote
US
125K-174K Annually
Senior level
125K-174K Annually
Senior level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
As a Manager of Cloud Software Engineering at Coupa, you will lead AI platform DevOps, manage global teams, and oversee agile delivery while ensuring technical quality and innovation.
Top Skills: AIAmazon SagemakerAWSCi/CdEksGenaiIamKubernetesLancedbMachine LearningS3TerraformVector Databases
23 Days Ago
Remote
US
149K-208K Annually
Senior level
149K-208K Annually
Senior level
Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI
This role involves designing scalable data ingestion systems and building a centralized data lake on Apache Iceberg. Responsibilities include improving performance and reliability, collaborating with data engineers, and providing technical leadership.
Top Skills: Apache IcebergAWSAzureBigQueryDatabricksGCPPrestoPythonSnowflakeSQLTrino

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account