High 5 Games Logo

High 5 Games

DevOps Engineer - ML & Data Infrastructure

Posted 18 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The DevOps Engineer will design, build, and optimize ML and data infrastructure on GCP, mentor a team, and ensure system reliability and performance.
The summary above was generated by AI

We’re looking for a DevOps Engineer to help design, build, and optimize the cloud infrastructure powering our machine learning operations. You’ll play a key role in scaling AI models from research to production — ensuring smooth deployments, real-time monitoring, and rock-solid reliability across our Google Cloud Platform (GCP) environment.

You’ll work hand-in-hand with data scientists, ML engineers, and other DevOps experts to automate workflows, enhance performance, and keep our AI systems running seamlessly for millions of players worldwide. We’re also looking for someone with strong leadership and team management capabilities who can mentor engineers, coordinate initiatives, and help drive operational excellence across the team.

What You’ll Do:

  • Manage, configure, and automate cloud infrastructure using tools such as Terraform and Ansible.
  • Implement CI/CD pipelines for ML models and data workflows, focusing on automation, versioning, rollback, and monitoring with tools like Vertex AI, Jenkins, and DataDog.
  • Build and maintain scalable data and feature pipelines for both real-time and batch processing using BigQuery, BigTable, Dataflow, Composer, Pub/Sub, and Cloud Run.
  • Set up infrastructure for model monitoring and observability — detecting drift, bias, and performance issues using Vertex AI Model Monitoring and custom dashboards.
  • Optimize inference performance, improving latency and cost-efficiency of AI workloads.
  • Ensure overall system reliability, scalability, and performance across the ML/Data platform.
  • Define and implement infrastructure best practices for deployment, monitoring, logging, and security.
  • Troubleshoot complex issues affecting ML/Data pipelines and production systems.
  • Ensure compliance with data governance, security, and regulatory standards, especially for real-money gaming environments.
  • Lead and mentor DevOps engineers, helping guide technical decisions and operational processes.
  • Support sprint planning, task prioritization, and cross-functional coordination across infrastructure and platform initiatives.
  • Conduct code reviews, share best practices, and contribute to building a high-performing engineering culture.
  • Collaborate closely with ML, Data, Product, and Security teams to align infrastructure strategy with business objectives.

What We’re Looking For:

  • 5+ years of experience as a DevOps Engineer, ideally with a focus on ML and Data infrastructure.
  • Experience leading projects, mentoring engineers, or managing technical teams.
  • Strong hands-on experience with Google Cloud Platform (GCP) — especially BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub.
  • Proficiency with Terraform (and bonus points for Ansible).
  • Solid grasp of containerization (Docker, Kubernetes) and orchestration platforms like GKE.
  • Experience building and maintaining CI/CD pipelines, preferably with Jenkins.
  • Strong understanding of monitoring and logging best practices for cloud and data systems.
  • Scripting experience with Python, Groovy, or Shell.
  • Familiarity with AI orchestration frameworks (LangGraph or LangChain) is a plus.
  • Strong communication, collaboration, and stakeholder management skills.
  • Bonus points if you’ve worked in gaming, real-time fraud detection, or AI-driven personalization systems.

Similar Jobs

3 Hours Ago
In-Office or Remote
Junior
Junior
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Plan and execute high-intent events (roundtables, workshops, conference) to drive qualified pipeline. Manage venues, vendors, contracts, logistics, reporting, and sales alignment. Travel monthly, work closely with sales, customer, product, and marketing teams to deliver world-class attendee experiences.
Top Skills: ConfluenceJIRA
4 Hours Ago
Remote or Hybrid
63K-99K Annually
Junior
63K-99K Annually
Junior
Digital Media • Information Technology • News + Entertainment
Responsible for developing and maintaining client relationships in advertising sales to achieve annual sales goals, including market research, proposals, and revenue generation activities.
Top Skills: AdvertisingCustomer Relationship Management (Crm)Sales
4 Hours Ago
Remote or Hybrid
California, USA
84K-157K Annually
Senior level
84K-157K Annually
Senior level
Digital Media • Information Technology • News + Entertainment
Sell integrated networking, security, and communications solutions (SD-WAN, UCaaS, network security) to enterprise multi-site accounts. Prospect, develop territory relationships, deliver in-person presentations, meet revenue and quota targets, collaborate with internal teams, and manage complex sales cycles while supporting customers and partners.
Top Skills: EthernetNetwork SecuritySd-WanUcaasWireless

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account