Bright Machines Logo

Bright Machines

Senior Platform/MLOps Engineer

Posted 4 Days Ago
Be an Early Applicant
Hybrid
San Francisco, CA, USA
160K-190K Annually
Senior level
Hybrid
San Francisco, CA, USA
160K-190K Annually
Senior level
The Senior Platform/MLOps Engineer will design and maintain scalable AI/ML infrastructures, implement training pipelines, and support robotic manufacturing systems.
The summary above was generated by AI
RETHINK MANUFACTURING  

The only way to ignite change is to build the best team. At Bright Machines®, we’re innovators and experts in our craft who have joined together to create a new category of manufacturing that will help transform the industry. We believe software and data are the answer, thoughtfully applied to solve our customers’ unique challenges. Through intelligent automation, we give factories newfound flexibility, scalability, and resilience. We deliver products to meet the demands of today while building a platform to take advantage of what comes next.  

Working with us means you’ll have the opportunity to make lasting, impactful changes for our company and our customers. If you’re ready to apply your exceptional skills to create the factory of the future, we’d love to speak with you. 

ABOUT THE ROLE
 

Platform Engineers at Bright Machines are responsible for defining and implementing the systems that make Software Defined Manufacturing possible and that power our flexible robotic manufacturing lines. Our robots, and the software that controls them, are deployed in a variety of factory conditions and help support the manufacturing operations for some of the biggest names in the industry. 

As a Senior Platform/MLOps Engineer, you will build scalable systems that are foundational to the Bright Machines technology stack. With a focus on our AI/ML infrastructure, you will design, implement, and maintain our training pipelines, model deployments, and inference app workloads. Our computer vision and deep learning models are used for defect detection, classification, and visual validation, providing end-to-end inspection solutions that deliver consistent, accurate results under real-world factory conditions. You will collaborate with the Smart Robotics team and the Platform Engineering team to design, implement, and deploy our GPU workloads in kubernetes. If you are ready to apply exceptional engineering practices and build the platform that will define the next generation in manufacturing, this is your opportunity to “Be Bright”.

WHAT YOU WILL BE DOING

    • Design, implement, and maintain reliable, scalable, and secure infrastructure, applications, and tooling, with a focus on our ML/AI pipelines and workloads

    • Write clean, maintainable code, and perform peer code-reviews 

    • Write clear and concise documentation and engage in cross-team communication and knowledge sharing

    • Work with other team members to investigate design approaches, prototype new technology and evaluate technical feasibility

    • Pair with adjacent teams to understand how your frameworks and infrastructure are actually used in the field, continuously improving them and leveraging recent advances to improve developer velocity

WHAT YOU WILL BRING

    • At least 5+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering (SRE).

    • B.S. or M.S. degree (or equivalent) in Computer Science, Engineering, or a related field

    • Proficiency in at least one modern programming languages (Python, Javascript, C#, Go, etc)

    • Demonstrated industry best-practices in MLOps

    • Proficiency with CI/CD tools and GitOps workflows

    • Familiarity with running GPU workloads in kubernetes

    • Strong knowledge of Kubernetes (self-hosted and managed) and modern k8s paradigms (e.g. CNCF)

    • Proficiency with Infrastructure as Code tools (Terraform, etc) and configuration management tools (Ansible, etc)

    • Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry)

IT WOULD BE GREAT IF YOU HAD

    • Experience in air-gapped or extremely strict security environments

      • Experience communicating with users, technical leaders and management to collect requirements, describe system designs, and architecting software systems that meets your stakeholders needs

      • Knowledge and demonstrated application of software engineering best practices relating to the SDLC including code reviews, SCM, CI/CD, testing, and operations

      • Demonstrated ability to mentor and grow other team members

BE EMPOWERED TO CHANGE AN INDUSTRY 

Bright Machines is a next-generation, AI-enabled manufacturer focused on data center infrastructure assembly operations. Bright Machines uses its proprietary AI-based robotics and software to assemble AI infrastructure hardware products (i.e., data center servers) for hyperscalers and leading Original Equipment Manufacturers (OEMs). With its new AI factory, Bright Machines addresses increasing market demands for computing power due to the surge of AI and the U.S. national mandate to reshore manufacturing by building data center infrastructure at scale with higher quality and shorter time-to-market.

Bright Machines is headquartered in San Francisco, California, with an integration center in Guadalajara, Mexico. The company has been recognized as one of Forbes’ AI 50, awarded “Best AI-based Solution for Manufacturing” by AI Breakthrough, named a “Technology Pioneer” by the World Economic Forum, and highlighted by several other leading technology and innovation organizations.
 

Top Skills

Ansible
C#
Go
Grafana
JavaScript
Kubernetes
Opentelemetry
Prometheus
Python
Terraform

Bright Machines San Francisco, California, USA Office

585 Howard St, San Francisco, CA, United States

Similar Jobs

37 Minutes Ago
In-Office
2 Locations
165K-189K Annually
Senior level
165K-189K Annually
Senior level
Cloud • Information Technology • Machine Learning
The Senior Account Manager will drive retention and revenue growth, manage enterprise customer strategies, and ensure effective AI/ML product adoption; leveraging strong commercial judgment and technical curiosity.
Top Skills: AWSAzureGCPGitJupyterKubernetesPythonPyTorch
37 Minutes Ago
Hybrid
79K-135K Annually
Mid level
79K-135K Annually
Mid level
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
The DevOps Engineer will automate cloud services, manage on-premises infrastructure, develop CI/CD pipelines, and ensure system security and reliability.
Top Skills: AnsibleAzure StackBashChefCloud InfrastructureDevOpsDockerDocker SwarmElk StackGitlabKubernetesLinux/UnixNagiosOpenstackPowershellPrometheusPuppetPythonRancherVMware
37 Minutes Ago
Hybrid
133K-226K Annually
Senior level
133K-226K Annually
Senior level
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
The Site Reliability Engineer will deploy and monitor IaaS, PaaS, and SaaS solutions, focusing on automation, service delivery, and infrastructure support while collaborating with cross-functional teams.
Top Skills: AnsibleAzure StackCephHelm ChartsIaasJdfsKubernetesNfsObject StorageOpen StackPaasSaaSVMware

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account