Character.AI Logo

Character.AI

Machine Learning Infrastructure Engineer

Reposted 6 Days Ago
In-Office
Redwood City, CA
150K-350K Annually
Mid level
In-Office
Redwood City, CA
150K-350K Annually
Mid level
The role involves supporting ML infrastructure, building diagnostic tools, managing deployments, and optimizing GPU utilization for ML projects.
The summary above was generated by AI

About the role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

Responsibilities:

  • Provide infrastructure support to our ML research and product

  • Build tooling to diagnose cluster issues and hardware failures

  • Monitor deployments, manage experiments, and generally support our research

  • Maximize GPU allocation and utilization for both serving and training

Requirements:

  • 4+ years of experience supporting the infrastructure within an ML environment

  • Experience in developing tools used to diagnose ML infrastructure problems and failures

  • Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)

  • Experience working with GPUs

Nice to have

  • Experience with large GPU clusters and high-performance computing/networking

  • Experience with supporting large language model training

  • Experience with ML frameworks like Pytorch/TensorFlow/JAX

  • Experience with GPU kernel development

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Top Skills

Cloud Platforms
Cloud Storage
Compute Engine
Jax
Kubernetes
PyTorch
TensorFlow

Character.AI Menlo Park, California, USA Office

700 El Camino Real, Menlo Park, California, United States, 94025

Similar Jobs

2 Days Ago
Hybrid
5 Locations
178K-313K Annually
Senior level
178K-313K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
The Software Engineer, ML Infrastructure will design, optimize, and manage systems for machine learning workloads, ensuring efficient AI model training and serving within Snapchat's infrastructure.
Top Skills: C++Caffe2FlinkJavaPythonPyTorchRayScalaScikit-LearnSparkTensorFlow
2 Days Ago
In-Office
Mountain View, CA, USA
160K-241K Annually
Mid level
160K-241K Annually
Mid level
Artificial Intelligence • Automotive • Information Technology • Robotics
The role involves optimizing machine learning models, developing infrastructure for model life cycles, and collaborating across teams to enhance Nuro's autonomy technology.
Top Skills: C++CudaJaxKerasPythonPyTorchTensorFlowTriton
4 Days Ago
In-Office
Santa Clara, CA, USA
150K-250K Annually
Senior level
150K-250K Annually
Senior level
Artificial Intelligence • Machine Learning
As a Senior Site Reliability Engineer, you will manage HPC cluster operations, deploy infrastructure-as-code solutions, support research teams, and develop automation tools.
Top Skills: AnsibleAWSAzureBashCephGCPGitopsGpudirectInfinibandKubernetesLinuxPythonRdmaTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account