Together AI Logo

Together AI

Senior AI Infrastructure Engineer

Sorry, this job was removed at 04:09 a.m. (PST) on Tuesday, Jun 03, 2025
Be an Early Applicant
In-Office
San Francisco, CA, USA
In-Office
San Francisco, CA, USA

Similar Jobs

4 Days Ago
In-Office
Santa Clara, CA, USA
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and build AI-native, high-performance CI/build/test/validation infrastructure for large Tegra C/C++ codebases. Optimize build graphs, caching, incremental builds, remote execution, device-lab testing, and integrate reasoning agents for autonomous triage, remediation, and self-healing workflows. Package reusable platform capabilities, drive adoption, and measure developer feedback loop improvements.
Top Skills: Agent WorkflowsAi Coding ToolsBuild GraphsCC++CachingCi/Build SystemsDevice LabEmulationIncremental BuildsPythonRemote ExecutionSimulationTegra
11 Days Ago
In-Office
Santa Clara, CA, USA
175K-296K Annually
Senior level
175K-296K Annually
Senior level
Automotive
Design and operate petabyte-scale end-to-end data pipelines for autonomous driving: onboard upload, cloud preprocessing, dataset production, and model training input. Build data cleaning, annotation QA, lineage, versioning, metadata management, and high-throughput distributed processing. Optimize I/O, memory, transmission, and containerized deployments; support cross-team collaboration and algorithm iteration.
Top Skills: Apache IcebergDockerGoJavaKafkaKubernetesLanceMongoDBMySQLPostgresPulsarPythonRabbitMQRedis
11 Days Ago
In-Office or Remote
2 Locations
152K-288K Annually
Senior level
152K-288K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design, build, deploy, and operate large-scale GPU cloud infrastructure and tooling for AI training and inference. Perform performance analysis, capacity management, monitoring, automation, incident response, and lifecycle support for distributed multi-GPU/multi-node systems.
Top Skills: C/C++ContainersDgx CloudGoInfrastructure As Code (Iac)JavaKubernetesLinuxNetworkingOpenstackPublic CloudPythonSlurmStorageTerraform
About the Role

As a Senior AI Infrastructure Engineer, you will be responsible for building the next generation, highly available, global, multi-cloud PaaS platform with open-source technologies to enable and accelerate Together AI’s rapid growth.

This system spans many diverse environments (Kubernetes, VMs, bare metal compute, and edge deployments) and provides a cohesive and reliable abstraction for running AI workloads in them. You will get to be a technology thought leader, evangelize new, cutting-edge technologies, and solve complex problems.

To be successful, you’ll need to be deeply technical and possess excellent communication, collaboration, and diplomacy skills. You have experience practicing infrastructure-as-code, including using tools like Terraform and Ansible. You have strong software development fundamentals and skills. In addition, you have strong systems knowledge and troubleshooting abilities.

Requirements
  • 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)
  • Demonstrated experience with high performance or distributed cloud microservices architectures and ideally experience building them in operation at a global scale using multiple cloud providers such as AWS, Azure, or GCP
  • Excellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance, and scale
  • Pragmatic, methodical, well-organized, detail-oriented, and self-starting
  • Experience with Kubernetes and containerization, VPNs, AI workloads, and blockchain based protocols a plus
  • GPU programming, NCCL, CUDA knowledge a plus
  • Experience with Pytorch or Tensorflow a plus
  • 5+ years experience writing high-performance, well-tested, production quality code
Responsibilities
  • Perform architecture and research work for decentralized AI workloads
  • Work on the core, open-source Together AI platform
  • Create services, tools, and developer documentation
  • Create testing frameworks for robustness and fault-tolerance
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our privacy policy at https://www.together.ai/privacy  

Together AI San Francisco, California, USA Office

584 Castro St, #2050, San Francisco, California , United States, 94114

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account