xAI Logo

xAI

RDMA Engineer - Supercomputing

Reposted 16 Days Ago
Be an Early Applicant
Easy Apply
In-Office
2 Locations
180K-440K Annually
Mid level
Easy Apply
In-Office
2 Locations
180K-440K Annually
Mid level
The RDMA Engineer designs and optimizes low-latency networking solutions using NVIDIA technologies for high-performance supercomputing environments, collaborating with AI research teams and improving performance.
The summary above was generated by AI
About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

RDMA Engineers on xAI’s Supercomputing team design and optimize low-latency, high-bandwidth networking solutions using NVIDIA’s RDMA-capable technologies to support some of the world’s largest GPU supercomputing clusters. These clusters drive AI training and inference workloads, demanding cutting-edge performance and scalability.

Focus
  • Develop and tune RDMA-based communication systems leveraging NVIDIA GPUs and Mellanox NICs (InfiniBand, RoCE) for ultra-fast data transfer between nodes.
  • Implement and optimize GPUDirect RDMA to enable direct memory access between GPUs and network interfaces, minimizing CPU overhead.
  • Integrate RDMA solutions with Kubernetes-based workloads, ensuring seamless operation across distributed compute and storage systems.
  • Collaborate with AI researchers and infrastructure teams to accelerate data pipelines and collective communications using NCCL and MPI.
  • Troubleshoot and resolve performance bottlenecks in high-throughput, low-latency networking environments.
Ideal Experience
  • Hands-on experience with NVIDIA RDMA technologies (e.g., GPUDirect RDMA, RoCE, InfiniBand) in HPC or AI supercomputing environments.
  • Proficiency in programming with Rust, C, or C++ for low-level networking and system optimization.
  • Familiarity with NVIDIA’s networking stack, including Mellanox drivers, libraries (e.g., libibverbs), and tools (e.g., NVPeerMemory).
  • Experience optimizing distributed systems with MPI, NCCL, or similar frameworks for GPU-accelerated workloads.
  • Knowledge of Kubernetes networking and integrating RDMA into containerized environments.
  • Bonus: Background in AI/ML training workflows and their networking demands (e.g., large-scale parameter synchronization).
Tech Stack
  • NVIDIA GPUs and Mellanox networking (InfiniBand, RoCE)
  • RDMA protocols (e.g., GPUDirect RDMA, RoCEv2)
  • Kubernetes
  • Rust and C/C++
  • MPI (Message Passing Interface) and NCCL (NVIDIA Collective Communications Library)
Annual Salary Range

$180,000 - $440,000 USD

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer.

California Consumer Privacy Act (CCPA) Notice

Top Skills

C
C++
Gpudirect Rdma
Infiniband
Kubernetes
Mpi
Nccl
Nvidia Rdma Technologies
Roce
Rust
HQ

xAI San Francisco, California, USA Office

3180 18th St., San Francisco, CA, United States

xAI Palo Alto, California, USA Office

1450 Page Mill Road, Palo Alto, CA, United States

Similar Jobs

5 Hours Ago
Hybrid
San Francisco, CA, USA
189K-351K Annually
Senior level
189K-351K Annually
Senior level
Cloud • Software
Lead and develop a talented SRE team while ensuring compliance with FedRAMP regulations and collaborating across teams for security and operations.
Top Skills: AIAutomationCloudDistributed SystemsFedrampSecurity
6 Hours Ago
Hybrid
Carmel, CA, USA
22-28 Hourly
Junior
22-28 Hourly
Junior
Fintech • Financial Services
As a Teller, you will support customer transactions, engage with clients, process operations, and ensure compliance with bank policies while building community relationships.
6 Hours Ago
Hybrid
Temecula, CA, USA
23-31 Hourly
Entry level
23-31 Hourly
Entry level
Fintech • Financial Services
The Associate Personal Banker will build relationships and provide financial solutions to customers, assist with account openings, and ensure compliance with regulations.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account