Tencent Logo

Tencent

Sr. Cloud AI Infrastructure Engineer

Reposted 13 Days Ago
Be an Early Applicant
In-Office
Palo Alto, CA, USA
145K-273K Annually
Senior level
In-Office
Palo Alto, CA, USA
145K-273K Annually
Senior level
Responsible for researching AI hardware accelerators, optimizing performance for cloud computing environments, defining architecture, and analyzing technology trends.
The summary above was generated by AI
Business UnitWhat the Role Entails

1.Architecture Research: Conduct in-depth research into the underlying hardware logic of various AI accelerators; evaluate the power-efficiency ratio and suitability of different heterogeneous architectures in the context of Large Language Model (LLM) inference and training.

2.Operator & Performance Optimization: Design and optimize high-performance operator libraries for large-scale cloud computing environments; resolve long-tail latency issues in hardware scheduling, memory management, and distributed communication.

3.Interconnect Architecture Definition: Define the interconnect architecture ; drive the virtualization, standardized access, and efficient pooling of heterogeneous computing resources in the cloud.

4.Technology Trend Analysis: Monitor global trends in semiconductors and accelerators; perform feasibility studies and experimental validation for the implementation of emerging technologies within cloud infrastructure.

Who We Look For

1.Education: Master’s or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field.

2.Core Expertise: Expertise in GPGPU architectures or other mainstream AI accelerator architectures.

3.Programming & Frameworks: Proficient in parallel computing frameworks; deep understanding of low-level operator development languages (e.g., CUDA, Triton).

4.Network & Distributed Systems: Solid understanding of large-scale distributed systems, cluster topologies (e.g., Fat-tree, Torus), and high-performance network protocols.

5.Industry Insight: Familiar with the architectural evolution of global leading computing enterprises; ability to objectively analyze the technical pros/cons and engineering challenges of different architectural paths.

6.Experience: Experience in the application, optimization, or architectural design of ultra-large-scale accelerator clusters is preferred.

7.Framework Optimization: Experience in the low-level adaptation and performance tuning of mainstream deep learning frameworks (e.g., PyTorch, TensorFlow) is preferred.

Location State(s)

US-California-Palo Alto

The expected base pay range for this position in the location(s) listed above is $145,100.00 to $273,200.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

HQ

Tencent Palo Alto, California, USA Office

2747 Park Blvd, Palo Alto, CA, United States, 94306

Similar Jobs

21 Days Ago
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead the optimization and performance analysis of distributed training and inference workloads on NVIDIA GPU platforms, with responsibilities including debugging, benchmarking, and ensuring reliability of large-scale AI systems.
Top Skills: C/C++Containerized EnvironmentsCudaInfinibandMegatronNcclNemoNsight SystemsNvlinkNvswitchPciePythonPyTorchRdmaRoceTensorrt-Llm
17 Days Ago
In-Office or Remote
Santa Clara, CA, USA
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves developing AI infrastructure for large-scale workloads, enhancing system reliability, and optimizing performance, requiring extensive experience in AI software systems.
Top Skills: C/C++DynamoElkJaxKubernetesLokiPrometheusPythonPyTorchRayTensorFlow
Yesterday
In-Office or Remote
3 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves developing and optimizing AI infrastructure for large-scale training and inference, ensuring system reliability and efficiency through software engineering practices.
Top Skills: C/C++ElkIb VerbsJaxLibfabricsLokiNcclPrometheusPythonPyTorchRdmaTensorFlowUcx

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account