d-Matrix Logo

d-Matrix

Senior Principal Architect

Reposted 7 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA
Senior level
In-Office
Santa Clara, CA
Senior level
Develop and optimize a high-performance inference runtime integrating with PyTorch, collaborating with diverse teams and ensuring code quality.
The summary above was generated by AI

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.

We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution.  Ready to come find your playground? Together, we can help shape the endless possibilities of AI. 

Location:

Working onsite at our Santa Clara, CA headquarters 3 days per week - Hybrid.

We are seeking a skilled and experienced Senior Principal Architect to architect our next generation of AI software stack. The ideal candidate will have a solid understanding of AI inference landscape, model architectures, and a strong background in C++ development. Moreover, they should be able to take a holistic view of the software stack and drive designs toward high-performance execution. Thus, experience in building distributed systems or high-performance computing (HPC) applications is a strong plus as well as familiarity with the internals of any frameworks like PyTorch, Ray, MPI, vLLM, Pallas, SGLang or or similar machine learning frameworks.

What You Will Do:

  • Architect and Develop: Lead the design of scale-up and scale-out inference software stack that leverages d-Matrix's advanced hardware capabilities.

  • Problem Solving: There are multiple layers in the software stack that one would need to touch to bring novel advancements, so the candidate is expected to think broadly and solve problems across different layers.

  • Collaborate: Work closely with cross-functional teams including hardware engineers, data scientists, and product managers to define requirements and deliver integrated solutions.

  • Optimize Performance: Develop and implement optimization techniques to ensure low latency and high throughput in distributed and HPC environments.

  • Code Quality: Ensure the code quality, and performance through rigorous testing and code reviews.

  • Documentation: Create technical documentation to support development, deployment, and maintenance activities.

What You Will Bring:

  • Education: Bachelor’s with a minimum of 20+ years of professional experience in software development with a focus on C++, master’s degree or PhD preferred in computer science, Engineering, or a related field with 3+ years of professional experience in software development with a focus on C++

  • Experience in architecting and building complex software systems.

  • Experience with distributed systems or high-performance computing (HPC) applications.

  • Familiarity with PyTorch internals or similar machine learning frameworks.

Technical Skills:

  • Strong proficiency in modern C++ (C++11 and above) and Python.

  • Experience with parallel and concurrent programming.

  • Proficient in CMake, Pytest, and other development tools.

  • Knowledge of GPU programming and acceleration techniques is a plus.

  • Proficient in using development tools and frameworks for building and deploying large-scale applications.

Soft Skills:

  • Excellent problem-solving and analytical skills.

  • Strong communication and interpersonal abilities.

Equal Opportunity Employment Policy

d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.

Top Skills

C++
Cmake
Gpu Programming
Pytest
Python
PyTorch
HQ

d-Matrix Santa Clara, California, USA Office

5201 Great America Pkwy, Santa Clara, CA, United States, 95054

Similar Jobs

13 Days Ago
Remote or Hybrid
Santa Clara, CA, USA
218K-381K Annually
Senior level
218K-381K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
This role involves designing and implementing AI-first solutions to enhance product delivery and customer adoption for ServiceNow's AI technologies.
Top Skills: AIData ScienceGenerative AiLlmMlServicenow
3 Days Ago
Easy Apply
In-Office
2 Locations
Easy Apply
205K-284K Annually
Expert/Leader
205K-284K Annually
Expert/Leader
Artificial Intelligence • Hardware • Machine Learning • Software • Semiconductor
The role involves leading digital architecture for mixed-signal designs, collaborating with teams to deliver cutting-edge communication technology, and ensuring successful silicon product development.
Top Skills: Cad ToolsComputer EngineeringDigital DesignElectrical EngineeringIbis-Ami SimulatorsMatlabPythonSpice Simulators
11 Days Ago
In-Office
Santa Clara, CA, USA
177K-266K Annually
Expert/Leader
177K-266K Annually
Expert/Leader
Semiconductor
Lead the architecture and execution of next-generation data center switch products, collaborating across teams to define features and ensure performance validation.
Top Skills: C/C++Python

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account