NVIDIA Logo

NVIDIA

Senior Product Quality Engineer

Posted 8 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA, USA
116K-236K Annually
Senior level
In-Office
Santa Clara, CA, USA
116K-236K Annually
Senior level
Lead system-level power failure analysis for data center systems and compute modules, driving customer-return investigations from symptom confirmation to root cause and corrective action. Use lab instruments and Linux-based diagnostics to reproduce, isolate, and analyze complex power issues, correlate field data and telemetry, partner with cross-functional teams, and deliver concise technical and executive reports.
The summary above was generated by AI

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most experienced and hardworking people in the world working for us. If you're creative, autonomous, and energized by deep technical problem solving, we want to hear from you!

We are looking for a Product Quality Engineer to join our Systems Product Quality team as the system-level power debug domain expert for customer returns and field failures. This role will lead technical failure analysis for NVIDIA data center systems, compute trays, and compute modules, with special focus on customer-reported field issues involving power delivery, power sequencing, and intermittent power events.

Your hands-on debug capability, structured root-cause mindset, and ability to connect board-level signals with system-level behavior will be essential to driving customer-return investigations from symptom confirmation through technical root cause and corrective action closure. Own customer-return and field-failure power investigations from symptom confirmation through containment, root cause, corrective action, and quality learning closure.

What you'll be doing:

  • Lead system-level power failure analysis for customer returns and field failures across data center systems, compute trays, and compute modules.

  • Confirm, reproduce, and isolate complex power failures such as no power, intermittent boot, unexpected shutdown, brown-out, rail droop, over-current protection, under-voltage protection, sequencing faults, hot-plug events, and margin-related failures.

  • Analyze system power architecture from AC/DC input through PSU, PDU, hot-swap, eFuse, VR, regulator, current-sense, and board-level power rails to determine the true failure boundary.

  • Use oscilloscopes, current probes, DMMs, BMC-reported voltage/current readings, system event logs, and Linux-based diagnostics to build fact-based debug conclusions.

  • Correlate field return data, customer logs, firmware behavior, board schematics, PCB layout, BOM history, and telemetry trends to identify root cause and assess risk.

  • Partner with hardware design, power design, firmware, customer quality, reliability, manufacturing, and supplier quality teams to resolve critical customer and field issues.

  • Drive containment, failure analysis, corrective and preventive actions, and defect-prevention feedback with clear ownership and closure criteria.

  • Create concise technical reports, quality updates, and executive-ready summaries that communicate failure mechanism, impact, risk, mitigation, and next steps.

What we need to see:

  • Bachelor's degree or equivalent experience in Electrical Engineering, Electronic Engineering, or a related field; Master's degree preferred.

  • 5+ years of hands-on experience in hardware debug, customer return analysis, field failure analysis, or power electronics support for complex electronic systems.

  • Strong understanding of system power delivery, DC-DC converters, multiphase VRs, regulators, power sequencing, current sharing, sense circuits, protection circuits, and high-current low-voltage rails.

  • Proven ability to debug power issues at system, board, and component level by reading schematics, PCB layouts, power trees, design specifications, and test logs.

  • Experience with Linux systems, Linux shell scripts, BMC/IPMI/Redfish-style logs or telemetry, and basic automation for data collection and debug efficiency.

  • Strong analytical and problem-solving skills, including structured troubleshooting, design of experiments, root cause analysis, statistical process control, and quality data analysis.

  • Ability to work across engineering, customer quality, supplier, and customer-facing teams while maintaining clear technical ownership and urgency.

  • Excellent written and spoken English, strong documentation habits, and the ability to explain complex debug findings to both technical and non-technical audiences.

  • High sense of responsibility, self-motivation, collaborative working style, and comfort driving ambiguous technical issues to closure.

Ways to stand out from the crowd:

  • Experience debugging high-power server or data center platforms in customer-return or field-failure analysis workflows.

  • Hands-on familiarity with PSU/PDU behavior, rack-level power distribution, power capping, power transients, or data center deployment conditions observed in field returns.

  • Experience with board-level power design, hardware verification, power integrity measurement, or design-for-debug improvements.

  • Knowledge of quality and reliability concepts, 8D problem solving, customer failure reporting, RMA/FA workflow, and supplier corrective action processes.

  • Ability to confirm, bound, and translate power-related field failures into corrective actions, debug playbooks, and prevention feedback for design, customer quality, and supplier teams.

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, with a genuine passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 116,000 USD - 184,000 USD for Level 3, and 148,000 USD - 235,750 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 18, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

HQ

NVIDIA Santa Clara, California, USA Office

2701 San Tomas Expressway, Santa Clara, CA, United States, Santa Clara

Similar Jobs

12 Days Ago
In-Office
Santa Clara, CA, USA
192K-264K Annually
Senior level
192K-264K Annually
Senior level
Artificial Intelligence • Semiconductor • Manufacturing
Lead overall quality management for Component & Services divisions: manage quality personnel, embed quality in design, drive customer complaint investigations and CAPA, set performance metrics, support business improvement projects, and recruit/train the quality organization to ensure consistent manufacturing quality.
13 Days Ago
In-Office
San Jose, CA, USA
120K-155K Annually
Senior level
120K-155K Annually
Senior level
Aerospace
Lead quality engineering for airframe and composite components across design, validation, and production. Develop inspection plans, APQP artifacts, FAI per AS9102, SPC, root-cause/CAPA, metrics, and support FAA conformity and continuous improvement.
Top Skills: ApqpAs6500As9100As9102As9103As9145CadDfmErpFmeaGd&T Asme Y14.5MesMicrosoft BiMsaPfmeaPlmPpapSAPSigmaSpcSQL
4 Days Ago
Hybrid
114K-131K Annually
Senior level
114K-131K Annually
Senior level
Aerospace • Hardware • Machine Learning • Robotics • Software
Responsible for overseeing product quality from design to production for complex electro-mechanical assemblies, managing nonconformances, and implementing quality practices while collaborating with cross-functional teams.
Top Skills: As9100ExcelIso 9001Oracle ErpPower BIQms SoftwareTableau

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account