About this role:
Wells Fargo is seeking a Principal Engineer - GPU & LLM Infrastructure to lead the end-to-end strategy and operations of our enterprise GPU platforms within Digital Technology's AI Capability Engineering group. In this role, you will design and evolve GPU architecture across on-premises and cloud environments, guide POCs through production readiness, and oversee Day-2 operations for large-scale, multi-cloud deployments.
You will serve as the technical authority for Nvidia/Run:AI orchestration, drive alignment with OpenShift AI, and enable high-performance LLM/SLM inferencing using Triton and vLLM. A core part of the role is ensuring our GenAI platforms are secure, resilient, scalable, and fully observable to meet the demands of enterprise-grade AI workloads.
In this role, you will:
Reflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to demonstrated examples of prior performance, skills, experience, or work location. Employees may also be eligible for incentive opportunities.
$159,000.00 - $305,000.00
Benefits
Wells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs for an overview of the following benefit plans and programs offered to employees.
5 Feb 2026
* Job posting may come down early due to volume of applicants.
We Value Equal Opportunity
Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.
Applicants with Disabilities
To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo .
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.
Wells Fargo Recruitment and Hiring Requirements:
a. Third-Party recordings are prohibited unless authorized by Wells Fargo.
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
Wells Fargo is seeking a Principal Engineer - GPU & LLM Infrastructure to lead the end-to-end strategy and operations of our enterprise GPU platforms within Digital Technology's AI Capability Engineering group. In this role, you will design and evolve GPU architecture across on-premises and cloud environments, guide POCs through production readiness, and oversee Day-2 operations for large-scale, multi-cloud deployments.
You will serve as the technical authority for Nvidia/Run:AI orchestration, drive alignment with OpenShift AI, and enable high-performance LLM/SLM inferencing using Triton and vLLM. A core part of the role is ensuring our GenAI platforms are secure, resilient, scalable, and fully observable to meet the demands of enterprise-grade AI workloads.
In this role, you will:
- Act as an advisor to leadership to develop or influence GPU buildout for highly complex business and technical needs across multiple groups
- Lead the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas or the enterprise, delivering solutions that are long-term, large-scale and require vision, creativity, innovation, advanced analytical and inductive thinking
- Translate advanced technology experience, an in-depth knowledge of the organizations tactical and strategic business objectives, the enterprise technological environment, the organization structure, and strategic technological opportunities and requirements into technical engineering solutions
- Provide vision, direction and expertise to leadership on implementing innovative and significant business solutions
- Maintain knowledge of industry best practices and new technologies and recommends innovations that enhance operations or provide a competitive advantage to the organization
- Strategically engage with all levels of professionals and managers across the enterprise and serve as an expert advisor to leadership
- Design and implement GPU cluster topologies (H100/H200, NVLink/NVSwitch), networking, and storage paths for high-throughput inferencing; publish sizing and performance baselines.
- 7+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
- 1+ years of experience with NVIDIA GPU and CUDA ecosystems, including CUDA, cuDNN, NVLink/NVSwitch, MIG, NCCL, GPU profilers, and performance tuning for H100/H200 architectures
- 1+ years of experience with LLM/SLM runtimes, such as vLLM, TensorRT-LLM, and Triton; hands-on work with model quantization (FP8, INT4 AWQ/GPTQ), KV-cache optimization strategies, and disaggregated prefill/decode pipelines
- 1+ years of experience in orchestration and GPU workload management, including GPU resource managers (collections/departments/projects/workloads), OCP/GKE administration, quota management, preemption and fair-share enforcement, GPU scheduling and timeslicing, Helm/Kustomize, upgrade validation, and admission controls
- 1+ years of experience with API and gateway platforms, including Apigee authentication/authorization, quota and rate-limit configuration, OpenAPI specifications, SDK generation, SLA operations, and API versioning/deprecation workflows
- 1+ years of experience in observability and evaluation tooling, including Arize-like systems for tracing and evaluations, SLO development, alerting design, retention/export workflows, and dashboard creation
- 1+ years of experience in performance engineering, including throughput and latency modeling (token/sec, batch shaping, cache policies) and cost/performance optimization strategies for LLM/SLM workloads
- Hybrid onsite at required locations
- No visa sponsorship available.
- No relocation assistance for this position.
Reflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to demonstrated examples of prior performance, skills, experience, or work location. Employees may also be eligible for incentive opportunities.
$159,000.00 - $305,000.00
Benefits
Wells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs for an overview of the following benefit plans and programs offered to employees.
- Health benefits
- 401(k) Plan
- Paid time off
- Disability benefits
- Life insurance, critical illness insurance, and accident insurance
- Parental leave
- Critical caregiving leave
- Discounts and savings
- Commuter benefits
- Tuition reimbursement
- Scholarships for dependent children
- Adoption reimbursement
5 Feb 2026
* Job posting may come down early due to volume of applicants.
We Value Equal Opportunity
Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.
Applicants with Disabilities
To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo .
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.
Wells Fargo Recruitment and Hiring Requirements:
a. Third-Party recordings are prohibited unless authorized by Wells Fargo.
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
Top Skills
APIs
Apigee
Cuda
Cudnn
Helm
Kustomize
Mig
Nccl
Nvidia Gpu
Nvlink
Nvswitch
Tensorrt-Llm
Triton
Vllm
Wells Fargo San Francisco, California, USA Office
420 Montgomery St, San Francisco, CA, United States, 94103
Similar Jobs at Wells Fargo
Fintech • Financial Services
The role involves leading the GPU infrastructure strategy, designing architectures, and overseeing operations for high-performance AI workloads. Responsibilities include serving as a technical authority, advising leadership, and ensuring the scalability, security, and performance of the GPU platform.
Top Skills:
APIsArizeCudaCudnnHelmKubernetesMigNcclNvidia GpuNvlinkNvswitchOpenapiTensorrt-LlmTritonVllm
Fintech • Financial Services
The Associate Customer Service Representative will support customers with inquiries about financial products through various communication channels, ensuring a positive experience while adhering to policies and guidelines.
Top Skills:
Computer SystemsCustomer ServiceFinancial ServicesInternetMobileSocial Media Technology
Fintech • Financial Services
The Senior Information Security Engineer will support production and test systems while leading incident response activities, conducting investigations, and providing security consulting. They will design and maintain security solutions, assess vulnerabilities and risk, and collaborate within the team to meet security standards.
Top Skills:
AppdynamicsAWSAzureElasticExcel VbaGCPGrafanaJavaScriptKubernetesLinuxOpenshiftPowershellPythonSplunk
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

