Quantiphi Jobs

Architect - Platform Engineer

Quantiphi

Architect - Platform Engineer

Posted 10 Days Ago

Remote

Hiring Remotely in USA

Expert/Leader

Remote

Hiring Remotely in USA

Expert/Leader

Design and scale GenAI/LLM infrastructure for multi-GPU environments. Perform GPU profiling and optimization, manage Slurm/OpenShift/Kubernetes clusters, enable NVIDIA GPU stack, build IaC templates, CI/CD automation, and support production deployments and client engagements for GenAI workloads.

The summary above was generated by AI

While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!

About Quantiphi:

Quantiphi is an award-winning, AI-First global digital engineering company that helps the world’s leading Fortune 1000 organizations transform bold ideas into measurable business impact. We go beyond building innovative AI technologies, we solve the problems that matter most to our clients.

Since our founding in 2013, Quantiphi has built a proven track record of turning complex challenges into meaningful outcomes across industries.

Headquartered in Boston, with more than 4,000 professionals worldwide, we partner with global enterprises to deliver large-scale digital, cloud, and AI-driven transformation. #SolvingWhatMatters

We are an Elite and Premier partner to Google Cloud, AWS, NVIDIA, Snowflake, and other leading technology platforms, and our work has been recognized across the industry, including:

3 NVIDIA Partner of the Year awards
3 AWS AI/ML Partner of the Year awards
21x Google Cloud Partner of the Year awards in the past 10 years
3 Snowflake Partner of the Year awards
Rated Leaders by Gartner, Forrester, IDC, ISG, Everest Group and other leading analyst firms

Quantiphi delivers First-in-class AI solutions across Life Sciences, Healthcare, Banking, Financial Services, CPG, Manufacturing, Energy, High-Tech, Telecommunications, etc., powered by cutting-edge Generative AI and Agentic AI accelerators.

For more details, visit: Website or LinkedIn Page.

Role: Architect - Platform Engineer

Experience Level: 10+ yrs

Work Location: US East/Canada (Remote)

Role Overview:

We are looking for a highly skilled Architect - Platform Engineer to design, optimize, and scale infrastructure for GenAI and LLM workloads. This role is ideal for someone with deep hands-on experience in GPU profiling, distributed training, and high-performance compute environments. You will be working with Architects from other specialties such as Data engineering, Software engineering, ML engineering to create platforms, solutions and applications that cater to latest trends

You’ll play a key role in building out GenAI platform foundations, supporting production-grade deployments, and partnering closely with data science, MLOps, and application teams to bring cutting-edge AI solutions to life.

Key Responsibilities:

Design and implement scalable infrastructure for LLM and GenAI workloads across multi-GPU environments
Perform GPU profiling, benchmarking, and performance optimization for distributed training workloads
Manage and schedule compute-intensive jobs using Slurm-based clusters and OpenShift/Kubernetes environments
Enable and optimize the NVIDIA GPU stack (CUDA, cuDNN, NCCL, Triton, RAPIDS, etc.)
Collaborate with cross-functional teams to deploy models in research and production environments
Build and support GenAI pipelines (fine-tuning, RAG, multi-modal inferencing, LLMOps)
Develop reusable infrastructure templates using tools like Terraform and Helm
Contribute to internal innovation (PoCs, workshops) and support client-facing delivery engagements
Develop and deliver automation software required for building & improving the functionality, reliability, availability, and manageability of applications and cloud platforms
Champion and drive the adoption of Infrastructure as Code (IaC) practices and mindset
Design, architect, and build self-service, self-healing, synthetic monitoring and alerting platform and tools
Automate the development and test automation processes through CI/CD pipeline (Git, Jenkins, SonarQube, Artifactory, Docker containers)
Build container hosting-platform using Kubernetes
Introduce new cloud technologies, tools; processes to keep innovating in the commerce area to drive greater business value.
Lead the technical discussion regarding architecture designing and troubleshooting with the clients and provide solutions proactively as required

Basic Qualifications:

Strong experience with Slurm and distributed training environments
Hands-on expertise with Red Hat OpenShift and/or Kubernetes
Deep knowledge of the NVIDIA GPU ecosystem (CUDA, cuDNN, NCCL, Nsight, Triton/TensorRT)
Strong foundation in Linux systems, performance tuning, and multi-GPU optimization
Experience deploying GenAI workloads (LLM fine-tuning, RAG pipelines, multi-modal systems)
Familiarity with Infrastructure-as-Code tools (Terraform, Ansible)
Experience with cloud GPU environments (GCP, Azure, AWS, OCI) and/or on-prem GPU clusters
Serve as a mentor or guide for senior resources / team leads.
Lead the technical discussion regarding architecture design

Other Qualifications (OQs):

Experience with NVIDIA NIMs, DGX systems, or GPU-accelerated containers
Knowledge of LLMOps frameworks and MLOps integration
Familiarity with vector databases and retrieval systems for RAG architectures
Comfortable working in client-facing environments and collaborating with AI solution teams

Healthcare Domain Experience (Nice to Have):

Experience working with FHIR R4, HL7 v2, or SMART on FHIR
Integration with EHR systems (e.g., Epic)
Understanding of HIPAA compliance and healthcare data privacy
Exposure to clinical workflows, CDS Hooks, or patient-facing applications
Experience building clinical decision support systems or healthcare interoperability solutions

What’s in it for YOU at Quantiphi:

Make an impact at one of the world’s fastest-growing AI-first digital engineering companies.
Up-skill and discover your potential as you solve complex challenges in cutting-edge areas of technology alongside passionate, talented colleagues.
Work where innovation happens - work with disruptive innovators in a research-focused organization with 60+ patents filed across various disciplines.
Stay ahead of the curve, immerse yourself in breakthrough AI, ML, data, and cloud technologies and gain exposure working with Fortune 500 companies.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Similar Jobs

Comcast

Sales Manager

2 Hours Ago

Remote or Hybrid

Maryland, USA

70K-161K Annually

Senior level

70K-161K Annually

Senior level

Digital Media • Information Technology • News + Entertainment

Responsible for managing sales to enterprise customers, ensuring team training and development, monitoring performance, and achieving sales targets. Leads promotions and coordinates with internal teams.

Top Skills: Communication SkillsLeadershipPerformance MonitoringSales ManagementTeam Training

Cox Enterprises

Communications Specialist

4 Hours Ago

Remote or Hybrid

United States

61K-92K Annually

Junior

61K-92K Annually

Junior

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity

Execute end-to-end digital advertising across Search, Display, Social, and Video for a 50+ account portfolio. Monitor KPIs, perform keyword research, create ads, troubleshoot and optimize campaigns, produce monthly reports, consult with clients to retain and grow budgets, and maintain required platform certifications. Handle client communications, track actions for audit, and perform limited travel (5%).

Top Skills: CSSFtpGoogle AdsGoogle AnalyticsHTMLHTTPMicrosoft AdvertisingSalesforceSeo

Onebrief

Outcome Engineer - Early in Career Professional

4 Hours Ago

Remote

United States

120K-200K Annually

Entry level

120K-200K Annually

Entry level

Software • Defense

As an Outcome Engineer, you will architect multi-agent systems, implement automated governance, and build evaluation frameworks to enhance AI-powered workflows.

Top Skills: AWSKubernetesLarge Language ModelsNode.jsPostgresRedisTypescriptVector Databases

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine