HP IQ Logo

HP IQ

Senior Machine Learning Engineer – Fine-Tuning and On-device AI

Reposted 14 Days Ago
In-Office
Palo Alto, CA, USA
120K-215K Annually
Mid level
In-Office
Palo Alto, CA, USA
120K-215K Annually
Mid level
As an AI Engineer, contribute to HP's intelligent operating system by building orchestration architecture for LLM agents, integrating APIs, and collaborating on innovative AI features.
The summary above was generated by AI

Who We Are

HP IQ is HP’s new AI innovation lab. Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.

We’re assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP’s portfolio. Together, we’re developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.

We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.

By embedding AI advancements into every HP product and service, we’re expanding what’s possible for individuals, organisations, and the future of work.

Join us as we reinvent work, so people everywhere can do their best work.

About the Role 

We are seeking a Senior Machine Learning Engineer to lead the fine-tuning, optimization, and deployment of AI models for diverse tasks, with a strong emphasis on on-device inference. You will work on cutting-edge applications such as orchestration, planning, multi-agent coordination, and other intelligent decision-making systems. 

You will be responsible for adapting foundation models (LLMs, multimodal models) to specialized domains, making them fast, accurate, and efficient for resource-constrained environments—while ensuring robustness and safety. 

What You Might Do

  • Model Fine-Tuning & Adaptation 
  • Fine-tune large language models, multimodal models, and task-specific models for orchestration, planning, and any other workflows as defined. 
  • Design and run experiments to improve task accuracy, robustness, and generalization. 
  • Explore and apply methods like full fine-tuning, LoRA, QLoRA and other types of parameter-efficient fine-tuning. 
  • Employee advanced techniques such as QAT, DPO, GRPO to further improve the model quality. 
  • On-Device Optimization 
  • Prune, quantize and compress models (e.g., INT8, INT4, mixed-precision) for CPU, GPU, NPU and edge accelerators. 
  • Optimize models for low-latency inference using frameworks like OpenVINO, ONNX Runtime, QNN etc..
  • Data Pipeline & Deployment 
  • Build robust data pipelines for domain-specific datasets, including synthetic data generation and annotation. 
  • Define evaluation metrics. Perform evaluations and analyze results. 
  • Establish best practices for versioning, reproducibility, and continuous improvement of model performance. 
  • AI Orchestration & Planning 
  • Develop and refine models to support multi-step reasoning, tool orchestration, and decision planning. 
  • Work with stakeholders on orchestrator architecture. 
  • Collaborate with product and research teams to design intelligent, context-aware assistant capabilities. 

Essential Qualifications

  • 7+ years of experience in applied machine learning, including at least 3 years in LLM fine-tuning. 
  • Proficiency in Python and ML frameworks ecosystem (HuggingFace, PyTorch). 
  • Strong understanding of transformer architectures, attention mechanisms, and PEFT techniques. 
  • Experience with on-device inference optimization (OpenVINO, ONNX, QNN). 
  • Familiarity with orchestration/planning architectures and techniques for AI assistants. 
  • Track record of delivering production-ready ML solutions in latency-sensitive environments. 

Preferred Qualifications

  • Experience with multi-agent systems or AI assistant orchestration. 
  • Familiarity with advanced inference optimization techniques such as KV cache paging , flash attention. 
  • Knowledge about common inference engines, including but not limited to llama.cpp, vLLM. 

Salary Range:  $120,000 - $215,000

Compensation & Benefits (Full-Time Employees)

The salary range for this role is listed above. Final salary offered is based upon multiple factors including individual job-related qualifications, education, experience, knowledge and skills.

At HP IQ, we offer a competitive and comprehensive benefits package, including:

  • Health insurance
  • Dental insurance
  • Vision insurance
  • Long term/short term disability insurance
  • Employee assistance program
  • Flexible spending account
  • Life insurance
  • Generous time off policies, including; 
    • 4-12 weeks fully paid parental leave based on tenure
    • 11 paid holidays
    • Additional flexible paid vacation and sick leave (US benefits overview)

Why HP IQ?

HP IQ is HP’s new AI innovation lab, building the intelligence to empower humanity—reimagining how we work, create, and connect to shape the future of work.

  • Innovative Work
    Help shape the future of intelligent computing and workplace transformation.
  • Autonomy and Agility
    Work with the speed and focus of a startup, backed by HP’s scale.
  • Meaningful Impact
    Build AI-powered solutions that help people and organisations thrive.
  • Flexible Work Environment
    Freedom and flexibility to do your best work.
  • Forward-Thinking Culture
    We learn fast, stay future-focused, and imagine what comes next—together.

Equal Opportunity Employer (EEO) Statement

HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).

Please be assured that you will not be subject to any adverse treatment if you choose to disclose the information requested. This information is provided voluntarily. The information obtained will be kept in strict confidence.

If you’d like more information about HP’s EEO Policy or your EEO rights as an applicant under the law, please click here: Equal Employment Opportunity is the Law Equal Employment Opportunity is the Law – Supplement

HQ

HP IQ Palo Alto, California, USA Office

1501 Page Mill Road, Palo Alto, United States, 94304

Similar Jobs

12 Minutes Ago
Remote or Hybrid
Santa Clara, CA, USA
221K-387K Annually
Expert/Leader
221K-387K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead strategy and execution for the Vault data & AI security product bundle, owning roadmap, cross-functional coordination, regulatory compliance, encryption, code signing, log export, and AI-native security features to scale monetizable, enterprise-grade security capabilities and drive adoption.
Top Skills: Agentic SystemsAICode SigningEncryptionIdentity And AuthenticationLog ExportProcess AutomationSecopsServicenow PlatformVault
12 Minutes Ago
Remote or Hybrid
Santa Clara, CA, USA
264K-449K Annually
Expert/Leader
264K-449K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead CEGs global partner strategy and execution for a $300M+ partner portfolio. Own partner governance, commercial management, vendor relationships, and partner-enabled delivery. Drive partner performance, capacity planning, executive relationships, strategic programs, and AI/automation-enabled service models while advising senior leadership and aligning cross-functional stakeholders.
Top Skills: AIAutomationServicenow
12 Minutes Ago
Hybrid
Mountain View, CA, USA
Mid level
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, train, evaluate, and productionize LLM-based NLU and agentic AI systems. Improve model quality, latency, reliability, and safety; fine-tune models, implement agent architectures, and collaborate across teams to deploy scalable, privacy-preserving conversational AI.
Top Skills: DpoGoGraph Of ThoughtsHybrid Vector DatabasesLarge Language ModelsMac Development EnvironmentMultimodal Foundation ModelsPythonRlaifRlhfTree Of Thoughts

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account