SK hynix Logo

SK hynix

Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems

Posted 2 Days Ago
Be an Early Applicant
In-Office
San Jose, CA, USA
140K-165K Annually
Senior level
In-Office
San Jose, CA, USA
140K-165K Annually
Senior level
Design, deploy, and maintain on-prem AI infrastructure and agentic systems. Build GPU clusters, model serving, vector DB-backed RAG pipelines, fine-tune and deploy models, implement governance (MCP), and automate CI/CD and monitoring to integrate AI capabilities into enterprise workflows.
The summary above was generated by AI

About the Company:

At SK Hynix Memory Solution, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.

We're looking for innovative minds to join our mission of shaping the future of technology. At SK Hynix Memory, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.


Why Join Us?

  • Build foundational AI infrastructure that powers next-gen enterprise systems.
  • Work on cutting-edge agentic AI — not just chatbots, but autonomous systems that reason, plan, and act.
  • Opportunity to influence AI strategy, deployment, and governance in a high-impact environment.

About the Role:

We are seeking a hands-on AI Engineer to design, deploy, and maintain on-prem AI infrastructure and build agentic AI systems that drive real-world automation. You’ll be responsible for setting up scalable AI environments, implementing RAG pipelines, fine-tuning embedded models, and architecting AI agents that operate autonomously in enterprise settings. This role sits at the intersection of AI systems engineering and applied ML — you’ll bridge infrastructure, model deployment, and agent logic.


Responsibilities:

  • Design and deploy on-prem AI infrastructure — including GPU clusters, model serving (e.g., vLLM, TGI, Triton), vector DBs (e.g., Milvus, Qdrant, FAISS), and orchestration (Kubernetes, Helm, Docker).
  • Build and optimize RAG pipelines — including document chunking, retrieval strategies (hybrid, re-ranking), and evaluation of retrieval accuracy and latency.
  • Develop agentic AI systems — design stateful agents with memory, tool use, and planning capabilities (e.g., using LangGraph, AutoGen, or custom frameworks).
  • Fine-tune and deploy embedded models — work with LoRA, QLoRA, or full fine-tuning for domain-specific tasks; optimize for edge/on-device inference.
  • Implement Model Control Protocols (MCP) — ensure model governance, versioning, access control, and monitoring for production AI systems.
  • Collaborate with product and engineering teams to integrate AI capabilities into enterprise workflows — especially in storage, QA, or systems engineering contexts.
  • Automate and monitor AI pipelines — build CI/CD for model deployment, logging, and performance tracking.

Minimum Qualifications:

  • 2+ years of experience in AI/ML engineering, with hands-on deployment of AI systems on-prem or private cloud.
  • Proven experience building agentic AI systems — including state management, tool integration, and multi-step reasoning.
  • Strong working knowledge of RAG architectures — chunking, retrieval, re-ranking, evaluation metrics.
  • Experience with model fine-tuning (LoRA, QLoRA, full fine-tuning) and embedding models for retrieval.
  • Familiarity with Model Control Protocols (MCP) or similar governance frameworks (model versioning, access control, audit trails).
  • Proficiency in Python, Linux, Docker/Kubernetes, and vector databases (e.g., Milvus, Qdrant, Pinecone).
  • Experience with AI serving frameworks (vLLM, TGI, Triton, Ollama, etc.).

Preferred Qualifications:

  • Experience deploying AI in enterprise storage or hardware-adjacent environments.
  • Background in systems engineering or QA automation — bonus if you’ve used AI to automate testing or validation.
  • Familiarity with embedded AI or edge inference (ONNX, TensorRT, GGUF, etc.).
  • Experience with AI agent frameworks (LangGraph, AutoGen, BabyAGI, etc.).
  • Knowledge of AI observability tools (LangSmith, Weights & Biases, Prometheus/Grafana for AI).
  • As a Storage company, knowledge of storage area/NVMe is a PLUS.

Education Requirement:

  • Bachelor of Science in CS, EE, ME, or other applicable Engineering field.

COMPENSATION$140,000/yr - $165,000/yr


REGARDING COMPENSATION:

SK hynix memory solutions America Inc. offers you the opportunity to apply your skills to exciting projects while working with innovative teams. Our compensation package is complimented by a generous benefits package including medical, dental, vision, life insurance and a company 401(k) match, as well as cafeteria, onsite gym and much more. If you are motivated by technical challenges, we offer a collaborative work environment that encourages career growth.

The salary offered to a selected candidate will be tailored based on several factors, including the location, job grade, relevant knowledge, skills, and experience. We also take into account the internal equity among our current team members to ensure fairness and competitiveness

SK hynix Santa Clara, California, USA Office

Santa Clara, United States

Similar Jobs

44 Minutes Ago
Hybrid
San Francisco, CA, USA
130K-165K Annually
Junior
130K-165K Annually
Junior
Blockchain • Cloud • Fintech • Information Technology • Software • Cryptocurrency • Web3
Provide hands-on technical support and onboarding for Account Abstraction and Wallet Services customers, scale support with tooling and runbooks, gather customer feedback to inform product roadmap, and maintain relationships with complex stakeholders to drive customer success.
Top Skills: Account AbstractionBlockchainCryptoWallet Services
3 Hours Ago
Remote or Hybrid
USA
125K-180K Annually
Expert/Leader
125K-180K Annually
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Manage a team of TPRM analysts to run the vendor risk lifecycle, improve tooling and automation (ServiceNow TPRM, AI), perform assessments and audits, develop TPRM policies aligned to frameworks (NIST/ISO/SOC 2), partner with procurement/legal/IT, track KPIs, and support audit and reporting to leadership.
Top Skills: Ai/Ml ToolsCloud EnvironmentsCrowdstrike ProductsFairIso 27001Nist 800-53Nist CsfSecure CodingServicenowServicenow TprmSigSoc 2
4 Hours Ago
In-Office
151K-205K Annually
Senior level
151K-205K Annually
Senior level
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Lead a cross-functional service engineering team supporting the 777/777X fleet, deliver in-service technical solutions, approve technical approaches, manage suppliers, develop staff, liaise with customers and stakeholders, and support 24x7 fleet operations including some domestic and international travel.

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account