Cloudera Jobs

Staff Software Engineer , Anywhere Cloud - AI Systems & Runtimes

Cloudera

Staff Software Engineer , Anywhere Cloud - AI Systems & Runtimes

Reposted 8 Days Ago

In-Office

San Jose, CA, USA

184K-230K Annually

Mid level

In-Office

San Jose, CA, USA

184K-230K Annually

Mid level

The Staff Software Engineer will lead the development of cloud-native AI platforms, optimizing AI workload deployment in Kubernetes environments and collaborating with cross-functional teams.

The summary above was generated by AI

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description:

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

Ready to take cloud innovation to the next level? Join Cloudera’s Anywhere Cloud team and help deliver a true “build your own pipeline, bring your own engine” experience — enabling data and AI workloads to run anywhere, without friction or vendor lock-in.

We bring the best of public cloud — cost efficiency, scalability, elasticity, and agility — to wherever data lives: public clouds, private data centers, and the edge. Powered by Kubernetes, our hybrid architecture separates compute and storage to maximize flexibility and optimize infrastructure usage.

This isn’t just cloud management — it’s about building a consistent, secure, and compliant cloud experience that gives organizations full access to all their data, anywhere.

With the acquisition of Taikun, we’re simplifying Kubernetes and cloud management even further, creating a unified, scalable, future-ready platform. If you’re passionate about Kubernetes — not just using it, but building it at the core, managing workloads across hybrid clouds and data centers, and obsessing over performance and DevOps — this is where you belong.

We are seeking a Staff Software Engineer to lead the architecture and delivery of our cloud‑native AI platform. In this high‑impact role, you will bridge the gap between cutting‑edge AI research and production‑grade Kubernetes environments. You will build the “nervous system” of our AI stack—optimizing how we run and manage open‑source models (Llama, Qwen, etc.) using K8s‑native patterns like Custom Resources (CRDs) and Operators, enabling agentic AI to thrive, and designing integration patterns that let our product teams and customers consume AI capabilities seamlessly.

As a Staff Software Engineer, you will:

Enterprise AI Services: Design and implement elegant, scalable application services (Go/Node.js) that wrap AI capabilities for enterprise use.
K8s-Native AI Orchestration: Lead the deployment of inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to ensure serverless-style scaling for AI workloads.
Developer Velocity: Build internal tooling, SDKs, and "AI Gateways" that enhance team agility and simplify the integration of Foundation Models (Llama, GPT) into product features.
RAG & Prompt Engineering: Architect robust Retrieval-Augmented Generation (RAG) pipelines and prompt management services that integrate seamlessly with vector databases and enterprise data sources.
Cross-Functional Collaboration: Partner with UI engineers, UX designers, and Product Management to ensure the AI platform is not just powerful, but highly usable for internal developers.
Infrastructure & Security: Ensure AI workloads are secure, multi-tenant, and optimized for GPU resource scheduling (MIG, fractional GPUs) within Kubernetes.

We’re excited about you if you have:

Bachelor’s degree with 6+ years of software engineering experience (or equivalent Masters/PhD tenure), with at least 2+ years focused on AI/ML systems.
Expert proficiency in Python (for AI ecosystem) and strong competence in a systems language like Go or Rust/C++ (for high-performance serving layers).
Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization techniques (AWQ, GPTQ) to optimize model size/speed.
Experience building complex workflows using tools like LangChain or LlamaIndex, and deploying them on containerized infrastructure (Docker/Kubernetes).
Ability to navigate the rapidly changing AI landscape, filtering hype from practical engineering solutions, and driving technical alignment across teams.

You May Also Have:

Model Fine-Tuning: Experience with efficient fine-tuning techniques (PEFT, LoRA/QLoRA) on custom datasets.
GPU Optimization: Familiarity with CUDA programming or profiling GPU performance (Nsight systems).
Open Source: Contributions to open-source AI projects (HuggingFace transformers, vLLM, etc.).

Why this role matters:

This is more than cloud management, it’s about building the foundation for a consistent, secure, and compliant cloud experience that gives organizations 100% access to 100% of their data, anywhere.

With the recent acquisition of Taikun, we are simplifying Kubernetes and cloud management even further, creating a platform that is unified, scalable, and future-ready.

If you are passionate about Kubernetes, not just using it but building it at the core managing workloads across hybrid clouds and datacenters and obsessed with performance, devops, etc. this is where you belong.

This role is not eligible for immigration sponsorship.

The anticipated annual base salary range for this position is:

California: $184,000- $230,000

Individual compensation within the published range is determined by the candidate's skills, experience, qualifications, and primary work location. In addition to base pay, sales roles are eligible for Cloudera's commission plan, while non-sales roles are eligible for the corporate incentive plan. All employees receive a comprehensive benefits package.

What you can expect from us:

Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Paid Volunteer Time
Employee Resource Groups

EEO/VEVRAA

#LI-BV1

#LI-HYBRID

Santa Clara, CA, United States

6220 America Center Dr, 5th Floor, San Jose, California, United States, 95002

Similar Jobs

SailPoint

Technical Program Manager

2 Minutes Ago

Remote or Hybrid

United States

98K-165K Annually

Senior level

98K-165K Annually

Senior level

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy

Lead end-to-end execution of a 12+ initiative product security program, managing cross-team dependencies, vendor procurement, BAU security operations, and executive reporting. Maintain roadmap and tracking in Jira/Confluence, drive procurement/POCs, produce executive-ready metrics, and establish program cadence to ensure timely remediation and predictable delivery.

Top Skills: Ai And Machine LearningCi/CdCloud-Native SecurityConfluenceDevsecopsIdentity And Access Management (Iam)JIRAScmVulnerability Management

Achieve

VP, Capital Markets

An Hour Ago

Hybrid

San Mateo, CA, USA

340K-370K Annually

Expert/Leader

340K-370K Annually

Expert/Leader

Fintech • Professional Services • Sales • Financial Services

Lead Achieve's capital markets strategy and funding stack (forward flows, warehouse lines, ABS), build an AI-enabled capital allocation engine, establish institutional-grade loan data and investor reporting, align funding with credit risk and product strategy, and build/lead cross-functional capital markets teams.

CrowdStrike

Senior Program Manager

2 Hours Ago

Hybrid

Sunnyvale, CA, USA

140K-215K Annually

Senior level

140K-215K Annually

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

Lead large-scale platform and sensor infrastructure programs, coordinating 100+ engineers across locations. Drive schedules, resolve architectural and deployment issues, track risks, and report status. Build AI-powered dashboards, integrations, and agents to automate reporting, surface program risks, and improve workflows. Coach distributed teams, manage cross-functional stakeholders, and ensure platform reliability, scaling, and build system improvements for endpoint security sensor initiatives.

Top Skills: ClaudeClaude CodeCopilotGitGitlabGitlab Ci/CdGoJIRAKubernetesPythonTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Cloudera

Staff Software Engineer , Anywhere Cloud - AI Systems & Runtimes

Cloudera Santa Clara, California, USA Office

Cloudera San Jose, California, USA Office

Similar Jobs

Technical Program Manager

VP, Capital Markets

Senior Program Manager

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech