Astera Labs Logo

Astera Labs

AI/ML Engineer

Posted 7 Days Ago
In-Office
San Jose, CA, USA
140K-165K Annually
Mid level
In-Office
San Jose, CA, USA
140K-165K Annually
Mid level
Build and deploy production AI/LLM systems and agentic workflows for engineering productivity, diagnostics, search, and automation. Design retrieval, tool integration, evals, failure handling, and reusable skills; collaborate with infrastructure and domain teams to monitor and improve systems in production.
The summary above was generated by AI

Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs’ Intelligent Connectivity Platform integrates CXL®, Ethernet, NVLink, PCIe®, and UALink™ semiconductor-based technologies with the company’s COSMOS software suite to unify diverse components into cohesive, flexible systems that deliver end-to-end scale-up, and scale-out connectivity. The company’s custom connectivity solutions business complements its standards-based portfolio, enabling customers to deploy tailored architectures to meet their unique infrastructure requirements. Discover more at www.asteralabs.com.

AI/ML Engineer

Location: San Jose, CA
Experience: 1–5 years
Team: Applied AI


The role

We’re hiring an AI/ML Engineer to build production AI systems for technical users. This is an applied engineering role for someone who can take modern model capabilities and turn them into reliable systems that people actually use.

The core problems in this role are the same ones that matter in modern applied AI: getting the right context into the system, making tool use reliable, designing useful abstractions around skills and workflows, building evals that reflect real tasks, and iterating until the system is good enough to become part of a team’s daily workflow.

In practice, you might work on coding agents in terminal and IDE environments, verification and debug assistants, log-analysis systems tied to real product diagnostics, documentation and spec-comparison agents, or internal assistants that operate over company knowledge and engineering data. You will be expected to think end-to-end: prompt and context design, retrieval quality, tool interfaces, evals, failure modes, deployment, and ongoing improvement.


What you’ll do
  • Build AI applications and agentic workflows for engineering productivity, diagnostics, search, documentation, and workflow automation.
  • Design systems that combine LLMs with retrieval, tool use, structured outputs, and evaluation loops.
  • Integrate models with internal tools, APIs, CLIs, MCP interfaces, and operational workflows so they can do useful work in real environments.
  • Improve system quality through eval design, prompt and context iteration, model selection, failure analysis, and human feedback.
  • Build reusable skills, workflows, and abstractions so useful capabilities can be shared across agents and teams instead of rebuilt from scratch.
  • Work closely with infrastructure and domain teams to deploy, monitor, and continuously improve AI systems in production.
What we’re looking for
  • 1–5 years of experience in software engineering, applied AI, ML engineering, or related backend/platform roles.
  • Strong Python skills and strong production engineering fundamentals.
  • Hands-on experience building AI/LLM applications, agents, retrieval-backed systems, or workflow automation.
  • Comfort working with tool-using systems where correctness depends on context quality, tool integration, and careful failure handling.
  • Experience with AWS or GCP and the realities of deploying and debugging production AI services.
  • Good judgment around evals, failure modes, latency/cost tradeoffs, and safe rollout of non-deterministic systems.
  • Clear communication and the ability to turn ambiguous technical workflows into robust product behavior.
What strong candidates often look like

They have built more than demos. They have worked on systems where retrieval quality matters, where tool use can fail in subtle ways, where evaluation changes engineering decisions, and where product usefulness depends as much on system design as on model choice. They usually care about the details that separate a clever prototype from a dependable system.


Why this role is interesting

The team’s direction is very concrete: enterprise search, coding agents, workspace automation, customized skills, and agentic applications for specific engineering problems, all measured against real usage and outcomes. This role sits directly in that path. If you want to build applied AI systems that are ambitious but grounded in real workflows, technical users, and fast feedback loops, this is that job.

 

The base pay range for this position is $140,000 - $165,000 

We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.

HQ

Astera Labs Santa Clara, California, USA Office

Santa Clara, CA, United States, 95054

Similar Jobs

Yesterday
In-Office
Mountain View, CA, USA
160K-240K Annually
Senior level
160K-240K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Machine Learning • Software • Business Intelligence • Data Privacy
Build and productionize ML algorithms and high-performance pipelines for large-scale structured, tabular, relational, and graph data. Translate research into prototypes and production-ready systems, design benchmarks and evaluation harnesses, optimize training/inference (cost, latency, throughput), and collaborate across research and engineering to iterate from prototype to production.
Top Skills: C++CudaDistributed SystemsJaxPythonPyTorchRustTensorFlow
18 Days Ago
Remote or Hybrid
Sunnyvale, CA, USA
189K-321K Annually
Senior level
189K-321K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead end-to-end ML and perception efforts for future sensing in autonomous vehicles: design sensor architectures, multi-modal fusion, model development, metrics, simulation-driven evaluation, system integration, and production readiness while mentoring engineers and building scalable data and training infrastructure.
Top Skills: SparkCameraLidarNumpyPandasPythonPyTorchRadarSimulationSynthetic Data
18 Days Ago
Remote or Hybrid
Sunnyvale, CA, USA
182K-251K Annually
Senior level
182K-251K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Develop and evaluate ML and perception models for future sensing and autonomous driving. Work on detection, tracking, classification, sensor fusion, training pipelines, validation, robustness metrics, and integrate models into onboard systems while collaborating with cross-functional teams.
Top Skills: SparkCameraLidarMulti-Modal Sensor FusionNumpyPandasPythonPyTorchRadar

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account