Applied Intuition Logo

Applied Intuition

ML Runtime Optimization Engineer

Reposted 2 Days Ago
In-Office
Sunnyvale, CA, USA
159K-199K Annually
Mid level
In-Office
Sunnyvale, CA, USA
159K-199K Annually
Mid level
Optimize machine learning models for embedded environments, focusing on performance, efficiency, and deployment strategies across various platforms.
The summary above was generated by AI
About Applied Intuition
Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure needed to bring intelligence to every moving machine on the planet. Applied Intuition services the automotive, defense, trucking, construction, mining and agriculture industries in three core areas: tools and infrastructure, operating systems, and autonomy. Eighteen of the top 20 global automakers, as well as the United States military and its allies, trust the company’s solutions to deliver physical intelligence. Applied Intuition is headquartered in Sunnyvale, California, with offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Learn more at applied.co.

We are an in-office company, and our expectation is that employees primarily work from their Applied Intuition office 5 days a week. However, we also recognize the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments.

About the role

We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton).

At Applied Intuition, you will:
  • Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms 
  • Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
  • Work on model pruning and quantization, and support deployment on memory constrained platforms
  • Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
  • Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration 
We're looking for someone who has:
  • Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field
  • 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
  • Strong software development skills with the focus on embedded programming
  • Experience profiling and optimizing model performance on embedded compute platforms
  • Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)
Nice to have:
  • M.Sc or PhD in a ML related area
  • Built an ML optimization framework from scratch before
  • Deployed ML solutions to embedded chips for real time robotics applications

Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment.

Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position.

Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $159,053 - $199,295 USD annually. 

Don’t meet every single requirement? If you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. Consequently, the parties agree that, as applicable, they will abide by the requirements of 41 CFR 60-1.4(a), 41 CFR 60-300.5(a) and 41 CFR 60-741.5(a) and that these laws are incorporated herein by reference. These regulations prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities, and prohibit discrimination against all individuals based on their race, color, religion, sex, sexual orientation, gender identity or national origin. These regulations require that covered prime contractors and subcontractors take affirmative action to employ and advance in employment individuals without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status or disability. The parties also agree that, as applicable, they will abide by the requirements of Executive Order 13496 (29 CFR Part 471, Appendix A to Subpart A), relating to the notice of employee rights under federal labor laws.

Applied Intuition Sunnyvale, California, USA Office

157 S Murphy Ave, Sunnyvale, CA, United States

Similar Jobs

8 Days Ago
In-Office
Mountain View, CA, USA
213K-263K Annually
Senior level
213K-263K Annually
Senior level
Automotive
Lead collaboration to improve ML workloads on cloud and self-driving cars, optimize models, and mentor junior engineers.
Top Skills: C++CudaJaxJaxPythonPyTorchTensorFlowTritonXla
An Hour Ago
In-Office
180K-231K Annually
Senior level
180K-231K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Information Technology • Software • Defense • Manufacturing
Lead design and implementation of an AI-first enterprise platform (Hyperdrive) to automate aerospace operations. Build full-stack, data-dense React interfaces, scalable distributed systems, data pipelines, APIs, and AI integrations (LLMs, agents, RAG). Mentor engineers, set architecture, and collaborate with hardware, supply chain, and finance to turn operational bottlenecks into automated workflows.
Top Skills: Agentic WorkflowsAPIsAWSAzureCloud-NativeData LakeETLGoJavaScriptKubernetesLlmsMicroservicesPostgresPythonRagReactReal-Time ProcessingSnowflakeTypescript
An Hour Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
201K-279K Annually
Expert/Leader
201K-279K Annually
Expert/Leader
Fintech • Machine Learning • Mobile • Security • Software
Lead creative strategy and execution for Growth and Product Marketing, managing a multidisciplinary team to produce performance-driven paid social, video, web, and lifecycle creative. Build scalable toolkits, AI-enabled workflows, and production systems that increase speed, personalization, and measurement while maintaining brand quality and creative excellence.
Top Skills: Agent-Powered SystemsAi Creative ToolsDisplay AdvertisingDrtvPaid SocialPmm ToolkitsSemStreamingVideo Production

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account