Founding AI Frameworks Engineer

Sorry, this job was removed at 08:11 a.m. (PST) on Wednesday, Jul 16, 2025

Be an Early Applicant

In-Office

San Francisco, CA

In-Office

San Francisco, CA

Similar Jobs

Boeing

Financial Analyst

58 Seconds Ago

In-Office

Long Beach, CA, USA

71K-122K Annually

Junior

71K-122K Annually

Junior

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing

As a Procurement Financial Analyst, you will analyze supplier proposals, assess costs, ensure compliance with regulations, and collaborate with the supply chain team. Your role is vital for evaluating bids on military programs and providing insights on pricing strategies.

Boeing

Senior Software Architect

A Minute Ago

In-Office

170K-246K Annually

Senior level

170K-246K Annually

Senior level

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing

The Senior Software Architect will lead software design, development, and analysis for satellite systems, focusing on modernization efforts and aligning with technical standards and customer needs.

Top Skills: AWSC++CSSHibernateHTMLJavaJavaScriptJenkinsJqueryMongoDBOpenapiRdbmsReactRestSpring Framework

Boeing

Design Engineer

A Minute Ago

In-Office

El Segundo, CA, USA

139K-224K Annually

Senior level

139K-224K Annually

Senior level

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing

Design and develop digital electronics for aerospace applications, including circuit design, testing, and compliance with specifications, while providing technical support through the product lifecycle.

Top Skills: Digital CircuitsDigital Control SystemsDigital ElectronicsEmbedded SystemsFirmwareMicroprocessor

Tensorlake is building a distributed data processing platform for developers building Generative AI applications. Our product, Indexify(https://getindexify.ai), enables building continuously evolving knowledge bases and indexes for Large Language Model applications by allowing structured data or embedding extraction algorithms on any unstructured data.

We are building a server-less product on top of Indexify that allows users to build real time extraction pipelines for unstructured data. The extracted data and indexes would be directly consumed by AI Applications and LLMs to power business and consumer applications.

As an AI Frameworks Engineer, you will be responsible for optimizing our AI infrastructure, developing high-performance inference engines, and maximizing GPU utilization. You’ll work on the critical backend architecture that powers our platform’s scalability and performance, collaborating with both researchers and product engineers to ensure Tensorlake’s models run efficiently on a variety of hardware configurations.

Responsibilities

As an AI Frameworks Engineer, your focus will be on optimizing and building high-performance AI systems. You will:

Design and build custom inference engines optimized for high throughput and low latency.
Optimize GPU usage across our platform, ensuring that deep learning models run efficiently at scale.
Write and optimize custom CUDA kernels and other low-level operations to accelerate deep learning workloads.
Develop and implement techniques for model compression, including quantization and pruning, to make models more efficient for real-world deployment.
Collaborate with research scientists and engineers to integrate new models into Tensorlake’s platform while ensuring peak performance.
Utilize cuDNN, cuBLAS, and other GPU-accelerated libraries to optimize computational workloads.
Troubleshoot and debug performance bottlenecks using tools like nvprof and Nsight, and implement fixes to improve throughput and memory usage.
Work on scaling AI models to multiple GPUs and nodes using NCCL and other parallel computing techniques.

Basic Qualifications

5+ years of experience in building and optimizing AI models for performance at scale.
Strong knowledge of deep learning frameworks such as TensorFlow, PyTorch, or JAX, with experience optimizing them for hardware.
Proficiency in GPU programming with CUDA, OpenCL, or similar parallel computing frameworks.
Expertise in writing custom CUDA kernels to optimize deep learning operations.
Experience with inference engines such as TensorRT, and understanding of model deployment optimization.
Software engineering proficiency in C/C++, Python, and low-level system components like memory management and concurrency.
Experience in using profiling tools like nvprof, Nsight, and other debugging tools for performance tuning.

Benefits

- Ability to save in 401(k) plans

- Comprehensive Healthcare and Dental Benefits

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine