Tensorlake Logo

Tensorlake

Distributed Systems Engineer

Posted 4 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA
Senior level
In-Office
San Francisco, CA
Senior level
Build core infrastructure for a durable application runtime: implement schedulers, runtimes, and data plane components in Rust; optimize cluster scheduling, resource utilization, and performance; extend SDKs; collaborate across product and engineering from research to production.
The summary above was generated by AI

Our core mission at Tensorlake is to unlock your data wherever it is. We believe that people should have access to the best tools to parse, extract, and manipulate data, run data applications, so they can spend more time putting knowledge into action.

We’re looking for engineers who want to build the operating system for AI Data Applications and Workflows.

About the role

We're looking for experienced distributed systems engineers to build the core infrastructure for our durable application runtime. This is a systems programming role—you'll be writing the schedulers, runtimes, and data plane components that other engineers build applications on top of. Some of the things you'll work on in this role

  • Build and evolve our durable application runtime to support advanced data processing and machine learning workflows
  • Design and implement core components of our cluster scheduler to improve resource utilization, reduce costs, and maximize performance
  • Write systems-level code in Rust for our data plane and execution engine
  • Design and build new capabilities for our SDKs
  • Work closely with the rest of the engineering team to take something from an idea to a polished product
About you
  • You have 5+ years of experience building distributed systems infrastructure—not configuring or operating it, but designing and implementing it from scratch
  • You've written production systems in Rust or other systems programming languages (C, C++, Go at the systems level)
  • You understand how cluster schedulers, databases, and runtimes work at the implementation level because you've built or contributed to them
  • You can autonomously lead, design, and build fault-tolerant systems
  • You enjoy diving deep into performance challenges at the systems level—memory allocation, concurrency primitives, network protocols
  • You want to be part of the entire product development process, from customer research to implementation

This role is not a fit if...

  • Your experience is primarily in DevOps, SRE, or platform operations (Terraform, Kubernetes administration, CI/CD pipelines)
  • You're looking for a role focused on automation, tooling, or infrastructure-as-code rather than building core systems
  • You haven't written substantial code in a systems programming language
Things you should know
  • We’re a startup, and we expect people to be able to wear multiple hats at any given time.
  • We’re distributed across the US and Europe, and everyone is self-sufficient to get work done even when nobody else is around.
  • We do not expect people to work all the time, but we expect everyone to follow up on their commitments.
  • We’re a small team with high ownership and we’re passionate about what we do.
  • Our tech stack is somehow diverse. You’d be mostly working with Rust, Python and FoundationDB on your day to day. But you’ll also need to understand TypeScript, Go, and Terraform enough to touch parts of our backend infrastructure.


Top Skills

Rust,Python,Foundationdb,Typescript,Go,Terraform

Similar Jobs

8 Days Ago
In-Office
Palo Alto, CA, USA
158K-237K Annually
Junior
158K-237K Annually
Junior
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
Build and own scalable, high-performance platform and system-level services for Rubrik. Design, implement, test, and operate Linux-based distributed systems, debug kernel and performance issues, collaborate cross-functionally, mentor engineers, and improve monitoring, reliability, and tooling for cluster health and scalability.
Top Skills: Python,Go,C++,Java,Scala,Linux,Linux Kernel,Kernel Debugging,Containerization,Networking,Storage,Filesystems,Distributed Systems,Clustering
Yesterday
Easy Apply
In-Office or Remote
7 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Hardware • Information Technology • Software
Design, build, and secure scalable distributed backend systems for AI workloads. Implement microservice-based architectures, manage Kubernetes-based deployments, apply IaC and DevOps practices, and collaborate with performance and solutions engineering to optimize model deployment and infrastructure cost, security, and scalability.
Top Skills: Cloud PlatformsContainerizationDevops PracticesGoInfrastructure As CodeJavaKubernetesLarge Language ModelsMicroservice ArchitectureOrchestrationSpring Boot
4 Days Ago
Hybrid
San Francisco, CA, USA
150K-250K Annually
Mid level
150K-250K Annually
Mid level
Software
Design and implement RDMA/InfiniBand networking primitives for distributed GPU inference, optimize communication for Disaggregated KV Cache and WideEP, enable sub-10s model startup, validate hardware performance on H100/H200/B200/B300/GB300 clusters, build observability tools, and optimize or author communication kernels (NCCL/NVSHMEM/UCX).
Top Skills: C++,Python,Rdma,Roce V2,Infiniband,Tcp/Ip,Nvlink,Nccl,Nvshmem,Ucx,Gpudirect Storage (Gds),Weka,3Fs,Tensorrt-Llm,Vllm,Sglang

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account