Relace Logo

Relace

Infrastructure Engineer

Reposted 6 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA, USA
Junior
In-Office
San Francisco, CA, USA
Junior
Design and operate systems for high-performance inference and training infrastructure, focusing on reliability, speed, and cost-efficiency. Collaborate with research and product teams.
The summary above was generated by AI

About Us

Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.

Our technology supports some of the world’s fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we’re growing quickly.

Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.

The Role

As an Infrastructure Engineer at Relace, you’ll design and operate the systems that power our high-performance inference and training infrastructure. You’ll work closely with our research and product teams to ensure our models run at scale with reliability, speed, and cost-efficiency. This is a hands-on engineering role where you’ll shape how we build and scale the backbone of modern code generation.

You’ll have the opportunity to:

- Architect and manage the infrastructure powering our ultra-fast inference and training stack.

- Build reliable, efficient systems for deploying and scaling ML workloads globally.

- Work on GPU scheduling, distributed systems, and high-performance cloud deployments.

- Optimize performance and cost across compute, networking, and storage layers.

- Collaborate with world-class engineers to push the limits of what small models can do.

Requirements

2+ years of experience writing high-quality production code

Strong experience with cloud infrastructure (AWS, GCP, Azure, or equivalent)

Experience with data science and systems optimization

Familiarity with ML infrastructure, GPU’s, etc. a plus

Work out of our SF office in FiDi

Similar Jobs

Yesterday
Hybrid
Sunnyvale, CA, USA
189K-291K Annually
Senior level
189K-291K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Design, build, and deploy scalable ML training and evaluation platforms for autonomous driving. Lead architecture and implementation of distributed, high-performance pipelines, drive cross-team prioritization, mentor engineers, and support recruiting and code quality to accelerate ML model development lifecycle.
Top Skills: BazelBlazeBuckC++Cloud InfrastructureCmakeDistributed TrainingDockerGpu/Cpu ClustersKubernetesMlopsPythonPyTorchTensorFlow
5 Days Ago
Hybrid
Mountain View, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, build, and operate an agent-native search infrastructure: indexing engines, hybrid sparse/semantic indices, ingestion/enrichment pipelines, multi-modal indexing, entity resolution/knowledge graph, and scale/observability for low-latency agentic retrieval and ranking.
Top Skills: ElasticsearchEmbeddingsKnowledge GraphLuceneOpensearchSolrVector DatabasesVector Quantization
16 Days Ago
Hybrid
San Francisco, CA, USA
212K-265K Annually
Expert/Leader
212K-265K Annually
Expert/Leader
Consumer Web • Healthtech • Professional Services • Social Impact • Software
Lead architecture and evolution of Headway's data platform (warehouse, ingestion, orchestration, CI/CD, monitoring, cloud infra). Serve as technical anchor across analytics, product, and ML teams, drive platform roadmaps, set standards, mentor engineers, and own end-to-end infrastructure decisions for scale and performance.
Top Skills: AirflowAstronomerAWSAws CdkBigQueryDatabricksDatadogDbtDockerGithub ActionsNew RelicPulumiPythonRedshiftSnowflakeSparkSQLTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account