Roboflow Logo

Roboflow

Infrastructure Engineer

Reposted 4 Days Ago
In-Office or Remote
2 Locations
165K-200K Annually
Mid level
In-Office or Remote
2 Locations
165K-200K Annually
Mid level
As an Infrastructure Engineer, you will secure, scale, and maintain core infrastructure, collaborate across teams, and optimize machine learning workflows.
The summary above was generated by AI
Who We Are

Our mission is to make the world programmable. Sight is one of the key ways we understand the world, and soon this will be true for the software we use, too.

We’re building the tools, community, and resources needed to make the world programmable with artificial intelligence. Roboflow simplifies building and using computer vision models. Today, over 1M+ developers, including those from half the Fortune 100, use Roboflow’s machine learning open source and hosted tools. That includes counting cells to accelerate cancer research, improving construction site safety, digitizing floor plans, preserving coral reef populations, guiding drone flight, and much more.

Roboflow is supported by great customers and investors, having raised over 63 million from Y Combinator, Google Ventures, Craft Ventures, Sam Altman, Lachy Groom, amongst other leading software investors.

Roboflowers love building great things with passionate teammates. We value ownership, accountability, and a bias toward action—whether it's a big initiative or a small fix. You’re naturally curious, hands-on with new tech (maybe even played with ChatGPT or AI products early on), and prefer to show your work over talking about it. Many of us have founder mindsets and thrive in Roboflow’s high-autonomy environment—some even started as side hustlers in school.

What We're Looking For

Primarily, you like to make great things with passionate colleagues. You are someone that likes to own outcomes, not only inputs. You’re motivated by having responsibility and accountability. You’re eager to ‘do the work,’ big and small.

You’re curious and learning about new technologies, perhaps an early tinkerer with MLOps products. You show more than you tell.

You’re motivated by the question, “How can I improve this?” and have a track record of doing so, even in ways adjacent to your role. Much of our current team is made up of former founders and thrive in the level of autonomy at Roboflow. Maybe you had a side hustle in high school or college.

Many Roboflowers have used our tools before joining. One of the best ways to stand out amongst other applicants is to write about something you have built with Roboflow or contribute to one of ouropen source projects.Likewise we highly value users with meaningful contributions to successful open source devtool and security projects.

What You'll Do

As a member of our infrastructure team, you'll be at the heart of a fast-paced startup environment. Your primary focus will be on striking the right balance between rapid delivery, high reliability, and robust security. This isn't a traditional, siloed role; you'll need to wear many hats—acting as an infrastructure engineer one moment, and a developer, or even a security analyst.

You will be securing, scaling, and maintaining the core infrastructure that powers our product. This includes our cloud architecture, databases, file storage, search clusters, microservices, and machine learning pipelines. You'll work closely with our product team and collaborate across the company on product, operations, and customer-facing projects, constantly context-switching to solve the next critical challenge.

Skillset

We're looking for a versatile engineer excited by high-impact challenges. At Roboflow, we are AI-native: we expect our team to use AI to accelerate everything from writing code and fixing bugs to analyzing security, cost, and performance. Experience in some or all of the following areas will be crucial:

  • Production experience with Kubernetes: Building and managing containerized applications at scale.

  • Infrastructure-as-Code (IaC): Using Terraform, Helm charts, bash scripting, and Python to automate everything.

  • Scale & Site Reliability: Operating, monitoring, and scaling large-scale applications (especially in ML/AI) in AWS and/or GCP.

  • Development Skills: Proficiency in Node.js and Python, with the ability to collaborate with full-stack developers on designing and operating SaaS applications.

  • ML/Big Data Ops: Hands-on experience with the infrastructure required for machine learning at scale (GPUs, Docker, Kubernetes) and familiarity with libraries like PyTorch or Tensorflow.

  • CI/CD Automation: Experience with tools like GitHub Actions or Spacelift to build and deploy code efficiently.

  • Pragmatic Security: Awareness of security best practices for cloud operations and how they can be applied to startup environments.

  • AI-Native Engineering: Leveraging LLMs and AI tools to accelerate the development lifecycle—from writing and refactoring code to identifying security vulnerabilities and optimizing infrastructure costs.

 
A Glimpse of Your Work

No two days will be the same. Your tasks will be a blend of strategic projects and hands-on implementation. Examples include:

  • Running and optimizing a high-availability machine learning inference service.

  • Collaborating with customer security teams to ensure secure integration.

  • Developing creative IaC solutions to scale our platform cost-effectively.

  • Working with the engineering team to define SLOs/SLAs and participating in incident response.

  • Improving the Observability and Alerting stack and the processes built around it.

  • Diving deep into our stack to identify and act on cost-optimization opportunities.

  • Contributing code (Python, JavaScript, etc.) as part of a team designing and deploying new product features.

  • Fixing security vulnerabilities and bugs

  • Hardening our systems and processes to meet SOC 2, HIPAA, and GDPR requirements, making us audit-ready.

  • Participating in an on-call rotation to ensure platform reliability.

📅 Within one week, you will…

  • Learn all about computer vision, our product, company, customers, and vision.

  • Ship something substantial to an end user

  • Start learning our infrastructure and security practices.

📅 Within one month, you will…

  • Onboard in person with your manager

  • Build your first computer vision project with Roboflow (if you haven't already)

  • Start contributing to infra-as-code

  • Start working with customers to help with their security questions and onboarding

  • Understand the architecture of Roboflow

📅 Within six months, you will…

  • Attend your first all company onsite

  • Be ramped up on other relevant parts of the Roboflow product.

Who You'll Be Working With

Our team of ~100 attracts talent like executives that wanted to return to building, founders with a 100M+ exit, Roboflow users turned team members, open source contributors, a cyclist who biked across the United States, prolific high school hackers, a CTO from 100+ engineering organization, amongst many exceptional others.

You will directly be working with our Engineering Lead and a team of product, infrastructure and security engineers.

Where You'll Work

Roboflow is distributed across the US and Europe. We currently have Hubs in New York City and San Francisco (and plan to open more as we grow density in new cities). We provide opportunities (like team on-sites in different cities) and resources (like a $4000/yr travel stipend) to work in person with other team members as much as you'd like, while also supporting remote team members. You can work from one of our Hubs (we offer a relocation bonus), work from home, work at co-working spaces, etc. We want you to work where you work best!

When You'll Work

Roboflow primarily operates during the daytime hours in the US and there are some synchronous meetings you’ll be expected to attend each week. Apart from that, we have a flexible schedule that allows you to work collaboratively with other team members and asynchronously when needed.

What You'll Receive

To determine your salary, we use a number of market and data-driven salary sources. We review all salaries every six months to ensure we stay in line with the market.

💰 The target compensation for this role is USD $165,000 base - $200,000 base.

📈 In addition to our cash compensation, we offer generous perks and benefits. Below are some of the highlights:

  • $4000/yr Travel Stipend to travel anywhere anytime to work alongside other Roboflowers

  • $350/mo Productivity stipend to spend on things that make your work environment more productive, like high-speed internet at home or a co-working space

  • Cover up to 100% of your health insurance costs for you and your partner or family

  • Equity in the company so we are all invested in the future of computer vision

Interview Process (~5 hours)

Below is the interview process you can expect for this role. We are all motivated to work with an exceptional team and don't currently have in-house recruiters. You will be speaking directly with our team about what it's like to work and thrive at Roboflow. We like to be decisive and work fast, so don't be surprised if all the below conversations happen over a day or two.

Before the Interview:

  • We’ll review your application, LinkedIn, Github, etc.

  • The best way to stand out is to write about something you’ve built with Roboflow or contribute to one of our open source projects, or highlight your contributions to devtools/infrastructure/security engineering open-source projects.

  • We may send you a technical screen if applicable.

Introduction Phase:

  • [45m] Meet with hiring manager for introduction, Sachin Agarwal, to assess overall mindset and skillset. This first interview is a time to get to know more about the role, allow us to get to know you better, and ensure it's a good fit for both parties to continue moving forward in the process

Team Interview Phase:

  • [45m] Meet with our CTO, Brad Dwyer

  • [90m] Meet with hiring manager and team for a technical infrastructure hands-on interview

Ask questions!

Final Interview Stage:

  • [45m] Meet with Kate Wagner, Head of Operations for a culture discussion

  • [60m] Meet with Joseph Nelson, CEO

  • We check references and conduct a background check

Note: you are welcome to request additional conversations with anyone you would like to meet and we will accommodate as best we can.

Learn More About Us

We are building a diverse Distributed team that is distributed across the globe. Roboflow is an equal opportunity workplace; we welcome people from all backgrounds, communities, and experiences.

We provide competitive compensation and stellar benefits to accelerate your personal and work life. Learn more about what it is like to work at Roboflow by reading these blog posts.

See our careers page for all open listings.

Similar Jobs

8 Days Ago
Remote or Hybrid
USA
140K-215K Annually
Senior level
140K-215K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, implement, and maintain scalable hybrid multi-cloud Kubernetes platforms at massive scale. Ensure high reliability, integrate open-source observability tools, provide technical direction, operate large Linux environments across cloud and data centers, handle on-call duties, and mentor junior engineers.
Top Skills: AlertmanagerAWSGCPGoGrafanaKubernetesLinuxOciPrometheusThanos
15 Days Ago
Remote
United States
235K-275K Annually
Senior level
235K-275K Annually
Senior level
Software • Defense
Build, operate, and secure cloud-native and edge infrastructure including Kubernetes, artifact pipelines, application streaming, and air-gapped appliances. Partner with product, GRC, and mission owners to design, deploy, and harden systems, remediate vulnerabilities, support audits (IL5/IL6/JWICS), and ensure trusted artifact delivery across environments.
Top Skills: Air-Gapped AppliancesApplication StreamingArtifact PipelineAWSAws GovcloudAzure GovernmentCve RemediationFedrampGoJwicsKubernetesKubernetes OperatorsAzureMulti-Cluster KubernetesPythonRustSoc 2StigTypescript
15 Days Ago
Remote
United States
180K-235K Annually
Senior level
180K-235K Annually
Senior level
Software • Defense
Design, build, and run secure production infrastructure across cloud-native and air-gapped deployments. Own end-to-end platform outcomes, harden the artifact pipeline for signed releases, embed with teams to advise on security and deployment best practices, and partner with GRC on STIGs, CVE remediation, and audit readiness for classified environments.
Top Skills: Air-Gapped DeploymentsAmazon AwsApplication StreamingArtifact Pipeline (Ci/Cd)Aws GovcloudAzure GovernmentContainer/Image SigningCve RemediationFedrampGoJwicsKubernetesAzureNetwork SegmentationPythonRustSecrets ManagementSoc 2StigTypescript

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account