Sight Machine Logo

Sight Machine

Infrastructure Engineer

Posted 11 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA, USA
175K-200K Annually
Senior level
In-Office
San Francisco, CA, USA
175K-200K Annually
Senior level
As an Infrastructure Engineer, you'll enhance our Kubernetes-based cloud platform, implement CI/CD pipelines, build AI automation, and ensure strong observability, collaborating closely with development teams to improve infrastructure.
The summary above was generated by AI

Team Culture

Great things happen when people can bring their authentic selves to work. We empower all of our employees to share their perspectives, passions and experiences because collectively we make a better, stronger team. Our team members collaborate closely with peers & cross functional stakeholders throughout the business, our clients on the forefront of digital transformation, and the cutting edge of digital manufacturing thought leadership. 

We take pride in our self-starter culture where employees are enabled and encouraged to achieve their professional goals through leadership guidance, learning and development. Our philosophy is that careers are continuous journeys, and we dedicate time and offer resources so that employees can reach their full potential.

Benefits + Perks

We value you at and outside of work and know your loved ones are important. Our benefits are designed to support you and your family’s health through life’s expected and unexpected events.

Our Benefits Include:

  • Competitive Salary + Stock Options
  • Health Care Coverage + Life Insurance + Health Savings Account + Flexible Spending
  • Account (includes spouse + children)
  • Flexible Vacation Policy
  • Adaptable Working Schedule and Environment
  • Our Perks Include:
  • Casual Dress Attire
  • Hybrid work flexibility
  • Catered Lunches, Snacks and Beverages
  • Commuter Savings Program
  • Company Outings
  • Designated Volunteering Hours + Group Volunteer Events

Sight Machine is proud to be an equal opportunity employer and considers candidates regardless of age, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Sight Machine also considers qualified applicants regardless of criminal histories, consistent with legal requirements.

About Sight Machine, Inc.

Sight Machine strengthens manufacturers by providing the industry’s only standard data model and system-level visualization capabilities. By integrating all crucial data into a single innovative platform, everyone involved in the fabrication process can visualize, contextualize and examine data in one intuitive interface.

Sight Machine is committed and mission-driven to improve lives, strengthen communities and make the world cleaner through continuously re-envisioning manufacturing processes - making them more efficient, sustainable and absolute. Founded in Michigan in 2011 and expanded to San Francisco in 2012, Sight Machine blends the spirit of technology innovation and the down to earth style of Detroit manufacturing. Our team includes early leadership from Yahoo, Tesla Motors and Oracle. Together, we share wide industry knowledge and a commitment to advance manufacturing to a more sustainable future.

At Sight Machine, you will work with manufacturing leaders in the automotive, medical device, apparel, construction, and pharmaceutical industries. You will have access to, and work with massive amounts of factory floor data to help uncover insights on how customers make products and develop solutions to pressing business problems. The platform solves problems like Extract Transform Load (ETL), information retrieval, data aggregation and analytics, factory automation, distributed computing, and security. 

We place great value on professional, technical, and personal growth in an inclusive, collaborative environment. The ideal candidate will have a passion for technology and a strong can-do attitude.

About the role

Most infrastructure roles ask you to maintain what exists. This one asks you to rethink how it should work.

Sight Machine runs AI and analytics systems on top of some of the most demanding industrial data on the planet: millions of sensor events per minute, dozens of interconnected machines, production environments where downtime has real consequences. The infrastructure holding all of that together is not a solved problem. It is a living system, and we are looking for an engineer who sees that as an opportunity rather than a burden.

As an Infrastructure Engineer, you will own the systems that deploy, monitor, and operate Sight Machine's cloud platform across a global customer base. You will be deep in Kubernetes, Terraform, and CI/CD pipelines, and you will bring AI into that work not as an afterthought but as a core part of how you reduce toil, accelerate automation, and stay ahead of failure. We are not looking for someone who will hold the line. We are looking for someone who will move it.

This is a role for an experienced infrastructure engineer who has made real mistakes in production, learned from them, and built the instincts that only come from that. If you have also been pushing AI tools into your workflow in ways that actually change what you can ship, we want to talk.

What You’ll Actually Work On

In your first year, you can expect to work on problems like these:

  • Owning and evolving our Kubernetes-based cloud infrastructure across Azure and other providers, including fleet management, networking, and cluster operations at scale.
  • Designing and implementing CI/CD pipelines that let the engineering team ship faster and with more confidence, including automated testing, progressive delivery, and rollback capability.
  • Building AI-assisted automation for operational tasks: runbook generation, anomaly triage, alerting logic, and anywhere else we can eliminate repetitive human intervention without sacrificing control.
  • Driving Infrastructure as Code discipline across the platform (Terraform, Helm, FluxCD) so that every environment is reproducible, auditable, and fast to recover.
  • Building and maintaining monitoring and observability infrastructure that gives the team real signal across our stack, from container health to database performance to customer-facing SLAs.
  • Participating in on-call rotation and using every incident as a forcing function to improve the system: better runbooks, better alerting, better automation.
  • Collaborating closely with Development Engineering to close the gap between what gets built and what gets operated well in production.

You will work across a mix of mature systems and active greenfield development. Both require care. We want engineers who can operate what exists reliably while finding the leverage points to make it better.

What We’re Looking For

We care more about what you have operated than where you have worked. Here is what actually matters:

  • 5+ years of professional infrastructure or DevOps engineering experience, with at least some of that at meaningful scale in a cloud-native environment.
  • Deep hands-on experience with Kubernetes and Docker in at least one major cloud provider (Azure, GCP, AWS). You have run clusters in production and have the scars to prove it.
  • Strong IaC fluency with Terraform, Helm, FluxCD, or similar. You write infrastructure the way developers write code: versioned, reviewed, and tested.
  • Real fluency with AI development tools. Not just autocomplete. You have used AI to write automation scripts, draft runbooks, accelerate incident triage, or build internal tooling. Show us how it has actually changed your output.
  • Solid coding ability in at least one scripting or systems language (Python, Go, or similar). You write tools, not just configs.
  • Strong Linux fundamentals and a working knowledge of networking: TCP/IP, DNS, load balancing, and how things break when they should not.
  • Experience with monitoring and alerting stacks: Prometheus, Sentry, Opsgenie, or equivalent. You build observability that gives people real signal, not noise.
  • A track record of on-call participation and a philosophy around incident response that leads to improvement, not just resolution.
  • Clear, direct communication. You can write a postmortem, a runbook, or a design doc that people actually read.
  • A bias for action. You have made decisions under uncertainty, taken the risk, and adjusted when you were wrong. Endless planning is not your style.

Nice to Have

  • Familiarity with our current stack: Kubernetes, FluxCD, Terraform, Helm, Prometheus, Elasticsearch, Kafka, PostgreSQL, Jenkins.
  • Experience with Python and Java in the context of platform tooling or automation.
  • Prior work in industrial IoT, manufacturing, or operational technology environments.
  • Experience managing infrastructure for multi-tenant SaaS platforms.
  • An active GitHub or open-source presence that shows how you approach technical problems when no one is watching.

Growth & Mentorship

Infrastructure work at Sight Machine gives you access to an uncommon set of problems: real-time data pipelines at an industrial scale, heterogeneous customer environments, and the challenge of building systems reliable enough that plant operators can trust them. The problems here transfer.

You will work alongside engineers who take craft seriously, push back in design reviews, and invest in making each other better. Senior engineers here move fast because they have earned trust, not because they have seniority. If you know where you want to be in two to three years, we want to help you get there.

HQ

Sight Machine San Francisco, California, USA Office

243 Vallejo St, San Francisco, United States, 94111

Similar Jobs

6 Days Ago
Hybrid
San Francisco, CA, USA
135K-240K Annually
Senior level
135K-240K Annually
Senior level
Blockchain • Cloud • Fintech • Information Technology • Software • Cryptocurrency • Web3
Design and manage scalable cloud infrastructure for a blockchain developer platform, focusing on reliability, observability, and automation while mentoring team members.
Top Skills: ArgocdAWSClaude CodeCodexCursorDnsGCPGrafanaIpamIstioKubernetesN8NOpentelemetryPrometheusTerraformVpc
4 Minutes Ago
Remote or Hybrid
California, USA
Senior level
Senior level
Artificial Intelligence • Information Technology • Robotics • Defense
Design and operate secure internal infrastructure across Azure and AWS, focusing on GitOps, Kubernetes, observability, IoT, and ML workloads.
Top Skills: AWSAzureAzure GovernmentEvent ProcessingGitopsGrafanaHelmIotKinesisKubernetesMl WorkloadsOpentofuTerraform
32 Minutes Ago
In-Office
100K-130K Annually
Senior level
100K-130K Annually
Senior level
Software • Energy • Solar • Renewable Energy
The IT Infrastructure Engineer is responsible for the design, deployment, configuration, and support of IT and OT infrastructure for SCADA systems, ensuring reliability and security across various platforms and collaborating with multiple teams.
Top Skills: Active DirectoryFirewallsHyper-VScada SystemsTcp/IpVlansVMwareVpnsWindows Server

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account