Bretton AI Logo

Bretton AI

Software Engineer, Infrastructure

Reposted 24 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA, USA
168K-213K Annually
Senior level
In-Office
San Francisco, CA, USA
168K-213K Annually
Senior level
The Senior Infrastructure Engineer will own and evolve the Kubernetes infrastructure, design deployment pipelines, manage customer engagements, and ensure compliance with SOC 2 standards while automating secure infrastructure solutions.
The summary above was generated by AI
About Bretton AI

Bretton AI is the leading AI agent platform for financial services. Companies like Robinhood, Mercury, and Gusto trust us to automate mission-critical work, starting with anti-money laundering (AML) and counter-terrorist financing investigations.

We've raised over $95M from Greylock, Y Combinator, Thomson Reuters Ventures, and other top-tier investors. We're based in downtown San Francisco and our team comes from world-class organizations like SpaceX, Google, Netflix, Stripe, Plaid, and more.

The Role

As a Senior Infrastructure Engineer, you will own the foundation that enables us to deploy secure, compliant AI systems at major financial institutions fighting financial crime at a massive scale. Our infrastructure is built on a modern, container-native architecture, leveraging Docker and Kubernetes to deliver consistent, auditable deployments across diverse customer environments.

You will work directly with our largest customers—institutions serving over a billion people—to architect, automate, and harden our on-premises and cloud environments to meet the strictest regulatory and performance requirements, including SOC 2 compliance. Your work will be informed by real customer needs and will ship to everyone, so you must build enterprise-grade systems, work effectively with engineering and customer teams, understand financial services compliance, and adapt quickly.

What You’ll Do
  • Own and evolve our Kubernetes infrastructure, including cluster management, service mesh configuration, and container security policies.

  • Design and implement progressive delivery pipelines with canary deployments, automated rollbacks, and deployment health validation.

  • Build and maintain our observability infrastructure in Datadog, including dashboards, monitors, SLOs, and distributed tracing.

  • Drive incident response for high-severity outages and proactively model capacity needs for low-latency AI inference.

  • Architect and automate secure infrastructure using Infrastructure-as-Code for VPCs, IAM policies, Kubernetes manifests, and private cloud deployments.

  • Maintain and improve the infrastructure controls that support our SOC 2 compliance posture.

  • Lead customer engagements for enterprise rollouts and mentor mid-level engineers on infrastructure best practices.

What We’re Looking For

Must-Haves:
  • 8+ years in infrastructure engineering or DevOps at high-growth or hyperscale companies.

  • Experience with Docker and Kubernetes, including production cluster management, Helm, and service mesh technologies.

  • A proven track record of architecting and operating AWS (preferred), GCP, or Azure at an enterprise scale.

  • Experience with observability platforms, preferably Datadog (metrics, logs, APM, distributed tracing).

  • A strong background in Infrastructure-as-Code (Terraform, Helm, Kustomize) and safe deployment practices (progressive delivery, canary deployments, GitOps, automated rollbacks).

  • "Battle scars" from leading outages, capacity events, and large-scale incident reviews.

  • Strong programming skills in Python.

Bonus Points:
  • Familiarity with TypeScript.

  • Direct involvement in SOC 2 or other compliance audit preparation or remediation.

  • Direct experience with private-cloud or on-premises deployments for regulated customers.

  • Previous experience at startups scaling infrastructure from the early stages to the enterprise level.

  • A background in fintech or building systems for highly regulated industries.

  • Experience with AI/ML infrastructure and model deployment at scale.

Why You’ll Love Working Here
  • Build for Scale: You thrive at the intersection of technical leadership and customer impact, building systems that enable rapid development while maintaining the highest standards of security, compliance, and reliability.

  • Infrastructure as a Product: You see infrastructure as a product for your engineering peers and understand the value of platform automation in enabling developer velocity.

  • High-Impact Work: Your contributions will have a direct, measurable impact on how financial institutions adopt AI to fight crime.

  • Mentorship and Leadership: You are comfortable balancing technical excellence with mentoring others and leading customer engagements.

Compensation & Benefits
  • $168k - $213k + equity

  • Comprehensive healthcare, 401k matching, commuter benefits

  • 15 days PTO + holidays, unlimited sick days

  • Flexible leave options

  • Working late? We’ve got you covered with DoorDash and an Uber home

Join us in building AI that protects the global financial system from financial crimes that fund terrorism, human trafficking, and other serious threats.

HQ

Bretton AI San Francisco, California, USA Office

San Francisco, California, United States, 94110

Similar Jobs

5 Days Ago
Hybrid
San Francisco, CA, USA
260K-330K Annually
Senior level
260K-330K Annually
Senior level
Artificial Intelligence • Productivity • Software
As a core engineer on the Web Infrastructure team, you will enhance Notion's web client performance and development speed by improving load times, interaction latency, and providing tooling for product engineers.
Top Skills: ReactWebpack
5 Days Ago
Hybrid
Palo Alto, CA, USA
133K-235K Annually
Junior
133K-235K Annually
Junior
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
The Software Engineer will optimize ML infrastructure for training and inference, develop scalable systems, and work closely with ML engineers on producing high-performance models.
Top Skills: C++Caffe2FlinkJavaPythonPyTorchRayScalaScikit-LearnSparkSpark MlTensorFlow
5 Days Ago
Hybrid
Mountain View, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Senior Software Engineer will design and maintain search infrastructure services, optimize performance, and collaborate with other engineering teams.
Top Skills: AWSAzureC++DockerElasticsearchGCPGoJavaKafkaOpensearchPython

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account