Rational Dynamics Logo

Rational Dynamics

Senior Infrastructure Engineer

Posted 6 Days Ago
Be an Early Applicant
Hybrid
Berkeley, CA, USA
180K-250K Annually
Senior level
Hybrid
Berkeley, CA, USA
180K-250K Annually
Senior level
The Senior Infrastructure Engineer designs and builds cloud infrastructure for AI platforms, ensuring reliability and security while managing dynamic team requirements and deployments.
The summary above was generated by AI
The Company

Rational Dynamics builds customized AI reasoning systems for tasks of high cognitive complexity.

Our initial market is the world’s leading institutional asset owners. We work very closely with these customers to create specialized, rigorous benchmark datasets encompassing their most valuable and difficult knowledge work. Then we use the benchmarks to construct agentic large reasoning models, applying the same rigor to prove that the models correctly do the work. Customers access the models through a tailored application service, making their most skilled, expensive workers dramatically more productive.

We are an early-stage startup. Our founders previously started Voleon, now one of the world’s largest systematic investment managers, and recognized as a longstanding industry leader in applied machine learning. They bring to Rational Dynamics the same research discipline and data-driven focus that succeeded in the unforgiving, high-stakes setting of financial markets.

Job Opportunities

We are looking for entrepreneurial researchers and engineers who want to work on cutting-edge agentic AI methods and build out a best-in-class core technical infrastructure. Our work environment is highly collaborative. Your colleagues will be accomplished experts in AI/ML, statistics, and systems engineering.

Job Description

As a Senior Infrastructure Engineer, you will design and build cloud infrastructure that powers Rational Dynamics' AI platform and customer deployments. Reporting to the Director of Software Engineering, you will join a small, fast-moving team. Your job is to build high impact systems to accelerate our systems that accelerate research and customer deployments across domains including model training, data acquisition and curation, cloud orchestration, and security. You will also flex beyond pure infrastructure when the situation calls for it, supporting research iteration, customer deployments, and security needs as they arise. This role is a means to make a difference: the infrastructure you build and maintain will determine whether Rational Dynamics can deliver high cognitive complexity systems that enterprises trust to drive their most critical workflows. We are building a team of people motivated by the future of speed and productivity that will be unlocked that agentic AI will unlock high complexity domains.

Duties
  • Own, extend, and improve cloud infrastructure that powers both production customer systems and internal research platforms, including compute, networking, storage, and deployment environments

  • Build and maintain CI/CD pipelines, developer tooling, and observability that keep the team shipping fast and catching problems early

  • Support GPU workloads and ML infrastructure needs in close collaboration with the research & ML team

  • Drive security posture and compliance efforts, including standards relevant to enterprise customers such as SOC 2

  • Build deployment and operations infrastructure that enables Forward Deployed Engineers to reliably build solutions quickly in heterogeneous enterprise cloud environments

  • Make pragmatic, well-reasoned infrastructure decisions that balance speed now with scalability later

  • Continuously improve system reliability, deployment simplicity, uptime, and cost efficiency through monitoring, feedback loops, and disciplined engineering

Requirements
  • Proven experience designing and deploying reliable production infrastructure

  • 5+ years of experience designing and operating cloud infrastructure in production across multiple cloud providers (AWS, GCP, Azure)

  • Strong command of Kubernetes, Terraform, and modern CI/CD tooling

  • Security-conscious mindset with experience navigating enterprise compliance requirements such as SOC 2, PCI, DSS, or equivalent

  • Strong programming skills, with experience building infrastructure tooling in Go or an equivalent systems language

  • Familiarity with data pipeline tooling for batch and long-running workflows (e.g. Airflow, Temporal, etc.)

  • Comfort operating on a small team with dynamic requirements, threading the needle of speed and scalability, willing to take any task critical to customers and the team

Preferred qualifications
  • Prior experience in a regulated or high-consequence industry such as finance, healthcare, or defense strongly preferred

  • Experience supporting GPU or ML-specific infrastructure workloads

  • Experience deploying solutions in enterprise customer cloud environments

  • Experience building infrastructure for the training and deployment of AI agents

  • Prior early-stage or small-team experience where you made critical end-to-end infrastructure decisions

“Friends of Rational Dynamics” Candidate Referral Program

If you have a great candidate in mind for this role and would like to have the potential to earn $7,500 to $15,000 if your referred candidate is successfully hired and employed by Rational Dynamics, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Rational Dynamics Referral Bonus Program.


Equal Opportunity Employer

Rational Dynamics is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Top Skills

Airflow
AWS
Azure
Ci/Cd
GCP
Go
Kubernetes
Temporal
Terraform

Similar Jobs

3 Days Ago
Hybrid
140K-215K Annually
Senior level
140K-215K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves building and managing large-scale distributed data processing systems, focusing on Flink/Spark infrastructure across various platforms while ensuring reliability and security.
Top Skills: AWSDockerFlinkGoHiveJavaKafkaKotlinKubernetesMinioPinotS3ScalaSparkTerraformTrino
10 Days Ago
In-Office
165K-242K Annually
Senior level
165K-242K Annually
Senior level
Cloud • Information Technology • Machine Learning
As a Senior Site Reliability Engineer, you'll ensure the reliability and performance of a Kubernetes-based data platform, focusing on scaling infrastructure, enhancing security, and optimizing deployment processes.
Top Skills: AirflowArgo CdFlinkGithub ActionsGrafanaHelmIstioKafkaKubernetesLinkerdOpentelemetryPrometheusPulumiSparkTerraform
24 Days Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.
Top Skills: AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account