At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride!
A backend‑leaning Software Engineer excited to apply software engineering to platform and infrastructure problems. This is an automation‑first role: you’ll design and ship services and agents that eliminate toil, auto‑remediate issues, and accelerate how teams at Docker build, ship, and run software. You’ll work primarily in Go, and experience with other languages is welcome.
What You’ll DoDesign, build, and operate backend services and automation tooling in Go on AWS.
Create automation and agentic systems: event‑driven workers, Kubernetes operators/controllers, policy agents (OPA), GitHub Apps/bots, and ChatOps.
Develop safe, observable LLM‑augmented runbooks/agents with guardrails (approvals, rate limits, audit logs).
Evolve platform foundations: multi‑tenant EKS, Envoy Gateway‑based ingress (ALB/NLB), networking, observability, and CI/CD.
Codify infrastructure with Terraform and drive GitOps workflows for fast, safe delivery.
Raise reliability: define SLOs, participate in on‑call, lead blameless incident reviews, and automate remediation and prevention.
Level‑up developer experience: templates, job/workflow orchestration, dashboards, and paved‑road deployment patterns.
Partner with Product, Security, and other engineering teams.
You likely have many of the following:
3+ years building and operating SaaS or large‑scale backend systems, with strong proficiency in Go.
Solid API design skills; familiarity with microservices or event‑driven architectures.
Experience running workloads in AWS (or another major cloud) and automating with Terraform or similar.
Practical understanding of Linux, networking, and production security (least privilege, secrets management, identity/IAM).
Familiarity with CI/CD and modern monitoring/logging/metrics.
Strong written communication; comfortable working remotely across time zones.
Bonus (nice‑to‑have):
Automation & Agents: controllers/operators, GitHub Apps/bots, ChatOps, policy engines (OPA), queues/streams (SQS, SNS, Kafka), and auto‑remediation.
LLM‑augmented systems: tool‑using agents or runbooks with clear guardrails.
Kubernetes ecosystem (EKS, ingress, CNI, service mesh) and Envoy/Envoy Gateway.
Observability tooling (OpenTelemetry, Prometheus, Grafana) with emphasis on measuring automation efficacy and safety.
CI/CD & release automation (GitHub Actions, Argo CD) and GitOps practices.
Cost‑aware design and FinOps mindset for running at scale.
Containers and Go‑based platform tooling; exposure to distributed systems.
Meet the team and understand our mission, architecture, and ways of working.
Set up your dev environment and ship a small change (service feature, Terraform module, or reliability improvement).
Shadow on‑call; learn our SLOs and incident practices; identify top automation opportunities.
Own a service or platform component and deliver a meaningful project from design to production.
Rotate fully into on‑call; lead incident response when needed.
Deliver your first automation/agent (e.g., auto‑remediation bot, Kubernetes operator, GitHub App) with measurable reliability or velocity impact.
Demo your work at internal Product Development demos; contribute improvements to paved‑road patterns.
Lead the design and rollout of a significant platform or infrastructure initiative that measurably improves reliability, security, or developer velocity.
Become a go‑to engineer for backend/platform questions; mentor others and influence engineering culture through high‑quality designs, reviews, and documentation.
We use Covey as part of our hiring and / or promotional process for jobs in NYC and certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound on April 13, 2024.
Please see the independent bias audit report covering our use of Covey here.
Perks
Freedom & flexibility; fit your work around your life
Designated quarterly Whaleness Days
Home office setup; we want you comfortable while you work
16 weeks of paid Parental leave
Technology stipend equivalent to $100 net/month
PTO plan that encourages you to take time to do the things you enjoy
Quarterly, company-wide hackathons
Training stipend for conferences, courses and classes
Equity; we are a growing start-up and want all employees to have a share in the success of the company
Docker Swag
Medical benefits, retirement and holidays vary by country
Docker embraces diversity and equal opportunity. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our company will be.
Due to the remote nature of this role, we are unable to provide visa sponsorship.
#LI-REMOTE
Top Skills
Docker, Inc Palo Alto, California, USA Office
3790 El Camino Real, Palo Alto, CA, United States, 94306
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



