Design, build, and operate reliable, scalable cloud infrastructure. Maintain AWS/GCP and Linux systems, manage Kubernetes clusters, implement IaC (Ansible/Puppet/Terraform), automate CI/CD (Jenkins), monitor with Prometheus/ELK, triage alerts, participate in design/reviews, migrate apps to Kubernetes, and improve operational automation.
Based in NY/SF or willing to relocate (in-person collaboration is critical). In-person component is important (if in NYC/SF, 3 days a week; if within 30 miles outside the city - 1 day a week)
The Company:
We are a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient than most blockchain-based systems. It’s designed so Stellar’s ecosystem can make a real-world, lasting impact.
As one of the first engineers, you will help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate operational work so developers can focus on building great products.
Key Responsibilities:Matching & Scoring Systems
- Maintain, improve, scale and secure our AWS/GCP infrastructure and Linux systems.
- Assist our development teams in running, packaging, deploying and troubleshooting applications
- Work with developers on streamlining deployment processes with Jenkins and other CI/CD tooling.
- Build, maintain, monitor and improve our Kubernetes clusters.
- Work with development teams on migrating applications to Kubernetes.
- Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK.
- Monitor, triage and respond to alerts in our high availability environments.
- Participate in design and code reviews, and ensure that the foundation for our services is best in class.
- Evaluate new technologies, design and implement as appropriate.
- Identify automation opportunities and implement by creating custom or by using off the shelf solutions.
Qualifications:
Required Experience
- 5+ years of experience of working in cloud-based systems operations, as a SRE or DevOps engineer.
- First-hand experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform).
- Proficient in utilizing SRE methodologies like capacity planning and disaster recovery testing to ensure the scalability, resilience, and availability of critical services.
- Production experience building and maintaining Kubernetes clusters.
- Will need to know how to code
Preferred Attributes:
- Ability to understand Go, Rust, C++ and TypeScript source code
- Experience experimenting with AI-driven approaches to operations
- Comfortable with participating in on-call rotations and conducting thorough root cause analyses to keep systems running smoothly.
- Experienced in managing production workloads and skilled in using monitoring tools to detect issues early.
- A strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
- No blockchain needed
- Experience using AI is a plus
The base pay range for this role is $205,000 – $225,000 per year.
Similar Jobs
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Develops and implements manufacturing processes for avionics assemblies, creates work instructions/MBOMs, reviews designs for manufacturability, troubleshoots production issues, drives corrective actions, and supports NPI to full-scale production in cross-functional teams.
Top Skills:
Altium DesignerIpc-2221Ipc-7711C/7721CIpc-A-600Ipc-A-610Lean ManufacturingMbomNasa-Std-8739.1Pcb ManufacturingPcba ManufacturingSolidworks
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Partner with managers and employees to deliver HR programs and coaching, drive employee engagement and performance, resolve workplace concerns, support leadership development, and integrate HR initiatives across operations and HR Centers of Excellence.
Top Skills:
MS Office
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Administer and implement U.S. industrial security standards (NISPOM, SCI/SAP rules). Manage personnel and classified material records, conduct security briefings, reconcile inventories, support security CONOPs, and communicate with Boeing teams and USG customers to meet contract security requirements.
Top Skills:
Defense Information System For Security (Diss)Enterprise Security Systems (Sims)MS OfficeScattered Castles
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

