Blackpoint Cyber Logo

Blackpoint Cyber

Senior SRE

Reposted 8 Days Ago
Remote
Hiring Remotely in United States
145K-170K Annually
Senior level
Remote
Hiring Remotely in United States
145K-170K Annually
Senior level
The Senior SRE will design and maintain cloud/on-premise infrastructure and CI/CD pipelines, focusing on automation, scalability, and reliability while collaborating with cross-functional teams.
The summary above was generated by AI

Blackpoint Cyber is the leading provider of world-class cybersecurity threat hunting, detection and remediation technology. Founded by former National Security Agency (NSA) cyber operations experts who applied their learnings to bring national security-grade technology solutions to commercial customers around the world, Blackpoint Cyber is in hyper-growth mode,  fueled by a recent $190m series C round. 


Job Overview:

We are seeking an experienced Senior Site Reliability Engineer to join our dynamic team. As a Senior SRE Engineer, you will be responsible for designing, implementing, and maintaining our Cloud, On-Premise infrastructure and CI/CD pipelines, with a focus on automation, scalability, and performance.

You will collaborate with cross-functional teams to ensure the reliability and efficiency of our systems while fostering a culture of continuous improvement.

Key Responsibilities:

  • Infrastructure Management & Automation: Design, develop, and maintain highly scalable infrastructure utilizing Infrastructure as Code (IaC) methodologies, with primary focus on Terraform and Terragrunt for automated cloud resource provisioning and orchestration.

  • Cloud Platform Administration: Oversee and optimize cloud environments, with a specialized focus on Amazon Web Services (AWS), ensuring adherence to cost optimization strategies, security best practices, and high-availability standards.

  • Container Orchestration & Continuous Delivery: Manage and optimize Kubernetes cluster environments utilizing Helm, ArgoCD, Istio, and Kustomize to support continuous delivery pipelines and infrastructure-as-code practices.

  • Data Streaming Platform Operations: Administer and scale data streaming infrastructure using Confluent Cloud and Apache Kafka to support enterprise-level data processing requirements.

  • Caching & Real-Time Data Solutions: Deploy, configure, and maintain Redis instances to facilitate caching mechanisms and real-time data processing capabilities.

  • Observability & Incident Management: Implement and maintain comprehensive monitoring, alerting, and incident response frameworks utilizing Prometheus, Grafana, Alert Manager, and OpsGenie/PagerDuty to ensure optimal system reliability and performance.

  • Feature Management & Release Engineering: Facilitate controlled feature deployments and progressive rollouts through LaunchDarkly/PostHog platform integration and management.

  • Cross-Functional Collaboration: Partner with software development teams to ensure seamless integration of new services, applications, and features into existing infrastructure ecosystems.

  • Technical Issue Resolution: Diagnose and resolve complex system-level issues, implementing solutions that maintain high performance standards and maximize system uptime.

  • Process Optimization & Enhancement: Drive continuous improvement initiatives for automation tools, operational processes, and engineering methodologies to enhance system scalability, reliability, and maintainability.

  • Technical Innovation & Knowledge Management: Maintain current knowledge of emerging Site Reliability Engineering trends, tools, and technologies, ensuring organizational adoption of relevant industry advancements and best practices.

Skills & Qualifications:

  • Professional Experience: Minimum of eight (8) years of demonstrated experience in a Senior Site Reliability Engineer role or equivalent position, with substantial emphasis on cloud infrastructure management and automation technologies.

  • Infrastructure as Code Proficiency: Expertise in Infrastructure as Code (IaC) methodologies, specifically utilizing Terraform and Terragrunt for enterprise-scale deployments.

  • Cloud Architecture Expertise: Comprehensive knowledge of Amazon Web Services (AWS) cloud platform, including demonstrated proficiency in designing, implementing, and maintaining secure, scalable, and resilient cloud architectures aligned with industry best practices.

  • Distributed Streaming Systems: Extensive hands-on experience architecting and managing distributed data streaming solutions utilizing Confluent Cloud and Apache Kafka platforms.

  • Data Storage & Caching Technologies: Proven experience implementing and managing Redis for high-performance caching solutions and Amazon RDS for relational database management.

  • Search & Analytics Platforms: Proven experience with enterprise search and analytics solutions, including OpenSearch, Elasticsearch, and ChaosSearch platforms.

  • Observability & Monitoring Systems: Proficiency in designing and implementing comprehensive monitoring and alerting infrastructures utilizing Prometheus, Grafana, Alert Manager, and OpsGenie/PagerDuty.

  • Feature Management Platforms: Practical experience implementing and managing feature flag systems using LaunchDarkly/PostHog for controlled release management.

  • Container Orchestration Expertise: Extensive experience administering production-grade Kubernetes clusters, including package management via Helm, continuous deployment using ArgoCD, and service mesh implementation with Istio.

  • Configuration Management: Working knowledge of Kustomize for Kubernetes resource configuration and customization.

  • Excellent problem-solving skills with the ability to troubleshoot complex systems in production.

  • Strong communication and collaboration skills, with experience working in agile environments.

Nice to Have:

  • Multi-Cloud Platform Experience: Demonstrated experience architecting and managing infrastructure across multiple cloud service providers, including Google Cloud Platform (GCP) and Microsoft Azure.

  • Security & Compliance Expertise: Comprehensive understanding of security frameworks, compliance standards, and best practices applicable to cloud-native and containerized infrastructure environments.

  • Serverless & CI/CD Proficiency: Working knowledge of serverless computing architectures and continuous integration/continuous deployment (CI/CD) pipelines, including practical experience with Jenkins and GitHub Actions platforms.

  • Software Development Capabilities: Technical proficiency in software development using Node.js, Python, and/or Go programming languages.

Blackpoint Cyber welcomes and encourages applications from qualified individuals of all races,  colors, religions, sex, sexual orientation, gender identity or expression, national origin, age, marital  status, or any other legally protected status. We are committed to equality of opportunity in all  aspects of employment.  For eligible employees in the US, Blackpoint offers competitive Health, Vision, Dental, and Life Insurance plans, a robust 401k plan, Discretionary Time Off, and other minor perks.

Top Skills

Alert Manager
Apache Kafka
Argocd
AWS
Confluent Cloud
Github Actions
Go
Grafana
Helm
Istio
Jenkins
Kubernetes
Kustomize
Launchdarkly
Node.js
Opsgenie
Pagerduty
Posthog
Prometheus
Python
Redis
Terraform
Terragrunt

Similar Jobs

Yesterday
Easy Apply
Remote or Hybrid
United States
Easy Apply
140K-170K Annually
Senior level
140K-170K Annually
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills: ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
5 Days Ago
Remote or Hybrid
Los Angeles, CA, USA
130K-160K Annually
Senior level
130K-160K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Unified Communication Engineer manages and improves telecom systems, provides technical support, and integrates new UC technologies while ensuring stability of voice networks.
Top Skills: AWSCiscoMicrosoftUcs ServersVcenterVMwareVoipZoom
16 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills: AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account