The Voleon Group

Site Reliability Engineer

Reposted 6 Days Ago

In-Office or Remote

Hiring Remotely in Berkeley, CA

115K-135K Annually

Mid level

In-Office or Remote

Hiring Remotely in Berkeley, CA

115K-135K Annually

Mid level

As a Site Reliability Engineer, you will enhance and monitor production systems, automate workflows, and respond to incidents to maintain system reliability.

The summary above was generated by AI

Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future.

Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together.

In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more.

As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production-critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real-world problems and collaborate with passionate and talented colleagues in an empowering, results-driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort.

Responsibilities

Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems
Diagnose and fix bugs in code
Lead complex deployments
Automate manual workflows
Track and prioritize outstanding production-related issues
Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems

Requirements

Experience with coding and debugging Python
Experience with Linux
Familiarity with Relational Databases & SQL
Sharp analytical and problem-solving skills and a persistent drive to make things work (better)
Strong growth mindset and a passion for learning
Strong technical communication skills
Attention to detail
2 years of relevant industry experience
An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience

Preferred Qualifications

Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment
Experience supporting production systems
Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes

The base salary for this position is $120,000 to $160,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match.

“Friends of Voleon” Candidate Referral Program

If you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program.

Equal Opportunity Employer

The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Top Skills

Airflow

Bazel

Git

Grafana

Grpc

Jenkins

Kubernetes

Linux

Pandas

Postgres

Prometheus

Python

Relational Databases

SQL

Downtown, Berkeley, CA, United States, 94704

Similar Jobs

Coinbase

Site Reliability Engineer

5 Days Ago

Easy Apply

Remote

USA

Easy Apply

186K-219K Annually

Senior level

186K-219K Annually

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.

Top Skills: AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform

Coinbase

Site Reliability Engineer

12 Days Ago

Easy Apply

Remote

USA

Easy Apply

152K-179K Annually

Senior level

152K-179K Annually

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.

Top Skills: AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform

Milestone Systems

Site Reliability Engineer

13 Days Ago

Remote or Hybrid

160K-180K Annually

Expert/Leader

160K-180K Annually

Expert/Leader

Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics

The Lead Site Reliability Engineer will oversee the reliability and scalability of the infrastructure, lead a team in operational execution, ensure best practices in SRE, and mentor senior engineers.

Top Skills: Ci/CdDockerGitopsGoKubernetesLinuxPythonTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Google, Apple, Salesforce, Meta
Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

The Voleon Group

Site Reliability Engineer

Top Skills

The Voleon Group Berkeley, California, USA Office

Similar Jobs

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

What you need to know about the San Francisco Tech Scene

Key Facts About San Francisco Tech