Build and maintain ML infrastructure to accelerate model development and deployment: scale model evaluation, optimize GPU utilization, automate staging/deployment, migrate workflows to orchestration tools, and improve Python monorepo tooling and CI/CD.
Gridmatic is a high-growth startup and a new kind of energy company, delivering affordable, clean power by optimizing renewable energy and grid-scale batteries. With offices in the Bay Area and Houston, we bring together Silicon Valley–style innovation with deep, hands-on expertise in real-world power markets and energy retail.
As solar and wind become the fastest-growing sources of electricity, variability from weather and grid conditions makes energy prices more volatile. Gridmatic tackles this challenge with industry-leading forecasting and optimization—and gives our team the opportunity to work on problems that truly matter. Forecasting and trading energy are the foundation of what we do. We ingest large-scale data—weather, prices, load, and grid conditions—to build probabilistic machine learning forecasts that drive real operational decisions. Our work directly determines when power is bought, stored, or deployed, turning uncertainty into value for customers and the grid.
Our impact is measurable. Gridmatic is the most profitable participant in ERCOT’s wholesale market and operates the top-performing battery asset in CAISO. Profitable without venture capital, we offer a collaborative, low-ego environment where rigorous thinking, autonomy, and continuous learning are core to how we work.
The Role
We’re looking for a strong backend engineer to work closely with our ML engineers to help speed up their work through better infrastructure and tooling.
What you might work on:
- Scaling model evaluation to handle large timeseries data
- Measuring and improving utilization of GPUs
- Automating staging and deployment of ML models
- Moving complex workflows to orchestration tools like Airflow/Flyte
- Improving python monorepo tooling (code sharing, docker, CI/CD)
What we're looking for:
- Strong backend software engineer who has worked with ML engineers and has helped solve their problems
- Strong distributed systems and infrastructure skills. Is comfortable standing up services in AWS/GCP, scaling and debugging Kubernetes services, writing Terraform, and working with orchestration tools like Flyte, Airflow, or Temporal.
- Strong software engineering skills. Being able to write easy-to-extend and well-tested code.
- Has worked with large-scale data, and makes good choices on data storage and schema design (relational databases, data warehouses, object storage, timeseries data)
Join our team and make a difference! Click below or email us at [email protected].
Top Skills
Python,Kubernetes,Aws,Gcp,Terraform,Airflow,Flyte,Temporal,Docker,Ci/Cd,Gpus,Relational Databases,Data Warehouses,Object Storage,Timeseries
Gridmatic Cupertino, California, USA Office
20450 Stevens Creek Blvd, Suite 100, Cupertino, CA, United States, 95014
Similar Jobs
Real Estate
As a Senior Software Engineer in Machine Learning Infrastructure, you will develop software tools for ML lifecycle management, ensuring efficient use of AI technologies.
Top Skills:
AzureDockerGCPJavaKubernetesPython
Automotive
Lead the development of AI/ML infrastructure for simulations. Collaborate on realism models, scale distributed systems, and ensure alignment with business goals.
Top Skills:
DeepspeedPyTorchRayTensorFlow
Mobile • Social Media
In this role, you'll build and enhance ML infrastructure for scalable training and serving, ensuring impactful implementations and mentoring junior engineers.
Top Skills:
DatabricksGoJavaPythonRay ServeScalaTriton
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



.png)