Similar Jobs
Artificial Intelligence • Software
The Data Reliability Engineer will enhance the resilience and scalability of data infrastructure, focusing on automation and reliability. Responsibilities include managing data pipelines, operating Kubernetes clusters, and defining observability standards.
Top Skills:
GrafanaKubernetesPrometheusPythonRayTerraform
Artificial Intelligence • Information Technology • Robotics • Software
Architect and develop robust, petabyte-scale data pipelines, provide technical leadership, and collaborate with ML Engineers at Watney Robotics.
Top Skills:
GoJavaPythonRustScala
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The Senior Scientist conducts food ingredient research, oversees ingredient quality control, and collaborates with cross-functional partners for innovation and compliance.
Top Skills:
Experimental DesignFood Ingredient ResearchRegulatory Compliance
Build the Future of AI Infrastructure with Kumo!
Companies invest millions in storing terabytes of data in data lakehouses, yet only a small fraction is leveraged for predictive insights. Traditional machine learning pipelines are slow and complex, requiring months of engineering effort for data preparation, feature engineering, and model training.
At Kumo, we are redefining AI infrastructure for data lakehouses, enabling businesses to harness the power of Graph Neural Networks with minimal effort. Our platform eliminates the complexities of traditional ML pipelines, allowing users to train high-performance models directly on their relational data with just a few lines of Predictive Query Language (PQL).
We are looking for Data Infrastructure Engineers to join our team and help build a scalable, high-performance ML platform. If you thrive in designing robust, cloud-native infrastructure, optimizing data pipelines, and building scalable services, we’d love to hear from you!
As a Data Infrastructure Engineer at Kumo, you will:
- Design and optimize scalable, cloud-native infrastructure for high-performance ML workloads.
- Develop and maintain efficient data ingestion pipelines and connectors for large-scale datasets.
- Build and enhance resilient ETL pipelines to transform, process, and store data for analytics and ML.
- Implement best practices for data security, governance, and sharing within distributed environments.
- Optimize performance of data processing frameworks, including Spark, Presto, and Hive.
- Automate deployment of infrastructure using Kubernetes, Terraform, and CI/CD tools.
- Work closely with data scientists and ML engineers to bridge infrastructure with machine learning applications.
Your Foundation:
- 1+ years of experience as an Infrastructure Engineer, Data Engineer, or related role in SaaS/Enterprise environments.
- Strong expertise in building, scaling, and maintaining cloud infrastructure (AWS, GCP, or Azure).
- Hands-on experience with data storage, ingestion, and processing in distributed environments.
- Proficiency in ETL development and building high-performance data pipelines.
- Solid understanding of databases, storage formats (Parquet, Avro, Arrow, JSON), and schema designs.
- Experience working with orchestration tools such as Temporal, Airflow, or Luigi.
- Strong programming skills in Python, Scala, or Java.
- Knowledge of containerization and orchestration (Docker, Kubernetes).
- Experience with Infrastructure as Code (Terraform, CloudFormation, Pulumi).
- Ability to debug performance bottlenecks and optimize distributed computing workloads.
- Excellent communication skills, with the ability to collaborate effectively across teams.
Bonus Points:
- Expertise in Spark, Presto, or Hive for large-scale data processing.
- Experience with serverless architectures and event-driven processing (AWS Lambda, Kinesis, Kafka).
- Familiarity with Databricks, Azure Data Factory (ADF), or cloud ML solutions.
- Understanding of high-availability, fault tolerance, and observability in cloud environments.
Why Join Kumo?
- Be part of a cutting-edge AI and ML infrastructure team revolutionizing how companies leverage their data.
- Work with top engineers and data scientists on solving complex, large-scale infrastructure challenges.
- Competitive salary, equity, and benefits in a fast-growing AI company.
- Flexible work environment with opportunities to shape the future of AI-powered data platforms.
Ready to build the next-gen AI infrastructure? Apply today!
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Kumo Mountain View, California, USA Office
357 Castro St, Suite 200, Mountain View, CA, United States, 94041
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



