DataPelago Logo

DataPelago

Principal Data Processing Engineer

Reposted 13 Days Ago
Be an Early Applicant
In-Office
Mountain View, CA, USA
Expert/Leader
In-Office
Mountain View, CA, USA
Expert/Leader
Lead the architecture, design, and implementation of a high-performance data processing engine focused on large-scale data processing.
The summary above was generated by AI

Principal Data Processing Engineer
Mountain View, CA

About DataPelago:

DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for world-class engineers to join our team and shape the future of accelerated data processing.


The Role:

As a Principal Data Processing Engineer, you will be a key technical leader in the development of the core execution components of our data processing engine. You will lead architecture, design, and implementation that will enhance the functional breadth, performance, scale, and reliability of the engine to deliver a product that will redefine how users extract intelligence from their data. This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers.


What You'll Do:

  • Drive the evolution of our parallel and distributed execution engine architecture, with a strong focus on leveraging accelerated computing technologies.
  • End-to-End Ownership: Lead the execution engine team in the complete lifecycle of design,
    implementation, and rollout of an enterprise-grade product.
  • Individually design, implement, test, and maintain critical components of the data processing execution engine.
  • Innovation and Differentiation: Analyze technology advances from industry and academia to identify opportunities for the engine to enhance technology and product leadership.
  • Collaboration: Partner effectively with engineering, product management,
    and customer success teams. 
  • Guide and mentor engineers on the execution engine team.
  • Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain the highest product quality, security, efficiency, and productivity.


What You'll Bring:

  • BS/MS in  Computer Science (or a related field) with 10+ years of relevant experience 
  • 7+ years of deep technical experience in developing core components of enterprise-grade database or analytics execution engines designed for large-scale data processing.
  • Proven expertise in developing high-performance parallel implementations of data processing operators and functions on rich data types.
  • Significant experience developing for platforms such as Apache Spark, Apache Flink, Apache Doris, Apache Gluten, Velox, Apache DataFusion/Comet preferred.
  • Previous experience working as technical lead/architect with teams of 10+ engineers in the design, development, and successful release of high-performance data processing engines for large production deployments.
  • Proficiency in C, C++, and Rust programming.
  • Extensive development experience in Linux environments.
  • Strong analytical and problem-solving skills with a passion for performance optimization.
  • Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences.


Why Join DataPelago?

  • Technical Leadership: Take a leadership role in shaping the architecture and development of how our core engine works with open source data processing platforms
  • Cutting-Edge Innovation: Work on challenging problems at the forefront of accelerated
    computing and data processing.
  • Significant Impact: Your contributions will directly impact the performance and scalability of our mission-critical platform.
  • Mentorship and Growth: Mentor and guide other talented engineers while expanding your own technical expertise.
  • Competitive compensation, stock options, comprehensive benefits package, and leadership development opportunities
HQ

DataPelago Mountain View, California, USA Office

100 View Street, Suite 102, Mountain View, CA, United States, 94041

Similar Jobs

8 Days Ago
In-Office
Mountain View, CA, USA
6-6 Annually
Senior level
6-6 Annually
Senior level
Software
As a Principal Data Processing Engineer, you will enhance a data processing engine, collaborate on open-source platforms, and optimize performance.
Top Skills: Apache FlinkSparkCC++JavaLinux
2 Hours Ago
Hybrid
Oakland, CA, USA
123K-223K Annually
Mid level
123K-223K Annually
Mid level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Territory Account Executive will oversee sales in their area, engage local businesses, conduct demos, and close deals while building partnerships and generating leads.
Top Skills: Salesforce
2 Hours Ago
Hybrid
San Francisco, CA, USA
123K-223K Annually
Mid level
123K-223K Annually
Mid level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Territory Account Executive will engage local businesses, driving sales through in-person visits, demos, and building strong relationships in the community to meet sales targets.
Top Skills: Salesforce

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account