DataPelago Logo

DataPelago

Principal Data Processing Software Engineer-OSS

Reposted 5 Days Ago
Be an Early Applicant
In-Office
Mountain View, CA, USA
6-6 Annually
Senior level
In-Office
Mountain View, CA, USA
6-6 Annually
Senior level
As a Principal Data Processing Engineer, you will enhance a data processing engine, collaborate on open-source platforms, and optimize performance.
The summary above was generated by AI

Principal Data Processing Engineer - OSS
Mountain View, CA 

About DataPelago:

DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray, and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for world-class engineers to join our team and shape the future of accelerated data processing.

The Role:

As a Principal Data Processing Engineer (OSS), you will be a key individual contributor in
adopting and advancing the capabilities of open-source software (OSS) platforms such as Apache

Gluten, Velox, Apache Spark, and Apache Flink in the context of DataPelago’s data processing engine. You will enhance the functional breadth, performance, scale, and reliability of the DataPelago engine through downstream and upstream contributions. You will have the opportunity to engage with the community working on these platforms. This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers.

What You'll Do:

  • Influence the architecture of how our data processing engine interfaces with open-source platforms and engines.
  • Lead the design of functional and performance enhancements to open source platforms such as Apache Gluten and Velox, and their integration with our data processing engine.
  • Individually design, implement, test, optimize, and maintain components of the data processing engine.
  • Analyze the technology roadmap of Apache Gluten, Velox, and equivalent platforms and identify opportunities for our engine to enhance technology and product leadership.
  • Collaboration: Partner with engineering, product management, the open-source community and customer success teams.
  • Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain the highest product quality, security, efficiency, and productivity.

What You'll Bring:

  • BS/MS in  Computer Science (or a related field) with 6+ years of relevant experience 
  • 3+ years of deep technical experience in instrumenting, analyzing, and optimizing the performance of data processing engine components on benchmark and customer workloads.
  • Sound knowledge of the architecture and internal operation of one or more of Apache Spark,
    Apache Flink, Presto/Trino.
  • Demonstrated experience in the design, development, and successful release of high-performance data processing engines for large production deployments.
  • Exceptional programming skills in C, C++, and Java.
  • Extensive development experience in Linux environments.
  • Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences.
  • Strong analytical and problem-solving skills with a passion for performance optimization.

Location Considerations:

We value face-to-face collaboration, but recognize that talent can be found anywhere. Our engineering team works at our headquarters in Mountain View, CA, at our India office in Hyderabad, and at remote locations.

Why Join DataPelago?

  • Technical Leadership: Take a leadership role in shaping the architecture and development of how our core engine works with open source data processing platforms
  • Cutting-Edge Innovation: Work on challenging problems at the forefront of accelerated
    computing and data processing.
  • Significant Impact: Your contributions will directly impact the performance and scalability of our mission-critical platform.
  • Mentorship and Growth: Mentor and guide other talented engineers while expanding your own technical expertise.
  • Competitive compensation, stock options, comprehensive benefits package, and leadership development opportunities

Top Skills

Apache Flink
Spark
C
C++
Java
Linux
HQ

DataPelago Mountain View, California, USA Office

100 View Street, Suite 102, Mountain View, CA, United States, 94041

Similar Jobs

An Hour Ago
Hybrid
17-28 Hourly
Junior
17-28 Hourly
Junior
eCommerce • Fashion • Other • Retail • Sales • Wearables • Design
The Supervisor supports store leadership by driving sales and productivity, coaching team members, enhancing client relationships, and overseeing operational excellence.
Top Skills: ExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft Word
An Hour Ago
Hybrid
Livermore, CA, USA
15-22 Hourly
Entry level
15-22 Hourly
Entry level
eCommerce • Fashion • Other • Retail • Sales • Wearables • Design
The Sales Support Associate role focuses on providing customer service, handling transactions at the POS, organizing stock, and supporting store efficiency.
Top Skills: Basic Computer SkillsInternetIpadMobile PosPos Systems
An Hour Ago
Hybrid
15-22 Hourly
Junior
15-22 Hourly
Junior
eCommerce • Fashion • Other • Retail • Sales • Wearables • Design
The Sales Support Associate provides exceptional customer service, manages sales floor organization, operates POS, and supports inventory management.
Top Skills: Basic Computer SkillsCash Register SystemsIpadLaptopMobile PosPos SystemsWalkie Talkie

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account