Enformion Logo

Enformion

Senior Data Engineer

Posted 19 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Sacramento, CA
110K-125K Annually
Senior level
Easy Apply
In-Office
Sacramento, CA
110K-125K Annually
Senior level
The Senior Data Engineer will build and maintain a big data processing platform, develop ETL processes, and optimize data pipelines using various technologies.
The summary above was generated by AI

Enformion is a dynamic and innovative data and analytics company that assists digital marketplaces in fraud prevention, risk management, seamless user onboarding, and fostering trust between shoppers and merchants. Our AI-powered solutions leverage extensive data intelligence and advanced behavioral analysis, enabling continuous monitoring for emerging risk indicators.


Who We Want

Do you live for working on challenging Big Data problems at a massive scale? Are you the type that intimately knows the ins and outs of Big data development with the expert knowledge and experience to push your hardware to the limits? If yes, then we want you.

We are looking for a Senior Data Engineer to help our engineering team build a modern data processing platform using Spark, EMR and other relational and noSQL databases. We are investing resources into setting up a more flexible and scalable data infrastructure to support the addition of new data sets and improve overall data quality. An ideal candidate will be excited to be in a small size company with a startup mindset that moves quickly on a constant flow of ideas, is able to weed through the maze of Big data tools and potential approaches to find the best possible solution and architecture.

Salary - $110K - $125K

Responsibilities

  • Implement and maintain big data platform and infrastructure
  • Develop, optimize and tune MySQL stored procedures, scripts, and indexes
  • Develop Hive schemas and scripts, Spark Jobs using pyspark and Scala and UDFs in Java
  • Design, develop and maintain automated, complex, and efficient ETL processes to do batch records-matching of multiple large-scale datasets, including supporting documentation
  • Develop and maintains pipelines using Airflow or any other tools to monitor, debug, and analyze data pipelines
  • Troubleshoot Hadoop cluster and query issues, evaluate query plans, and optimize schemas and queries
  • Strong interpersonal skills to resolve problems in a professional manner, lead working groups, and negotiate consensus

Qualifications & Skills

  • BS, MS, or PhD in Computer Science or related field
  • 5+ years minimum experience in language such as Java, Scala, PySpark, Perl, Shell Scripting and Python
  • Working knowledge of the Hadoop ecosystem applications (MapReduce, YARN, Pig, Hbase, Hive, Spark and more!)
  • Strong Experience working with data pipelines in multi-terabyte data warehouses. Experience in dealing with performance and scalability issues
  • Strong SQL (MySQL, Hive, etc.) and No-SQL (MongoDB, Hbase, etc.) skills, including writing complex queries and performance tuning
  • Knowledge of data modeling, partitioning, indexing, and architectural database design.
  • Experience using Source Code and Version Control systems like GIT etc.
  • Experience on continuous build and test process using tools such as GitLab, SBT, Postman, etc.
  • Experience with Search Engines, Name/Address Matching, or Linux text processing

Preferred:

  • Knowledge of cluster configuration, Hadoop administration and performance tuning are a huge plus.
  • Distributed computing principles and experience in big data technologies including performance tuning
  • Machine Learning

Location

Remote

Top Skills

Airflow
Emr
Git
Gitlab
Hadoop
Hive
Java
MongoDB
MySQL
Pyspark
Scala
Spark

Similar Jobs

2 Days Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Seeking a Senior Data Engineer to design and optimize data pipelines, ensuring data quality and supporting advanced analytics. Responsibilities include building data architectures, developing automated testing, and collaborating with stakeholders.
Top Skills: Apache AirflowAWSAzureAzure SynapseDbtHadoopJavaKafkaKinesisPytestPythonPyTorchRedshiftScalaScikit-LearnSeleniumSnowflakeSparkSQLTensorFlow
2 Days Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Seeking a Senior Data Engineer to design and optimize data architecture and pipelines, ensuring data quality and enabling advanced analytics through AI and machine learning techniques.
Top Skills: Apache AirflowAWSAws RedshiftAzureAzure SynapseDbtHadoopJavaKafkaKinesisPytestPythonPyTorchScalaScikit-LearnSeleniumSnowflakeSparkSQLTensorFlow
3 Days Ago
In-Office or Remote
8 Locations
168K-297K Annually
Senior level
168K-297K Annually
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
As a Senior Data Engineer, you'll design and manage ETL pipelines, optimize data models, monitor data quality, and collaborate with teams to support compliance operations.
Top Skills: AirflowDatabricksDbtGitPrefectPythonSnowflakeSQLTableauTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account