Sayari Logo

Sayari

Senior Data Engineer

Posted 12 Days Ago
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Remote
Hiring Remotely in United States
140K-160K Annually
Senior level
Build and maintain scalable ETL pipelines using Python, Spark, and Airflow; collaborate with AI/ML and Product teams to deliver AI-native data products; identify and resolve ETL bottlenecks; ensure code quality through reviews and tests; own sprint deliverables and contribute to roadmap planning and major epics.
The summary above was generated by AI
About Sayari: 

Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.

POSITION DESCRIPTION

As a Data Engineer at Sayari, you will be the engine behind the world’s most comprehensive commercial world model. You will join a high-autonomy team responsible for building and scaling the complex orchestration systems that transform billions of primary-source records into actionable intelligence. This is a role for a "builder" who respects the complexity of large-scale ETL and graph databases and is "PhD-curious" about the future of AI-native data products and modern orchestration.

JOB RESPONSIBILITIES
  • Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution engines.
  • Collaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native products.
  • Proactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebase.
  • Contribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisions.
  • Own the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performance.
  • Participate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibility.
SKILLS & EXPERIENCE

Required

  • 5 or more years of production data engineering experience, with clear ownership of systems you built and operated end to end
  • Strong Python, with meaningful experience in a JVM language (Scala preferred) or willingness to ramp quickly
  • Hands-on Snowflake experience, or equivalent depth in BigQuery or Redshift with demonstrated ability to transfer
  • Experience deploying and operating AI or ML applications in production, including output validation, monitoring, and cost management at scale
  • Orchestration experience with Apache Airflow or a comparable workflow tool
  • Track record of operating production systems reliably, with comfort navigating failure, monitoring, and recovery

Preferred

  • Experience with Spark on Dataproc Serverless or other serverless Spark environments
  • Familiarity with Kubernetes for deployment
  • Experience with data quality tooling such as deequ, Great Expectations, or equivalent
  • GCP experience (BigQuery, Dataproc, Cloud Storage)
  • Experience leading or contributing to a data warehouse migration
  • Background in team mergers or migrating a team onto a new operating process

The target base salary for this position is $140,000-$160,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above.


Benefits: 
  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.
Pay Range
$140,000$160,000 USD

Similar Jobs

5 Hours Ago
Remote or Hybrid
Richmond, CA, USA
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering efforts within Technology Consulting: design data architecture and pipelines, implement AWS/Redshift and ETL solutions, support BI (QlikView/Oracle BI), coach teams, manage client relationships and SLAs, apply systems thinking to optimize outcomes and validate solutions with stakeholders.
Top Skills: AWSDatastageDb2ETLJavaManaged ServicesOracle BiPythonQlikviewRedshiftSlasSQL ServerWorkload Orchestration And Scheduling
Yesterday
Remote or Hybrid
99K-232K Annually
Senior level
99K-232K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering engagements to design, build, and maintain ETL/ELT pipelines and cloud data architectures. Manage client accounts and mentor teams, leverage tools like DataStage, AWS/Redshift, DB2/SQL Server, GoldenGate, and BI/visualization platforms to deliver analytics, performance tuning, and scalable reporting solutions.
Top Skills: AWSBirtCdcDatastageDb2Etl/EltGlueGoldengateJavaPythonQlikviewRedshiftS3SpotfireSQL Server
4 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and operate scalable, secure cloud-based data platforms and pipelines across the full data engineering lifecycle. Instrument and monitor pipelines, optimize performance, troubleshoot production issues, reduce technical debt, drive cloud and open-source adoption, and maintain documentation and governance for federal and military healthcare data solutions.
Top Skills: AzureCi/CdDevOpsGoogle Cloud Platform (Gcp)OraclePostgresSQLSQL Server

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account