AbsenceSoft Logo

AbsenceSoft

AI Data Engineer

Reposted 4 Days Ago
Remote
Hiring Remotely in United States
152K-190K Annually
Senior level
Remote
Hiring Remotely in United States
152K-190K Annually
Senior level
The AI Data Engineer will build and manage data pipelines and infrastructure for ML/LLM applications, ensuring data quality and compliance, and collaborating with data scientists and AI teams.
The summary above was generated by AI

At AbsenceSoft, we’re transforming the employee experience. Our secure, intuitive technology helps employers bring humanity, certainty, and efficiency to some of the most complex moments in the workplace. Built by HR professionals for HR professionals, we’re proud of where we’ve been and even more excited about where we’re going.


We’re seeking a Data Engineer to design and manage the data pipelines, platforms, and tools that power intelligent AI applications. You will work closely with data scientists, AI software engineers, and product teams to ensure our ML and LLM workloads are backed by scalable, secure, and high-performance data infrastructure. This is a hands-on, high-impact role where reliability and flexibility of data architecture is paramount.



What you'll do

  • Design, build, and maintain data pipelines for structured, unstructured, and semi-structured data sources.
  • Develop and optimize data models, ETL processes, and batch/streaming data infrastructure.
  • Partner with data scientists to support training, evaluation, and deployment of ML and LLM models.
  • Implement scalable architectures for embeddings, vector databases, and retrieval pipelines.
  • Enable real-time and offline analytics workflows using best-in-class data engineering practices.
  • Ensure data quality, lineage, observability, and governance across all data products.
  • Deploy secure, cloud-native data infrastructure (AWS, Azure, GCP) for high-volume AI workloads.
  • Contribute to the design of feature stores and MLOps platforms for continuous learning and model updates.
  • Collaborate on Responsible AI workflows to ensure compliant data usage and access controls.
  • Continuously evaluate new tools and technologies for improving performance, reliability, and agility. 

What you'll bring

  • 5+ years of experience as a Data Engineer building large-scale, production-grade data pipelines.
  • Strong command of SQL, Python, and distributed data processing frameworks (Spark, Flink, Beam).
  • Hands-on experience with ETL/ELT tools and orchestration systems (Airflow, dbt, Prefect, Dagster).
  • Familiarity with cloud-native data platforms (Snowflake, BigQuery, Redshift, Databricks).
  • Experience supporting ML/AI workloads and collaborating with model development teams.
  • Knowledge of vector databases (FAISS, Pinecone, Weaviate) and embeddings management.
  • Understanding of data privacy, access control, and compliance in regulated environments.
  • Proficiency in modern DevOps tooling for data infrastructure (Docker, Terraform, CI/CD).
  • Ability to work autonomously and thrive in a fast-paced, collaborative environment.

Nice to Have

  • Cloud: AWS (Redshift, S3, Lambda), Azure (Data Lake, Synapse), GCP (BigQuery, Cloud Functions)
  • Streaming: Kafka, Kinesis, Pub/Sub, Spark Streaming, Apache Flink
  • Workflow Tools: dbt, Airflow, Dagster, Prefect
  • Storage & Processing: Snowflake, Databricks, Parquet, Delta Lake
  • Vector Search: FAISS, Pinecone, Weaviate, txtai 

Why join us


At AbsenceSoft, we LEAD with our values:

Lead with Innovation - We create meaningful change through intelligence, focus and passion.  We embrace curiosity, data, and insight to shape the future of our industry. Always innovating, learning and evolving.

Elevate Every Voice - Every perspective matters. We listen, learn, and build a culture where diversity of thought and experience drives better solutions and smarter decisions. 

Achieve Together - The customer fuels everything we do. We share knowledge, collaborate, celebrate wins, and face challenges as one team because success is always a collective achievement.

Drive Outcome - Every action we take delivers measurable value to our teams, our customers, and the employees they support. Accountability is non-negotiable. We honor our commitments, take responsibility for results, and see every success and setback as a chance to grow stronger.

 

We offer:

  • Impact that matters. You’ll do work that shapes the future of the modern workplace
  • Flexibility and trust. We’re remote-first and results driven. You’ll have the freedom and flexibility to do your best work, wherever you do it best.
  • Growth and development. We believe the best work happens when people are growing. You’ll have access to learning resources, leadership programs, and real opportunities to take on new challenges and expand your impact.  
  • Competitive rewards. We offer comprehensive benefits, a performance-based bonus program, and equity opportunities – because when we grow, you should too.
  • Time for life. Recharge and reconnect with flexible time off, paid holidays, and flexible leave programs designed to support every season of life.
  • Belonging and balance. We’re building an inclusive culture where every voice is valued, collaboration is celebrated, and success is shared.

 

We’re committed to building a team as diverse as the customers we serve. If your experience doesn’t align perfectly with every qualification, we still encourage you to apply you might be exactly what we’re looking for. If this sounds like a fit, apply today, we’d love to meet you!

Top Skills

Airflow
Beam
BigQuery
Dagster
Databricks
Dbt
Docker
Faiss
Flink
Pinecone
Prefect
Python
Redshift
Snowflake
Spark
SQL
Terraform
Weaviate

Similar Jobs

3 Days Ago
In-Office or Remote
Boston, MA, USA
90K-161K Annually
Senior level
90K-161K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design and implement automated test suites for AI/ML workflows, analyze clinical data, perform various tests, and mentor junior engineers.
Top Skills: Api Testing ToolsAWSCloudFormationCypressGCPGithub ActionsPythonSeleniumTerraform
3 Days Ago
In-Office or Remote
State Road, IL, USA
82K-172K Annually
Mid level
82K-172K Annually
Mid level
Information Technology • Consulting • Defense
The role involves developing machine learning algorithms for RF problems, building data pipelines, and utilizing cloud-based infrastructure for ML solutions.
Top Skills: AirflowAWSAzureGCPPythonPyTorchScikit-LearnSparkSQLTensorFlow
3 Days Ago
Remote
USA
175K-220K Annually
Senior level
175K-220K Annually
Senior level
Software
The AI Data Engineer will create automated pipelines for content discovery, build extraction systems for unstructured data, maintain data quality, and collaborate with product engineers, ensuring reliable data for AI features.
Top Skills: AWSCeleryDockerFaissFastapiGCPMilvusNode.jsPgvectorPineconePythonQdrantRuby On RailsSQLTypescriptWeaviate

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account