RevenueBase Logo

RevenueBase

Senior Data & AI Platform Engineer (AWS, Snowflake, Vector Search)

Reposted Yesterday
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Build production-grade AI-powered data tooling: extract data from Snowflake, generate and store embeddings, enable semantic search, design enrichment pipelines using LLM APIs, optimize AWS infrastructure, and create reusable services and SDKs for scalable, observable data and AI workflows.
The summary above was generated by AI
RevenueBase:
  • We're building the data infrastructure that makes AI agents trustworthy instead of error-prone.

  • We provide continuously refreshed, verified B2B data for autonomous AI agents and GTM workflows.

  • We've tripled growth while maintaining 100% gross dollar retention and staying cashflow positive.

  • We power AI agents for Clay, Zoominfo, Dun & Bradstreet, and the next generation of AI GTM tools.

About the Role

We are looking for a Senior Data & AI Platform Engineer to build internal tools and services on top of our large-scale data infrastructure. Your primary focus will be developing systems that leverage vector embeddings, LLM APIs, and semantic search to unlock value from structured and unstructured data.

This is a hands-on engineering role for someone who enjoys building practical AI-powered tools — not just experiments — and shipping them into production in a fast-moving startup environment.

What You’ll Do
  • Design and build data-driven tools that operate on large datasets stored in S3 and Snowflake

  • Implement pipelines that:

    • Extract specific columns or datasets from Snowflake

    • Generate vector embeddings via APIs such as OpenAI

    • Store and manage embeddings in vector databases like Pinecone

    • Enable semantic search and similarity-based retrieval

  • Develop enrichment workflows that:

    • Query structured data

    • Use LLM APIs to generate new derived columns

    • Write enriched results back into Snowflake

  • Build reusable internal services and SDKs around embedding generation, prompt orchestration, and data augmentation

  • Optimize performance and cost across AWS infrastructure

  • Work closely with product and data teams to turn use cases into scalable engineering solutions

  • Ensure reliability, observability, and maintainability of AI-powered pipelines

Example Projects
  • Tool to extract a single Snowflake column, generate embeddings, push to Pinecone, and expose a semantic search API

  • Batch enrichment pipeline that queries records from Snowflake, calls OpenAI APIs for structured enrichment, and writes new columns back

  • Internal framework for LLM-based data transformation and validation

  • Query abstraction layer to make AI-enhanced analytics accessible to non-engineering teams

Required Qualifications
  • 5+ years of software engineering experience

  • Strong backend engineering skills (Python preferred; other modern languages acceptable)

  • Solid experience with:

    • AWS (IAM, Lambda, ECS/EKS, S3, networking, security best practices)

    • Data warehousing (Snowflake preferred)

    • API design and distributed systems

  • Hands-on experience working with LLM APIs (e.g., OpenAI) and embedding workflows

  • Experience with vector databases (Pinecone or similar)

  • Strong understanding of data modeling, ETL/ELT patterns, and performance optimization

  • Production experience in at least one startup environment

  • Ability to operate independently and ship high-impact systems end-to-end

Nice to Have
  • Experience building internal developer platforms or data tooling

  • Familiarity with prompt engineering and evaluation pipelines

  • Experience with orchestration frameworks (Airflow, Prefect, Dagster)

  • Exposure to retrieval-augmented generation (RAG) systems

  • Infrastructure-as-code experience (Terraform, CDK)

  • Experience managing large-scale embedding refresh and re-indexing workflows

What Success Looks Like
  • Engineers and analysts can easily leverage AI-powered data enrichment

  • Embedding-based search works reliably at scale

  • New AI use cases can be implemented quickly using shared internal tooling

  • Systems are robust, observable, and cost-efficient

Why Join Us?
  • Work on practical, production-grade AI systems

  • Direct impact on how data is leveraged across the company

  • Startup speed with real ownership and autonomy

  • Opportunity to define the internal AI platform from the ground up

Top Skills

AWS
Ecs
Eks
Embeddings
Iam
Lambda
Llm Apis
Openai
Pinecone
Python
S3
Semantic Search
Snowflake
Vector Databases

Similar Jobs

11 Minutes Ago
Remote
NY, USA
100K-130K Annually
Mid level
100K-130K Annually
Mid level
Conversational AI • Generative AI
Lead cross-functional initiatives for an AI-powered writing platform, overseeing project lifecycles and ensuring alignment across teams.
Top Skills: AgileAISaaSScrum
31 Minutes Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Digital Media • Marketing Tech • Database • Automation
The Director of Product Management will lead product strategy and outcomes for Adstra's data and identity solutions, mentor product teams, and drive product discovery and delivery that aligns with customer needs and business outcomes.
Top Skills: AdtechAIAPIsDataIdentityMartechMl
31 Minutes Ago
In-Office or Remote
United States
Mid level
Mid level
AdTech • Big Data • Digital Media • Marketing Tech • Database • Automation
The Brand Experience Lead at Adstra will develop and execute brand-led programs, manage content across channels, and support B2B marketing initiatives.
Top Skills: Basic AnalyticsCanvaCmsEmail PlatformsFigmaSocial Schedulers

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account