LiveKit Logo

LiveKit

Senior Data Engineer

Sorry, this job was removed at 02:35 p.m. (PST) on Tuesday, May 26, 2026
Remote
Hiring Remotely in U.S.
Remote
Hiring Remotely in U.S.

Similar Jobs

10 Days Ago
Easy Apply
Remote or Hybrid
Easy Apply
136K-160K Annually
Senior level
136K-160K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and maintain SparkSQL/PySpark data pipelines in the central data lake to ingest IoT, product, and unstructured data (video/audio). Produce reliable computed tables for analytics, model training, and dashboards; integrate external datasets; ensure high data quality and uptime; and collaborate with Data Science, ML, and cross-functional teams.
Top Skills: AirflowAWSAzureDagsterData LakeDatabricksDelta LakeETLGCPGitGitPrefectPysparkPythonRest ApisSparkSparksqlSQL
Yesterday
In-Office or Remote
120K-160K Annually
Senior level
120K-160K Annually
Senior level
Fintech • Payments • Financial Services
Build, own, and evolve Flinks' data and ML platform: BigQuery/dbt pipelines, Airflow ingestion, Kubeflow/Vertex training, model serving, governance, observability, and cross-team data contracts to support production ML and analytics at scale.
Top Skills: AirflowAzure DevopsBashBigQueryCi/CdCloud ComposerCloud FunctionsDbtDockerFastapiGcp LoggingGoogle Cloud Platform (Gcp)GrafanaKubeflowMlflowProtocol BuffersPub/SubPythonSQLTerraformVertex Ai
2 Days Ago
Remote
130K-150K Annually
Senior level
130K-150K Annually
Senior level
Software
Lead optimization and scaling of Snowflake architecture, build resilient ELT/ETL and MLOps pipelines, implement data observability, automate workflows with AI-assisted tools, and design enterprise semantic models to support analytics and commercial data products.
Top Skills: Aws GlueAws LambdaBi ToolsCursorDatadogDynamic TablesEltFivetranMaterialized ViewsMlopsPythonRbacSnowflakeSnowflake CortexSnowpipeSQLStored ProceduresUipath

LiveKit is building the infrastructure layer for the voice-driven era of computing. Our platform gives developers everything they need to build, test, deploy, scale, and observe agents in production. Founded in 2021, LiveKit powers voice AI applications for OpenAI, xAI, Salesforce, Coursera, Spotify, and thousands of others, collectively facilitating billions of calls each year.

You'll thrive at LiveKit if you:
  • obsess with crafting code that is fast, reliable and practical for the problem

  • are known as the go-to person for tackling tough technical problems

  • work hard and can build and ship fast

  • can clearly explain complex technical concepts to others

  • are a fast learner, frequently picking up new languages and tools

The best way to impress us is with thoughtful Issues and/or PRs on our Github repos 😊

About This Role:

As a Senior Data Engineer at LiveKit, you'll own the analytics infrastructure that powers our business intelligence and data analysis capabilities. Working closely with the Head of Data and analytics peers, you'll design and implement scalable GCP-based data pipelines — from ingestion through transformation to delivery — maximizing the GCP ecosystem for cost-effective solutions while integrating additional services or homegrown tooling where appropriate. While analytics infrastructure is the core focus, you'll also engage with the broader application data infrastructure, contributing your data pipeline expertise to support product and engineering needs. This is a foundational IC role with significant ownership over the architecture and direction of our analytics stack as the team grows.

What You’ll Do:

Own the Analytics Infrastructure: You are the end-to-end owner of our GCP-based data infrastructure — including ingestion, movement, storage, security, and availability. You build and operate reliable, scalable pipelines that power analytics, and partner closely with the Analytics team on downstream transformation and BI.

Maximize the Cloud Ecosystem: Build cost-effective solutions primarily within GCP-native services, while bringing transferable cloud infrastructure expertise. Know when to extend with third-party tooling or homegrown solutions, and make pragmatic tradeoffs.

Contribute Across Data Infrastructure: While analytics is the primary focus, you'll bring broad data pipeline expertise to application data needs in collaboration with the product engineering team.

Managed Services First: Favor managed solutions over self-hosting. Evaluate build vs. buy with cost and operational burden in mind.

Engineering Standards: This role reports to the Head of Data within the Engineering org. Expect PR reviews, automated testing, proper change management, and production-grade standards.

AI-First Development: Work extensively with AI coding assistants and contribute to evolving our AI development workflows and infrastructure.

Startup Pace: Priorities shift quickly. Balance long-term architectural thinking with the tactical execution the moment requires.

Who You Are:
  • 8+ years of experience in data engineering with strong Python and SQL expertise. You've built analytics data infrastructure from scratch — ideally more than once — and owned the architecture end-to-end

  • Experience with cloud-native data infrastructure (GCP preferred; strong AWS builders who can translate cloud concepts welcome). Familiarity with BigQuery, Dataflow, Cloud Storage, or equivalent services

  • Proven ability to design and implement production-grade data pipelines and aggregation layers for BI and analysis

  • AI-first development mindset with hands-on experience building AI-driven workflows and effectively using AI coding assistants

  • Strong understanding of data modeling, transformation patterns, and working with dbt

  • Experience with data movement tools (Estuary, Airbyte, Fivetran, or similar)

  • Solid infrastructure and DevOps fundamentals: Terraform or similar IaC, CI/CD, Git workflows, and change management

  • Experience implementing observability and monitoring for data systems (DataDog, Grafana, or similar)

  • Strong communication skills and ability to work cross-functionally with engineering and business stakeholders

  • Self-directed and comfortable with ambiguity in a fast-paced startup environment

  • Located in the US or Canada

Bonus
  • Experience coordinating with dbt and analytics engineering teams

  • Background with AI workflow tools (n8n or similar)

  • Background with AI coding assistants

  • Prior experience as an early infrastructure hire building from the ground up

  • Prior experience building on GCP/BigQuery in production

Our Commitment to You:
  • An opportunity to build something truly impactful to the world

  • Contribute to open source alongside world-class engineers

  • Competitive salary and equity package

  • Health, dental, and vision benefits

  • Flexible vacation policy

LiveKit is an equal opportunity employer and does not discriminate on the basis of any characteristic protected by applicable law. If you require a reasonable accommodation during the application or interview process, please contact [email protected].

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account