Maximum of 25 job preferences reached.
Top Staff Data Engineer Jobs in San Francisco, CA
Big Data • Machine Learning • Software • Analytics • Big Data Analytics
The role involves deep troubleshooting, root cause analysis, and architectural optimization in the Data and AI ecosystem to enhance platform reliability and supportability.
Top Skills:
Delta LakeHiveJavaPythonScalaSparkSQL
Big Data • Healthtech
Lead design, build, and support of cloud-based ETL, data warehousing, and reporting solutions. Develop/optimize ETL pipelines, data models, and queries; deploy containerized services on Kubernetes; automate workflows; mentor teammates; collaborate with architects, data scientists, and stakeholders to deliver scalable analytics and reporting products.
Top Skills:
APIsAWSAws GlueChatgptCopilotDbtDockerETLJIRAKubernetesMySQLOraclePostgresPythonRedshiftRubyStarburst
Automotive • Robotics • Software • Transportation
Build and automate large-scale ML data pipelines and tooling for autonomous trucking. Improve dataset curation, deduplication, labeling, evaluation, and metrics infrastructure. Collaborate with robotics, autonomy, and infra teams to support continuous learning, deployment, and scalable AI infrastructure for Kodiak's fleet.
Top Skills:
Apache AirflowData LakeEltLlmsMetaflowPythonSQL
Artificial Intelligence • Information Technology • Robotics • Software
Architect and develop robust, petabyte-scale data pipelines, provide technical leadership, and collaborate with ML Engineers at Watney Robotics.
Top Skills:
GoJavaPythonRustScala
Information Technology • Software • Analytics
The Staff Data Engineer will design and maintain data infrastructure using Azure and Databricks, optimize data workflows, ensure data quality, and collaborate with teams for data-driven decisions.
Top Skills:
SparkAzureCi/CdDatabricksDelta LakeGitPower BIPythonSQL
Cloud
Design and implement data-intensive platform components, mentor engineers, and optimize streaming infrastructure to enable scalable data services at Okta.
Top Skills:
AWSBeamFlinkHadoopJavaKafkaKinesisKubernetesSnowflakeSpark
Automotive
You will create large datasets and training recipes, develop methods for data mining, create scalable infrastructure solutions, and collaborate with ML infrastructure teams.
Top Skills:
C++Python
Big Data • Machine Learning • Software • Analytics • Big Data Analytics
As a Staff Software Engineer, you will build distributed data systems, focusing on reliability and performance for large-scale data processing, using technologies like Apache Spark™ and Delta Lake.
Top Skills:
C++JavaScala
Software
Design and deploy scalable data infrastructure and models, maintain CDC pipelines, build reporting tools and dashboards, write high-performance SQL, partner with product and engineering to enable enterprise-wide analytics and data-driven decision making.
Top Skills:
AirflowAWSBigQueryCdcDagsterDbtDebeziumFivetranLookerModeRedshiftSigmaSnowflakeSQLStitchTerraform
Fintech • Real Estate • Software
As a Staff Software Engineer, you'll design and deliver a core data platform, focusing on architecture, pipeline components, and AI integration. This hands-on role emphasizes creating production code and establishing technical standards within a small team.
Top Skills:
AWSEltETLLlmsRedshift
Automotive
Own and develop concurrent C++ backend services for Webviz to stream time-series and sensor data, integrate offboard storage and WebRTC, optimize latency and throughput, build APIs for automated triage/evaluation, plan technical roadmaps, and mentor engineers.
Top Skills:
BoqBorgC++CnsRpcSpannerWebrtc
Transportation
Build scalable ML data pipelines for Waabi's autonomous driving platform. Design, optimize, and manage datasets and training processes while collaborating with scientists and engineers.
Top Skills:
Apache AirflowApache BeamApache HadoopSparkAws Step FunctionsGoogle Cloud DataflowJaxPythonPyTorchTensorFlow
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Gaming • Hardware
Design and build large-scale data pipelines, lakes, and warehouses; develop ETL, data models, monitoring, and governance; collaborate with product and cross-functional teams; lead and mentor engineers while driving technical strategy for advertising data platforms.
Top Skills:
Apache FlinkSparkBigtableCassandraData LakeData WarehouseETLGoogle Cloud Platform (Gcp)HadoopHbaseJavaKafkaMySQLNoSQLPostgresPythonRabbitMQSQL
Big Data • Information Technology • Software • Analytics
Design, build, and operate petabyte-scale data infrastructure: real-time ingestion, storage (open table formats), and serving. Architect scalable Iceberg-based data lakes, optimize Spark pipelines and streaming (Kafka/Flink), drive reliability, cost efficiency, and data contracts; collaborate with product and platform teams and establish best practices and tooling.
Top Skills:
AirflowAmazon S3Apache FlinkApache IcebergApache KafkaSparkAws GovcloudKubernetesPythonScala
Big Data • Information Technology • Software • Analytics
Architect and build core data governance systems (fine-grained permissions, auditability, policy enforcement, metadata, labeling, compliance) for a multi-tenant platform. Drive projects end-to-end with Product, balance security and usability, own large components, mentor engineers, and ensure scalable, secure handling of billions of data points.
Top Skills:
AirflowAWSBedrockCeleryDjangoElasticsearchKafkaKubernetesMapboxPostgresPulumiPythonReactReduxSagemakerTerraform
Fintech • Payments
Design and build scalable, semantically consistent 360 entity data models and transformation pipelines. Implement cleansing, enrichment, KPIs, scoring, and business rules in SQL and Python/Scala, ensure data quality, lineage, and performance at scale. Use AI coding assistants and SDD to accelerate development, create tests and documentation, and collaborate with domain experts, data scientists, and product teams to deliver trusted, reusable data assets.
Top Skills:
Ai AgentsClaudeContext EngineeringCursorGithub CopilotLlmsMdmPrompt DesignPythonScalaSpec-Driven Development (Sdd)SQL
Real Estate • Travel • PropTech
As a Senior Staff Machine Learning Engineer, you will drive AI product development, collaborate with cross-functional teams, and enhance ML models at scale.
Top Skills:
Agile MethodologiesArtificial IntelligenceDeep LearningMachine LearningNlpSoftware Engineering
Software
Design and implement scalable backend systems for analytics, optimizing performance and resource efficiency while mentoring engineers and managing petabytes of data.
Top Skills:
C++JavaRubyScala
Artificial Intelligence • Information Technology • Machine Learning
Build and scale AI data and ML infrastructure: evaluation systems, fine-tuning pipelines, agent-first product surfaces, high-throughput data workflows, and integrations of new models into production.
Top Skills:
GCPGraphQLJavaKafkaKotlinKubernetesMySQLNode.jsPostgresPubsubPythonReactReduxSpannerTypescript
Cloud • Software
The Staff Software Engineer will design, build, and maintain scalable data infrastructure, ensure reliability, mentor engineers, and enhance data workflows in a collaborative environment.
Top Skills:
AirflowBashChefEmrGithub ActionsGoGrafanaHive MetastoreKubernetesPinotPythonSQLStarrocksTerraformTrinoVault
Social Media
Lead architecture and technical direction for Pinterest's conversion data privacy platform. Own design and operation of de‑identification pipelines, access controls, policy enforcement, deletion workflows, monitoring, and tooling. Partner cross‑functionally with product, data science, legal, and infra to translate privacy requirements, balance privacy and utility, and drive rollouts. Mentor engineers, lead reviews, and define privacy‑by‑design best practices for large‑scale data systems used in ads reporting and monetization.
Top Skills:
JavaKotlinPythonScalaSQL
Social Media
Lead architecture and strategic direction for Pinterest's data warehouse, analytics tools, and data governance. Drive cross-functional initiatives to build scalable data platforms, AI-assisted pipeline tooling, and analytics capabilities. Mentor engineers, define policies and tooling, and deliver measurable adoption and business impact across the company.
Top Skills:
AirflowClaude CodeCodexCursorDatahubExadataFlinkQuerybookSparkSupersetTrino
Social Media
Lead design and operation of identity resolution and data governance systems. Build batch and streaming pipelines, APIs, and tooling for identity ingestion, matching, lineage, quality, access controls, and privacy compliance (GDPR/CCPA). Collaborate with product, analytics, legal, and security to ensure privacy-by-design, monitoring, and documented runbooks.
Top Skills:
APIsAWSBatchCloud WarehousesData LakesScalaSparkStreaming
AdTech • Marketing Tech
Lead design and implementation of identity resolution and data governance systems. Build batch and streaming pipelines, APIs, and services to ingest, normalize, link, and version identity data. Ensure auditable matching, data lineage, quality checks, schema enforcement, access controls, and privacy-by-design. Collaborate cross-functionally with product, analytics, legal, privacy, and security teams and create documentation, monitoring, and runbooks.
Top Skills:
SparkAPIsAWSBatch And Streaming Data PipelinesCloud Data WarehouseData LakeScalaStorage Formats
Productivity • Software • Conversational AI
Design and maintain a dbt-based business data layer for GTM metrics, centralize and reconcile multi-source data, implement automated data quality and reconciliation checks, align and document metrics with stakeholders, mentor analytics team, and lead root-cause investigations into complex data anomalies.
Top Skills:
Ci/CdData ObservabilityDbtSQL
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top San Francisco Companies Hiring Staff Data Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results
.png)












_1.png)




_0.png)











