Develop and maintain data infrastructure for AV safety by building data pipelines, conducting analyses, and assessing events to inform safety metrics and regulatory compliance.
As an AV Safety Data Engineer, you will play a critical role in developing and maintaining robust data infrastructure to surface safety-relevant interactions, assess AV events, and inform autonomy development through data-driven insights. You will build scalable pipelines and metrics that ensure we receive strong, reliable signals from data, directly influencing how we assess system readiness and accelerate safe deployment, while also supporting the crucial task of event triage and analysis.
In this role you will:
- Develop and maintain miners that identify events of interest from fleet and simulation data.
- Improve the accuracy and efficiency of existing miners through advanced statistical and ML techniques, ensuring high signal quality.
- Build scalable data pipelines and dashboards to monitor miner performance, signal quality, and key safety metrics.
- Conduct exploratory data analyses to uncover trends, failure modes, and opportunities to improve vehicle behavior, specifically focusing on critical safety events and system anomalies.
- Triage and assess AV events including both system behavior and hardware related anomalies, for potential safety/legal/regulatory concerns or system gaps, using contextual judgment grounded in safety, legal, and engineering principles.
- Develop and maintain clear triage guidelines and criteria for both onshore and offshore triage teams.
- Support regulatory reporting processes by providing data and analysis as needed.
Requirements
- Master’s Degree in Computer Science, Data Science, Statistics, Applied Mathematics, or a related quantitative or Engineering field such as Transportation Engineering, Systems Engineering, Civil Engineering, Mechanical Engineering, or Electrical Engineering.
- 6-8 years of experience working with safety-critical systems in AV, Automotive, Transportation Engineering, or a closely related industry.
- Strong programming skills in Python and SQL.
- Experience with large datasets and distributed data processing frameworks (e.g., Spark, Databricks).
- Strong data engineering skills, including data wrangling and expertise in building maintainable, scalable, and efficient data pipelines.
- Familiarity with statistics and probability, and experience with sampling and estimation methodologies.
- Proven ability to design metrics, run analyses, and translate findings into actionable recommendations.
- Strong understanding of behavioral safety concepts and the ability to interpret vehicle behavior in real-world contexts, along with knowledge of AV requirements, road user behavior, and traffic regulations.
- Experience working with cross-functional teams and clearly communicating technical content to both engineering and non-technical audiences.
- Strong organizational skills with attention to detail, especially in process development and documentation related to data pipelines and safety analysis.
Bonus Qualifications
- PhD in Engineering or a related technical field.
- Prior experience with AV safety or system behavior triage.
- Experience with geospatial data analysis.
- Experience with business intelligence tools (e.g., Looker) to support analysis.
- Familiarity with ISO 26262 or other functional safety standards
Benefits
- Health Care Plan (Medical, Dental & Vision)
- Life Insurance (Basic, Voluntary & AD&D)
- Paid Time Off (Vacation, Sick & Public Holidays)
- Training & Development
- Retirement Plan (401k, IRA)
- Free breakfast and lunch
Top Skills
Databricks
Python
Spark
SQL
Sigma Software Vertex San Jose, California, USA Office
San Jose 75E Santa Clara St, San Jose, California, United States, 95113
Similar Jobs
Fintech • Machine Learning • Payments • Software • Financial Services
As a Distinguished Data Engineer, you will lead data architecture, manage high-availability data solutions, and collaborate with various teams to enhance customer experiences. You will leverage cutting-edge technologies while mentoring others and driving engineering best practices.
Top Skills:
AWSKafkaPythonScalaSnowflakeSQL
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Principal AI Data Engineer is responsible for designing and operating AI-driven data platforms, focusing on Snowflake and LLM integrations, while leading technical initiatives and mentoring teams.
Top Skills:
AWSDbtGeminiGitlabGleanSalesforce AiSnowflakeTerraform
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The role involves architecting and maintaining marketing databases, implementing data infrastructure for AI/ML, managing data pipelines, and collaborating with stakeholders to develop data-driven solutions.
Top Skills:
DatabricksDomoGoogle AnalyticsGoogle BigqueryKubeflowLookerMarketoMixpanelMlflowPythonRdsRedshiftSalesforceSnowflakeSQLTableau
What you need to know about the San Francisco Tech Scene
San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine



