Genentech Logo

Genentech

Data Scientist

Posted 9 Days Ago
In-Office
2 Locations
127K-236K Annually
Expert/Leader
In-Office
2 Locations
127K-236K Annually
Expert/Leader
The Data Scientist will develop and deploy ML models, automate workflows, and handle unstructured data while communicating effectively with stakeholders. A strong background in ML, programming, and business acumen is required.
The summary above was generated by AI
 The Opportunity

As a Data Scientist you will have a strong foundation in machine learning (ML), data science, and software engineering. You will have practical experience in building and deploying ML models and developing AI agents, particularly for tasks involving unstructured/structured data and workflow automation.

As a Data Scientist you will have a strong foundation in machine learning (ML), data science, and software engineering. You will have practical experience in building and deploying ML models and developing AI agents, particularly for tasks involving unstructured/structured data and workflow automation.

Key Responsibilities:

  • Machine Learning and Deep Learning: The candidate must be proficient in a wide range of ML algorithms, from traditional models like linear regression and decision trees to more advanced deep learning architectures such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). They should understand the principles behind model training, validation, and hyperparameter tuning.

  • Natural Language Processing (NLP): For extracting information from unstructured text, strong NLP skills are essential. Look for experience with techniques like tokenization, sentiment analysis, named entity recognition, topic modeling, and using pre-trained language models like BERT, GPT, or others from the Hugging Face ecosystem.

  • Data Handling and Feature Engineering: They should be adept at working with various data formats and have experience in data cleaning, preprocessing, and transforming raw data into useful features for ML models. This includes handling missing values, encoding categorical data, and scaling numerical features.

  • Programming and MLOps: Proficiency in Python is a must, along with a solid understanding of key libraries like Scikit-learn, Pandas, TensorFlow, and PyTorch. Experience with MLOps (Machine Learning Operations) practices, including model versioning, monitoring, and deployment on cloud platforms (AWS, Azure, or GCP), is crucial for building and maintaining robust solutions.

  • AI Agent Architectures: Look for a candidate who understands the components of an AI agent, including a Large Language Model (LLM) as the brain, tools for specific tasks, and a logical structure for decision-making.

  • Workflow Automation: The candidate should have practical experience in designing and implementing automated workflows. This involves integrating AI agents and ML models into existing business processes. They should be able to identify bottlenecks, map out a solution, and build the necessary connectors or APIs to execute tasks automatically.

  • Unstructured Data: The candidate needs to demonstrate expertise in handling various forms of unstructured data, including text, images, and audio. This involves building pipelines to ingest, process, and analyze this data to extract meaningful insights or trigger actions.

Who you are

  • Problem-Solving: The ability to break down complex business problems into manageable, data-driven solutions is key. They should be able to think critically and creatively to solve real-world challenges.

  • Communication: A great candidate can clearly articulate technical concepts to non-technical stakeholders, explaining the "why" and "how" of their solutions. This is vital for collaborating with different teams and ensuring the project meets business goals.

  • Business Acumen: The best candidates understand the business context of their work. They should be able to connect their technical solutions directly to a positive impact on the company's bottom line or operational efficiency.

Education & Academic Background

  • Minimum Requirement: A Bachelor’s degree in a highly quantitative field (Computer Science, Data Science or related field).

  • Preferred: A Master’s in a specialized domain such as Machine Learning, Computational Statistics, Operations Research, or a related quantitative discipline.

  • Proven Track Record: At least 7 years of professional experience in data science, with a clear history of taking AI applications from conceptualization to production environments.

  • Data Handling: Expertise in handling unstructured data

  • Advanced ML Expertise: Experience with supervised/unsupervised learning, deep learning (CNNs, Transformers), and reinforcement learning; proficiency in building agentic workflows, including RAG integration and LLM orchestration

  • Data Infrastructure: Expertise in SQL and experience working with cloud platforms (AWS, GCP, or Azure)

  • Large Language Model expertise required

  • Experience with Diagnostics and/or Pharmaceutical data is a plus

Pleasanton location (where the team resides) is highly preferred. The position can be remote for exceptional candidates.

Relocation benefits are not available for this posting

The expected salary range for this position based on the primary location of California is $127,200 - $236,200.00. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below.

Benefits 

#LI-PK1

Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.

If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form Accommodations for Applicants.

Top Skills

AWS
Azure
Deep Learning
GCP
Machine Learning
Natural Language Processing
Pandas
Python
PyTorch
Scikit-Learn
SQL
TensorFlow
HQ

Genentech South San Francisco, California, USA Office

1 Dna Way, South San Francisco, CA, United States, 94080

Similar Jobs

Yesterday
In-Office or Remote
United States
Mid level
Mid level
Automotive
The Senior Data Scientist designs and implements systems for manufacturing analytics, develops optimization models in Python, builds data pipelines, and collaborates with stakeholders to enhance decision-making frameworks.
Top Skills: BigQueryDashDockerGcsNumpyPandasPyomoPythonSQL
7 Days Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Senior Data Scientist leads advanced analytics and AI/ML model development, focusing on statistical modeling, market share analysis, and strategic analytics while mentoring juniors and ensuring responsible AI practices.
Top Skills: AWSLimePower BIPythonPyTorchShapSQLTableauTensorFlow
4 Days Ago
In-Office or Remote
80K-120K Annually
Entry level
80K-120K Annually
Entry level
Artificial Intelligence • Automotive • Machine Learning • Financial Services
As an Applied Data Scientist, you will ensure data quality, build machine learning models, and collaborate with cross-functional teams to drive data strategy.
Top Skills: PythonSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account