Databricks Logo

Databricks

PhD GenAI Research Scientist Intern

Reposted 19 Days Ago
Be an Early Applicant
In-Office
San Francisco, CA, USA
54-60 Annually
Internship
In-Office
San Francisco, CA, USA
54-60 Annually
Internship
Assist the research team in developing and evaluating domain adaptation methods for LLMs and AI systems focused on enterprise domains.
The summary above was generated by AI

Company Description:

At Databricks, we are obsessed with enabling data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’s best data and AI platform, so our customers can focus on the high value challenges that are central to their own missions.

The Mosaic AI organization enables companies to develop AI models and systems using their own data, with technologies ranging from fine-tuning LLMs for enterprise domains, to a platform for building compound AI systems that use retrieval and agents. Mosaic AI is committed to the belief that a company’s AI models are just as valuable as any other core IP, and that high-quality AI models should be available to all.

Job description:

Most of the world's data+AI problems lie in enterprise domains, behind closed doors. Our research team's goal is to push the frontier of "domain adaptation" - how can we develop LLMs and AI systems that work well for custom domains. To do this we are tackling open research problems on a range of topics, from how to scale/automate eval, fine tune with synthetic data, retrieval augmentation, fast/efficient inference and more. 

You will work with our research team on projects focused on adapting LLMs and AI systems towards enterprise domains. This may include:

  • Adapting, improving, and evaluating a method from the literature.
  • Designing an entirely new method for domain adaptation.
  • Composing together multiple methods to create new recipes for efficient post-training.
  • Evaluation of LLMs and AI systems. 

Your qualifications and qualities:

  •  Required:
    • Research experience in and proficiency with the fundamentals of deep learning.
    • Pursuing a PhD in computer science or related fields (electrical engineering, neuroscience, physics, math, etc.).
    • Proficient software engineering skills, including with PyTorch.

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles.  Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.


SF Bay Area Hourly Rate
$54$60 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.
Benefits
At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region click here.

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Databricks San Francisco, California, USA Office

160 Spear Street, San Francisco, CA, United States, 94105

Similar Jobs

10 Minutes Ago
In-Office
San Jose, CA, USA
144K-216K Annually
Mid level
144K-216K Annually
Mid level
Artificial Intelligence • Fintech • Software
Own product vision and execution for data integrations: define supported integration types and tooling (API, sFTP, CDC, Snowflake), shape integration architecture and data mapping, work with engineering and non-technical customers, create playbooks and documentation, and align with GTM, Sales, and Customer Success to accelerate onboarding and platform value.
Top Skills: APIsChange Data Capture (Cdc)Data PipelinesData WarehousingNetSuiteSAPSftpSnowflakeWorkdayYardi
10 Minutes Ago
Hybrid
2 Locations
164K-246K Annually
Senior level
164K-246K Annually
Senior level
Artificial Intelligence • Fintech • Software
As a Senior Product Manager, lead the strategy and development of Journal Entry Management in FloQast's AI-powered accounting platform, enhancing automation and compliance for enterprise customers.
Top Skills: AIErpMicrosoft DynamicsNetSuiteOracleSage IntacctSap Ecc/S4Workday Financials
14 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
118K-179K Annually
Senior level
118K-179K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and operate large-scale data platforms and Spark/PySpark pipelines. Enable data integration, modeling, quality, and observability. Build MCP servers and AI-augmented tooling, mentor engineers, and lead cross-functional projects to deliver reliable data products.
Top Skills: Ai AgentsApache IcebergAuroraAWSAws RdsAzureDatabricksDbtFivetranGCPGoogle BigqueryMcp ServersMs Sql ServerMySQLOraclePostgresPysparkPythonSnowflakeSparkSQL

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account