Wikimedia Foundation Logo

Wikimedia Foundation

Senior Data Scientist (Contract)

Posted Yesterday
Easy Apply
Remote
Hiring Remotely in USA
51-80 Hourly
Senior level
Easy Apply
Remote
Hiring Remotely in USA
51-80 Hourly
Senior level
The contractor will analyze traffic sources and engagement patterns for Wikimedia projects, producing reports and documentation while collaborating with internal teams.
The summary above was generated by AI

Senior Data Scientist (4-month, full-time)

Contract Duration: 16 weeks

Location: Remote (but with the ability to take meetings between 9:00 am to 5:00 pm EST time zone)

Summary

Project Overview

The Wikimedia Foundation is undertaking a research initiative to better understand how people reach Wikimedia projects, how different traffic sources relate to on-wiki engagement, and how changes in visibility affect content quality and contributor activity. This work will support future improvements to search visibility, content reuse partnerships, and movement-wide understanding of contributor pathways.

Scope of Work

The contractor will support analytical components of this initiative, focusing on:

  • Traffic Health & Visibility Trends: Producing descriptive analyses of human traffic patterns, referrers, and indicators of traffic stability over time.
  • Engagement & Attribution Exploration: Supporting preliminary assessments of how traffic sources relate to on-wiki engagement (e.g., likelihood of exploring additional pages or initiating contribution).
  • Natural Experiments & Content Quality: Assisting in analyses of cases where sudden changes in visibility allow us to study downstream impacts on editing activity or content quality.

Specific methods and models will be selected based on data feasibility, privacy guidance, and consultation with internal teams.

DeliverablesAnalytic Deliverables
  • Cleaned/joined datasets, summary tables, and Jupyter notebooks supporting analyses
  • Time-series analyses for Traffic Health indicators and content reusers
  • Natural experiment analyses with interpretable visuals and written summaries
Documentation Deliverables
  • Method documentation describing assumptions, data limitations, and analytical decisions
  • Short briefs or memos explaining key findings for internal stakeholders
  • Clear handover materials enabling reproducibility
Potential Deliverables (Depending on Time & Feasibility)
  • Early prototype views for dashboards (Superset/Turnilo)
  • Early specification drafts for indicators or experimental frameworks
QualificationsTechnical Skills
  • Advanced SQL (large-scale distributed datasets)
  • Python expertise (pandas, numpy, statsmodels, scikit-learn, Jupyter)
  • Ability to work collaboratively in GitLab repositories
  • Time-series modeling experience
  • Applied causal inference (Diff-in-Diff, event studies, lag analysis)
  • Experience working with log-level or large behavioral datasets
Analytical Skills
  • Ability to evaluate data feasibility and design methodological approaches
  • Ability to interpret and communicate analytical uncertainty
  • Strong documentation practices and reproducibility mindset
Soft/Collaborative Skills
  • Ability to work independently in a fast-moving, ambiguous research environment
  • Strong communication with non-technical stakeholders
  • Ability to manage competing priorities across multiple research modules
  • Knowledge of the Wikimedia movement and ecosystem a plus
Collaboration & Reporting

The contractor will collaborate with the Product & Technology department and work closely with Data Engineering, Research & Decision Science, and relevant program teams. They will report to the project’s Staff Data Scientist.

Timeline

4 months (full-time), with sequencing of work dependent on data access, privacy reviews, and research needs.

About the Wikimedia Foundation

The Wikimedia Foundation is the nonprofit organization that operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge freely. We host Wikipedia and the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. 

The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive donations from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.

As an equal opportunity employer, the Wikimedia Foundation values having a diverse workforce and continuously strives to maintain an inclusive and equitable workplace. We encourage people with a diverse range of backgrounds to apply. We do not discriminate against any person based upon their race, traits historically associated with race, religion, color, national origin, sex, pregnancy or related medical conditions, parental status, sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, or any other legally protected characteristics.

The Wikimedia Foundation is a remote-first organization with staff members including contractors based 40+ countries*. Salaries at the Wikimedia Foundation are set in a way that is competitive, equitable, and consistent with our values and culture. The anticipated annual pay range of this position for applicants based within the United States is US$51/hour to US$80/hour with multiple individualized factors, including cost of living in the location, being the determinants of the offered pay. For applicants located outside of the US, the pay range will be adjusted to the country of hire. We neither ask for nor take into consideration the salary history of applicants. The compensation for a successful applicant will be based on their skills, experience and location. 

*Please note that we are currently able to hire in the following countries: Australia, Austria, Bangladesh, Belgium, Brazil, Canada, Colombia, Costa Rica, Croatia, Czech Republic, Denmark, Egypt, Estonia, Finland, France, Germany, Ghana, Greece, India, Indonesia, Ireland, Israel, Italy, Kenya, Mexico, Netherlands, Nigeria, Peru, Poland, Singapore, South Africa, Spain, Sweden, Switzerland, Uganda, United Kingdom, United States of America and Uruguay.  Our non-US employees are hired through a local third party Employer of Record (EOR). 

We periodically review this list to streamline to ensure alignment with our hiring requirements. 

All applicants can reach out to their recruiter to understand more about the specific pay range for their location during the interview process.

If you are a qualified applicant requiring assistance or an accommodation to complete any step of the application process due to a disability, you may contact us at [email protected] or +1 (415) 839-6885.

More information

Applicant Privacy Policy

Wikimedia Foundation

What does the Wikimedia Foundation do?

What makes Wikipedia different from social media platforms?

Our Projects

Our Tech Stack

News from across the Wikimedia movement

Wikimedia Blog

Wikimedia 2030

 

Top Skills

Jupyter
Numpy
Pandas
Python
Scikit-Learn
SQL
Statsmodels

Wikimedia Foundation San Francisco, California, USA Office

1 Montgomery Street, San Francisco, CA, United States, 94104

Similar Jobs

Yesterday
Easy Apply
Remote
USA
Easy Apply
51-80 Hourly
Senior level
51-80 Hourly
Senior level
Other • Social Impact
The contractor will analyze traffic patterns and engagement on Wikimedia projects, producing datasets, visualizations, and documentation to support research initiatives.
Top Skills: GitlabJupyterNumpyPandasPythonScikit-LearnSQLStatsmodelsSupersetTurnilo
Yesterday
Easy Apply
Remote
USA
Easy Apply
51-80 Hourly
Senior level
51-80 Hourly
Senior level
Other • Social Impact
The Senior Data Scientist will support analytical tasks related to traffic health, engagement, and natural experiments to improve Wikimedia projects. Responsibilities include data cleaning, time-series analysis, documentation, and collaboration with various teams.
Top Skills: Causal InferenceGitlabJupyterNumpyPandasPythonScikit-LearnSQLStatsmodelsTime-Series Modeling
Yesterday
Easy Apply
Remote
USA
Easy Apply
51-80 Hourly
Senior level
51-80 Hourly
Senior level
Other • Social Impact
The Senior Data Scientist will analyze traffic patterns and engagement for Wikimedia projects, assisting in methodologies and delivering analytical insights and documentation.
Top Skills: JupyterNumpyPandasPythonScikit-LearnSQLStatsmodels

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account