Outreachy/Past projects

This page tries to keep up with the current status of all past Outreach Program for Women/Outreachy projects.

See also Google Summer of Code/Past projects.

Quantitative summary of past Outreachy projectsEdit

Completed Outreachy projects since 2013:

In the 18 Outreachy rounds between 2013 and 2021, contributors joined from 22 countries: India, United States, Brazil, United Kingdom, Sri Lanka, Canada, Israel, Romania, Germany, Turkey, Cameroon, Kenya, Nigeria, Vietnam, Taiwan, Nepal, Bangladesh, Russia, Malaysia, Uganda, France, Pakistan.

Develop a web app for editing Toolhub recordsEdit

  • Mentees: Nicole Barnabee-Burns, Hannah Waruguru Njoroge
  • Mentors: Slavina Stefanova, Damilare Adedoyin
  • Outcome: Over the course of the internship, we developed a full-stack web application that could be used to improve discoverability of other Wikimedia tools. The tool identifies gaps in the Toolhub records of other tools, and presents a user-friendly interface for filling in the missing information.
  • Tech stack: The application was built with Vue.js on the front-end and Flask on the back-end, and is connected to a MariaDB database. Task queuing is handled by Celery, with Redis as a broker.
  • Relevant links: Toolhunt, Phabricator workboard, Frontend repository, Backend repository
  • Blog: Nicole's blog, Hannah's blog

Hybrid event production for QueeringWikipedia 2023Edit

  • Mentee: André Rodrigues
  • Mentors: Željko Blaće, Owen Blacker
  • Outcome: After investigating various FLOSS options and considering time commitments, we decided to use Zoom for regular meetings, Jitsi for unconference style sessions, and BigBlueButton for workshops and explanatory sessions. In addition, I conducted outreach and held office hours to promote the event during the internship period.
  • Relevant links: Phabricator page
  • Blog: André's blog

Develop features for Wiki Loves Monuments AppEdit

Develop a web app for patrolling based on the new ML-based service to predict revertsEdit

Rewrite Imagebulk tool to scale upEdit

  • Mentees: Enow97
  • Mentors: Jay Prakash and Sudhanshu
  • Outcome: The project involved rewriting the existing web app codebase using Vue.js and Flask, along with integrating Celery to improve the scalability and performance of the system. The resulting system will be able to handle large volumes of traffic and complex user interactions while remaining responsive and efficient. Although, code has been written under this project but deployment is still being left and will be handle by mentor (Jay Prakash).
  • Tech stack:
  1. Vue.js on the front-end
  2. Flask on the back-end
  3. Task queuing in Celery along with Redis as the broker
  4. Docker

Add support for tracking specific namespaces to Programs & Events DashboardEdit

  • Student: Vaidehi Atpadkar
  • Mentors: Sage Ross
  • Outcome: Dashboard now has a new feature of selecting specific wiki-namespaces for tracking and displaying the stats for them.
  • Relevant links: source code
  • Blog: Vaidehi's Blog

Build Python library to work with html-dumpsEdit

  • Student: Nazia Tasnim
  • Mentors: Martin Gerlach, Isaac Johnson
  • Outcome: mwparserfromhtml, a python-library to parse the Wikipedia HTML dumps.
  • Relevant links: source code
  • Blog: Nazia's Blog

What's in a name? Automatically identifying first and last author names for Wikicite and WikidataEdit

Automatically matching new Wikipedia articles with Wikidata items using PythonEdit

Automatically matching new Wikipedia articles with Wikidata items using PythonEdit

Develop learning toolkits and videos to demonstrate the use of essential tools for WikimediaEdit

Improve Wikidata support on Programs & Events DashboardEdit

  • Student: Ivana Novakovic-Lekovic
  • Mentors: Sage Ross
  • Outcome: Integrated Wikidata edit analysis into the Dashboard’s data update system; it now shares Wikidata edits details about merges, aliases, labels, claims, and more.
  • Relevant links: source code
  • Blog: Ivana's Blog

Refactor Mediawiki tests to use WebdriverIO AsyncEdit

  • Student: Osama Tahir
  • Mentors: Soham Parekh, Željko Filipin
  • Outcome: Refactored MediaWiki tests in wide range of extensions (such as Math, Newsletter, VisualEditor) to use WebdriverIO Async
  • Relevant links: source code
  • Blog: Osama's Blog


  • Student: Muniza A.
  • Mentors: Martin Gerlach and Isaac Johnson
  • Outcome: Developed WikiNav, a tool that processes the Wikipedia clickstream data to generate statistics and visualizations that help make this data more accessible to folks with varying levels of programming and data wrangling experience.
  • Relevant links: Phabricator task, demo application
  • Blog: Muniza's Blog

Developing mwsql: A Python package for working with Wikimedia SQL dumpsEdit

Synchronising Wikidata and Wikipedias using pywikibotEdit

Modules Research ToolEdit

Wiki-Reliability: A Large Scale Dataset for Content Reliability on WikipediaEdit

Wiki Country Inference Tool: A Model that Infers countries from Wikipedia ArticlesEdit

Developing a lightweight and efficient Content Filtration module for Wikimedia CommonsEdit

Review and improve Lua documentation on meta and mediawikiEdit

Enhancements to gdrive-to-commons uploader toolEdit

Productionize Wikidata-based Topic Model on ORESEdit

WikiContrib: Gather and analyze user contributions on Wiki and GitHubEdit

  • Student: Raymond Ndibe
  • Mentors: Srishti Sethi and Rammanoj potla
  • Outcome: 1) Implemented feature to count contributions made to Wikimedia repositories on GitHub 2) Implemented contributions caching feature 3) Implemented persistent URL feature 4) Fixed all outstanding issues and bugs 5) Improved the tool's UI/ UX.

Converting Campaign pages to ReactEdit

  • Student: Lalitha Reddy
  • Mentors: Sage Ross, Khyati Soneji
  • Outcome: Created the campaign navbar and the home tab component in React.
  • Relevant links: project task, bi-weekly reports

Improvements and User Testing of Wiki Education Dashboard Android AppEdit

A system for releasing data dumps from a classifier detecting unsourced sentences in WikipediaEdit

Documentation improvements to the ~20 top 100 most viewed MediaWiki Action API pages on-wikiEdit

Create regression automated tests for Special:Homepage functionality testingEdit

Improve MediaWiki Action API Integration TestsEdit

Documentation improvements to the ~20 top 70 most viewed MediaWiki Action API pages on-wikiEdit

Improve Programs & Events Dashboard for use in the #1lib1ref campaignEdit

  • Student: Khyati Soneji
  • Mentors: Sage Ross, Wes Reid
  • Outcome: Added support for counting references added to English Wikipedia articles in Programs & Events Dashboard, along with improved data download options and support for scoping via PetScan PSIDs.
  • Relevant links: Internship blog posts, project task

Research project on the editing patterns of users of wiki CX translation toolEdit

  • Student: Doris Zhou
  • Mentor: Isaac Johnson, Jonathan Morgan
  • Outcome: Did research analyzing the editing patterns, article selection, and article writing quality of users who initiated article translation using the CX Translation tool. Looked specifically at English to French in depth and did some English to Chinese analysis.
  • Relevant links: bi-weekly reports, research meta page

Improve top 50 viewed pages of the MediaWiki Action API & create a demo app to educate usersEdit

Update MediaWiki Action API docs, add Python code to repo, create a demo app, and write a tutorial for the demo which showcases several APIs.

Add a new Linter Category: Links-in-LinksEdit

Write code in Parsoid to detect links inside links and in PHP Linter extension to add this category.

Provide Test Support for Various Wikimedia ProjectsEdit

Apply exploratory testing principles to test weekly maintenance releases of Content Translation tool and Visual Editor.

QA: Testing Automation - port Echo Notification tests to Node.jsEdit

Created automated tests to check that updates to the changes made to the code base do no break existing components.

Create an event setup wizard for Programs & Events DashboardEdit

Design, create and test a wizard which helps to make it easy for users to set up an event with exactly the settings they need, which is an interface that walks through all the main options and describes what they do and what they are for to help configure an event.

Improve support for photo/media contribution campaigns on Wikimedia Programs & Events DashboardEdit

Made media contributions a first class citizen in the Wikimedia Programs & Events Dashboard. The project included building dedicated user-friendly pages for viewing and assessing the metadata of uploads from a specific campaign, and adding upload contribution statistics in other views alongside article statistics.

Automatically detect spambot registration using machine learning like invisible reCAPTCHAEdit

Create a captcha which is friendlier to humans and harder for bots to crack

Improvements to Grants review and Wikimania scholarships web appsEdit

Improve scholarships and grant review applications by important bug fixes and feature additions

Refactoring of MassMessage ExtensionEdit

Fix technical depth cleaning on MassMessage

Translation outreach: User guides on MediaWiki.orgEdit

Create, test and document new strategies to recruit technical translators

User Contribution Summary ToolEdit

Create a tool that's optimized for presenting one's activity on wikipedia in a CV-like manner

Improve Programs & Events Dashboard support for Art+Feminism 2018Edit

Improve the Program & Events Dashboard from WikiEducation based on the feedback from the Art+Feminism campaign of 2018.

Remind me of this article in X daysEdit

Make it possible for logged-in user to get a reminder of an article after a few

days. Possibility to enter a short comment.

Documentation on how to develop Zotero translators at translation-serverEdit

Document the process of writing Zotero web translators on server side and on Scaffold and how to get them in production.

Allow Programs & Events Dashboard to make automatic edits on connected wikisEdit

  • Student: Medha Bansal
  • Mentors: Sage Ross and Jonathan Morgan
  • Status: All tasks as mentioned in the proposal and in the timeline have been completed. Project is live with all supporting documentation.
  • Link to project task on Phabricator: T158678
  • Link to weekly reports archives: Weekly reports

Creating User Profile Pages for Wiki Ed Dashboard and providing cumulative statistics for all programs a user has participated in.Edit

Added customizable Profile pages to the Wiki Education Dashboard and generated contribution statistics of the users, providing them a brief overview of all the contributions they made to encourage them to do more.

Easier categorization of pictures in Upload to Commons Android appEdit

This project improves the image categorization functionality of the app by offering relevant category suggestions based on geolocation, and making category search more flexible.

Reinvent Translation SearchEdit

The objective of this project is to offer a search tool to empower translators to find messages they want to translate and maintain consistency between translations.

Wikipedia article translation metricsEdit

"This project aims at building a model that would estimate whether a page is translated or not, using statistical analysis and machine learning tools."

Pywikibot compat to core migrationEdit

"The purpose of this project is to improve all the documentation including getting started guides and project documentation in Pywikibot."

Wikipedia Education Program need-finding researchEdit

"The task is to improve the function, usability and design of the course pages for both professors and students."

Collaborative spelling dictionary building toolEdit

"The project aims at developing a collaborative dictionary which shall also have an additional feature of checking spellings of the words."

Adding Performance Instrumentation to ParsoidEdit

"This project will develop a dashboard of metrics that will allow users to, at-a-glance, understand Parsoid's performance. It will provide a resource for application tuning, quick assessments of production readiness, and troubleshooting sources of performance problems."

  • Student: Christy Okpo
  • Mentors: Subramanya Sastry
  • Wrap-up blogpost: Link
  • Phabricator Evaluation task: T92244
  • Status: Dashboards have been created, here and here. A glossary of metrics and guide to performance instrumentation using Graphite, have also been created.

Extending PyWikiBot support to sites on IWMEdit

"PyWikiBot currently supports only a few wiki projects. At the end of this project, the benefits of automation of tasks by PWB will be provided to all MediaWiki sites on the meta:Interwikimap, and provide the basis for support of non-MediaWiki wiki sites and non-wiki sites."

  • Student: Manpreet Kaur
  • Mentors:John Mark Vandenberg, Fabian Neundorf
  • Wrap-up blogpost: Link
  • Phabricator Evaluation task: T92246
  • Status: Final report can be found here. Further work to be done on non-mw sites.

Improving URL citations on WikimediaEdit

Aims to make citing sources in VisualEditor easier by generating a citation given a unique identifier such as a URL or DOI.

Enhancing Wikimaps/OpenHistoricalMaps ProjectEdit

  • Student: Jaime Lyn
  • Mentors: Dr. Rob Warren
  • Wrap-up blogpost: Link
  • Final report:
  • Status:

Welcome to labs - Welcoming new contributors to Wikimedia Labs and Tool LabsEdit

Finding the best and making them better: Evaluating, documenting, and improving MediaWiki web API client librariesEdit

Feed the Gnomes - Wikidata OutreachEdit

Template Matching for RDFIOEdit

WikiHunt the 'Property': Wikidata Outreach InitiativeEdit