Wikimedia Hackathon 2019/Outcomes

"Logo for Wikimedia Hackathon"

17–19 May, 2019 | Prague

Event metrics edit

We are collecting metrics from the event here:


Event blogging edit

Add your own posts or check back for others here: phab:T219198

Completed projects edit

The projects presented at the showcase, plus 20 other projects that were highlighted as being worked on (but not presented), can be seen below (taken from the larger list at phab:T216260).

See the phabricator workboard for the event for more than 300 software projects, trainings, discussion sessions, and activities that were documented as occurring.

Also keep in mind all the undocumented and/or tangential work that attendees participated in. Thank you, all!

Projects presented at the showcase session at Wikimedia Hackathon 2019 Prague:

No Name Interesting for Developers (D) / Interesting for Editors (E) Link to phabricator Description
1 Do Log Msg for Toolforge D phab:T222244 With the new “dologmsg” command on Toolforge, tool developers can now post updates to their tool’s server admin log straight from the command line – no need to go on IRC.


2 Tesseract OCR E phab:T223533 Tesseract OCR is a new Wikisource gadget that derives text from a scan of a document on Wikisource, using optical character recognition (OCR) technology, in almost 60 languages.


3 Scribe E phab:T223661 Scribe will help Wikipedia editors create new articles by automatically recommending broad, high-level outlines and suggesting references from newspapers and web searches.


4 Secure log out E phab:T222626 On wikis, a cross-site request forgery (CSRF) token was not required to log out of the wikis. This was considered a security issue, so CSRF tokens are now required and logouts without CSRF tokens are deprecated.

GPS-friendly download of query result & Download of RDF Format

Both phab:T216601 & phab:T211177 Users will be able to download Wikidata Query Service results in a variety of new file formats: GeoJSON, GPX, and KML for data containing geolocation, and RDF Formats for CONSTRUCT queries.
6 Wikigrade E phab:T223735 WikiGrade is a tool to make judging article-writing contests more efficient by automatically calculating scores and ranking contributors.


7 Password-less Login E phab:T220418 This proof-of-concept tool could allow users to log-in without requiring the user to enter a password by pressing a button on a pre-registered smartphone.


8 MediaWiki Apps Gallery D phab:T223748 This is a new showcase of interactive demo apps that have been built using the MediaWiki Action API.


9 ShEx Tools Documentation for Wikidata E phab:T223622 Wikidata recently implemented Shape Expressions (ShEx), a new feature to define schemas within Wikidata. Substantial documentation will be needed for this new feature. We started documenting a potential workflow for developing schemas.


10 Voice Assistant editing for Wikidata E phab:T223510 Voice assistants like Alexa, Siri, or Google Assistant are all read-only. What if users could talk back and become first-class participants in our projects? This tool facilitates two-way interaction with Wikidata by accepting qualitative and quantitative feedback. You can say, "hey, that doesn't sound true," and it will post a comment to the talk page on the talk page accordingly. You will be able to save edits directly. (Screenshot of the interaction)
11 Wikichat E phab:T223807 An experiment to support real-ish time chat in MediaWiki, making full use of MediaWiki’s native infrastructure. This could potentially be used to provide user-to-user support within MediaWiki.
12 Languageinfo API module D phab:T220415 The MediaWiki Action API gains a new module, action=query&meta=languageinfo, to get information about languages supported by MediaWiki, such as fallback languages, variants, and writing direction.


13 Wikidocumentaries E phab:T223532 A service for discovering openly licensed media through Wikidata items. In particular, we succeeded at developing Indonesian and Russian translations and adding Creative Commons search. We also filtered out certain sources that were not useful to the project. In the future we plan on supporting Structured Data on Commons and importing files to Commons directly.

14 Integraality for Wikidata E phab:T220219 This tool helps you identify missing Wikidata statements by generating reports of “property coverage” on existing Wikidata items. This could be used to find items that are missing images or geographical coordinates.


15 Echo Push Notifications E phab:T113125 This adds in-browser push notifications to MediaWiki for both mobile and desktop users. This adds to existing email and on-wiki notifications.
16 From Google Drive to Wikimedia Commons E phab:T223541 This tool enables users to upload pictures directly from Google Drive to Wikimedia Commons! Coming soon:
17 Wikidata Quality Score Display E phab:T223590 This is a new gadget that adds information to the top of a Wikipedia article, indicating the quality level of the related Wikidata item as calculated by ORES.
18 Common templates in the visual editor E phab:T55590 Communities can now define the most common templates on their wiki, giving easier access via the "insert template" menus.

19 ISBN Scanner E phab:T216775

Use your phone’s camera to scan ISBNs to automatically generate references for Wikipedia articles. Uses the visual editor.

20 Commons Mobile App refreshed UI E phab:T223159 Developed a new home screen and Explore UI for the Commons mobile app. This improved the usability and discoverability of features within the app, and reduced the background color to make the images shine more.



21 Search page sorting options E phab:T197525 This new interface feature in the Advanced Search page helps users sort their search results by date-created, date-last-edited, or by relevance (default).


22 THICC E phab:T220765 Experimental work on a content-model structured-voting system and structured-discussion system. It attempts to solve the issues with threading of comments and replies.
23 Namescript Improvements E phab:T223806 Namescript is a Wikidata user-script that assists with cleaning up items for names, such as given names or family names. It now has a documentation page. Additionally, the descriptions for Kanji names were improved.


24 Score Extension refactoring E phab:T135597

Now when you upload .midi files, it transcodes automatically on Commons and Wikipedia pages, so you don't need to download the file and run it through a synthesizer in order to hear it.

25 Newcomer Homepage improvements E phab:T220145 and phab:T220146 A new feature for the Newcomer Homepage extension which will allow mentors to customize their short description. We also created a GuidedTour (walkthrough tutorial) to help users understand how to follow-up on their questions with notifications.


26 Cite Unseen E phab:project/profile/4050/ Cite Unseen is a user script to help identify and flag sources based on origin or ideological bias. There are different classifications for sources, including biased, government controlled, press release, newspaper, book, opinion piece, etc.


27 Using a bot to import structured data to Commons Both phab:T223746 Structured data on Commons has been introduced, but we don't have any bots yet that can migrate data. We built some proof of concepts, and tested it by importing the existing multilingual captions for Picture Of The Day, giving us 5,000 nice multilingual captions without any extra work.
28 Technical Conference 2019 D phab:T220212 Announcement of this year's Technical Conference, Nov 12 - 15 in Atlanta, Georgia, USA.


29 Advanced Tutorial for OOUI D phab:T223810 We wrote a tutorial on how to create a larger-scale application with the OOUI front-end development library that follows the Model-View-Controller pattern to make it more maintainable. In a series of refactoring steps we show how the "Todo List" application we created in the basic tutorials goes from a collection of Widgets to a full-fledged application with proper separation of concerns.
30 Wikidata Query Service in Blocky E phab:T223811 A new drag-and-drop interface for building Wikidata queries. No SPARQL queries required!


31 Topic-works matrix E phab:T223809 Added a new visualization experiment for Scholia, to have a query for applications of research and implied topics, placed in an air plot.


Additional projects pulled out of the Non-Presented projects list (bottom half of phab:T216260)

1 Flickr Dashboard E phab:tag/flickrdash Edit photos on Commons and Flickr at the same time.

2 Wiki Loves iNat E You can now upload much needed photos from iNaturalist to commons


3 VideoCutTool E phab:T217503 This tool can be used to trim (crop) videos in Wikimedia Commons and uploads the results back to Commons.


4 Mortar E phab:T223608 Created and documented a unified, user-friendly workflow for GLAM uploads on both Wikidata and Commons. We created a workflow using Pattypan and OpenRefine, defined the missing steps needed, and started the documentation. -

5 Wiki Education Dashboard E After meeting with Czech Wikipedians who use Programs & Events Dashboard for classroom projects, we identified 14 bugs and feature requests, then we tackled many of the highest priority ones:
  • Campaign organizers can now make edits to any programs in their campaign.
  • We added an API that can be used by a mediawiki gadget to display information about courses from Special:Contributions (gadget in progress).
  • We made it easier to copy previous programs for a new academic term.
  • We made a number of UI improvements and bug fixes.
  • We made the features for downloading detailed campaign metrics more discoverable.
  • We upgraded several key dependencies, and improved build speed.
  • Wikidata labels show up in more places.

All of these improvements are live now on

6 Audio subtitles for video.js E phab:T121387 For our next generation video player based on video.js, I created a plugin to show subtitles/captions to users for audio clips
7 Global Search Tool Released cloudelastic E phab:T223519 Provides keyword and source regex queries against all wmf wikis in a single search request. Powered by cloudelastic.

like mysql replicas in wmf cloud, except elasticsearch.

8 Scrape all the pictures of the digital image repository of the National Archive of Curaçao with the Copyright status 'negative' E phab:T223050 Attempt to scrape the image URLs and metadata out of the National Archive of Curaçao's imagebank for images that have a copyright status of "Negatief". So far the metadata of all 1103 images with this status has been cleaned; the URLS are still underway. Once the CSV file is ready, Yupik will send it to Ecritures, who will batch upload the images to Commons.
9 Fix longstanding Bug in Wikidata-Lua-Module in German Wikipedia E When using data from Wikidata in German Wikipedia duplicated sources were shown multiple times, they now get reused.
10 Add copy button to URL shortener result E After shortening, you need to select and copy the link from Special:UrlShortener. It would be amazing if we had a button and when you clicked it, boom, magically it stores it in the clipboard.
11 Normalize homoglyphs in mixed-script tokens E phab:T222669 Oзон and Озон look the same, but the first one starts with a Latin O rather than a Cyrillic О. Searching for either will not find the other. Working on an Elasticsearch plugin to attempt to map homoglyphs in mixed-script tokens and index any single-script variants that can be generated.
12 Wikidata Stream E A nice view of the Wikidata edit stream to use as an eye-catcher for people interested in Wikidata

13 Wikidata autocomplete gadget for external-id properties E Created a gadget that provides autocomplete for selected external-id properties (like VIAF) by querying external search APIs.
14 Host Primerpedia on Toolforge and add multilanguage support E Migrated the tool to Toolforge, complete with auto-update and everything! Now also works in multiple languages.

15 Collect information on SPARQL query builders E phab:T192967#5194136 All the currently working SPARQL query builders are now listed in a single place:
16 Conduct usability experiments of SPARQL query builders D The various existing tools for visual building of SPARQL queries incorporate different design decisions, with different usability consequences. User testing these tools helped identify areas to improve in these or future tools.
17 Research and document how to set up auto-updating of Toolforge tools from a GitHub repository. D
18 Improve Toolforge documentation at D Updates were made to various help pages, including Portal:Toolforge, Help:Toolforge, Help:Getting Started, Help:Toolforge/Web, and others.
19 Improve documentation for WMDE's Wikibase Docker images repository D The README for was a bit sparse and confusing for a beginner, which hinders its usage. It was improved so that newcomers can easily understand and follow it, with more in progress.
20 Cite web helper Tool E The tool which will make it easier to put references into articles. User have only to input an URL-address of online newspaper article, and it will return Cite web template with all parameters filled.
21 AccessControl improvements Both Work was started on a comprehensive rewrite of Extension:AccessControl to have greater security in some circumstances, and to enable (optionally) Extension:Page Forms to manage the user-access lists.