Accuracy review
This was a Google Summer of Code 2016 project, located at https://arowf.toolforge.org/
Synopsis
editThis application provides editors with a peer-review system to find and review inaccurate content in Wikipedia articles. This has been achieved with the help of algorithms that are specifically designed to flag suspected content, which the editors can then review by a three-way review mechanism. The reviewer bot finds articles in given categories, category trees, and lists. For each such article, the bot creates questions from the following sets:
- Passages with facts and statistics which are likely to have become out of date and have not been updated in a given number of years.
- Passages which are likely unclear.
- Student edits.
- Content from Wikipedia Backlog categories
Question candidates from each set are then ranked by taking into account the pageview count of the article and the high ranking candidates are made into questions. These questions are then open to the reviewers for reviewing and resolving them. A three-way peer-review mechanism ensures that questions are resolved based on common consensus. Two reviewers work on each question, and in the case of a conflict, it is sent to a third reviewer. The first reviewer provides a solution to the question posed. The second reviewer can decide to either 'Endorse' or 'Oppose' a proposed solution as valid/invalid. In case of a conflict, the third reviewer decides between supporting the first or second reviewer's viewpoint. Reviewer reputation scores are computed based on a mechanism of acceptability of reviews by other peer reviewers. Reviews which lowered the scores can be optionally displayed to the reviewers.
The app is up and running here!
Tasks completed
edit- Designed the architecture of the system
- NoSQL approach of the data storage system
- Implemented the /ask, /answer, /recommend, /inspect, /register, /token and /help end-points
- Optional registration system
- Logging functionality
- Wrote scripts to create questions from the following:
- Articles containing the word 'recent'
- Poor Flesch-Kincaid readability scores
- Student edits
- Wikipedia Backlog categories
- Ranked extracted article candidates based on standardized scores that include the pageview count
- Deployed the app on ToolLabs and PythonAnywhere
Next steps
edit- Internationalization and Localization: Provide language support for the app
Quick Links
editMentors
editPrimary mentor: James Salsman
Co-mentor: Fabian Flock
Contact information
editSee also
edit- Biancani, Susan (2014) "Measuring the Quality of Edits to Wikipedia" OpenSym '14 Proceedings (Berlin, Germany: August 2014.)
- meta:Research:Revision scoring as a service
- Strategy proposal from 2009
- Yu Suzuki and Masatoshi Yoshikawa (2012) "Mutual Evaluation of Editors and Texts for Assessing Quality of Wikipedia Articles" WikiSym '12 Proceedings (Linz, Austria: August 2012.)
- Outreach programs/Selection process