Forbedringer i søkeresulteter på tvers av wikier
Et nytt mål
Discovery-teamet ønsker å gjøre det mulig å vise søkeresultater fra andre Wikimedia-prosjekter på samme språk på en enkelt wiki.
For eksempel, hvis du er på fransk Wikivoyage og søkte etter «milk», så ville du også fått resulteter fra fransk WIkipedia og fra andre franske Wikimedia prosjekter som Wiktionary og Wikiquote.
Denne siden prøver å oppsumere noen av tankene våre og vi oppfordere alle bidragsytere til diskutere denne endringen på diskusjonssiden.
Rasjonalet vårt
The way search currently works for all Wikimedia projects is this: each language project has its own separate index to search in. This means that there is currently no way to search across all wiki projects for relevant results at the same time. However, on the backend, the language is the same for a search (regardless of project) and visitors may be interested in information that could be in a sister project in their language.
Providing search results across projects (within the same language) provides more information to visitors, easier visibility into other projects, and increases the value Wikimedia visitors receive out of searching on-wiki, which includes the potential for less zero result searches.
Displaying additional search results across projects will not only increase visibility to those other sister projects but it could also increase discovery into more articles of interest and possibly even encouraging additional contributions.
Why haven't we done this already?
Why not have one giant index to search across all projects in all languages?
- Technical complexity is one reason: the size of the resulting index would be in the hundreds of gigabytes.
For example, the English Wikipedia database index of all article pages is around 200 gigabytes. Providing search results quickly would be negatively impacted if having just one very large database index.
- Another reason for this 'one language' search results approach is that it probably wouldn't be very useful to search for a topic across all languages.
Searching for "Paris, France" and seeing generally the same article in French, German, and English wouldn't help very much in discovering new information.
The English language wiki projects are quite large:
- English Wikipedia index for all English article pages is approximately 200 GB
- English Wikinews is approximately 15 GB
- English Wikisource is approximately 6 GB
This might impact various existing wiki workflows by displaying additional search results.
- This could possibly affect bots, contributors, researchers, readers, etc.
The Plan
In the first quarter (July - September 2016) the engineering team will continue to work on the following four main steps:
First
- Combine indexes within the same languages.
(task T139498) Partially done
- Ask for the help from the community - which includes the discussion on this page.
(task T137312) Done
- The Search and Design teams at WMF will create mockups of how these new search results might be displayed.
Done
Second
By the end of the next quarter (October - December 2016) the team will compare search satisfaction. This will be done:
- after the cross-wiki index is ready for several pre-selected languages
(task T121541) Done
- after running A/B tests to collect real time data with which to analyze
(task T121546) Not done
- and get the UI mocks finalized for front end testing here:
Cross-wiki Search Result Improvements/Testing Done
- while also creating a Labs instance for early testing.
(task T151344) Done
Third
In the this quarter (January - March 2017), we will:
- begin A/B testing for the front end of this new feature for analysis and feedback from the community.
(task T145917) In progress
- continue to refine and enhance the back end for this feature
In progress
- First A/B/C test was run and found to be not very conclusive
(task T149806) Done
- Second A/B test will be run, after fixing a minor UI bug and adding four more Wikipedias to test on, for a total of eight.
(task T160004) Partially done
Fourth
April - June 2017:
- wrap up analysis of the second A/B test for the sister projects snippets in the search results page
(task T160008) Partially done
- release translated note to Village Pumps announcing the production deployment
(task T162276) In progress
- release blog post about search
- release into production on all Wikipedias
Current Search Examples
There are already a few community-led solutions to provide additional discovery of other Wikimedia projects and articles in the same language. Here is a small collection of examples:
Current search results pages for a few language wikis:
- Hatian Creole:
- https://ht.wikipedia.org/wiki/Espesyal:Chache/Milk
- Entries at the page-bottom are added via mw:Extension:ArticlePlaceholder
- French:
- https://fr.wikipedia.org/wiki/Spécial:Recherche/Milk
- Entries at the page-bottom are added via d:User:Yair rand/WikidataInfo.js
- Italian:
- https://it.wikipedia.org/w/index.php?title=Speciale:Ricerca/Milk&fulltext=1
- Entries at the page-bottom are added via d:User:Yair rand/WikidataInfo.js
- Navbox at the page-side is added via w:it:MediaWiki:Search-interwiki-custom
External Search Gadget:
- mw:MediaWiki:Gadget-externalsearch.js
- This gadget will search a custom list of technical sites, giving a multi-tab result list
and is further explained at mw:Wikimedia technical search- Here is a screenshot of this custom action (unfortunately, searching for "google.com/cse" can trigger a spam filter message for some browsers)
- This gadget will search a custom list of technical sites, giving a multi-tab result list
How could these additional search results be displayed?
The appearance of search results is open for discussion and we have some rough drafts for you to look at below with more design possibilities on Design. Here are a few examples of what a new search results page could look like based on existing solutions on other language wikis:
-
Example of a wiki page with an addition of a box on the right hand side that shows sister project links that might be of interest and is related to the original search
-
Example of a wiki page with an addition of a listing of article links for sister projects that might be of interest and is related to the original search (also, a history of the page).
-
Example search results page with an added tab for wiki projects
-
Example of a tabbed interface - see more at /Design
Help us choose the solution
Please provide your feedback now!
- Two quarters are needed, at a minimum, to architect and design the technical implementation.
- The team would like to have something to test and to show to the community sometime in late 2016.
- We've decided on the mocks in Cross-wiki Search Result Improvements/Testing that will be tested in the first quarter of 2017.
The Questions
The team has many questions and this is what we'd like to request feedback on from the community:
- Should the results from whatever wiki you're on to be shown first and then have an option to show more from other wikis?
- Should the additional results be inter-mixed with the local wiki results?
- Should the additional results be displayed off to the side (or maybe the bottom) of the results page?
- Should we have the option to turn off these other relevant search results (a user and/or project opt-out)?
- This could be a keyword search term or maybe a button for a visitor to click
- This could also be similar to the
local:
keyword that will only search for images on the local wiki and not Commons files, for instance.
- This could also be similar to the
- Would the additional results be best displayed as a list or a grid design?
- Should we include relevant metadata (images and/or a short description) with the search results?
- Do the results need to have the size of the article (i.e.:
848 bytes (104 words)
) and the date it was created/modified?
- Do the results need to have the size of the article (i.e.:
- Should we indicate that clicking on a result will take you to another wiki project?
- How many results from other wikis should we show - 1, 2, 3, or more?
- Should we limit the existing method of displaying results from the wiki that you searched on?
- We currently show up to 10,000 results in a paginated manner, but testing shows that generally only the first 3 results are ever acted upon.
- Do we want these new search results to work across all Wikimedia projects?
- For example, if I'm on Wikiquote, do I want to also see relevant search results from Wikivoyage, Wikipedia or Wikinews?
- Or, if I'm on Wikipedia, just show me results from other projects?
- Would these other relevant search results be useful and encourage deeper exploration into various topics?
- Is it annoying to see the other wiki search results?
- Conversely, does it encourage a user to discover more knowledge?
- How much weight do we give results from other wiki projects in the results?
- Will the display of the additional search results from other wikis encourage contributions from editors?
- i.e.: if you search for
Piazza del Duomo
and don't see a Wikivoyage article about it (while I'm searching on Wikiquote), would that encourage you to start an article for it?
- i.e.: if you search for
- Should we limit the amount of languages we search in?
- i.e.: only use the top 50 languages to implement this in?
- Or, only use the languages that we are detecting queries in an other language than the wiki the user is on?
See also: Explore similar, Wiktionary widget, thumbnail icons in search results
- Cross-wiki Search Result Improvements/Design - Design notes and illustrations on how search results might appear
- Explore Similar links on the search results page
- A/B testing information
- Self-guided testing step-by-step instructions
- Wiktionary widget on the search results page
- A/B testing information
- Self-guided testing step-by-step instructions
- Adding thumbnail icons to search results
- A/B testing information
Phabricator tickets:
- https://phabricator.wikimedia.org/T137312
- https://phabricator.wikimedia.org/T136639
- https://phabricator.wikimedia.org/T139310
Discussion notes:
After taking into account community feedback and design team recommendations, we'll start A/B testing soon. View this page for more information.
Cross-wiki Search Result Improvements is maintained by the Discovery department.
Get help:
|
This page was created to encourage users to do their own testing, via a self-guided testing page with examples for those not-so-technical and those that have a Wikipedia account and are a little more experienced.