Community Wishlist Survey 2021/Search/Use Wikidata to improve search

Use Wikidata to improve search

  • Problem: The current search function is quite poor unless you know exactly what article you are searching for, and what it's title is. If you're searching based on a category (say, "French films of the 1970s"), the search is quite useless.
  • Who would benefit: Everybody using the search function.
  • Proposed solution: Use Wikidata to find search results. This would work by comparing terms in the search query against the Wikidata profiles of pages. So searching for "French films of the 1970s" in Wikipedia would find articles which are categorized as a "film", as "French", and as related to the "1970s". Those articles which would have all three would then be in the top search results if no article had a title with a close match.
  • More comments:
  • Phabricator tickets:
  • Proposer: Oijsdaio (talk) 03:34, 30 November 2020 (UTC)

Discussion

  • Having an interface to easily make semantic queries to execute on Wikidata would be very useful. Silver hr (talk) 02:12, 1 December 2020 (UTC)
  • If this were possible, we would already be doing it at Wikidata Query Service. Mapping English language phrases into Wikidata claims is impossible with some very serious AI. And even if it were possible, it would be much too slow for the regular search interface. Kaldari (talk) 02:47, 9 December 2020 (UTC)
    Do any support voters have a response to this? I see that this request is gaining a lot of traction, but if that's just since we all wish search functioned better rather than since this would actually be a good idea, it might not be the best thing to push to the top. {{u|Sdkb}}talk 10:32, 9 December 2020 (UTC)
    • only because who's working on this right now couldn't figure out a way to do it doesn't mean it's impossible. if enough people think this should be done, there might be a threshold of collective brain power which is able to find a solution. and, if not, at least we'll have a better case for the impossibility of it today. that being said, i don't know enough of the technical policies, but i also don't believe there's any such search problem that can't be solved with a bigger index and less updated results. it is, after all, how every web search work in the end. (ps: i'm probably unable to support this idea any further. wished to vote for integration with fossil because of offline search among other things, but got in too late. this is my only contribution for the 2021 wishlist, though. cheers! 😘) --cacawee (talk) 07:55, 13 December 2020 (UTC)
  • WMDE is making a UI for 'easy queries' right now so I wonder if that suits the need. --Izno (talk) 14:57, 11 December 2020 (UTC)
  • I am running a website (German, non-profit) covering some 60000 pages on basic maths and physics. A search field takes a query and first looks for exact matches based on file names. The query then looks for similiar matches, mixing two algorithms (Levenshtein and the PHP similar-function). Finally, a list of search results with short preview texts is displayed. The search-engine is self-programmed in PHP, but not at a professional level. One (big) shortcoming is lack of speed. I'd gladly share any ideas and experiences. --Rhetos (talk) 12:51, 14 December 2020 (UTC)

Voting

  •   Support Dr747 (talk) 18:40, 8 December 2020 (UTC)
  •   Support Imetsia (talk) 18:56, 8 December 2020 (UTC)
  •   Support MarioSuperstar77 (talk) 20:17, 8 December 2020 (UTC)
  •   Support urgently needed. --Braveheidi (talk) 20:52, 8 December 2020 (UTC)
  •   Support Ssstela (talk) 21:20, 8 December 2020 (UTC)
  •   Support Pmau (talk) 21:42, 8 December 2020 (UTC)
  •   Support شادي (talk) 22:30, 8 December 2020 (UTC)
  •   Support YFdyh000 (talk) 23:03, 8 December 2020 (UTC)
  •   Support Martinkunev (talk) 23:09, 8 December 2020 (UTC)
  •   Support Silver hr (talk) 00:06, 9 December 2020 (UTC)
  •   Support Wikidata also lists alternative names, so they could also be used to help direct search queries to the correct article 5225C (talkcontributions) 00:18, 9 December 2020 (UTC)
  •   Support Eric0892 (talk) 01:10, 9 December 2020 (UTC)
  •   Support BALA. RTalk 01:30, 9 December 2020 (UTC)
  •   Support Keepcalmandchill (talk) 02:00, 9 December 2020 (UTC)
  •   Support Pamzeis (talk) 02:56, 9 December 2020 (UTC)
  •   Support NMaia (talk) 03:16, 9 December 2020 (UTC)
  •   Support JopkeB (talk) 05:46, 9 December 2020 (UTC)
  •   Support Pinerineks (talk) 07:00, 9 December 2020 (UTC)
  •   Support Philbutler (talk) 07:23, 9 December 2020 (UTC)
  •   Support Karbohut (talk) 09:49, 9 December 2020 (UTC)
  •   Support Xavi Dengra (MESSAGES) 12:57, 9 December 2020 (UTC)
  •   Support TheImaCow (talk) 17:14, 9 December 2020 (UTC)
  •   Support Monozigote (talk) 17:18, 9 December 2020 (UTC)
  •   Support Петър Петров (talk) 17:35, 9 December 2020 (UTC)
  •   Support Rafael (stanglavine) msg 18:36, 9 December 2020 (UTC)
  •   Oppose If data from Wikidata is incorporated into the search engine protocol, it's going to end up skewing the sequence of the search results and preclude people from quickly and easily locating and accessing existing material on the site. Tyrekecorrea (talk) 18:56, 9 December 2020 (UTC)
  •   Support TheAmerikaner (talk) 20:33, 9 December 2020 (UTC)
  •   Support Thomas Kinz (talk) 21:12, 9 December 2020 (UTC)
  •   Support - Darwin Ahoy! 02:04, 10 December 2020 (UTC)
  •   Weak support Unbeatable101 (talk) 04:15, 10 December 2020 (UTC)
  •   Comment This proposal is less clear. Shall Wikidata search results be used as snippets, like other crosswiki search results? Shall Wikidata be amongst all other Wikipedia (or specific project) search results? Shall Wikidata results be JavaScript pop-ups? How else can Wikidata results be well executed? George Ho (talk) 07:49, 10 December 2020 (UTC)
  •   Support - yona B. (D) 08:13, 10 December 2020 (UTC)
  •   Support Nonahg (talk) 09:01, 10 December 2020 (UTC)
  •   Support Crocodile2020 (talk) 09:27, 10 December 2020 (UTC)
  •   Support Euro know (talk) 11:23, 10 December 2020 (UTC)
  •   Support Tim bates (talk) 15:43, 10 December 2020 (UTC)
  •   Support Libcub (talk) 20:28, 10 December 2020 (UTC)
  •   Support NaBUru38 (talk) 21:08, 10 December 2020 (UTC)
  •   Support Titore (talk) 23:56, 10 December 2020 (UTC)
  •   Support Higa4 (talk) 05:01, 11 December 2020 (UTC)
  •   Support Cryout (talk) 10:05, 11 December 2020 (UTC) great AI addition for better use experience
  •   Support Paucabot (talk) 12:30, 11 December 2020 (UTC)
  •   Support Susanna Giaccai (talk) 16:38, 11 December 2020 (UTC)
  •   Support Szalax (talk) 17:18, 11 December 2020 (UTC)
  •   Support BoldLuis (talk) 18:14, 11 December 2020 (UTC)
  •   Support It wouldn't have to be complicatedRosičák (talk) 18:25, 11 December 2020 (UTC)
  •   Support Anaxial (talk) 18:55, 11 December 2020 (UTC)
  •   Support MathieuMD (talk) 19:06, 11 December 2020 (UTC)
  •   Support Pinnermck (talk) 20:24, 11 December 2020 (UTC)
  •   Support Somej (talk) 21:12, 11 December 2020 (UTC)
  •   Support Stevenliuyi (talk) 22:26, 11 December 2020 (UTC)
  •   Support Tom Ja (talk) 10:04, 12 December 2020 (UTC)
  •   Support Kon Gl (talk) 14:16, 12 December 2020 (UTC)
  •   Support. Meiræ 21:51, 12 December 2020 (UTC)
  •   Support Consulnico (talk) 01:00, 13 December 2020 (UTC)
  •   Support Kew Gardens 613 (talk) 02:48, 13 December 2020 (UTC)
  •   Support Le moteur de recherche actuel est trop médiocre, donc tout ce qui est faisable techniquement pour l'améliorer (et en plus profitons de Wikidata)... (exemple, résultat logique attendu). Nemo Le Poisson (talk) 10:09, 13 December 2020 (UTC)
  •   Support Geagea (talk) 14:25, 13 December 2020 (UTC)
  •   Support Sounds great. Anything to improve search. Bodysurfinyon (talk) 02:34, 14 December 2020 (UTC)
  •   Support Fra4481 (talk) 10:41, 14 December 2020 (UTC)
  •   Support Sadads (talk) 11:52, 14 December 2020 (UTC)
  •   Support Rhetos (talk) 12:33, 14 December 2020 (UTC)
  •   Support good thing to keep on the table for brainstorming (assuming it is as complex/"impossible" as some folks say); it would be useful (as an OPTION) Philiptdotcom (talk) 13:45, 14 December 2020 (UTC)
  •   Support Nurtenge (talk) 06:54, 15 December 2020 (UTC)
  •   Support Anything that improves our search is a good idea, and this would be very useful (often more so than exact-text matches, but it depends on what you're doing). Should be able to turn it off, both in the search form and its results pages, and as a permanent setting.  — SMcCandlish ¢ >ʌⱷ҅ʌ<  08:18, 15 December 2020 (UTC)
  •   Support β16 - (talk) 09:57, 15 December 2020 (UTC)
  •   Support MTheiler (talk) 15:20, 15 December 2020 (UTC)
  •   Support. Shalomori123 (talk) 17:41, 15 December 2020 (UTC)
  •   Support Utopes (talk) 19:21, 15 December 2020 (UTC)
  •   Neutral I would like to use an improved search engine for Wikipedia, but not specifically via Wikidata, although it is not excluded. TechAcquisitor (talk) 19:42, 15 December 2020 (UTC)
  •   Support Jstalins (talk) 04:25, 16 December 2020 (UTC)
  •   Support Rhymes (talk) 18:18, 17 December 2020 (UTC)
  •   Support Kocgs (talk) 20:46, 17 December 2020 (UTC)
  •   Support ~~ Alex Noble - talk 14:49, 18 December 2020 (UTC)
  •   Support Mmitchell10 (talk) 20:10, 18 December 2020 (UTC)
  •   Support Patsagorn Y. (Talk) 05:01, 19 December 2020 (UTC)
  •   Support 常规搜索页面搜索不到相应内容,需要加强 郑洲扬 (talk) 12:45, 20 December 2020 (UTC)
  •   Support BradChim (talk) 14:51, 20 December 2020 (UTC)
  •   Support S8321414 (talk) 14:37, 21 December 2020 (UTC)
  •   Support Nachtbold (talk) 15:38, 21 December 2020 (UTC)