Community Wishlist Survey 2019/Wiktionary/Insert attestation exploiting Wikisource as a corpus
Insert attestation exploiting Wikisource as a corpus
- Problem: Wiktionaries definitions relies on attestations, sentences from corpora illustrating the usages and meanings of a word. Wikisource is an excellent corpus for Wiktionaries but it is uneasy to search into the texts for a specific word. Now, the reference of the sentence had to be copy/paste by hand and it's a long and unfunny way to contribute, the result being few quotation from Wikisources (less than 3 % for French Wiktionary).
- Who would benefit: Readers of Wiktionaries would find more examples of usages and a way to access the whole source directly in Wikisource. Contributors of Wiktionaries would have a fancy and enjoyable way to add attestations, similarly as Insert media tool that dig into Wikimedia Commons, and the community may grow with new people that like to add sentences from their readings. Editors of Wikisource would have a new way to shed light on their patient work. Both projects visibility would increase in search engines with more links between them. The global audience of both projects may increase with more connectivity.
- Proposed solution: This feature is inspired by Insert media but targeting Wikisource instead of Wikimedia Commons. So, instead of an instant search offering pictures, Insert attestation would display a list of sentences from Wikisource that include the targeted sequence of characters (no meaning requirement). That's a snippets of results that you can choose from. An editor would just grab a sentence with a single click and it will be added with the adequate sources. The feature would copy the sentence (no transclusion) and the source of the sentence (with the information of the page in the original manuscript optimally). This feature may need a specific parser to identify limits of sentences and to bold the targeted sequence of characters.
- More comments: This feature/tool/functionality should be accessible through WikiText editor and VisualEditor. It may be interesting to keep track of the reuses of Wikisource content in other project with a specific What's link here from Wiktionary to Wikisource, similarly as Wikimedia Commons indication of reuses in others projects, but this could be part of a second step of development. This idea was suggested last year and supported by 32 people, a draft was suggested the year before with 19 supports and this idea was coined first in a MediaWiki discussion.
- Proposer: Noé (talk) 08:57, 30 October 2018 (UTC)
Discussion
The problem of few quotaions should be too in archaicity of texts - wikisource texts are mostly old, by authors which died before 1948. Proposed solution should not be limited to wikitionary↔wikisource. The same use (sentence from wikisource) shoud be useful for wikipedia or wikiquote too. JAn Dudík (talk) 12:30, 30 October 2018 (UTC)
- Wiktionary not only describe the language as it is in use nowadays, texts can illustrate archaic meanings. And some texts are published recently directly in a compatible licence. Similarly as Insert media, this tool would be accessible in Wikipedia and other projects as well. I am wondering how it can be use elsewhere, as I am mostly contributing to Wiktionary and Wikisource, so you are welcome if you have some idea to share Noé (talk) 18:50, 30 October 2018 (UTC)
- The GitHub project wcorpus extracts data from Russian Wikisource and transforms data to the database format more convenient to search text in corpus. At this moment there is no user interface in wcorpus, it can be used only by programmers. I hope that this program can be used to create something more perfect. Welcome to read our paper about research based on Wikisource and Wiktionary data. -- Andrew Krizhanovsky (talk) 15:33, 4 November 2018 (UTC)
- This tool should allow users to choose also the languages of the Wikisource they want. For example I need quotations from Italian Wikisource for the French Wiktionary. Otourly (talk) 10:10, 20 November 2018 (UTC)
Voting
- Support Tom Ja (talk) 19:56, 16 November 2018 (UTC)
- Support Consulnico (talk) 23:54, 16 November 2018 (UTC)
- Support Libcub (talk) 11:51, 17 November 2018 (UTC)
- Support Giovanni Alfredo Garciliano Diaz (talk) 17:12, 17 November 2018 (UTC)
- Support JogiAsad (talk) 17:58, 17 November 2018 (UTC)
- Support JAn Dudík (talk) 20:36, 17 November 2018 (UTC)
- Support Liuxinyu970226 (talk) 01:01, 18 November 2018 (UTC)
- Support Psychoslave (talk) 03:47, 18 November 2018 (UTC)
- Support HLHJ (talk) 07:01, 18 November 2018 (UTC)
- Support Andrew Krizhanovsky (talk) 08:04, 18 November 2018 (UTC)
- Support NMaia (talk) 12:02, 18 November 2018 (UTC)
- Support <3 Ninovolador (talk) 20:16, 18 November 2018 (UTC)
- Support Pamputt (talk) 21:05, 18 November 2018 (UTC)
- Support DaraDaraDara (talk) 09:36, 19 November 2018 (UTC)
- Support -Xbony2 (talk) 16:41, 19 November 2018 (UTC)
- Support We need this. Otourly (talk) 10:07, 20 November 2018 (UTC)
- Support Vulphere 13:10, 20 November 2018 (UTC)
- Support Thibaut120094 (talk) 14:17, 20 November 2018 (UTC)
- Support Automatik (talk) 14:47, 20 November 2018 (UTC)
- Support Unsui (talk) 14:53, 20 November 2018 (UTC)
- Support Very usefull ! Lyokoï (talk) 14:56, 20 November 2018 (UTC)
- Support Pom445 (talk) 16:34, 20 November 2018 (UTC)
- Support CAPTAIN RAJU(T) 23:00, 20 November 2018 (UTC)
- Support Acer11 (talk) 07:34, 21 November 2018 (UTC)
- Support Novak Watchmen (talk) 15:23, 21 November 2018 (UTC)
- Support Manseng (talk) 22:30, 22 November 2018 (UTC)
- Support Satdeep Gill (talk) 05:49, 23 November 2018 (UTC)
- Support Nuevo Paso (talk) 07:35, 24 November 2018 (UTC)
- Support Sorcrosc (talk) 03:37, 25 November 2018 (UTC)
- Support Krokus (talk) 12:52, 25 November 2018 (UTC)
- Support TheIgel69 (talk) 11:12, 27 November 2018 (UTC)
- Support Viticulum (talk) 20:27, 27 November 2018 (UTC)
- Support Nemo 22:24, 27 November 2018 (UTC)
- Support Daniel Case (talk) 05:04, 29 November 2018 (UTC)
- Support Edhral 07:28, 30 November 2018 (UTC)