Community Wishlist Survey 2019/Wiktionary/Insert attestation exploiting Wikisource as a corpus

Insert attestation exploiting Wikisource as a corpus

  • Problem: Wiktionaries definitions relies on attestations, sentences from corpora illustrating the usages and meanings of a word. Wikisource is an excellent corpus for Wiktionaries but it is uneasy to search into the texts for a specific word. Now, the reference of the sentence had to be copy/paste by hand and it's a long and unfunny way to contribute, the result being few quotation from Wikisources (less than 3 % for French Wiktionary).
  • Who would benefit: Readers of Wiktionaries would find more examples of usages and a way to access the whole source directly in Wikisource. Contributors of Wiktionaries would have a fancy and enjoyable way to add attestations, similarly as Insert media tool that dig into Wikimedia Commons, and the community may grow with new people that like to add sentences from their readings. Editors of Wikisource would have a new way to shed light on their patient work. Both projects visibility would increase in search engines with more links between them. The global audience of both projects may increase with more connectivity.
  • Proposed solution: This feature is inspired by Insert media but targeting Wikisource instead of Wikimedia Commons. So, instead of an instant search offering pictures, Insert attestation would display a list of sentences from Wikisource that include the targeted sequence of characters (no meaning requirement). That's a snippets of results that you can choose from. An editor would just grab a sentence with a single click and it will be added with the adequate sources. The feature would copy the sentence (no transclusion) and the source of the sentence (with the information of the page in the original manuscript optimally). This feature may need a specific parser to identify limits of sentences and to bold the targeted sequence of characters.
  • More comments: This feature/tool/functionality should be accessible through WikiText editor and VisualEditor. It may be interesting to keep track of the reuses of Wikisource content in other project with a specific What's link here from Wiktionary to Wikisource, similarly as Wikimedia Commons indication of reuses in others projects, but this could be part of a second step of development. This idea was suggested last year and supported by 32 people, a draft was suggested the year before with 19 supports and this idea was coined first in a MediaWiki discussion.

Discussion

The problem of few quotaions should be too in archaicity of texts - wikisource texts are mostly old, by authors which died before 1948. Proposed solution should not be limited to wikitionary↔wikisource. The same use (sentence from wikisource) shoud be useful for wikipedia or wikiquote too. JAn Dudík (talk) 12:30, 30 October 2018 (UTC)[reply]

Wiktionary not only describe the language as it is in use nowadays, texts can illustrate archaic meanings. And some texts are published recently directly in a compatible licence. Similarly as Insert media, this tool would be accessible in Wikipedia and other projects as well. I am wondering how it can be use elsewhere, as I am mostly contributing to Wiktionary and Wikisource, so you are welcome if you have some idea to share   Noé (talk) 18:50, 30 October 2018 (UTC)[reply]

Voting