Community Wishlist Survey 2019/Wiktionary/Multiple collations per site

Multiple collations per site

  • Problem: It is extremely common, on Wiktionary projects, to display entries of multiple languages on the same page. But, only one collation can be used on a particular Wikimedia project. That means: if a website uses a language-compliant collation, e.g. uca-default which is a English- and Portuguese-friendly collation, all categories concerning e.g. Swedish words, will sort words starting with Å under A, because Å is considered in English to be the same letter than A with a diacritic, while it is a whole new letter in Swedish (where it is sorted at the near end of the alphabet). Categories' headers are therefore incorrect for many languages with the current solution used on Wiktionary projects.
    Currently a way to circumvent the problem is to use the default Mediawiki collation (namely uppercase), but this implies that sort keys are added in all English/French/etc. entries with a diacritic in the title, as Å, É, etc., as all diacritic letters are considered as first-entry headers in categories, and this implies a huge amount of sort keys in pages to bypass this behavior (and thus sort Å under A for e.g. English), and makes Wiktionary projects less readable and editable for newcomers.
  • Who would benefit: users of Wiktionary categories, and new editors to all Wiktionary projects
  • Proposed solution: allow multiple collations per site, and therefore collation to be specified per category: uca-sv should be used for Swedish-related categories, uca-es for Spanish cats, uca-default for English (and similar), etc.
  • More comments: Liangent and Bawolff have been working on this in the past, but feasability seems also to depend on sysadmins (for increased system load).
  • Phabricator tickets: phab:T30397
  • Proposer: Automatik (talk) 12:18, 11 November 2018 (UTC)



  •   Support Leiem (talk) 05:35, 17 November 2018 (UTC)
  •   Support Libcub (talk) 11:56, 17 November 2018 (UTC)
  •   Support Urhixidur (talk) 13:14, 17 November 2018 (UTC)
  •   Support Noé (talk) 16:16, 17 November 2018 (UTC)
  •   Support Liuxinyu970226 (talk) 01:02, 18 November 2018 (UTC)
  •   Support Psychoslave (talk) 04:01, 18 November 2018 (UTC)
  •   Support Pamputt (talk) 21:00, 18 November 2018 (UTC)
  •   Support My bot is barely treating a dozen languages by adding or removing defaultsort every day on the French Wiktionary, whereas we have more than 4,500 languages. JackPotte (talk) 22:09, 18 November 2018 (UTC)
  •   Support Peter Bowman (talk) 22:34, 19 November 2018 (UTC)
  •   Support Otourly (talk) 10:00, 20 November 2018 (UTC)
  •   Support absolutely needed on multilingual projects like Wiktionaries to make category sorting correct and the {{DEFAULTSORT:}} keyword close to useless. — Automatik (talk) 13:04, 20 November 2018 (UTC)
  •   Support Vulphere 13:05, 20 November 2018 (UTC)
  •   Support Absolutely needed for a good online multilingual dictionary ! Lyokoï (talk) 14:58, 20 November 2018 (UTC)
  •   Support Pom445 (talk) 16:34, 20 November 2018 (UTC)
  •   Support Really needed. An additional feature would be the kind of sorting wanted: two sorts are in use in dictionaries: skipping spaces, commas, apostrophes, etc. or skipping only apostrophes, and changing characters such as ",", "-", "/" and other punctuation to a space (and skipping redundant spaces). This second way is used at least in fr.wikt, and would probably be the best one for all wiktionary readers. Lmaltier (talk) 18:39, 20 November 2018 (UTC)
  •   Support Thibaut120094 (talk) 21:00, 20 November 2018 (UTC)
  •   Support A good idea. We need it for the intended Konkani Wiktionary in Goa. Fredericknoronha (talk) 21:20, 20 November 2018 (UTC)
  •   Support Novak Watchmen (talk) 15:23, 21 November 2018 (UTC)
  •   Support KaMan (talk) 16:53, 23 November 2018 (UTC)
  •   Support Metaknowledge (talk) 23:37, 23 November 2018 (UTC)
  •   Support Krokus (talk) 12:59, 25 November 2018 (UTC)
  •   Support Erutuon (talk) 22:13, 27 November 2018 (UTC)
  •   Support Curious (talk) 21:20, 28 November 2018 (UTC)