Wikimedia CEE Meeting 2022/Programme/Submissions/Modeling our languages with Lexeme

Modeling our languages with Lexeme

Title of the submission
Type of submission (lecture, panel, workshop, lightning talk, roundtable, poster)
  • workshop
Author(s) of the submission

Asaf Bartov (Wikimedia Foundation)


Asaf (WMF)


Wikimedia Foundation

  • Capacity development
Level of advancement (basic, medium, advanced)
  • medium (basic familiarity with Wikidata would help)
Abstract (up to 100 words)

Lexeme is the name for the lexicographical layer of Wikidata, that is, the features for documenting words in Wikidata. Now is an exciting opportunity to document CEE languages with structured data, in a more precise and high-quality way than Wiktionary ever provided, and which will enable a lot of innovative uses. This workshop will walk you through how to contribute to documenting your own language, both manually and using some tools.

How will this session be beneficial for the communities in the region of Central and Eastern Europe?

CEE has many languages, most of them under cultural pressure from larger/imperial languages; creating structured-data representations of their lexica can go a long way toward enabling innovative tools for education and reference.

Special requirements
Slides or further information
File:Documenting our languages with Lexeme.pdf

Interested attendees edit

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with a hash and four tildes. (# ~~~~).