WikiIndaba 2023/Submissions/Wikimedia Lexicography: Lexemes and Beyond

Indaba 2023 Logo ID : Wikimedia Lexicography: Lexemes and Beyond
Author(s): Masssly, Ruky Wunpini Username(s): Mohammed Sadat (WMDE) Type of submission: workshop
Affiliation: Wikimedia Deutschland Theme(s): Community Engagement, Diversity, Technology
Abstract:

Since 2018, Wikidata has also stored a new type of data: words, phrases and sentences, in many languages, described in many languages. This information is stored in new types of entities, called Lexemes (L), Forms (F) and Senses (S). Lexicographical data will serve as the basis for Abstract Wikipedia's natural language generation capabilities. A few languages including Dagbani, Hausa and Igbo were selected to become focus languages for the development of this new project.

This workshop session will walk participants through editing the Lexicographical data namespace.

Also, participants will learn how to use the Spell4Wiki app to produce audio recordings of words in their language. We will walk them through the installation of the app, and how they can add their language, record audio files in Wikimedia Commons and connect the pronunciations to Lexemes.

Level of advancement: medium
Special requirements: Participants need
  • to have already a basic understanding of Wikidata (i.e. how Properties, Values, and Qualifiers come together to form Statements
  • a laptop to participate
  • Also, a Wikimedia user account. (You can create one here)
Extra information: Etherpad, Slides
How will this session be beneficial for the communities in the region?

Attendees will get a better understanding of how Lexemes work, learn how to contribute to the Lexicographical data namespace in their native languages, and ask questions.


Interested participants edit

(register below and ask your questions now to the session organizer)

  • ...