User talk:NPRB/Archive 8
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Tech News: 2023-51
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Tech News
- The next issue of Tech News will be sent out on 8 January 2024 because of the holidays.
Changes later this week
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 19 December. It will be on non-Wikipedia wikis and some Wikipedias from 20 December. It will be on all wikis from 21 December (calendar). There is no new MediaWiki version next week. [1][2]
- Starting December 18, it won't be possible to activate Structured Discussions on a user's own talk page using the Beta feature. The Beta feature option remains available for users who want to deactivate Structured Discussions. This is part of Structured Discussions' deprecation work. [3]
- There will be full support for redirects in the Module namespace. The "Move Page" feature will leave an appropriate redirect behind, and such redirects will be appropriately recognized by the software (e.g. hidden from Special:UnconnectedPages). There will also be support for manual redirects. [4]
Future changes
- The MediaWiki JavaScript documentation is moving to a new format. During the move, you can read the old docs using version 1.41. Feedback about the new site is welcome on the project talk page.
- The Wishathon is a new initiative that encourages collaboration across the Wikimedia community to develop solutions for wishes collected through the Community Wishlist Survey. The first community Wishathon will take place from 15–17 March. If you are interested in a project proposal as a user, developer, designer, or product lead, you can register for the event and read more.
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
Wikidata weekly summary #607
- Discussions
- New requests for permissions/Bot:
- So9qBot 7. Task: Add not found in (P9660) --> Q123739672 to Danish Lexemes
- So9qBot 8. Task: Add missing names of European legal documents to labels and aliases of items with a CELEX identifier
- LccnBot. Task: Adds Library of Congress authority ID (P244) to bibliographic entities base on library authority records.
- Other discussions: How to handle concepts of trans people on Wikidata? Should {privacy at wikidata.org} be redirected to {privacy at wikimedia.org} or should it be monitored by Wikidata volunteers? Join the discussion!
- New requests for permissions/Bot:
- Events
- Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour December 18th, 2023: Over the summer and into the fall the LD4 Wikidata Affinity Group will be offering a series of Wikidata Working Hours to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to.The ninth and final Wikidata Working Hour in the series will be using SPARQL and Scholia to query and visualize the data we’ve added to Wikidata during our series. This session will be recorded and the recording shared on the event page
- Ongoing: Weekly Lexeme Challenge #121: Pottery
- Press, articles, blog posts, videos
- Blogs: #LD42023. Part I: The Future of Wikidata + Libraries (A Workshop) - This blog series explores how libraries engage with Wikidata and Linked Data in the face of AI challenges. Led by Silvia Gutiérrez and Giovanna Fontenelle from the Wikimedia Foundation, the series summarizes insights from a collaborative session at the 2023 LD4 Conference, using Design Thinking strategies to connect the Library-Wikidata community with WMF, focusing on Wikidata, Wikibase, and Structured Data on Commons (SDC) in libraries. By Silvia Gutiérrez & Giovanna Fontenelle
- Papers
- Wikipedia gender gap: a scoping review - This review analyzes Wikipedia's gender gap from 2007 to 2022, revealing a slight majority of female authors, addressing key themes, and exploring strategies to mitigate the gap, providing valuable insights into the research landscape in this domain. By Núria Ferran-Ferrer, Juan-José Boté-Vericad and Julia Minguillón.
- Ten years of Wikidata: A bibliometric study - This research delves into scholarly publications about Wikidata from its inception in 2012 to late 2022, revealing 945 relevant papers, primarily from conferences. The analysis highlights a concentration of experts and contributors from the Global North, as well as governmental institutions as predominant funders. The study calls for enhanced networking and outreach to promote diversity and inclusion within the Wikidata research community. Emphasizing computer science perspectives, the research focuses on methods for developing and utilizing open knowledge graphs, notably Wikidata, with a narrower but significant interest in application-oriented studies in digital humanities, biology, and healthcare. (Turki, et al)
- Videos
- Duplicating Everywhere All at Once | Cebuano Wikipedia - Five years ago, Lsjbot's Wikipedia articles caused duplicate Wikidata items, notably impacting geographic places on Cebuano Wikipedia. This video by User:Canley at Wikimania 2023 delves into the history, visualizes the issue, and suggests cleanup strategies for Wikidata and Wikipedia, emphasizing Aotearoa New Zealand and parts of Australia, with implications for the global challenge of bot-created duplicates.
- Useful Authorities for Data-Driven Collection Research with Alicia Fagerving - Alicia Fagerving, Wikimedia Sverige, introduces the project "Useful Authorities for Data-Driven Collection Research" and Wikidata. The project, spanning 2021-2023, links vocabularies from the databases of Nationalmuseum and Statens historiska museer to Wikidata, exploring it as a platform for semantic interoperability among cultural heritage institutions and providing tools and visualizations for similar projects.
- 2023: OSM-Wikidata Map Framework. Combining OpenStreetMap and Wikidata allows to leverage the strengths of the two projects to create richer maps. This talk explores how OSM-Wikidata Map Framework simplifies this process. By Daniele Santini
- Press: Adriano Rutz wins the Swiss National Open Research Data (ORD) Prize for “The LOTUS Initiative” project. LOTUS explores new ways of promoting the re-use of data in the fields of biology and chemistry and thus of sharing knowledge in natural products research. More coverage
- Notebooks
- It's not bad! Measuring Gérard Depardieu's mark on French cinema (in French) - The analysis centers on Gérard Depardieu's impact on French cinema amid legal issues and sexual assault allegations. Despite difficulties in addressing these accusations, the author leverages Wikidata to measure Depardieu's influence by querying films from directors born after 1930 to assess his involvement.
- How to Become a Billionaire: A Billionaire's Occupations Network Analysis - This network analysis investigates billionaires’ primary sources of income with a network graph—based on their occupations—connecting billionaires from all over the world and uncovering some of the biggest industries in the world.
- Documentation: User:Mahir256 statred Lexemes documentation pages about Lemmata and Lexeme languages. Your contributions are welcome.
- Tools of the week
- Drama Corpora Project (DraCor) is a digital database of plays, primarily from Europe. It collects and organizes texts of plays in a way that allows researchers and others to extract and analyze information from those texts. This could include details about the characters, the dialogue, the stage directions, and more. The data is being pulled from Wikidata.
- Magnus Manske added a new game to the Wikidata game to identify duplicate Items for researchers.
- Mike Peel set up a new Distributed Game to add links to Wikiquote to Wikidata.
- Other Noteworthy Stuff
- Wikibase Cloud has a new website. Check it out: https://www.wikibase.cloud/
- Did you know?
- Newest properties:
- General datatypes:
- battery life (length of time after a full charge that a device can continue to work under normal use before it needs its battery to be recharged)
- External identifiers: Atari-8-bit Forever game ID, Museum of Fine Arts of Rennes object ID, FNAC artwork ID, Akadem person ID, Game Classification game ID, Game Classification machine ID, Game Classification creation tool ID, TaiCOL ID (new version), a8.fandal.cz ID, Stadium 64 ID, Filmweb.no film ID, SixtyFour Originals DataBase game ID, BUGZ ID
- General datatypes:
- New property proposals to review:
- General datatypes:
- Substances in the reference (Substances studied in reference works such as papers, reports, etc.)
- Book format (Page size of a historical book, manuscript, or artwork on paper, based on folding sheets into leaves)
- beneficial owner ()
- External identifiers: DraCor ID, PCSX2 Wiki ID, Twitch numeric channel ID, identifiant article ORBi, identifiant auteur ORBi, TheTVDB IDs, RPCS3 Wiki ID, Retskrivningsordbogen ID, Kanjipedia word ID, MAMCS ID, Citra compatibility database ID, IGN wiki article ID, AreWeAntiCheatYet ID, RPGFan game ID, Swissubase ID, goalzz.com team ID, MilliBase taxon ID, Digicarmel ID, Arcade Hub ID, Great American Business Leaders of the 20th Century ID, Consortium of Lichen Herbaria taxon ID, Biota of New Zealand ID, NientePopCorn IDs, HistoriaGames game ID
- General datatypes:
- Query examples:
- Newest database reports: User:Pasleim/Unsupported sitelinks - Found 279 items
- Showcase Items: lion (Q140) - species of big cat
- Showcase Lexemes: cevap (L1124154) - Turkish noun for 'answer' derived from the Arabic noun جَواب
- Newest properties:
- Development
- Wikibase REST API:
- We finished adding the endpoints for adding aliases in a given language for a Property (phab:T343721) and removing a Property's label in a given language (phab:T342983)
- We started working on the endpoint for removing a Property's description in a given language (phab:T342985)
- We are fixing an issue with incorrect handling of lowercase statement IDs in edit requests (phab:T352644)
- Special:PrefixIndex now shows label/lemma for Properties and Lexemes (phab:T343115)
- Language codes: We changed where Wikidata is getting its languages from for Lexemes and Monolingual text statements and thereby resolved many tasks requesting another language being added to them (phab:T341409)
- Wikibase REST API:
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
The Signpost: 24 December 2023
- Special report: Did the Chinese Communist Party send astroturfers to sabotage a hacktivist's Wikipedia article?
- News and notes: The Italian Public Domain wars continue, Wikimedia RU set to dissolve, and a recap of WLM 2023
- In the media: Consider the humble fork
- Discussion report: Arabic Wikipedia blackout; Wikimedians discuss SpongeBob, copyrights, and AI
- In focus: Liquidation of Wikimedia RU
- Technology report: Dark mode is coming
- Recent research: "LLMs Know More, Hallucinate Less" with Wikidata
- Gallery: A feast of holidays and carols
- Comix: Lollus lmaois 200C tincture
- Crossword: when the crossword is sus
- Traffic report: What's the big deal? I'm an animal!
- From the editor: A piccy iz worth OVAR 9000!!!11oneone! wordz ^_^
- Humour: Guess the joke contest
Wikidata weekly summary #608
- Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here: What changes would you like to see in the newsletter in 2024?"
- Discussions
- Import sitelinks, labels, descriptions from ku wikipedia pages which use the template w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
- Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
- Events
- Upcoming: Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
- Ongoing: Weekly Lexeme Challenge #122: Rock-forming minerals
- Press, articles, blog posts, videos
- Blogs
- African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
- Papers: Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
- Videos
- Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
- Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
- Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
- No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
- Wiki(s)data #5: Wikidata Live editing (in Italian) --> The ontology of Wikidata: how to interact with it for a better quality, by Epìdosis
- Notebooks
- Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
- Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
- The Gender-Equality Gap in STEM Awards --> A network graph and multiple data visualizations on UCLA's alumnni awards based on gender.
- Exploring The Belichick Coaching Tree --> This analyses details the coaching tree of the prolific American Football coach Bill Belichick.
- State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
- An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
- Blogs
- Tool of the week
- Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
- Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
- Other Noteworthy Stuff =
- Job opening: Data Scientist / Knowledge Engineer to use Wikidata as a foundational layer for an US National Science Foundation (NSF) funded Prototype Open Knowledge Network.
- Did you know?
- Newest properties:
- General datatypes: none
- External identifiers: WHDLoad database ID, Shanghai Library movie ID, PCSX2 Wiki ID, KRS number, Twitch numeric channel ID, RPCS3 Wiki ID, Black Games Archive ID, Citra compatibility database ID, DraCor ID, ORBi article ID, IGN wiki article ID, AreWeAntiCheatYet ID, RPGFan game ID, Arcade Hub ID
- New property proposals to review:
- General datatypes:
- Laws of Malaysia URL (Uniform Resource Locator for laws of Malaysia)
- production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
- External identifiers: Schnittberichte.com ID, National Library of Malaysia OPAC ID, HistoriaGames series ID, Kemono Games game ID, Internet Game Database event ID, GamesMeter ID, Walk Score ID, Malaysia company new number, Am Faclair Beag ID, xemu compatibility database ID, Sofascore player ID, GameGear.jp ID, RPGWatch IDs, Team England ID, TORCH taxon ID, ScummVM ID, Abandonware France IDs
- General datatypes:
- Query examples:
- Newest WikiProjects: WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
- Newest database reports: children of dead mothers - List of mother-children pairs, where death date of parent < birth date of child
- Showcase Items: Esperanto (Q143) - international auxiliary language designed by L. L. Zamenhof
- Showcase Lexemes: L1222568 (বড়দিন) - Bengali noun for 'Christmas'
- Newest properties:
- Development
- Due to the winter holidays, the development team is taking a break and no deployment is happening for Wikidata at the moment.
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #609
- Discussions
- Open request for adminship: WikiBayer (RfP scheduled to end after 8 January 2024 12:01 UTC)
- Closed request for adminship: EPIC (closed as successful). Welcome onboard \o/
- New requests for permissions/Bot: HVSH-Bot . Task: Import data about politicians from the Q119949776, now only partially online available.
- Events
- Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Ongoing: Weekly Lexeme Challenge #123: Ologist
- Press, articles, blog posts, videos
- Papers: Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
- Videos: WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
- Tool of the week
- WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
- Did you know?
- Newest properties:
- General datatypes: none
- External identifiers: Shamela book edition ID, HistoriaGames series ID, Schnittberichte.com title ID, Kemono Games game ID, Internet Game Database event ID, Xemu compatibility database ID, GamesMeter game ID, GameGear.jp ID, Walmart product ID, Swissubase person ID, RPGWatch game ID, RPGWatch company ID, RPGWatch press ID, Indie DB company ID, NIWA article ID, turismoroma.it place ID, ScummVM ID, ORBi author ID, Abandonware-France video game series ID, Abandonware-France video game compilation ID, Abandonware-France person ID, Abandonware-France company ID, Abandonware-France magazine ID, Abandonware-France award ID, Kanjipedia word ID, Moviefone movie ID, South African NPO number, Nigerian registered company ID, Abandonware-France video game ID, AFJV directory ID
- New property proposals to review:
- General datatypes:
- Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
- International Classification of Nonprofit Organizations ({{Q|2976602}} for {{Q|163740}} created by the {{Q|193727}} and adapted by the {{Q|1065}}.)
- creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
- television judge ()
- External identifiers: SERNEC taxon ID, Consortium of Bryophyte Herbaria taxon ID, Rhineland-Palatinate school ID, nebula channel id, Deutsche Bahn station number, ISzDb series ID, BG localisation unit ID, Cathopedia article ID, Native Plants Hawaii ID, Taiwan Biographical Database ID, Penstemon Database ID, Wikisage ID
- General datatypes:
- Query examples:
- Newest database reports: Merge candidates: Identical birth and death dates
- Showcase Items: Team Fortress 2 (Q382108) - team-based first-person shooter multiplayer video game.
- Showcase Lexemes: ورھا لگّݨ / ਵਰ੍ਹਾ ਲੱਗਣ (L907713) - Punjabi verb expressing the setting in of a new year.
- Newest properties:
- Development
- The development team is just returning from the winter holidays so there is no development update at the moment.
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #610
Discussions
- Closed request for adminship: WikiBayer (closed as successful). Welcome onboard \o/
- New requests for permissions/Bot: So9qBot 9. Task: Add DDO identifier to Danish lexemes.
- Upcoming:
- The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Wiki Mentor Africa (WMA) Hackathon, 19th to 21st January 2024
- Forschungsdatenmanagement: Wikidata as a collaborative information resource on research data management (German), takes place online, Wednesday 10th January 2024, 10-11am (CET).
Press, articles, blog posts, videos
- Blogs: PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
- Papers
- Linked data: un’opportunità per il riuso (Q124079430) "scientific article published in 2023" (paper in Italian) - deals with linked data in library catalogues, with many mentions of Wikidata.
- Automatically Constructed Indonesian Question Answering Dataset by Leveraging Wikidata by K. Doxolodeo & A.A. Krisnadhi - researchers have created a new Indonesian Question Answering dataset that is produced automatically end-to-end using Context Free Grammar, the Wikipedia Indonesian Corpus, and the concept of the proxy model
- LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
- Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
- Videos
- Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
- This video on Biodiversity Explorations with Machine Learning: Biodiversity Data Access Functions shows how Wikidata is being used to populate species entity profiles at Wolfram U, presented by Jofre Espigulé-Pons.
- Notebooks: Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project Software Collaboration for Wikidata prematurely. Read their joint statement here.
Newest properties and property proposals to review
- Newest General datatypes:
- Flora of the Hawaiian Islands URL (URL of the entry for a plant genus, species, subspecies, or variety in the Flora of the Hawaiian Islands website)
- (Montana Plant Life URL (URL for a plant family, genus, or species on the Montana Plant Life website)
- plate(s) (plate number(s) in the reference source being cited to support the statement being made)
- Newest External identifiers: Abandonware-France book ID, MilliBase taxon ID, Monasticon Hibernicum database ID, Rhineland-Palatinate school ID, Enciclopedia di Roma monument ID, Enciclopedia di Roma street ID, Mid-Atlantic Herbaria Consortium taxon ID, The Criterion Collection spine number
- New General datatypes property proposals to review:
- Water bottle volume (Volume of the water bottle)
- Is it metric? (To check if it's a metric.)
- Anti-Cheat software used (anti-cheat solution used by this multiplayer video game)
- New External identifier property proposals to review: turismo.marche.it place ID, Joseph Smith Papers person ID, Team Scotland ID, Globoplay ID, DoblajeVideojuegos game ID, National Natural Parks System ID, Commonwealth Games Australia ID, Adventure-Treff game ID, TouchArcade game ID, Mod.io ID, The Models Resource game ID, The Models Resource entity ID, tourist information point number, Jinji Koshinjyo ID, Bandcamp track ID
Did you know?
- Query examples:
- Newest WikiProject: Podcast Episodes 2024 - The goal of this project is to add episode pages for individual podcasts.
- Newest database report: children of unborn parents
- Showcase Item: Helsinki (Q1757) - capital and most populous city of Finland
- Showcase Lexeme: Allah korusun (L1226849) - Turkish for 'God forbid'
Development
- IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (phab:T351968)
- Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (phab:T305660)
- Wikibase REST API:
- We finished the work on making it possible to get all sitelinks of an Item (phab:T344041)
- We are working on getting a sitelink for a given wiki (phab:T344039)
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
Weekly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to the showcase Item and Lexeme above.
- Participate in this week's Lexeme challenge: Ologies
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Tech News: 2024-02
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Recent changes
- mediawiki2latex is a tool that converts wiki content into the formats of LaTeX, PDF, ODT, and EPUB. The code now runs many times faster due to recent improvements. There is also an optional Docker container you can install on your local machine.
- The way that Random pages are selected has been updated. This will slowly reduce the problem of some pages having a lower chance of appearing. [5]
Changes later this week
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 9 January. It will be on non-Wikipedia wikis and some Wikipedias from 10 January. It will be on all wikis from 11 January (calendar). [6][7]
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
The Signpost: 10 January 2024
- From the editor: NINETEEN MORE YEARS! NINETEEN MORE YEARS!
- Special report: Public Domain Day 2024
- Technology report: Wikipedia: A Multigenerational Pursuit
- News and notes: In other news ... see ya in court!
- WikiProject report: WikiProjects Israel and Palestine
- Obituary: Anthony Bradbury
- Traffic report: The most viewed articles of 2023
- Comix: Conflict resolution
Wikidata weekly summary #611
Discussions
- New request for comments: Community request for the development team to access inverse properties on client wikis. (Summary: We currently cannot access inverse property values on Wikipedia. This can be a data management issue on Wikipedia as we must always ask ourself if we must introduce an inverse property for cases where we need them. So I think it’s useful to gather the usecases community would want and draft a request for an API to the devteam to do that.)
- Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Past
- Provenance Loves Wiki (PLW24), Jan 12th - 14th, research and data on the origin of artworks and cultural heritage and how Wikibase and Wikidata can support this.
- WikiLovesWomen #SheSaid campaign wrapped up the 2023 campaign by visiting Kinshasha and Kisangani, where local Wikimedians improved quotes from women on FR Wikipedia and Wikidata.
Press, articles, blog posts, videos
- Blogs
- Building Connected Libraries in Nigeria: --> Reflections from the Wikibase Journey on collaboration and resouce sharing between Nigerian Libraries.
- Wikidata and ChatGPT integration failure --> read about Finn Årup Nielsen's attempts to integrate LLM's with Wikidata.
- QLever: a new way to query OpenStreetMap --> Discussion of the new opportunities offered by QLever to query OpenStreetMap and to run federated queries with Wikidata
- Wikidata for authority control: 3 years of work --> The three-year Wikidata for authority control project, a collaboration between Wikimedia Sverige and Swedish museums, concluded in December 2023. It equipped museum staff with tools and skills to integrate their authority databases with Wikidata, resulting in added identifiers, SPARQL query proficiency, and enhanced knowledge sharing within the GLAM sector.
- Go-ahead for Wikidata Project of GLAM institutions from Baden-Württemberg --> The GLAM-BW project, under "GLAM goes OpenData," connects major collections in Baden-Württemberg, focusing on the württembergische Kunstkammer. With over 3,000 objects, the project integrates information on collectors, histories, and objects into a knowledge graph for semantic searches, contributing to the broader realm of linked open data, akin to Wikidata.
- Swiss GLAM Programme --> Wikimedia CH imported the Museum of Natural History of Neuchâtel's urchin fossil casts to Wikimedia Commons, connecting structured data on Wikidata. The project involved data cleaning, adding missing elements, and file imports via OpenRefine, highlighting seamless integration between Wikidata and Commons.
- Papers
- Reflections on the PCC Wikidata Pilot at UCLA Library: --> Undertaking the PCC Learning Objectives. Discusses the 14-month Pilot programme for cooperative cataloguing of UCLA Library and Museum Collections. By E. Zhang, P. Biswas & I. Dagher.
- Few-Shot Event Classification in Images using Knowledge Graphs for Prompting --> How can Wikidata and Wikipedia help Vision-Language Models improve their classification of images. Tahmasebzadeh et al., 2024.
- Videos
- SMWCon 2023: Semantics, Wikis, and AI --> Day 1, Keynote by Prof. Markus Krötzsch who explores origins and principles of semantic wikis and key challenges that lie ahead in managing knowledge.
- GLAM on Tour 2023 im Museum Barberini (German) --> find out what Museums have got to do with Wikipedia, Wikimedia Commons and Wikidata. More Info Here.
- Interactive notebooks: GLAM : Geolocated and Labelled Articles Map - explore Featured and Good Wikipedia articles through a map, powered by Wikidata.
Tool of the week
- Brian M Sperlongano released US boundary QA checker, a quality assurance tool for finding issues with boundary data in the United States by using Wikidata, OpenStreetMap, and US Census Bureau data.
- The Surrounding Ocean (available at vrandezo.github.io/TheSurroundingOcean) - is a tool that allows you to browse lexicographical data. You can use the tool to explore words and their meanings, translations, and synonyms. The tool is currently under development, and the developer, Danny, would appreciate feedback to fix any issues with the tool. More info: Wikidata:The Surrounding Ocean.
Other Noteworthy Stuff
- OS-Wikidata Map Framework List of tools and maps which combines OSM and Wikidata.
- Call for Projects and Mentors for Google Summer of Code 2024 and Outreachy Round 28 is OPEN!
- Got an idea for a project to reclaim the public nature of the internet? With Wikidata? NLNet has a new fund you could apply to.
Newest properties and property proposals to review
- Newest General datatypes: none
- Newest External identifiers: Walk Score ID, HistoriaGames game ID, Deutsche Bahn station number, Legends Tour player ID, Moscow Cultural Heritage ID, USOPC Hall of Fame ID, Cathopedia article ID, TouchArcade game ID, DoblajeVideojuegos game ID, Adventure-Treff game ID, Biota of New Zealand ID, Consortium of Bryophyte Herbaria taxon ID, Consortium of Lichen Herbaria taxon ID, Native Plants Hawaii ID, SERNEC taxon ID, TORCH taxon ID, Penstemon Database ID, Digicarmel ID, Retskrivningsordbogen ID, MAMCS artwork ID, Sofascore player ID, Mod.io game ID, Sina Chinese Basketball player ID, turismo.marche.it place ID
- New General datatypes property proposals to review:
- describes (Data objects that are described by this entity (e.g. an encyclopedia or topic-related book; intended for input of several data objects.))
- memory type (specifies the type of working memory of this data object)
- filial church (church which acts as the less important temple of a parish)
- TOPO id (unique code to identify topographical features of France (department, city, thoroughfare...))
- New External identifier property proposals to review: Bluesky handle, Merkur author ID, Bavarian school ID, Rugby Database ID, Playstation Store Concept ID, ArchDaily Architecture Office ID, Il Nuovo De Mauro ID, Bluesky DID
Did you know?
- Query examples:
- Newest WikiProjects: WikiProject Decolonise Wiki --> intends to focus on decolonising text, depictions and media within all relevant Wikipedia articles.
- WikiProject Highlights: Ontology Cleaning Task Force: A group of people have started a task force to discuss problems with the Wikidata ontology and how to clean them up. Anyone interested in participating is welcome. The task force maintains Wikidata:WikiProject Ontology/Cleaning Task Force as a record of its activities. You can add yourself to the participants list there and find out how to join group meetings or otherwise participate in the group. (Got something noteworthy happening in your WikiProject? Share it in the upcoming issue!)
- Newest database reports: Lexicographical data/Reports/Empty lexemes - Lexemes with no statements, no forms and no senses. (Do you see Lexemes from your language in the list that you can fix?)
- Showcase Items: January 15, 2018 (Q45919591) - Monday in January 2018
- Showcase Lexemes: در جنگ حلوا بخش نمیکنند (L1081423) - Persian with a meaning similar to "all's fair in love and war"
Development
- IP masking: We are working on adjusting Wikibase to handle the upcoming introduction of IP masking, which will give editors who are not logged in a temporary account name instead of using their IP to attribute edits to (phab:T351968)
- Lexicographical data: We are changing how empty Senses and Forms are represented in the dumps (phab:T305660)
- mul language code: We are doing user testing for the current implementation to see if it is understandable for people.
- Mismatch Finder: We are continuing the work on migrating it to the Codex design system.
- REST API:
- We improved the handling of lower-case statement IDs (phab:T354262)
- We are working on getting a sitelink for a given wiki (phab:T344039)
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
Weekly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to the showcase Item and Lexeme above.
- Participate in this week's Lexeme challenge: Pigs
- Summarize your WikiProject's ongoing activities in one or two sentences.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!