WikiCite 2016/Report/Group 4

Group 4: (Semi-)automated ways to add references to Wikidata statementsEdit

Room 124, 4:00 - 6:00 pm • Etherpad: Room 124

GoalEdit

Improve tools for semi-automated statement and reference creation (e.g. StrepHit, ContentMine)

ParticipantsEdit

  1. Adam Shorland (Wikimedia Deutschland, Wikidata), Thursday
  2. Alex Kalderimis (RefMe), Wednesday
  3. Marco Fossati (Fondazione Bruno Kessler (FBK)), both days
  4. Scott Chamberlain (rOpenSci), Wednesday
  5. Thomas Arrow (ContentMine), Thursday
  6. Till Sauerwein (Universität Würzburg (University of Wurzburg)), both days

SummaryEdit

This work group had very specific goals that were met. The report is included in wikidata:Wikidata:Requests_for_comment/Semi-automatic_Addition_of_References_to_Wikidata_Statements.

IntroductionEdit

Quoting from the StrepHit project page:

The trustworthiness of Wikidata assertions plays the most crucial role in delivering a high-quality, reliable Knowledge Base: in order to assess their truth, assertions should be validated against third-party resources, and few efforts have been carried out under this perspective. One form of validation can be achieved via references to external (i.e, non-wiki), authoritative sources. This has motivated the development of the primary sources tool: it will serve as a platform for users to either accept or reject new references and/or assertions coming from third-party datasets. We argue that there is a need for datasets which guarantee at least one reference for each assertion.

RecommendationsEdit

The StrepHit and ContentMine teams have a common vision for Wikidata. If we manage to join forces through the support of the Wikimedia Grants program (both StrepHit and ContentMine have proposals for project grants, cf. #Resources), we will produce protocols for Wikimedia that leverage semi-automated approaches to extract facts from reliable Web sources.

DiscussionEdit

The discussion focused on the primary sources tool usability, a platform for data curation in Wikidata. The outcomes can be browsed at wikidata:Wikidata:Requests_for_comment/Semi-automatic_Addition_of_References_to_Wikidata_Statements. Previous discussion on the tool is available at wikidata:Wikidata_talk:Primary_sources_tool.

ResourcesEdit

Appendix: workgroup notesEdit

Raw notes from group 4.