WikiCite/2020 Virtual conference/Align your Open Access Journal with Wikidata - using Python and OpenRefine
![](http://upload.wikimedia.org/wikipedia/commons/thumb/b/b8/WikiCite_Wikidata_8th_Birthday_logo.png/300px-WikiCite_Wikidata_8th_Birthday_logo.png)
Open citations & linked bibliographic data | 26-28 October 2020 | #WikiCite
Part of Celebrating Wikidata's 8th Birthday | #WikidataBirthday
12:15 UTC
15min |
Summary
editOpen Access Repositories like journal websites offer free accessible APIs like OAI-PMH to get access to the bibliographical metadata. In this talk I will present a lightweight python script (as a jupyter notebook) available under gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata. This script allows the scraping of any OAI2 conform site and extract all the necessary bibliographic values in a serialized json-file for a wikidata ingest. This json-output will be used for further processing in OpenRefine (e.g. linking and disambiguate authors or main subjects with Wikidata) and to prepare the upload to Wikidata.
Links
edit- Recording of talk online at TIB-AV-Portal.
- gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata OAI-PMH Parser - Jupyter Notebook
Bio
editChristian Erlinger works as systems librarian at Vienna Public Libraries. Twitter: @LibrErli