Event:WikiCon Australia 2024/Submissions/Using OpenRefine & IRMNG to improve Australian Biodiversity
Using OpenRefine & IRMNG to improve Australian Biodiversity
editAbstract/description
editWe
- demonstrate how to download a Darwin core csv file from IRMNG which may represent the taxa named by a particular taxonomist. The list will not be complete as IRMNG is very incomplete with respect to Australian Faunal Directory and World Register of Marine Species taxon databases.
- import this file into openRefine and create a project.
In openRefine, we learn to
- reconcile columns... with taxon names (Accept only perfect matches NOT synonyms)
- create new columns
- by splitting a column
- by copying a column
- by using GREL functions such as substring, replace, indexOf ...
- subset for further processing (and using flags and stars)
An alternative approach
editUsing the following queries for APNI and AFD taxa:
- For genera with APNI ids (and no authority) plus taxon author citation
- For species with APNI ids (and no authority)
- For genera with AFD ids (and no authority) plus taxon author citation
- for AFD arachnid genera (limiting a query)
- For species with AFD ids (and no authority)
Modify these queries
edit- to pick a family, genus, order
and download the query result as a CSV file
The tasks thereafter closely match those discussed above and include
- forming links to the APNI and AFD pages for the taxon
- grabbing the authority and the publication from these links
to create lists of authors, taxon year of publication, publication name and page, and again, creating a schema to upload the reconciled authors and publications to wikidata.
What I am hoping to achieve
editAt the end of the session, participants will have learned
- how to create a project in openRefine
- why & how to facet
- how to split a column (and how to undo an action)
- how to reconcile a column with its wikidata
- some useful GREL functions
- how to create a schema for uploading data to wikidata
to ultimately create Wikidata entries like that for Illawarra wisharti.
Relationship to Wiki skills or to the theme
editLearning how to use openRefine to import statements and items into Wikidata
Username/s
edit- MargaretRDonald (talk) 01:27, 26 September 2024 (UTC)
Session type & duration
edit4 x two hour online sessions