Wikisource Handbook/Uploading and Indexing

Wikisource Handbook
 Introduction Check Copyright Status Uploading and Indexing OCR Proofreading and Transclusion Wikidata Linkage 

After finding out the copyright status of a work, the next step is to upload it to Wikimedia Commons. There are many ways to upload a book there.

Upload tools edit

Upload Wizard

Upload Wizard is the default upload tool for Wikimedia Commons, where you can upload upto 50 files at a time from your computer. For detailed instructions on how to use Upload Wizard, it is recommended to follow the Upload Wizard page .

IA-upload tool

IA-upload [1] tool is a tool to transfer files from Internet Archive to Wikimedia Commons. The tool will only upload files with DjVu format to Commons.

To upload via this tool, you have to
  • Step 1:Go to IA-Upload and log in. It will request an “OAuth” (permission to have restricted access) from your Wikimedia Commons user account.
  • Step 2:Insert the archive.org identifier-access (the $ID portion of the URL as in https://archive.org/details/$ID) in the first field.
  • Step 3:Insert the desired filename for the file to be uploaded on Commons in the second field, without the File: prefix or .djvu suffix, and proceed.
  • Step 4:Click on ‘Get metadata’ button.
  • Step 5:Review the automatic metadata, changing it as and when needed. It will be based on Commons’ {{book}}
  • Step 6:Internet Archive has stopped creating DjVu files from March 2016. So, you can find that while some of the works there have DjVu format files, some don’t. IA-upload tool will allow you to create the DjVu format.
  • If you opt to create the DjVu from either JP2 or PDF, then your request will be placed in a queue and will usually take some time to convert. You can check the queue displayed on the tool homepage.
  • If the Archive already has created DjVu format of the work, you can select DjVu as the source and the file will be uploaded immediately.
URL2Commons

URL2Commons [2] is an uploader tool to transfer a file of any format from another website to Wikimedia Commons. While IA-upload tool can only upload DjVu files from Internet Archive, URL2Commons can upload any permissible format from any website.

  • Step 1: To use this tool, first authorise OAuth Uploader to upload in your name.
  • Step 2: In the URLs field (i.e. the first field)
  • Eenter file URLs one per line.
  • Give a space after each URL and add the desired filename for the file to be uploaded on Commons.
  • Step 3:In the Description field, it is recommended to use the {{book}} template

Check this link [3] to authorise


[Note: The books can be mass-uploaded to Commons, provided the copyright status of the books are compatible to Commons. Files can be deleted and users can be blocked if such violation occurs. Make sure to double check each file before a mass upload.]

Note
  • The file name and the index page created later will have the same title, so it is recommended that the correct title of the file be provided.
  • Provide correct description of the file, like author, publisher, publication date, license etc.
  • It is a best practice to provide the title and description in the same script as the book. For example, books with Bengali scripts are given a title and description with Bengali script.
  • Provide correct copyright license of the file.
  • Keep the files in a specific tracking category. For example: Bengali books in Commons are kept in Category:Books in Bengali

For Copyrighted works edit

Copyrighted works can also be uploaded to Wikimedia Commons, but to do so, the copyright holder will need to release his/ her work under free license by the following process

  • Step 1: Upload the book to Commons
  • Step 2: Add the free license template in which you want to release the book.
  • Step 3: Send an email to permissions-commons@wikimedia.org with evidence of permission to publish the file under a free license. This can be obtained using the Wikimedia OTRS release generator tool
  • Step 4: Add {{subst:OP}} template in the description of the file, which will add date-stamped version of the {{OTRS pending}} notification in the file.


Check for OTRS release generator tool [4] , {{subst:OP}} template, and {{OTRS pending}} The request will be handled by a team of volunteers of Open-source Ticket Request System (OTRS).

  1. They will assign a unique ticket number for the request,
  2. They will review the request and ask for more evidence if required,
  3. They will replace the {{OP}} tag with {{OTRS received}} if more evidence is required or with {{PermissionOTRS}} if they are satisfied.

Indexing (in NS:Index) edit

  • Step 1:The next step is to create Index pages (A page with Index namespace i.e. Index:) of the file in respective Wikisource
  • Step 2: If you have used the {{book}} template, you will see a Wikisource logo at the upper right corner in the file description, which when clicked, will land on your language Wikisource Index page of the file
  • Step 3:Fill up the form and save it

For backend proofreaders of Wikisource, Index pages are the main pages to work upon. An Index page will show the links to all the individual pages in the book, the progress of the proofreading status of the book by different color codes, and a quick summary of the text’s details (such as title, author etc).

 
Wikisource page proofread

The parameters of the Index page may vary in each language Wikisource, depending on the variables used in MediaWiki:Proofreadpage index data config and MediaWiki:Proofreadpage index template, but basic parameters like Title, Volume, Author, Translator, Editor, Publisher, Publication year, Cover Image, Progress, Pages and Table of Content should be consistent across all Wikisource projects. If you find these missing, or you want to add more parameters, you can discuss with this your community and ask your project admins.

Below is the short details of the basic parameters, stated above:

Parameters Explanation
Title Title of the work, should be wikilinked
Volume Volume of the work, if any
Author Author of the work, should be wikilinked with Author namespaces
Translator Translator of the work, if any, should be wikilinked with Author namespaces
Editor Editor of the work, if any
Publisher Publisher of the work
Publication year Mandatory
Cover Image Image of the page to be displayed in the index page (default to page number 1)
Progress This shows the progress of proofreading. (see the table below)[1]
Pages Get a list of pages by adding . Then index them.
Table of Content Add Table of Content, if possible
  1. Progress
Parameters Explanation
Done Validation completed for each and every pages.
To be validated All pages has been proofread, time to validate.
To be proofread OCR has been done, time to proofread
Source file needs an OCR text layer Book is ok, OCR needs to be done.
Source file is incorrect If there is any missing page or unordered page or duplicate page etc.
Pagelist needed Create a pagelist.

References edit