Wikisource Handbook/Uploading and Indexing
Introduction | Check Copyright Status | Uploading and Indexing | OCR | Proofreading and Transclusion | Wikidata Linkage |
After finding out the copyright status of a work, the next step is to upload it to Wikimedia Commons. There are many ways to upload a book there.
Upload tools
edit- Upload Wizard
Upload Wizard is the default upload tool for Wikimedia Commons, where you can upload upto 50 files at a time from your computer. For detailed instructions on how to use Upload Wizard, it is recommended to follow the Upload Wizard page .
- IA-upload tool
IA-upload [1] tool is a tool to transfer files from Internet Archive to Wikimedia Commons. The tool will only upload files with DjVu format to Commons.
- To upload via this tool, you have to
- Step 1:Go to IA-Upload and log in. It will request an “OAuth” (permission to have restricted access) from your Wikimedia Commons user account.
- Step 2:Insert the archive.org identifier-access (the $ID portion of the URL as in https://archive.org/details/$ID) in the first field.
- Step 3:Insert the desired filename for the file to be uploaded on Commons in the second field, without the File: prefix or .djvu suffix, and proceed.
- Step 4:Click on ‘Get metadata’ button.
- Step 5:Review the automatic metadata, changing it as and when needed. It will be based on Commons’ {{book}}
- Step 6:Internet Archive has stopped creating DjVu files from March 2016. So, you can find that while some of the works there have DjVu format files, some don’t. IA-upload tool will allow you to create the DjVu format.
- If you opt to create the DjVu from either JP2 or PDF, then your request will be placed in a queue and will usually take some time to convert. You can check the queue displayed on the tool homepage.
- If the Archive already has created DjVu format of the work, you can select DjVu as the source and the file will be uploaded immediately.
- URL2Commons
URL2Commons [2] is an uploader tool to transfer a file of any format from another website to Wikimedia Commons. While IA-upload tool can only upload DjVu files from Internet Archive, URL2Commons can upload any permissible format from any website.
- Step 1: To use this tool, first authorise OAuth Uploader to upload in your name.
- Step 2: In the URLs field (i.e. the first field)
- Eenter file URLs one per line.
- Give a space after each URL and add the desired filename for the file to be uploaded on Commons.
- Step 3:In the Description field, it is recommended to use the {{book}} template
Check this link [3] to authorise
[Note: The books can be mass-uploaded to Commons, provided the copyright status of the books are compatible to Commons. Files can be deleted and users can be blocked if such violation occurs. Make sure to double check each file before a mass upload.]
- Note
- The file name and the index page created later will have the same title, so it is recommended that the correct title of the file be provided.
- Provide correct description of the file, like author, publisher, publication date, license etc.
- It is a best practice to provide the title and description in the same script as the book. For example, books with Bengali scripts are given a title and description with Bengali script.
- Provide correct copyright license of the file.
- Keep the files in a specific tracking category. For example: Bengali books in Commons are kept in Category:Books in Bengali
For Copyrighted works
editCopyrighted works can also be uploaded to Wikimedia Commons, but to do so, the copyright holder will need to release his/ her work under free license by the following process
- Step 1: Upload the book to Commons
- Step 2: Add the free license template in which you want to release the book.
- Step 3: Send an email to permissions-commons@wikimedia.org with evidence of permission to publish the file under a free license. This can be obtained using the Wikimedia OTRS release generator tool
- Step 4: Add {{subst:OP}} template in the description of the file, which will add date-stamped version of the {{OTRS pending}} notification in the file.
Check for OTRS release generator tool [4] , {{subst:OP}} template, and {{OTRS pending}} The request will be handled by a team of volunteers of Open-source Ticket Request System (OTRS).
- They will assign a unique ticket number for the request,
- They will review the request and ask for more evidence if required,
- They will replace the {{OP}} tag with {{OTRS received}} if more evidence is required or with {{PermissionOTRS}} if they are satisfied.
Indexing (in NS:Index)
edit- Step 1:The next step is to create Index pages (A page with Index namespace i.e. Index:) of the file in respective Wikisource
- Step 2: If you have used the {{book}} template, you will see a Wikisource logo at the upper right corner in the file description, which when clicked, will land on your language Wikisource Index page of the file
- Step 3:Fill up the form and save it
For backend proofreaders of Wikisource, Index pages are the main pages to work upon. An Index page will show the links to all the individual pages in the book, the progress of the proofreading status of the book by different color codes, and a quick summary of the text’s details (such as title, author etc).
The parameters of the Index page may vary in each language Wikisource, depending on the variables used in MediaWiki:Proofreadpage index data config and MediaWiki:Proofreadpage index template, but basic parameters like Title, Volume, Author, Translator, Editor, Publisher, Publication year, Cover Image, Progress, Pages and Table of Content should be consistent across all Wikisource projects. If you find these missing, or you want to add more parameters, you can discuss with this your community and ask your project admins.
Below is the short details of the basic parameters, stated above:
Parameters | Explanation |
---|---|
Title | Title of the work, should be wikilinked |
Volume | Volume of the work, if any |
Author | Author of the work, should be wikilinked with Author namespaces |
Translator | Translator of the work, if any, should be wikilinked with Author namespaces |
Editor | Editor of the work, if any |
Publisher | Publisher of the work |
Publication year | Mandatory |
Cover Image | Image of the page to be displayed in the index page (default to page number 1) |
Progress | This shows the progress of proofreading. (see the table below)[1] |
Pages | Get a list of pages by adding . Then index them. |
Table of Content | Add Table of Content, if possible |
- Progress
Parameters | Explanation |
---|---|
Done | Validation completed for each and every pages. |
To be validated | All pages has been proofread, time to validate. |
To be proofread | OCR has been done, time to proofread |
Source file needs an OCR text layer | Book is ok, OCR needs to be done. |
Source file is incorrect | If there is any missing page or unordered page or duplicate page etc. |
Pagelist needed | Create a pagelist. |