User:Jayanta (CIS-A2K)/Midterm report (22-23)
Digitisation support to Maharashtra and Goa projects edit
CIS-A2K is providing continuous support for the digitisation program run at three centres - Vigyan Ashram Pabal, Pune Nagar Vachan Mandir library, Pune and Lek Ladki Abhiyan Training Centre at Satara in Maharashtra. The reviews for quality issues, skill building, future planning and process improvements are conducted regularly. The similar technical support was given to Digitization project of O Bharat in Goa. The digitised volumes are being uploaded on Wikimedia Commons in O Bharat category after processing. On the 75th Independence day, the Chief Minister of Goa inaugurated the digitized volumes of 100-year-old O Bharat newspaper and collection of selected editorial from O Bharat on Wikimedia Commons. The different media houses covered the event in the newspapers. The news mentions the support of CIS-A2K and the Wikimedia Foundation. Other details may be found in project page.
Front page of O Bharat in Portuguese
Front page of O Bharat in Marathi
Gujarati Wikisource Community skill-building workshop edit
What, When, Where, How:
Gujarati Wikisource Community skill-building workshop was conducted by CIS-A2K in collaboration with the Gujarati Wikisource community from 24-26 September 2022. A total of 10 participants attnded this workshop.
After 3 years, the Wikisource community came together to boost their engagement and learned lot of recently developed tools. The visits to explore the digitisation partnerships were also done to develop the future plan.
- Although all the participants were active in Wikisource, they were not aware about a few things regarding the tool, PDF file process and Pagelist Gadget.
- Scanning facility is required for the Guajarati language content.
- Limited PD books are available in Gujarati Wikisource. Presently, only 193 books are available in Gujarati Wikisource.
Indic Wikisource Community Online meetup 12 November 2022 edit
- CIS-A2K conducted the first all India level online Indic Wikisource Proofreadthon on Wikisource to enhance the skills of Indian language Wikisource contributors. for the follow up on the process we have conducted this.
- CIS-A2K conducted the first all India level online Indic Wikisource Proofreadthon on Wikisource to enhance the skills of Indian language Wikisource contributors. This was ten days Online Proofreadthon where the participants contested among their language-specific and all India basis. We have conducted Wikisource Online Webinar May 2020 just after finish this contest for a followup and next future plan. We have covered the transclusion part after the work of Proofread/validation. We share a few future Goal plans for Indic languages Wikisource like below.
- Check the most top visited. It should be as per the Wikisource standard.
- Do not encourage adding pages without a scanned page in the Main namespace.
- Organized local language Wikisource proofreading contests
When and where?
- This online webinar was conducted on 16 May 2020. Total 25 participants attended this webinar.
- Reaction and the response of participants not viewed.
- Internet connectivity
- Webinar software ( like, Google meet, zoom, etc) limitation.
- We have already initiated Indic Wikisource Proofreadthon II in this year.
Indic Wikisource proofread-a-thon November 2022 edit
What, When, Where, How:
CIS-A2K conducted 4th all India level online Indic Wikisource Proofreadthon in November 2022 on Wikisource to enhance the skills, awareness, and encourage to work on Wikisource of Indian language Wikisource contributors. This was fifteen days of Online Proofreadthon in November 2022, where the participants contested among their language-specific an all-India basis. Before start this contest we conducted one online Wikisource meetup on 12 November 2022.
We have 12 Indic language base Wikisource. But, all communities don't have any experience with the online proofreadthon and never organized such events. So CIS-A2K conducted the all-India level online proofreadthon to support the community to grow at the same level as others. But every language community has a different quality standard as per its policies. Most of the communities are at the infant level compared to others with respect to proofreading (MOS of layout and checking the error of OCR output). So this online edit-thon had helped the community to compare with other qualities and rectify to set the proper Wikisource standard.
We announced the same format for this session. Each community selected their books to be proofread by the 92 participants. A total of 162 books were proofread in this contest. A total of 12,241 pages and 3592 pages were proofread and validated respectively. Below is the statistics graph of the contest. In this contest total of 12 Indic Wikisource communities participated.
- Total Proofread 22688
- Total Validate 7465
- Total Participant 113
- Total Book 179
We organized three Wikisource Proofreadthons during lockdown period from year 2020 to 2022. The contest was very healthy and competitive to continue to approach and support the community. This contest was a completely online mode and generated significant contributions to creating their native language resource. The community realised their strength and weakness, what need to do, and what was the drawback. out of 12 Indic Wikisource communities, 10 communities contributed to this contest.
- There is only one tool to generate the stats of this contest. Still, that tool is in the developing stage, this contest tool is called indic-wscontest. And another wscontest tool is not working properly. So it is so difficult to show and compare the running stats and progress of each communities.
- Before the November 2022 edit-thon, The IndicTechTeam developed/improvised a good contest tool that is workable for the contest. But still, there is some issue we are facing.
- The stats are not updated spontaneously or in real-time mode ( as other Wikipedia contest tools like [fountain] work), because Wikisource data schema differs from Wikipedia. There is one phabricator task T173012 is listed to fix this issues.
- Coordination between the 12 language communities is too complex in nature. Every community has unique issues regarding each section, like the selection of books, reviewer action, and maintaining the quality text proofread/validation.
- Maintaining the quality control of Wikisource was a major concern during the contest time.
Plan for second half of program year edit
Minimum 4 activities for Wikisource will be conducted which include -
- One national-level proofread-a-thon
- Three skill-building workshops two are finalised Hindi advance training and Tamil community advance training
- Wikisource of the month with 4 communities: Assamese, Hindi, Punjabi, Tamil