CIS-A2K//Work plan July 2014 - June 2015/Making the Santali Wikipedia Live
Santali is an Indian indigenous language in the Munda subfamily of Austroasiatic languages. Around 6.2 million living in India, Bangladesh, Bhutan and Nepal speak Santali. Majority of its speakers live in India, in the states of Jharkhand, Assam, Bihar, Odisha, Tripura, and West Bengal.  Santali is written in Ol Chiki, Roman and Devanagari script.
About the Project edit
Santali Wikipedia is in incubation since September 2010. Currently Roman script is used as there is no free licensed Ol Chiki font available . There are totally 30 articles contributed by 19 editors over time. This project aims to make Santali Wikipedia into a live project. Towards this CIS-A2K will undertake two major activities. First, it will address the technical challenges facing the Santali script and will work towards bringing out a free Ol Chiki unicode font. Second, it will cultivate a small group of enthusiastic Santali editors who could contribute regularly to the Santali Wikipedia project. The project will be based at Kalinga Institute of Social Sciences (KISS) which is home to about 20,000 tribal students out of which about 1,000 students speak Santali as a native language.
Work done so far edit
- Preliminary interactions with Santali speaking students at KISS to know their knowledge in reading and writing in Santali using Ol Chiki.
- Discussion with Google webfont project about free font project.
- Preliminary survey on potential resources (Textbooks, other literary books in Santali).
- Native speakers of Santali (students at KISS for this project) are conversant but do not have formal reading and writing skills in Santali.
- KISS library currently has hardly any book written in Ol Chiki script.
- Santali Wikipedia incubation project has articles in only Roman script. None of the current operating systems have Ol Chiki support.
- Though there are few fonts created by the Wesantals project there is no free licensed Unicode font. So, webfont support can not be provided.
Implementation Plan edit
Development of Free licensed Unicode Ol Chiki font edit
To facilitate a project for developing a free licensed Unicode Ol Chiki font.
- Facilitating development of an OFL Ol Chiki font.
- Testing of the font by native speakers and getting bugs fixed.
- Liaison with Wikimedia Foundation's Language Engineering team for webfont support.
- One free licensed Unicode Ol Chiki font to be made available.
- Making available one free licensed Ol Chiki Webfont on ULS.
Reviving Santali Wikipedia incubator project edit
To cultivate a community of enthusiastic Santali Wikipedia editors at KISS.
- Creating a group of faculty and students to contribute to Santali Wikipedia
- Organize training workshops. One per month.
- Finding educational resources in Santali
- Regular mentoring and support
- Santali Wikipedia training to about 80-100 Santali students of which 10-15 could turn in to active contributors
- To result in 100 new articles on Santali Wikipedia
|Expenditure Item||FDC Support (INR)||FDC Support (US$)||Other Sources and in kind Support (INR)||Other Sources and in kind Support (US$)|
|1 PO x 10% *||82,560/-||1,335.89||-||-|
|Travel and Stay **||-||-||100,000/-||1,618.08|
|Research and typeface design||100,000/-||1,618.08||100,000/-||1,618.08|
* 10% of Subhashish Panigrahi's time as Program Officer, who will anchor this plan at CIS-A2K
** Expenses incurred by the CIS-A2K program team towards executing the plan.
- "Santali: A Language of India". Ethnologue: Languages of the World. SIL International. Retrieved 16 January 2012.
- Analysis of editor activity of Santali Wikipedia incubator
- Indigenous Languages Rising: Digital Activism, Research and Training
- Meijssen, Gerard. “The #Santali language and the Ol Chiki script II”.