Grants:Project/0x010C/LinguaLibre/Midpoint


Report under review
This Project Grant midpoint report has been submitted by the grantee, and is currently being reviewed by WMF staff. You may add comments, responses, or questions to this report's discussion page.


Welcome to this project's midpoint report! This report shares progress and learning from the grantee's first 3 months.

Summary edit

  • The core elements are developed and working well;
  • I'm on time, no delays are expected;
  • Finances are ok.

Methods and activities edit

As of today (7 May 2018), that's the progress of Lingua Libre's development.

Description Estimated duration
(in day)
Progression
Develop a new recording Extension for MediaWiki (RecordWizard)
1 - Take back and adapt the existing Lingua Libre's recording studio 4
DONE
2 - Change the interface to use OOjs-ui 7
DONE
3 - Turn it into a MediaWiki extension 14
85%
4 - Improve the recording studio with alternative recording methods 9
40%
5 - I18n 2
50%
6 - Develop a guided tour (using the GuidedTour extension) to help newcomers 5
/
Total 41 d
Migrate Lingua Libre on a wiki-based architecture
7 - Setup of a dedicated MediaWiki and Wikibase instance, with the newly-created recording extension 3
60%
8 - Use OAuth for the authentication 5
DONE
9 - Define the RDF Schema 4
DONE
10 - Create a dedicated MediaWiki skin 12
25%
11 - Develop specific on-wiki JS scripts to facilitate the navigation and the modification of items 18
40%
12 - Initialize the wiki with all the necessary basic wikibase properties and items 2
50%
13 - Import all our existing sound records in the new database 4
/
Total 48 d
Reuse on Wikimedia wikis
14 - Setting up OAuth to allow uploading sounds to Commons 10
90%
15 - Develop bot-tools to add the uploaded sounds to articles on wikis that asked for (currently the French Wiktionary) 15
/
16 - Contact other communities to extend the reuse of these sounds 4
/
Total 29 d
Data exploration and visualization
17 - Setup a SPARQL endpoint 10
90%
18 - Create turnkey SPARQL queries 7
80%
Total 17 d
Total project 135 d
60%

Midpoint outcomes edit

All the core and blocking elements of this second version of Lingua Libre are developped. They arn't in their final state, but they have already all the key features and are usable:

During the past moths, I've created several MediaWiki extesions and libraries, available at the moment on the following git repositories:

  • LinguaRecorder (a powerfull JS library to manage audio recording (intelligent cutting, saturation control, various export options,...));
  • RecordWizard (a Mediawiki extension used to record audio pronunciation, the core of Lingua Libre);
  • Upload2Commons (a MediaWiki extension adding an API module, to upload files stored localy on a wiki to Wikimedia Commons through OAuth);
  • QueryViz (a MediaWiki extension adding a <query> tag to display sparql queries results inside wiki pages).


They are all used and testable on the new MediaWiki instance, which will be LinguaLibre V2: https://v2.lingualibre.fr.

Furthermore, a SPARQL endpoint is available to query the records at https://sparql.0x010c.fr/bigdata/#query.


Finances edit

Funds have been spend according to plan so far. There is no major change to budget so far.

Due to personal reason, it may be hard for me to organise/attend to presentations and/or workshops in the second half of this grant period. So the 600€ of the budget line 20 - Presentations and workshops across France focused on local languages (Breton, Alemannic, Franco-Provençal) may not be needed (or not the complete amount); I will keep you informed as soon as I can.

Learning edit

What are the challenges edit

  • Working with undocumented (or badly documented) codes;
  • Even if it's outside of the grant itself, I've a bit underestimated the project management and all the paperwork that come with the project.

What is working well edit

Next steps and opportunities edit

  • The main focus now is on the UI/UX part, to make these features the easiest to use possible;
  • If all goes well, a public release should be published just before the Wikimedia Hackathon (arround the 16 May 2018);
  • After that happens, communication and outreach to the communities will gradually start again;
  • And begin to develop with them the bot tools to allow file usage on Wiktionaries/Wikipedias/Wikidata.