Research:GCI Wiki Study/2018/introduction

As one of the many editors of the English Wikipedia, I'm asking myself different thoughts regarding wiki statistics and I made a formal study of it. You can help me by answering these questions!

Instructions edit

Information edit

 
Graph sample, here of the average thanks received to edit count ratio by editor percentile.
  • Basic to middle SQL skills are needed to perform these tasks.
  • The target is to have for each question an SQL request and its result in the format of a table. Finally, we need to present the data visually as a graph.
  • We will work on English Wikipedia (witch database is enwiki_p), so it'll be pretty simple for people who is interested to run these queries on other wikis.

Tools used edit

"Replicas" is the common name for the SQL read-only database servers that allow everybody to run queries against publicly available Wikis datas. For example, all page edits are available in the revision column.

Quarry is a simple web interface that can be used to execute queries on replicas. The data is returned in tables and can be exported in various formats. Don't hesitate to use links present in the site header, they give information about how to navigate in replicas.

General instructions edit

Here are step-by-step instructions. Please follow them carefully.

  1. Claim one of the available tasks on GCI website.
  2. Try to understand the question, imagine what you'll need to look at (revisions? log entries?), what filters will be needed (by date? by user?).
  3. Create a Wikipedia/Wikimedia account if you don't have one yet.
  4. Log in on Quarry website and fork this query sample (the Fork button is at the right of the title).
  5. Try to understand what this query sample does, edit and run it until you've the excepted result. That's the most important part of the work.
  6. Be sure that with the data present in the query result you'll be able to answer to the question, with words or with a graph.
  7. Once you've the wanted data, download the exported file as CSV from Quarry and open it in your favorite spreadsheet app. You can use Excel, Libreoffice, or even Google Documents.
  8. Create a graph that visually answer to the question. Be careful to details, like correct legends, comprehensive title, or attractive colors. You are free to create one that speaks to you.
  9. Finally write a few sentences in English that explain what the table and the graph show, how to read them and what should be know to correctly understand them.
  10. Export this graph in SVG or PNG format. Then, upload it on Wikimedia Commons, the Wikimedia media file repository, with appropriate title and description. Please add category GCI Wiki Study 2018 charts when asked. If you can't export it properly, as a last recourse, you can take a screenshot of your screen showing the graph then properly crop the file with appropriate software (Paint does it as needed).
  11. Last step, create a new section to this page (button on the top right) with the question sentence, the SQL query, the table, the graph and your description, if possible in this order.
    To properly render your SQL query, use <source lang="sql">your query here</source> syntax.
    To insert the table, Quarry can export it in the good format for you. Copy/paste the Help:Wikicode export in the wiki page.
    To insert an image (the chart uploaded in last steep), have a look to the documentation.
  12. Finished! Before submitting the task for review on the GCI site, please verify that you've followed all the instructions.

During all this small process don't hesitate to ask for help on GCI site. You can also discuss with me with instant chatting system on IRC by contacting Framawiki on #wikimedia-techconnect. If I don't answer in minutes that is because I'm not behind my computer, so feel free to ask your question in the channel, other mentors will probably be present to help you.