Wikipedia Administrative Pages Analytics/Metrics
Metrics are measurements of aspects of a page. We grouped metrics into different dimensions. Thanks to them, we are able to compute some ratios which can show us when a page may require our attention.
The following are the metrics that are used in the Wikipedia Administrative Pages Analytics website dashboards. You can download the data at this URL: https://wapa.wmcloud.org/databases/wikipedia_administrative_pages_analytics_production.db
Metrics of page quality
editThese three dimensions represent primary aspects of quality: completeness, relevance, and popularity. They also represent goals a page aspires /
- completeness. the state or condition of the page having all the necessary or appropriate parts.
- relevance. the quality or state of the page being closely connected or appropriate.
- popularity. the state or condition of the page being liked, or consumed, by many people.
Each of these dimensions can be represented by one or more metrics.
Each of these dimensions can be represented by one or more metrics.
Completeness metrics:
Number of Bytes, Number of References, Number of Images, Number of Outlinks, Number of WD Properties, Number of WD Identifiers.
Relevance* metrics:
Number of Inlinks, Number of Interwiki, Number of Multilingual Sister Projects.
* Category relevance (only for categories): Number of Level from the top, Number of Categories contains, Number of Pages contains, Number of Pages Admin pages contains.
Popularity metrics:
Number of pageviews.
All these are valuable metrics from different viewpoints. We computed a Pearson correlation using all the metrics of the admin. pages from English Wikipedia. We saw that there was no correlation higher than 0.15. The only correlations showing higher values were those related to edition.
Metrics of Edition
editThe following dimensions represent other aspects by which a page is created and developed: participation, disagreement, engagement, inclusion, recency, regularity, and antiquity.
- activity. the condition in which the page is edited.
- disagreement. lack of consensus or approval.
- engagement. the capacity of drawing editors’ attention to participate.
- inclusion. the action or state of including editors in the edition of the page.
- recency. the quality or state of being recently edited.
- regularity. the state or quality of being regularly edited.
- creation. the moment in which the page was created.
These are secondary dimensions since they do not relate to an inherent aspect of a page, but to its relationship with the editing process and community. Likewise, each of these dimensions can be represented by one or more metrics.
Editing Activity metrics:
Number of Edits, Number of Edits last month, Number of Anonymous Edits, Number of Bot Edits, Active months, Editing days.
Editing Engagement metrics:
Number of Editors.
Editing Inclusion metrics:
Number of admin editors,
Number of Edits last month by newcomer 60d,
Number of Edits last month by newcomer 1y,
Number of Edits last month by newcomer 5y,
Number of Edits last month by admin, Number of Edits last month by anonymous, Median year first edit, Median editors Edits.
Editing Disagreement metrics:
Number of Discussions, Number of Reverts.
Editing Recency metrics:
Days last 50 edits, Days last 5 edits, Days last edit, Date of last discussion, Date of last edit.
Number of edits last month.
Editing Regularity metrics:
Percent active months, Percent active days, Max. Active Months Row, Max Inactive Months Row.
Creation metrics:
Date of creation, Language of the first timestamp, Total months.