Schema talk:CommonsCategoryTreeUse
Maintainer: | Dschwen |
---|---|
Please specify the team maintaining this schema. | |
Project: | Commons |
Status: | in development |
Please specify the purge schedule. |
This page holds a JSON schema that specifies a data model for EventLogging.
Question
edit- How are anonymous users using the commons category tree to find images, compared to logged in users. Is the category tree being used to discover images?
- The proposed schema should emit events on page view and on category link clicks. The event data should contain the log in status (logged in/not logged in), and the current namespace number.
Analysis
editThe following analysis on the dataset would be performed:
- Category page visitation frequency compared to image page visitation frequency for logged in and logged out users.
- How much relative "time" is each group spending the category namespace? This could indicate if categories are a significant path for the discovery of images (as opposed to direct jumps to image pages from internal/external search).
- Category link click rates in category and image namespaces.
- These metrics (again for each logged in and logged out users) would indicate if the category tree is actively browsed (rather than stumble upon).
- Category link clicks in the image namespace are an indicator for the effectiveness of categories to find similar content.
- Category link clicks in category namespace are an indicator for browsing the category tree to find specific content
Rationale
editMotivator for this study is finding out the significance of the category tree in content discovery on wikimedia commons. This directly impacts decisions for gadget default deployment, such as the FastCCI Gadget which would benefit anonymous users (if the category tree is a significant funnel for content discovery. The schema is designed to collect a minimum amount of data in a maximally anonymized way.
The data to be logged should be considered inexpensive (standard identifiers isAnaon and pageNS in the schema). I have no clue how the link click action will be logged, but determining the namespace from the link target should be rather trivial (using mw.title
for example).