Research:First messages to new users

Duration:  2011-07 — 2011-07
Open access project  Open access
no url provided
Open data project  Open data
no url provided

This page documents a research project in progress.
Information may be incomplete and change as the project progresses.
Please contact the project lead before formally citing or reusing results from this page.


Who is sending messages to new users, and what tools and scripts are they using? This sprint will break down these figures, particularly along the subjects of:

  • First messages sent to new users


Using a live replicating copy of the English Wikipedia database, these types of first messages can be acquired through a few basic steps:

  1. Get every distinct user_name from enwiki.user
  2. Join enwiki.user.user_name with to get all user talk pages
  3. Join enwiki.revision.rev_page with to get all the first revisions to a user talk page
  4. Parse rev_comment to determine tool and message type (requires a list of all tools, their names, and the automated edit summaries they leave)
  5. Parse rev_user and user_id to determine metadata

Results and discussionEdit



Future workEdit

Does the tool used and/or message type predict retention? Tricky, because tool users are selecting from different populations to begin with.