Research talk:Teahouse long term new editor retention/Work log/2015-11-25

Wednesday, November 25, 2015

edit

Today, I'm working with instances of <!-- Template: ... --> found on talk pages. This is commonly used when a template is substituted onto the page to aid in tracking which templates are being used. What I'd really like to do is build list of those templates that correspond to warnings so that we can see the rise of warnings in Wikipedia.

First, let's look at the most common templates:

> month_templates[,list(postings = sum(postings)), by=template][order(postings, decreasing=T),][1:100]
                                     template postings
  1:                   template:uw-vandalism1  7598768
  2:                        template:unsigned  5232148
  3:              template:uw-cluebotwarning1  5176690
  4:                   template:uw-vandalism2  3554289
  5:                      template:uw-huggle1  3038861
  6:            template:db-csd-notice-custom  2976716
  7:                   template:uw-vandalism3  2467730
  8:                      template:unsignedip  2021922
  9:                      template:uw-huggle2  1772924
 10:                   template:uw-vandalism4  1729712
 11:                template:shared ip advice  1715998
 12:                         template:welcome  1537006
 13:                     template:unsigned ip  1358353
 14:              template:uw-cluebotwarning2  1351987
 15:                        template:orphaned  1338576
 16:                      template:uw-huggle3  1151319
 17:     template:di-orphaned fair use-notice   939837
 18:                         template:undated   921745
 19:                      template:uw-huggle4   869519
 20:                        template:welcomeg   868722
 21:                         template:no fair   831747
 22:                        template:uw-test1   795100
 23:                     template:prodwarning   790365
 24:                   template:first article   786359
 25:                      template:uw-delete1   776057
 26:                      template:ani-notice   770452
 27:              template:uw-cluebotwarning3   764645
 28:                     template:afc decline   693823
 29:            template:db-notability-notice   641948
 30:                          template:uw-3rr   639090
 31:                      template:afdwarning   608508
 32:                    template:welcome-anon   599767
 33:                             template:idw   599545
 34:                       template:uw-vblock   590827
 35: template:di-no fair use rationale-notice   589424
 36:                           template:test5   583995
 37:                   template:uw-unsourced1   580988
 38:                       template:uw-block1   564287
 39:                           template:smile   560905
 40:                      template:uw-delete2   549592
 41:        template:proposed deletion notify   532922
 42:                      template:dykproblem   531822
 43:              template:uw-cluebotwarning4   531352
 44:                             template:adw   498872<nowiki><!--
 45:                 template:uw-vandalism4im   489806
 46:                       template:tfdnotice   487919
 47:                         template:nn-warn   483352
 48:                        template:uw-block   480144
 49:             template:di-no source-notice   479116
 50:                    template:firstarticle   478585
 51:                      template:afd-notice   476222
 52:                     template:frs message   424695
 53:                        template:uw-spam1   421547
 54:                        template:uw-test2   413546
 55:                   template:db-bio-notice   413490
 56:                  template:db-spam-notice   411424
 57:                   template:updateddyknom   393143
 58:                        template:uw-tilde   382693
 59:  template:di-replaceable fair use-notice   370515
 60:               template:missing rationale   367146
 61:                             template:fdw   335226
 62:                   template:uw-unsourced2   328842
 63:     template:teahouse hostbot invitation   309877
 64:                      template:uw-delete3   306730
 65:                      template:cfd-notify   304796
 66:                       template:uw-block2   302805
 67:                      template:updateddyk   295040
 68:            template:di-no license-notice   285162
 69:             template:db-nocontext-notice   282275
 70:         template:di-no permission-notice   278691
 71:                      template:archivebox   278532
 72:      template:you can request undeletion   272489
 73:                  template:prodwarningblp   267398
 74:               template:db-copyvio-notice   261870
 75:                 template:reviewer-notice   261439
 76:                         template:idw-pui   259508
 77:                     template:welcomemenu   254433
 78:                  template:uw-editsummary   239236
 79:                           template:tilde   234289
 80:                          template:uw-coi   230890
 81:                  template:uw-huggletest1   225330
 82:                 template:archivebox ends   218960
 83:               template:archivebox begins   216279
 84:             template:db-vandalism-notice   209334
 85:                       template:uw-ablock   204101
 86:                       template:uw-error1   201245
 87:                   template:db-afc-notice   200222
 88:                    template:uw-copyright   197881
 89:                      template:drn-notice   197678
 90:                         template:fdw-puf   191015
 91:                template:uw-huggledelete1   190227
 92:                    template:image source   189983
 93:                        template:uw-spam0   184244
 94:                      template:mfdwarning   183830
 95:             template:db-nocontent-notice   181716
 96:                        template:uw-test3   178027
 97:                        template:uw-spam2   175015
 98:                        template:afc talk   171398
 99:         template:teahouse afc invitaiton   169603
100:                 template:blatantvandal-n   168648

OK. Looks like we have quite a few to review! First let's try to gather the obvious warnings.

> common_templates[regexpr("^template:uw-", template) != -1,]
                        template postings
  1:      template:uw-vandalism1  7598768
  2: template:uw-cluebotwarning1  5176690
  3:      template:uw-vandalism2  3554289
  4:         template:uw-huggle1  3038861
  5:      template:uw-vandalism3  2467730
 ---                                     
427:             template:uw-ew4      108
428:        template:uw-attempt4      105
429:   template:uw-notenglish-fr      102
430: template:uw-deletionpolicy1      101
431:      template:uw-tempabuse4      101

OK. Looks like there's a lot of those. I saw some 3rr and "warning" stuff in there too.

> common_templates[regexpr("^template:(uw-|3rr|drm|.*warning.*)", template) != -1,]
                        template postings
  1:      template:uw-vandalism1  7598768
  2: template:uw-cluebotwarning1  5176690
  3:      template:uw-vandalism2  3554289
  4:         template:uw-huggle1  3038861
  5:      template:uw-vandalism3  2467730
 ---                                     
467:       template:drmspeedy3-n      106
468:        template:uw-attempt4      105
469:   template:uw-notenglish-fr      102
470: template:uw-deletionpolicy1      101
471:      template:uw-tempabuse4      101

OK. Cool. Now lets group those together by month and make some plots!


 
Warnings (templates). A raw count of warning template messages posted is plotted by month for the English Wikipedia.
 
Teahouse invites (templates). A raw count of teahouse invitation template messages posted is plotted by month for the English Wikipedia.

We can see the steep rise in warning template postings in 2006. It's interesting how the warning postings are periodic. The values seem to conform to the summer months -- when kids are not in school (and outside rather than vandalizing wikipedia??). The teahouse template postings appear to begin in 2013, but I know that this is merely when the template began to be flagged since I made the edit that added the <!-- ... --> to the template content in January of 2013 (see en:Special:Diff/532920543). --Halfak (WMF) (talk) 18:28, 25 November 2015 (UTC)Reply

Return to "Teahouse long term new editor retention/Work log/2015-11-25" page.