Research talk:Teahouse long term new editor retention/Work log/2015-11-25
Wednesday, November 25, 2015
editToday, I'm working with instances of <!-- Template: ... -->
found on talk pages. This is commonly used when a template is substituted onto the page to aid in tracking which templates are being used. What I'd really like to do is build list of those templates that correspond to warnings so that we can see the rise of warnings in Wikipedia.
First, let's look at the most common templates:
> month_templates[,list(postings = sum(postings)), by=template][order(postings, decreasing=T),][1:100] template postings 1: template:uw-vandalism1 7598768 2: template:unsigned 5232148 3: template:uw-cluebotwarning1 5176690 4: template:uw-vandalism2 3554289 5: template:uw-huggle1 3038861 6: template:db-csd-notice-custom 2976716 7: template:uw-vandalism3 2467730 8: template:unsignedip 2021922 9: template:uw-huggle2 1772924 10: template:uw-vandalism4 1729712 11: template:shared ip advice 1715998 12: template:welcome 1537006 13: template:unsigned ip 1358353 14: template:uw-cluebotwarning2 1351987 15: template:orphaned 1338576 16: template:uw-huggle3 1151319 17: template:di-orphaned fair use-notice 939837 18: template:undated 921745 19: template:uw-huggle4 869519 20: template:welcomeg 868722 21: template:no fair 831747 22: template:uw-test1 795100 23: template:prodwarning 790365 24: template:first article 786359 25: template:uw-delete1 776057 26: template:ani-notice 770452 27: template:uw-cluebotwarning3 764645 28: template:afc decline 693823 29: template:db-notability-notice 641948 30: template:uw-3rr 639090 31: template:afdwarning 608508 32: template:welcome-anon 599767 33: template:idw 599545 34: template:uw-vblock 590827 35: template:di-no fair use rationale-notice 589424 36: template:test5 583995 37: template:uw-unsourced1 580988 38: template:uw-block1 564287 39: template:smile 560905 40: template:uw-delete2 549592 41: template:proposed deletion notify 532922 42: template:dykproblem 531822 43: template:uw-cluebotwarning4 531352 44: template:adw 498872<nowiki><!-- 45: template:uw-vandalism4im 489806 46: template:tfdnotice 487919 47: template:nn-warn 483352 48: template:uw-block 480144 49: template:di-no source-notice 479116 50: template:firstarticle 478585 51: template:afd-notice 476222 52: template:frs message 424695 53: template:uw-spam1 421547 54: template:uw-test2 413546 55: template:db-bio-notice 413490 56: template:db-spam-notice 411424 57: template:updateddyknom 393143 58: template:uw-tilde 382693 59: template:di-replaceable fair use-notice 370515 60: template:missing rationale 367146 61: template:fdw 335226 62: template:uw-unsourced2 328842 63: template:teahouse hostbot invitation 309877 64: template:uw-delete3 306730 65: template:cfd-notify 304796 66: template:uw-block2 302805 67: template:updateddyk 295040 68: template:di-no license-notice 285162 69: template:db-nocontext-notice 282275 70: template:di-no permission-notice 278691 71: template:archivebox 278532 72: template:you can request undeletion 272489 73: template:prodwarningblp 267398 74: template:db-copyvio-notice 261870 75: template:reviewer-notice 261439 76: template:idw-pui 259508 77: template:welcomemenu 254433 78: template:uw-editsummary 239236 79: template:tilde 234289 80: template:uw-coi 230890 81: template:uw-huggletest1 225330 82: template:archivebox ends 218960 83: template:archivebox begins 216279 84: template:db-vandalism-notice 209334 85: template:uw-ablock 204101 86: template:uw-error1 201245 87: template:db-afc-notice 200222 88: template:uw-copyright 197881 89: template:drn-notice 197678 90: template:fdw-puf 191015 91: template:uw-huggledelete1 190227 92: template:image source 189983 93: template:uw-spam0 184244 94: template:mfdwarning 183830 95: template:db-nocontent-notice 181716 96: template:uw-test3 178027 97: template:uw-spam2 175015 98: template:afc talk 171398 99: template:teahouse afc invitaiton 169603 100: template:blatantvandal-n 168648
OK. Looks like we have quite a few to review! First let's try to gather the obvious warnings.
> common_templates[regexpr("^template:uw-", template) != -1,] template postings 1: template:uw-vandalism1 7598768 2: template:uw-cluebotwarning1 5176690 3: template:uw-vandalism2 3554289 4: template:uw-huggle1 3038861 5: template:uw-vandalism3 2467730 --- 427: template:uw-ew4 108 428: template:uw-attempt4 105 429: template:uw-notenglish-fr 102 430: template:uw-deletionpolicy1 101 431: template:uw-tempabuse4 101
OK. Looks like there's a lot of those. I saw some 3rr and "warning" stuff in there too.
> common_templates[regexpr("^template:(uw-|3rr|drm|.*warning.*)", template) != -1,] template postings 1: template:uw-vandalism1 7598768 2: template:uw-cluebotwarning1 5176690 3: template:uw-vandalism2 3554289 4: template:uw-huggle1 3038861 5: template:uw-vandalism3 2467730 --- 467: template:drmspeedy3-n 106 468: template:uw-attempt4 105 469: template:uw-notenglish-fr 102 470: template:uw-deletionpolicy1 101 471: template:uw-tempabuse4 101
OK. Cool. Now lets group those together by month and make some plots!
We can see the steep rise in warning template postings in 2006. It's interesting how the warning postings are periodic. The values seem to conform to the summer months -- when kids are not in school (and outside rather than vandalizing wikipedia??). The teahouse template postings appear to begin in 2013, but I know that this is merely when the template began to be flagged since I made the edit that added the <!-- ... -->
to the template content in January of 2013 (see en:Special:Diff/532920543). --Halfak (WMF) (talk) 18:28, 25 November 2015 (UTC)