Research:Developing Metrics for Content Gaps (Knowledge Gaps Taxonomy)/Literature

The following research papers were reviewed during the course of this project. They are organized by the gap they relate to: gender, sexual orientation, geography, cultural context content and time.

Gender Gap edit

  • Reagle, J., & Rhue, L. (2011). Gender bias in Wikipedia and Britannica. International Journal of Communication, 5, 21.
  • Gloor, P. A., Marcos, J., de Boer, P. M., Fuehres, H., Lo, W., & Nemoto, K. (2015). Cultural anthropology through the lens of Wikipedia: Historical leader networks, gender bias, and news-based sentiment. arXiv preprint arXiv:1508.00055.
  • Graells-Garrido, E., Lalmas, M., & Menczer, F. (2015, August). First women, second sex: Gender bias in Wikipedia. In Proceedings of the 26th ACM Conference on Hypertext & Social Media (pp. 165-174).
  • Klein, M., & Konieczny, P. (2015). Gender gap through time and space: A journey through wikipedia biographies and the" wigi" index. arXiv preprint arXiv:1502.03086.
  • Klein, M., & Konieczny, P. (2015, August). Wikipedia in the world of global gender inequality indices: What the biography gender gap is measuring. In Proceedings of the 11th International Symposium on Open Collaboration (pp. 1-2).
  • Wagner, C., Garcia, D., Jadidi, M., & Strohmaier, M. (2015, April). It's a man's Wikipedia? Assessing gender inequality in an online encyclopedia. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 9, No. 1).
  • Matias, J. N., Diehl, S., & Zuckerman, E. (2015, April). Passing on: Reader-sourcing gender diversity in Wikipedia. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems (pp. 1073-1078).
  • Wagner, C., Graells-Garrido, E., Garcia, D., & Menczer, F. (2016). Women through the glass ceiling: gender asymmetries in Wikipedia. EPJ Data Science, 5, 1-24.
  • Young, A., Wigdor, A. D., & Kane, G. (2016). It’s not what you think: Gender bias in information about Fortune 1000 CEOs on Wikipedia.
  • Moisset, I. (2017). Cien arquitectas en Wikipedia. Dearq, (20), 20–27. http://doi.org/10.18389/dearq20.2017.02
  • Zagovora, O., Flöck, F., & Wagner, C. (2017, June). "(Weitergeleitet von Journalistin)" The Gendered Presentation of Professions on Wikipedia. In Proceedings of the 2017 ACM on Web Science Conference (pp. 83-92).
  • White, A. (2018). The history of women in engineering on Wikipedia. Science Museum Group Journal, 10(10).
  • Adams, J., Brückner, H., & Naslund, C. (2019). Who Counts as a Notable Sociologist on Wikipedia? Gender, Race, and the “Professor Test.” Socius: Sociological Research for a Dynamic World, 5(5), 237802311882394–14. http://doi.org/10.1177/2378023118823946
  • Weijand, S. (2019). Automated Gender Classification in Wikipedia Biographies, 1–60.
  • Marinina, A. (2019). Overrepresentation of the Underrepresented: Gender Bias in Wikipedia, 1–37.
  • Schmahl, K. G., Viering, T. J., the, S. M. P. O., (2020). Is Wikipedia succeeding in reducing gender bias? Assessing changes in gender bias in Wikipedia using word embeddings. Aclweb.org. http://doi.org/10.18653/v1/P17
  • Beytía, P., & Wagner, C. (2020). Visibility Layers: A Framework for Facing the Complexity of the Gender Gap in Wikipedia Content.
  • Pradel, F. (2020). Biased Representation of Politicians in Google and Wikipedia Search? The Joint Effect of Party Identity, Gender Identity and Elections. Political Communication, 00(00), 1–32. http://doi.org/10.1080/10584609.2020.1793846
  • Worku, Z., Bipat, T., McDonald, D. W., & Zachry, M. (2020). Exploring Systematic Bias through Article Deletions on Wikipedia from a Behavioral Perspective (pp. 1–22). Presented at the OpenSym 2020: 16th International Symposium on Open Collaboration, New York, NY, USA: ACM. http://doi.org/10.1145/3412569.3412573
  • Young, A. G., Wigdor, A. D., & Kane, G. C. (2020). The Gender Bias Tug-of-War in a Co-creation Community: Core-Periphery Tension on Wikipedia. Journal of Management Information Systems, 37(4), 1047–1072. http://doi.org/10.1080/07421222.2020.1831773
  • Langrock, I., & González-Bailón, S. (2020). The Gender Divide in Wikipedia
, 1–36.
  • Field, A., Park, C. Y., & Tsvetkov, Y. (2020, December 31). Controlled Analyses of Social Biases in Wikipedia Bios. arXiv.org.

Sexual Orientation Gap edit

  • Wexelbaum, R. S., Herzog, K., & Rasberry, L. (2015). Queering Wikipedia.
  • Warncke-Wang, M., Ranjan, V., Terveen, L., & Hecht, B. (2015, April). Misalignment between supply and demand of quality content in peer production communities. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 9, No. 1).
  • Roued-Cunliffe, H., & Copeland, A. (2017). Forgotten history on Wikipedia. Participatory Heritage, 67-76.
  • Nagy, S., & Borgos, A. (2017). The efforts and plans of Hungarian LGBTQ archives.
  • Damas, C. A., & Mochetti, K. (2019, February). An analysis of homophobia on vandalism at Wikipedia. In 2019 Research on Equity and Sustained Participation in Engineering, Computing, and Technology (RESPECT) (pp. 1-2). IEEE.
  • Wexelbaum, R. (2019). Coming Out of the Closet: Librarian Advocacy to Advance LGBTQ+ Wikipedia Engagement. In LGBTQ+ Librarianship in the 21st Century: Emerging Directions of Advocacy and Community Engagement in Diverse Information Environments. Emerald Publishing Limited.
  • Park, C. Y., Yan, X., Field, A., & Tsvetkov, Y. (2020). Multilingual Contextual Affective Analysis of LGBT People Portrayals in Wikipedia. arXiv preprint arXiv:2010.10820.

Geography Gap edit

  • Lim, E. P., Wang, Z., Sadeli, D., Li, Y., Chang, C. H., Chatterjea, K., ... & Sun, A. (2006, November). Integration of Wikipedia and a geography digital library. In International Conference on Asian Digital Libraries (pp. 449-458). Springer, Berlin, Heidelberg.
  • Hecht B, Gergle D (2010) The tower of Babel meets web 2.0: user-generated content and its applications in a multilingual context.  ACM  Request Permissions, New York, New York, USA, pp 291–300
  • Ngo, Q. H., Doan, S., & Winiwarter, W. (2012). Using Wikipedia for extracting hierarchy and building geo‐ontology. International journal of Web information systems.
  • Hecht, B., & Stephens, M. (2014, May). A tale of cities: Urban biases in volunteered geographic information. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 8, No. 1).
  • Graham, M., Hogan, B., Straumann, R. K., & Medhat, A. (2014). Uneven geographies of user-generated information: Patterns of increasing informational poverty. Annals of the Association of American Geographers, 104(4), 746-764.
  • Sen, S. W., Ford, H., Musicant, D. R., Graham, M., Keyes, O. S., & Hecht, B. (2015, April). Barriers to the localness of volunteered geographic information. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (pp. 197-206).
  • Graham, M., Straumann, R. K., & Hogan, B. (2015). Digital Divisions of Labor and Informational Magnetism: Mapping Participation in Wikipedia. Annals of the Association of American Geographers, 105(6), 1158–1178. http://doi.org/10.1080/00045608.2015.1072791
  • Johnson, I., Lin, Y., Li, T. J.-J., Hall, A., & Halfaker, A. (2016). Not at Home on the Range: Peer Production and the Urban/Rural Divide. http://doi.org/10.1145/2858036.2858123
  • Sato, S., Yonezawa, T., Nakazawa, J., Kawasaki, S., Ohta, K., Inamura, H., & Tokuda, H. (2016). City Happenings into Wikipedia Category - Classifying Urban Events by Combining Analyses of Location-based Social Networks and Wikipedia. Urb-IoT, 47–52. http://doi.org/10.1145/2962735.2962740
  • Samoilenko A, Karimi F, Edler D, Kunegis J, Strohmaier M (2016) Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity. EPJ Data Sci 5:171–21. DOI 10.1140/epjds/s13688-016-0070-8
  • Ojanperä, S., Graham, M., Straumann, R. K., & Zook, M. (2017). Engagement in the Knowledge Economy: Regional Patterns of Content Creation with a Focus on Sub-Saharan Africa, 1–19.
  • Stephany, F., & Braesemann, F. (2017). An Exploration of Wikipedia Data as a Measure of Regional Knowledge Distribution, 1–9.
  • Sheehan, E., Meng, C., Tan, M., Uzkent, B., Jean, N., Lobell, D. B., et al. (2019). Predicting Economic Development using Geolocated Wikipedia Articles. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2698–2706.
  • Graham, M., De Sabbata, S., Straumann, R., & Ojanperä, S. (2019). Uneven Digital Geographies... and Why They Matter, 1–8.
  • Dittus, M., & Graham, M. (2019). Mapping Wikipedia’s Geolinguistic Contours. Digital Culture & Society, 5(1), 147–164. http://doi.org/10.14361/dcs-2019-0109
  • Beytía, P. (2020). The Positioning Matters (pp. 1–4).
  • Cailean, O., Mark, G., & Martin, D. (2021). Edit Wars in a Contested Digital City: Mapping Wikipedia’s Uneven Augmentations of Berlin. The Professional Geographer, 73(1), 85–95. http://doi.org/10.1080/00330124.2020.1800493

Cultural Context Content Gap edit

  • Hecht, B., & Gergle, D. (2009, June). Measuring self-focus bias in community-maintained knowledge repositories. In Proceedings of the fourth international conference on communities and technologies (pp. 11-20).
  • Liao, H. T., & Petzold, T. (2010). Analysing Analysing Geo-linguistic linguistic linguistic Dynamics of the World Wide Web: The Use of ynamics of the World Wide Web: The Use of Cartograms and Network Analysis to Understand Linguistic Development in Wikipedia Wikipedia.
  • Bao P, Hecht B, Carton S, Quaderi M, Horn MS, Gergle D (2012) Omnipedia: bridging the Wikipedia language gap. CHI 1075–1084. DOI 10.1145/2207676.2208553
  • Yasseri T, S., & GrahamM, K. 2014. The most controversial topics in Wikipedia: a multilingual and geographical analysis. In Global Wikipedia: international and cross-cultural issues in online collaboration.
  • Ronen, S., Gonçalves, B., Hu, K. Z., Vespignani, A., Pinker, S., & Hidalgo, C. A. (2014). Links that speak: The global language network and its association with global fame. Proceedings of the National Academy of Sciences, 111(52), E5616-E5622.
  • Karimi, F., Bohlin, L., Samoilenko, A., Rosvall, M., & Lancichinetti, A. (2015). Mapping bilateral information interests using the activity of Wikipedia editors. Palgrave Communications, 1(1), 1-7.
  • Mushiba, M., Gallert, P., & Winschiers-Theophilus, H. (2016). On Persuading an OvaHerero Community to Join the Wikipedia Community. CaTaC. http://doi.org/10.1007/978-3-319-50109
  • Samoilenko, A., Karimi, F., Edler, D., Kunegis, J., & Strohmaier, M. (2016). Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity. EPJ Data Science, 5(1), 171–21. http://doi.org/10.1140/epjds/s13688-016-0070-8
  • Dittus M, Graham M (2019) Mapping Wikipedia's Geolinguistic Contours. Digital Culture & Society 5:147–164. DOI 10.14361/dcs-2019-0109
  • Oeberst, A., Beck, I., Matschke, C., Ihme, T. A., & Cress, U. (2019). Collectively biased representations of the past: Ingroup Bias in Wikipedia articles about intergroup conflicts. British Journal of Social Psychology, 59(4), 791–818. http://doi.org/10.1111/bjso.12356
  • Roued-Cunliffe H (2017) Forgotten history on Wikipedia. In Participatory heritage. Facet Publishing, London.
  • Hinnosaar, M., Hinnosaar, T., Kummer, M. E., & Slivko, O. (2019). Wikipedia matters. Available at SSRN 3046400.
  • Speaks, H., Falise, A., Grosgebauer, K., Duncan, D., & Carrico, A. (2019). Racial Disparities in Mortality Among American Film Celebrities: A Wikipedia-Based Retrospective Cohort Study. Interactive Journal of Medical Research, 8(4), e13871–8. http://doi.org/10.2196/13871
  • Adams, J., Brückner, H., & Naslund, C. (2019). Who Counts as a Notable Sociologist on Wikipedia? Gender, Race, and the “Professor Test.” Socius: Sociological Research for a Dynamic World, 5(5), 237802311882394–14. http://doi.org/10.1177/2378023118823946
  • Miz, V., Hanna, J., Aspert, N., Ricaud, B., & Vandergheynst, P. (2020). What is Trending on Wikipedia? Capturing Trends and Language Biases Across Wikipedia Editions (pp. 794–801). Presented at the WWW '20: The Web Conference 2020, New York, NY, USA: ACM. http://doi.org/10.1145/3366424.3383567
  • Sunarsih. (2020). Representation of Indonesia in Wikipedia, 1–12.
  • Krause, A., & Cohen, S. (2020). Deriving Geolocations in Wikipedia (Vol. 6, pp. 3293–3296). Presented at the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, New York, NY, USA: ACM. http://doi.org/10.1145/3340531.3417459
  • Ezell, J. M. (2021). Empathy plasticity: decolonizing and reorganizing Wikipedia and other online spaces to address racial equity. Ethnic and Racial Studies, 0(0), 1–13. http://doi.org/10.1080/01419870.2020.1851383
  • Matias, J. N., Devouard, F., Kamin, J., Klein, M., & Pennington, E. (2021). Broadening African Self-Representation on Wikipedia: A Field Experiment, 1–4.
  • Bjork-James C (2021) New maps for an inclusive Wikipedia: decolonial scholarship and strategies to counter systemic bias. New Review of Hypermedia and Multimedia 10:1–22. DOI 10.1080/13614568.2020.1865463

Time Gap edit

  • Sipoš, R., Bhole, A., Fortuna, B., Grobelnik, M., & Mladenić, D. (2009, May). HistoryViz–Visualizing Events and Relations Extracted from Wikipedia. In European Semantic Web Conference (pp. 903-907). Springer, Berlin, Heidelberg.
  • Marek Ciglan and Kjetil Nørvåg. 2010. Wikipop: personalized event detection system based on wikipedia page view statistics. In Proceedings of the 19th ACM international conference on Information and knowledge management, 1931–1932.
  • Chasin, R., Woodward, D., Witmer, J., & Kalita, J. (2014). Extracting and displaying temporal and geospatial entities from articles on historical events. The Computer Journal, 57(3), 403-426.
  • Anna Samoilenko, Florian Lemmerich, Katrin Weller, Maria Zens, and Markus Strohmaier. 2017. Analysing timelines of national histories across wikipedia editions: a comparative computational approach. arXiv preprint arXiv:1705.08816.
  • Samoilenko, A., Lemmerich, F., Zens, M., Jadidi, M., Génois, M., & Strohmaier, M. (2018). (Don’t) Mention the War: A Comparison of Wikipedia and Britannica Articles on National Histories. Proceedings of the 2018 World Wide Web Conference, 843–852.
  • Piccardi, T., Redi, M., Colavizza, G., & West, R. (2020, April). Quantifying engagement with citations on wikipedia. In Proceedings of The Web Conference 2020 (pp. 2365-2376).
  • Michele Tizzoni, André Panisson, Daniela Paolotti, and Ciro Cattuto. 2020. The impact of news exposure on collective attention in the united states during the 2016 zika epidemic. PLoS computational biology, 16, 3, e1007633.