Abstract
The paper concerns studying the quality of teams of Wikipedia authors with statistical approach. We report preparation of a dataset containing numerous behavioural and structural attributes and its subsequent analysis and use to predict team quality. We have performed exploratory analysis using partial regression to remove the influence of attributes not related to the team itself. The analysis confirmed that the key issue significantly influencing article’s quality are discussions between teem members. The second part of the paper successfully uses machine learning models to predict good articles based on features of the teams that created them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baba, K., Shibata, R., Sibuya, M.: Partial correlation and conditional correlation as measures of conditional independence. Australian & New Zealand Journal of Statistics 46(4), 657–664 (2004)
Borzymek, P., Sydow, M.: Trust and distrust prediction in social network with combined graphical and review-based attributes. In: Jędrzejowicz, P., Nguyen, N.T., Howlet, R.J., Jain, L.C. (eds.) KES-AMSTA 2010, Part I. LNCS, vol. 6070, pp. 122–131. Springer, Heidelberg (2010)
Borzymek, P., Sydow, M., Wierzbicki, A.: Enriching trust prediction model in social network with user rating similarity. In: Wegrzyn-Wolska, K., Abraham, A., Snasel, V. (eds.) Proceedings of the 1st International Conference on Computational Aspects of Social Networks (CASoN 2009), pp. 40–47. IEEE Computer Society, Los Alamitos (2009)
Brandes, U., Lerner, J.: Visual analysis of controversy in user-generated encyclopedias. Information Visualization 7(1), 34–48 (2008)
Brieman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees (1984)
Fawcett, T.: An introduction to roc analysis. Pattern Recogn. Lett. 27(8), 861–874 (2006)
Fox, J., Weisberg, S.: An R Companion to Applied Regression. Sage (2011)
Kazienko, P., Musial, K., Kukla, E., Kajdanowicz, T., Bródka, P.: Multidimensional social network: Model and analysis. In: Jędrzejowicz, P., Nguyen, N.T., Hoang, K. (eds.) ICCCI 2011, Part I. LNCS, vol. 6922, pp. 378–387. Springer, Heidelberg (2011)
Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in wikipedia: quality through coordination. In: Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work, pp. 37–46. ACM (2008)
Kittur, A., Kraut, R.E.: Beyond wikipedia: coordination and conflict in online production groups. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, CSCW 2010, pp. 215–224. ACM, NY (2010)
Kittur, A., Suh, B., Pendleton, B.A., Chi, E.H.: He says, she says: Conflict and coordination in wikipedia. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 2007, pp. 453–462. ACM, New York (2007)
Le, M.-T., Dang, H.-V., Lim, E.-P., Datta, A.: Wikinetviz: Visualizing friends and adversaries in implicit social networks. In: IEEE International Conference on Intelligence and Security Informatics, ISI 2008, pp. 52–57. IEEE (2008)
Piskorski, J., Sydow, M., Weiss, D.: Exploring linguistic features for web spam detection: a preliminary study. In: AIRWeb 2008: Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web, pp. 25–28. ACM, New York (2008)
Rao, C.R., Toutenburg, H.: Linear Models and Generalizations: Least Squares and Alternatives. Springer (2007)
Turek, P., Wierzbicki, A., Nielek, R., Hupa, A., Datta, A.: Learning about the quality of teamwork from wikiteams. In: Proceedings of the 2010 IEEE Second International Conference on Social Computing, SocialCom/IEEE International Conference on Privacy, Security, Risk and Trust, PASSAT 2010, Minneapolis, pp. 17–24 (2010)
Turek, P., Wierzbicki, A., Nielek, R., Hupa, A., Datta, A.: Wikiteams: How do they achieve success? IEEE Potentials 30(5), 2–7 (2011)
Wasserman, S.: Social network analysis: Methods and applications, vol. 8. Cambridge University Press (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bukowski, L., Jankowski-Lorek, M., Jaroszewicz, S., Sydow, M. (2014). What Makes a Good Team of Wikipedia Editors? A Preliminary Statistical Analysis. In: Nadamoto, A., Jatowt, A., Wierzbicki, A., Leidner, J.L. (eds) Social Informatics. SocInfo 2013. Lecture Notes in Computer Science, vol 8359. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-55285-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-55285-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-55284-7
Online ISBN: 978-3-642-55285-4
eBook Packages: Computer ScienceComputer Science (R0)