Ensuring the Integrity of Wikipedia: A Data Science Approach
In this paper, we present our research on the problem of ensuring the integrity of Wikipedia, the world's biggest free encyclopedia. As anyone can edit Wikipedia, many malicious users take advantage of this situation to make edits that compromise pages' content quality. Specifically, we present DePP, the state-of-the-art tool that detects article pages to protect with an accuracy of 93% and we introduce our research on identifying spam users. We show that we are able to classify spammers from benign users with 80.8% of accuracy and 0.88 mean average precision.
Spezzano, Francesca. (2017). "Ensuring the Integrity of Wikipedia: A Data Science Approach". 25th Italian Symposium on Advanced Database Systems, SEBD 2017, 98-105.