Abstract
Several approaches have been proposed for the problem of identifying authoritative actors in online communities. However, the majority of existing methods suffer from one or more of the following limitations: (1) There is a lack of an automatic mechanism to formally discriminate between authoritative and nonauthoritative users. In fact, a common approach to authoritative user identification is to provide a ranked list of users expecting authorities to come first. A major problem of such an approach is the question of where to stop reading the ranked list of users. How many users should be chosen as authoritative? (2) Supervised learning approaches for authoritative user identification suffer from their dependency on the training data. The problem here is that labeled samples are more difficult, expensive, and time consuming to obtain than unlabeled ones. (3) Several approaches rely on some user parameters to estimate an authority score. Detection accuracy of authoritative users can be seriously affected if incorrect values are used. In this article, we propose a parameterless mixture model-based approach that is capable of addressing the three aforementioned issues in a single framework. In our approach, we first represent each user with a feature vector composed of information related to its social behavior and activity in an online community. Next, we propose a statistical framework, based on the multivariate beta mixtures, in order to model the estimated set of feature vectors. The probability density function is therefore estimated and the beta component that corresponds to the most authoritative users is identified. The suitability of the proposed approach is illustrated on real data extracted from the Stack Exchange question-answering network and Twitter.
Topics

No keywords indexed for this article. Browse by subject →

References
35
[2]
L. J. Bain and M. Engelhardt. 2000. Introduction to Probability and Mathematical Statistics (2nd ed.). Duxbury Press. L. J. Bain and M. Engelhardt. 2000. Introduction to Probability and Mathematical Statistics (2nd ed.). Duxbury Press.
[3]
J. C. Bezdek . 1981. Pattern Recognition with Fuzzy Objective Function Algorithms . Plenum , New York . J. C. Bezdek. 1981. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York.
[12]
M. Cha , H. Haddadi , F. Benevenuto , and K. P. Gummadi . 2010. Measuring user influence in twitter: The million follower fallacy . In Proc. AAAI Int’l Conf. Weblogs and Social Media (ICWSM’10) . M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. 2010. Measuring user influence in twitter: The million follower fallacy. In Proc. AAAI Int’l Conf. Weblogs and Social Media (ICWSM’10).
[14]
Maximum Likelihood from Incomplete Data Via the EM Algorithm

A. P. Dempster, N. M. Laird, D. B. Rubin

Journal of the Royal Statistical Society Series B:... 10.1111/j.2517-6161.1977.tb01600.x
[19]
S. Kumar F. Morstatter and H. Liu. 2013. Twitter Data Analytics. Springer New York NY. S. Kumar F. Morstatter and H. Liu. 2013. Twitter Data Analytics. Springer New York NY. 10.1007/978-1-4614-9372-3
[21]
A. Leavitt , E. Burchard , D. Fisher , and S. Gilbert . 2009 . The Influentials: New Approaches for Analyzing Influence on Twitter. A publication of the Web Ecology Project (2009). A. Leavitt, E. Burchard, D. Fisher, and S. Gilbert. 2009. The Influentials: New Approaches for Analyzing Influence on Twitter. A publication of the Web Ecology Project (2009).
[23]
Z. Ma and A. Leijon . 2009. Beta mixture models and the application to image classification . In Proc. 16th IEEE Int’l Conf. Image Processing (ICIP’09). 2045 -- 2048 . Z. Ma and A. Leijon. 2009. Beta mixture models and the application to image classification. In Proc. 16th IEEE Int’l Conf. Image Processing (ICIP’09). 2045--2048.
[26]
A. Pal , R. Farzan , J. A. Konstan , and R. E. Kraut . 2011. Early detection of potential experts in question answering communities . In Proc. 19th Int’l Conf. User Modeling, Adaption and Personalization (UMAP’11) - Lecture Notes in Computer Science 6787. 231--242. A. Pal, R. Farzan, J. A. Konstan, and R. E. Kraut. 2011. Early detection of potential experts in question answering communities. In Proc. 19th Int’l Conf. User Modeling, Adaption and Personalization (UMAP’11) - Lecture Notes in Computer Science 6787. 231--242.
[28]
F. Riahi , Z. Zolaktaf , M. Shafiei , and E. Milios . 2012. Influence and passivity in social media . In Proc. 21st ACM Int’l Conf. World Wide Web - Companion Volume (WWW’ 12 Companion). 791--798. F. Riahi, Z. Zolaktaf, M. Shafiei, and E. Milios. 2012. Influence and passivity in social media. In Proc. 21st ACM Int’l Conf. World Wide Web - Companion Volume (WWW’12 Companion). 791--798.
Cited By
30
Neural age screening on question answering communities

Mohan Timilsina, Alejandro Figueroa · 2023

Engineering Applications of Artific...
What identifies different age cohorts in Yahoo! Answers?

Alejandro Figueroa, Mohan Timilsina · 2021

Knowledge-Based Systems
Metrics
30
Citations
35
References
Details
Published
Apr 24, 2015
Vol/Issue
6(3)
Pages
1-23
License
View
Cite This Article
Mohamed Bouguessa, Lotfi Ben Romdhane (2015). Identifying Authorities in Online Communities. ACM Transactions on Intelligent Systems and Technology, 6(3), 1-23. https://doi.org/10.1145/2700481
Related

You May Also Like

LIBSVM

Chih-Chung Chang, Chih-Jen Lin · 2011

41,159 citations

A Survey on Evaluation of Large Language Models

Yupeng Chang, Xu Wang · 2024

2,144 citations

Trajectory Data Mining

Yu Zheng · 2015

1,330 citations

Urban Computing

Yu Zheng, Licia Capra · 2014

977 citations

A Survey of Unsupervised Deep Domain Adaptation

Garrett Wilson, Diane J. Cook · 2020

726 citations