Filtering users accounts for enhancing the results of social media mining tasks
Funding Sponsor
American University in Cairo
Author's Department
Computer Science & Engineering Department
Find in your Library
https://doi.org/10.1007/978-3-030-45691-7_36
Document Type
Research Article
Publication Title
Advances in Intelligent Systems and Computing
Publication Date
1-1-2020
doi
10.1007/978-3-030-45691-7_36
Abstract
© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2020. Filtering out the illegitimate Twitter accounts for online social media mining tasks reduces the noise and thus improves the quality of the outcomes of those tasks. Developing a supervised machine learning classifier requires a large annotated dataset. While building the annotation guidelines, the rules were found suitable to develop an unsupervised rule-based classifying program. However, despite its high accuracy, the performance of the rule-based program was not time efficient. So, we decided to use the unsupervised rule-based program to create a massive annotated dataset to build a supervised machine learning classifier, which was found to be fast and matched the unsupervised classifier performance with an F-Score of 92%. The impact of removing those illegitimate accounts on an influential users identification program developed by the authors, was investigated. There were slight improvements in the precision results but not statistically significant, which indicated that the influential user program didn’t identify erroneously spam accounts as influential.
First Page
391
Last Page
400
Recommended Citation
APA Citation
Shalaby, M.
&
Rafea, A.
(2020). Filtering users accounts for enhancing the results of social media mining tasks. Advances in Intelligent Systems and Computing, 1160 AISC, 391–400.
10.1007/978-3-030-45691-7_36
https://fount.aucegypt.edu/faculty_journal_articles/1542
MLA Citation
Shalaby, May, et al.
"Filtering users accounts for enhancing the results of social media mining tasks." Advances in Intelligent Systems and Computing, vol. 1160 AISC, 2020, pp. 391–400.
https://fount.aucegypt.edu/faculty_journal_articles/1542