Machine Learning And Data Science based Fake Account Detection
Abstract
Nowadays the usage of digital technology
have been increasing exponentially. At the same time
the rate of malicious users have been increasing.
Online social sites like Facebook and Twitter attract
millions of people globally. This interest in online
networking has opened to various issues including the
risk of exposing false data by creating fake accounts
resulting in the spread of malicious content. Fake
accounts are a popular way to forward spam, commit
fraud, and abuse through online social network. These
problems need to be tackled in order to give the user a
reliable online social network. In this paper, we are
using different ML algorithms like Support Vector
Machine (SVM), Logistic Regression (LR), Random
Forest (RF) and K-Nearest Neighbours (KNN). Along
with these algorithms we have used two different
normalization techniques such as Z-Score, and Min-
Max, to improve accuracy. We have implemented it to
detect fake Twitter accounts and bots. Our approach
achieved high accuracy and true positive rate for
Random Forest and KNN. Keywords: Data mining,
Classification, Logistic Regression, Support Vector
Machine, KNearest Neighbours, Random forest,
Normalization.
Downloads
References
Cao, X., David, MF., Theodore, H.: Detecting
Clusters of Fake Accounts in Online Social
Networks. In: 8th ACM Workshop on Artificial
Intelligence and Security, pp. 91-101 (2015).
2. Buket, E., Ozlem, A., Deniz, K., Cyhun, A.: Twitter
Fake Account Detection. In: IEEE 2nd International
Conference on Computer Science and Engineering,
pp. 388-392 (2017).
3. Naman, S., Tushar, S., Abha, T., Tanupriya, C.:
Detection of Fake Profile in Online Social Networks
Using Machine Learning. In: IEEE International
Conference on Advances in Computing and
Communicaton Engineering. pp. 231-234 (2018).
4. Sarah, K., Neamat, E., Hoda, M .O .M.: Detecting
Fake Accounts on Social Media. In: IEEE
Intenational Conference on Big Data. pp. 3672-3681
(2018).
5. Yeh-Cheng, C., Shyhtsun, F .W.: FakeBuster: A
Robust Fake Account Detection by Activiy Analysis.
In: IEEE 9th International Symposium on Parallel
Architectures, Algorithms and Programming. pp.
108-110 (2018).
6. Myo, MS., Nyein, NM.: Fake Accounts Detection
on Twitter using Blacklist. In: IEEE 17th
International Conference on Computer and
Information and Information Science. pp. 562-566
(2018).
6. Qiang, C., Michael, S., Xiaowei, Y., Tiago P.:
Aiding the Detection of Fake Accounts in Large
Scale Social Online Services. In: 9th USENIX
Conference on Networked Systems Design and
Implementation. pp. 1-14 (2012).
7. Mauro, C., Radha, P., Macro, S.: Fakebook :
Detecting Fake Profiles in Online Social Networks.
In: IEEE International Conference on Advances in
Social Networks Analysis and Mining. pp. 1071-
1078 (2012).
8. Kaur , R., and Singh, S.: A survey of data mining and
social network analysis based anomaly detection
techniques. In: Egyptian informatics journal. pp.199–
216 (2016).
9. Yazan, B., Dionysios, L., Georgos, S., Jorge, L.,
Jose, L., Matei, R., Konstatin, B., and Hassan, H.:
Integro : Leveraging victim prediction for robust fake