Fake Account Detection in Twitter using Long Short Term Memory and Convolutional Neural Network

Fake Account Detection in Twitter using Long Short Term Memory and Convolutional Neural Network

  IJETT-book-cover           
  
© 2024 by IJETT Journal
Volume-72 Issue-3
Year of Publication : 2024
Author : Louzar Oumaima, Ramdi Mariam, Baida Ouafae, Lyhyaoui Abdelouahid
DOI : 10.14445/22315381/IJETT-V72I3P112

How to Cite?

Louzar Oumaima, Ramdi Mariam, Baida Ouafae, Lyhyaoui Abdelouahid, "Fake Account Detection in Twitter using Long Short Term Memory and Convolutional Neural Network," International Journal of Engineering Trends and Technology, vol. 72, no. 3, pp. 116-126, 2024. Crossref, https://doi.org/10.14445/22315381/IJETT-V72I3P112

Abstract
With the growing influence of social media platforms, the identification and prevention of fake accounts has become a crucial challenge for maintaining the integrity of online interactions. The proliferation of Online Social Network (OSN) platforms has given rise to a significant increase in the number of fake accounts, leading to numerous detrimental effects on online communities. Many strategies have been suggested by various communities to deal with false accounts in OSN. Therefore, this paper proposes an innovative approach for detecting fake accounts on Twitter based on the content of tweets. It incorporated Long Short-Term Memory (LSTM) and Convolutional Neural Network (CNN). It conducted this research in several processes, including data collection, data preprocessing, data reduction by applied Correlation-based Feature Selection (CFS) and Principal Component Analysis (PCA), and data classification. The suggested method, the LSTM-CNN approach, is to cluster more than 2,000,000 accounts from the MIB dataset, and the experimental results show that the approach has the highest accuracy of 98.95% compared with other research.

Keywords
Fake account, Twitter, Online Social Network, LSTM, CNN.

References
[1] Prasanta Kumar Sahoo, and K. Lavanya, “Identification of Malicious Accounts in Facebook,” International Journal of Engineering and Advanced Technology (IJEAT), vol. 9, no. 1, pp. 2917-2921, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Priyanka Kondeti et al., “Fake Account Detection Using Machine Learning,” Evolutionary Computing and Mobile Sustainable Networks, Springer, vol. 53, pp. 791-802, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Xinyi Zhou, and Reza Zafarani, “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities,” ACM Computing Surveys, vol. 53, no. 5, pp. 1-40, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Kai Shu, Suhang Wang, and Huan Liu, “Exploiting Tri-Relationship for Fake News Detection,” arXiv, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Vahit Çalişir, Disinformation, Post-Truth, and Naive Realism in COVID-19: Melting the Truth, Handbook of Research on Representing Health and Medicine in Modern Media, IGI Global, pp. 200-215, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Santosh Kumar Uppada et al., “Novel Approaches to Fake News and Fake Account Detection in OSNs: User Social Engagement and Visual Content Centric Model,” Social Network Analysis and Mining, vol. 12, no. 52, pp. 1-19, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Maria Grazia Vigliotti, and Chris Hankin, “Discovery of Anomalous Behaviour in Temporal Networks,” Social Networks, vol. 41, pp. 18-25, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Nasira Perveen et al., “Sentiment Based Twitter Spam Detection,” International Journal of Advanced Computer Science and Applications, vol. 7, no. 7, pp. 568-573, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Nadav Voloch, Nurit Gal-Oz, and Ehud Gudes, “A Trust-based Privacy Providing Model for Online Social Networks,” Online Social Networks and Media, vol. 24, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Suneet Joshi, and Deepak Singh Tomar, “Deep Neural Network-Based Relationship Identification Framework to Discriminate Fake Profile over Social Media,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 3, pp. 599-611, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Vishal Sharma, Ilsun You, and Ravinder Kumar, “ISMA: Intelligent Sensing Model for Anomalies Detection in Cross Platform OSNs with a Case Study on IoT,” IEEE Access, vol. 5, pp. 3284 - 3301, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Khalid Binsaeed, Gianluca Stringhini, and Ahmed E. Youssef, “Detecting Spam in Twitter Microblogging Services: A Novel Machine Learning Approach Based on Domain Popularity,” International Journal of Advanced Computer Science and Applications, vol. 11, no. 11, pp. 11-22, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[13] B. Prabhu Kavin et al., “Machine Learning-Based Secure Data Acquisition for Fake Accounts Detection in Future Mobile Communication Networks,” Security Threats and Challenges in Future Mobile Communication Systems, vol. 2022, pp. 1-10, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[14] P. Sowmya, and Madhumita Chatterjee, “Detection of Fake and Clone Accounts in Twitter Using Classification and Distance Measure Algorithms,” 2020 International Conference on Communication and Signal Processing (ICCSP), Chennai, India, pp. 67-70, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Faouzia Benabbou, Hanane Boukhouima, and Nawal Sael, “Fake Accounts Detection System Based on Bidirectional Gated Recurrent Unit Neural Network,” International Journal of Electrical and Computer Engineering (IJECE), vol. 12, no. 3, pp. 3129-3127, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Stefano Cresci et al., “The Paradigm-Shift of Social Spambots: Evidence, Theories, and Tools for the Arms Race,” Proceedings of the 26th International Conference on World Wide Web Companion, pp. 963-972, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[17] MIB Datasets, Consiglio Nazionale Delle Ricerche (CNR). [Online]. Available: https://mib.projects.iit.cnr.it/dataset.html.
[18] Mark A. Hall, “Correlation-Based Feature Selection for Machine Learning,” Higher Degree Theses, University of Waikato, pp. 1-199, 1999.
[Google Scholar] [Publisher Link]
[19] Santiago Egea et al., “Intelligent IoT Traffic Classification Using Novel Search Strategy for Fast-Based-Correlation Feature Selection in Industrial Environment,” IEEE Internet of Things Journal, vol. 5, no. 3, pp. 1616-1624, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Atiqur Rehman et al., “Performance Analysis of PCA, Sparse PCA, Kernel PCA and Incremental PCA Algorithms for Heart Failure Prediction,” 2020 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), Istanbul, Turkey, pp. 1-5, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Weicong Kong et al., “Short-Term Residential Load Forecasting Based on LSTM Recurrent Neural Network,” IEEE Transactions on Smart Grid, vol. 10, no. 1, pp. 841-851, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Jiarui Zhang et al., “LSTM-CNN Hybrid Model for Text Classification,” 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, pp. 1675-1680, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Tara N. Sainath et al., “Convolutional, Long Short-Term Memory, Fully Connected Deep Neural Networks,” 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, QLD, Australia, pp. 4580-4584, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Ahmad Homsi et al., “Detecting Twitter Fake Accounts Using Machine Learning and Data Reduction Techniques,” Proceedings of the 10th International Conference on Data Science, Technology and Applications, pp. 88-95, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Kusum Kumari Bharti, and Shivanjali Pandey, “Fake Account Detection in Twitter Using Logistic Regression with Particle Swarm Optimization,” Application of Soft Computing, vol. 25, pp. 11333-11345, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Sreekanth Madisetty, and Maunendra Sankar Desarkar, “A Neural Network-Based Ensemble Approach for Spam Detection in Twitter,” IEEE Transactions on Computational Social Systems, vol. 5, no. 4, pp. 973-984, 2018.
[CrossRef] [Google Scholar] [Publisher Link]