Heterogeneous Ensemble Structure based Universal Spam Profile Detection System for Social Media Networks
Vinod A. M1, Sathish G. C2

1Vinod A. M, School of Computing and Information Technology, REVA University, Bengaluru, Karnataka, India.
2Sathish G. C, Associate Professor, School of Computing and Information Technology, REVA University, Bengaluru, Karnataka, India.

Manuscript received on April 02, 2020. | Revised Manuscript received on April 21, 2020. | Manuscript published on May 30, 2020. | PP: 1028-1039 | Volume-9 Issue-1, May 2020. | Retrieval Number: A2179059120/2020©BEIESP | DOI: 10.35940/ijrte.A2179.059120
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: The exponential rise in internet technology and online social media networks have revitalized human-being to connect and socialize globally irrespective of geographical and any demographic boundaries. Additionally, it has revitalized business communities to reach target audiences through social media networks. However, as parallel adverse up-surge the ever-increasing presence of malicious users or spam has altered predominant intend of such social media network by propagating biased contents, malicious contents and fraud acts. Avoiding and neutralizing such malefic users on social media network has remained a critical challenge due to gigantically large size and user’s diversity such as Facebook, Twitter, and LinkedIn etc. Though exploiting certain user’s behavior and content types can help identifying malicious users, majority of the existing methods are limited due to confined parametric assessment, and inferior classification approaches. With intend to provide spam profile detection system in this paper a novel heterogeneous ensemble-based method is developed. The proposed model exploits user profile features, user’s activity features, location features and content features to perform spam user profile detection. To ensure optimality of computational significances, we applied multi-phased feature selection method employing Wilcoxon Rank Sum test, Significant Predictor test, and Pearson Correlation test, which assured retaining optimal feature sets for further classification. Subsequently, applying an array of machine learning methods, including Logistic regression, decision tree, Support Vector Machine variants with Linear, Polynomial and RBF kernels, Least Square SVM with linear, polynomial and RBF kernels, ANN with different kernels, etc we constituted a robust ensemble model for spam user profile classification. Simulations revealed that the proposed ensemble classification model achieves accuracy and F-score higher than 98%, which is the highest amongst major works done so far. It affirms suitability and robustness of the proposed model for real time spam profile detection and classification on social media platforms. 
Keywords:  Social Media Network, Spam User Profile Detection, Heterogeneous Ensemble Learning, multi-phased feature selection.
Scope of the Article: Social Networks