An Optimization of Feature Selection for Classification using Bat Algorithm
V. Yasaswini1, Santhi Baskaran2
1V. Yasaswini, Research Scholar, Computer Science and Engineering Department, Pondicherry Engineering College, Puducherry, India.
2Santhi Baskaran, Professor& Head, Information Technology Department, Pondicherry Engineering College, Puducherry, India.
Manuscript received on January 30, 2020. | Revised Manuscript received on February 11, 2021. | Manuscript published on March 30, 2021. | PP: 39-43 | Volume-9 Issue-6, March 2021. | Retrieval Number: 100.1/ijrte.F5331039621 | DOI: 10.35940/ijrte.F5331.039621
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Data mining is the action of searching the large existing database in order to get new and best information. It plays a major and vital role now-a-days in all sorts of fields like Medical, Engineering, Banking, Education and Fraud detection. In this paper Feature selection which is a part of Data mining is performed to do classification. The role of feature selection is in the context of deep learning and how it is related to feature engineering. Feature selection is a preprocessing technique which selects the appropriate features from the data set to get the accurate result and outcome for the classification. Nature-inspired Optimization algorithms like Ant colony, Firefly, Cuckoo Search and Harmony Search showed better performance by giving the best accuracy rate with less number of features selected and also fine f-Measure value is noted. These algorithms are used to perform classification that accurately predicts the target class for each case in the data set. We propose a technique to get the optimized feature selection to perform classification using Meta Heuristic algorithms. We applied new and recent advanced optimized algorithm named Bat algorithm on UCI datasets that showed comparatively equal results with best performed existing firefly but with less number of features selected. The work is implemented using JAVA and the Medical dataset (UCI) has been used. These datasets were chosen due to nominal class features. The number of attributes, instances and classes varies from chosen dataset to represent different combinations. Classification is done using J48 classifier in WEKA tool. We demonstrate the comparative results of the presently used algorithms with the existing algorithms thoroughly.
Keywords: Optimization, Meta-heuristic, Feature Extraction, Deep learning.