An Effective Data Classification Method for Medical Dataset in terms of Accuracy and Time
Sivakumar Venkataraman1, Subitha Sivakumar2
1Sivakumar Venkataraman, Botho University, Gaborone, Botswana, Africa.
2Subitha Sivakumar, Annai Fathima College of Arts and Science, Madurai, (Tamil Nadu), India.
Manuscript received on 23 March 2019 | Revised Manuscript received on 30 March 2019 | Manuscript published on 30 March 2019 | PP: 501-507 | Volume-7 Issue-6, March 2019 | Retrieval Number: F2467017519/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Data classification plays a major role in organizing the optimal features for the most effective and efficient use. Feature Selection technique is one of the foremost methods to select the optimal features from the dataset. The classification accuracy and the processing time required to build the model are the two main keys in obtaining the effective data classification by using the feature subset methods and ranking methods. The work was tested on seven real time dataset (Breast Cancer, Breast Tissue, Contact Lenses, Dermatology, Hypothyroid, Iris and Liver Disorders) obtained from UCI Data repository. The results obtained from CFS Subset Attribute Evaluator, Correlation Attribute Evaluator, Gain Ration Attribute Evaluator, Info Gain Attribute Evaluator, OneR Attribute Evaluator, Principal Components Attribute Evaluator, ReliefF Attribute Evaluator, Symmetrical Uncertainty Attribute Evaluator and Wrapper Subset Attribute Evaluator were compared. Classification algorithms like Navis Bayes, Bayes Net, Multilayered Perception, Sequential Minimum Optimization, K Nearest Neighbours, Decision Tree, OneR, J48 and Random Tree are used to analyze the classification accuracy and processing time. Comparison are done with the results obtained by using the ranking methods and the results obtained bynot using the ranking method, to find whether the ranking methods are important in obtaining the classification accuracy and processing time.
Keywords: Data Classification, Feature Subset Methods, Ranking Methods, Supervised learning algorithms, classification accuracy
Scope of the Article: Classification.