Optimization of the ANOVA Procedure for Support Vector Machines
Borislava Vrigazova1, Ivan Ivanov2
1Borislava Vrigazova*, Department of Statistics and Econometrics, Sofia University, Sofia, Bulgaria.
2Ivan Ivanov*, Department of Statistics and Econometrics, Sofia University, Sofia, Bulgaria.
Manuscript received on November 12, 2019. | Revised Manuscript received on November 25, 2019. | Manuscript published on 30 November, 2019. | PP: 5160-5165 | Volume-8 Issue-4, November 2019. | Retrieval Number: D7375118419/2019©BEIESP | DOI: 10.35940/ijrte.D7375.118419
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Feature selection is a powerful tool to identify the important characteristics of data for prediction. Feature selection, therefore, can be a tool for avoiding overfitting, improving prediction accuracy and reducing execution time. The applications of feature selection procedures are particularly important in Support vector machines, which is used for prediction in large datasets. The larger the dataset, the more computationally exhaustive and challenging it is to build a predictive model using the support vector classifier. This paper investigates how the feature selection approach based on the analysis of variance (ANOVA) can be optimized for Support Vector Machines (SVMs) to improve its execution time and accuracy. We introduce new conditions on the SVMs prior to running the ANOVA to optimize the performance of the support vector classifier. We also establish the bootstrap procedure as alternative to cross validation to perform model selection. We run our experiments using popular datasets and compare our results to existing modifications of SVMs with feature selection procedure. We propose a number of ANOVA-SVM modifications which are simple to perform, while at the same time, boost significantly the accuracy and computing time of the SVMs in comparison to existing methods like the Mixed Integer Linear Feature Selection approach.
Keywords: Support Vector Machines, ANOVA, Bootstrapping, PCA Transformation, Feature Selection.
Scope of the Article: Aggregation, Integration, and Transformation.