ISVS3CE: Incremental Support Vector Semi Supervised Subspace Clustering Ensemble and ENhanced Bat Algorithm (ENBA) for High Dimensional Data Clustering
D. Karthika1, K. Kalaiselvi2
1D.Karthika, Research Scholar, Department of Computer Science, VELS Institute of Science, Technology & Advanced Studies (Formerly VELS University), Chennai, India.
2Dr.K.Kalaiselvi, Professor& Head, Department of Computer Science, VELS Institute of Science, Technology & Advanced Studies (Formerly VELS University), Chennai, India.
Manuscript received on 13 March 2019 | Revised Manuscript received on 20 March 2019 | Manuscript published on 30 July 2019 | PP: 930-939 | Volume-8 Issue-2, July 2019 | Retrieval Number: B1724078219/19©BEIESP | DOI: 10.35940/ijrte.B1724.078219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In the recent work, Incremental Soft Subspace Based Semi-Supervised Ensemble Clustering (IS4EC) framework was proposed which helps in detecting clusters in the dataset. IS4EC framework also increases the results of clustering by reducing the intra-cluster distance and increasing the inter-cluster distance with increased cluster quality. It cannot attain acceptable results while handling high dimensional data. However, decreasing the dimensional subspace becomes extremely difficult issue. In IS4EC framework, to choose the optimal ensemble members also extremely becomes challenging issue. In order to solve these issues of traditional cluster ensemble methods, first propose an Incremental Support vector Semi-Supervised Subspace Clustering Ensemble (ISVS3CE) framework which makes utilized of benefits of the random subspace algorithm and the Constraint Propagation (CP) algorithm. Here the centroid values were selected by using the Support Vector Machine (SVM) classifier. In the ISVS3CE framework, Incremental Ensemble Member Chosen (IEMC) process is performed by using the ENhanced Bat Algorithm (ENBA), and the normalized cut algorithm is introduced to perform high dimensional data clustering. The ISVS3CE framework is successful for solving high dimensional data issue, at the same time as the CP algorithm is valuable for incorporating the prior information. Results demonstrate that the proposed ISVS3CE framework performs well on datasets by means of very high dimensionality, and better than the traditional clustering ensemble methods.
Index Terms: Cluster Ensemble, Semi-Supervised Clustering, Random Subspace, Cancer Gene Expression Profile, Clustering Analysis, Support Vector Machine (SVM), and ENhanced Bat Algorithm (ENBA).
Scope of the Article: Clustering