Fuzzy Relational Scattered Distance Based Clustering Method for Sparsely Distributed High Dimensional Data Objects
R.Pushpalatha1, K. Meenakshi Sundaram2
1Dr.R.Pushpalatha, Assistant Professor, Department of Computer Science, Kongu Arts and Science College (Autonomous), Nanjanapuram, Erode, Tamil Nadu, India.
2Dr. K. Meenakshi Sundaram, Associate Professor, Department of Computer Science, Erode Arts and Science College (Autonomous), Erode, Tamil Nadu, India.
Manuscript received on January 05, 2020. | Revised Manuscript received on January 25, 2020. | Manuscript published on January 30, 2020. | PP: 4044-4049 | Volume-8 Issue-5, January 2020. | Retrieval Number: E6633018520/2020©BEIESP | DOI: 10.35940/ijrte.E6633.018520
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Clustering is one of the most significant ideas in data mining. It is an unsupervised learning model. Clustering technique in handling high dimensional data is more complex due to intrinsic sparsity nature of high dimensional data. Though, existing methods to reduce immaterial clusters were based on spectral clustering algorithm and graph-based learning algorithm, whose lack of sparsity and polynomial time complexity compromises their efficiency when applied to sparse high dimensional data. This paper concentrates to cluster the sparsely distributed high dimensional data objects. Fuzzy Relational Scattered Distance Based Clustering (FRSDBC) method is developed with three models such as Geometric Median Based Fuzzy model, Scattered Distance measure model, Grid based clustered sparse data representation model. Geometric Median Based Fuzzy model calculates the geometric median of similar sparse data and then the non similar sparse data objects to fitting the relational fuzziness across data points. It involves in the subspace reduction of data objects. Scattered Distance measure model is used to measure the distance between the inner and outer object. Grid based clustering is used to calculate the area of the cluster in FRSDBC method. The main idea of the FRSDBC method is to clustering data points over sparsely distributed data within limited processing time. The Clustering Time, Clustering Accuracy and Space Complexity of each method is analyzed. The result of the FRSDBC method is compared with other techniques, the results obtained are more accurate, easy to understand and the clustering time was substantially low in FRSDBC method. It is widely used in many practical applications such as weather forecast, share trading, medical data analysis and aerial data analysis.
Keywords: Data Mining, Geometric Median Based Fuzzy Concept, Scattered Distance Measurement, Graph-Based Learning.
Scope of the Article: Data Mining.