Performance Analysis of Data Mining Algorithms in the Prediction of Rain Fall
Yogesh Kumar Jakhar1, Nidhi Mishra2, Rakesh Poonia3
1Yogesh Kumar Jakhar, Ph. D. Scholar, Department of Computer Engineering, Poornima University, Jaipur (India).
2Dr. Nidhi Mishra, Associate Professor, Poornima University, Jaipur (India).
3Dr. Rakesh Poonia, Assistant Professor, Govt. Engineering College, Bikaner (India).
Manuscript received on 07 March 2019 | Revised Manuscript received on 14 March 2019 | Manuscript published on 30 July 2019 | PP: 558-563 | Volume-8 Issue-2, July 2019 | Retrieval Number: B1604078219/19©BEIESP | DOI: 10.35940/ijrte.B1604.078219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Weather being a random phenomenon its prediction has been always a challenge for the meteorologist all over the world. There are number of approaches for predicting this weather based on atmospheric data collected. Rain forecasting is a puzzling, composite, vigorous and mind-boggling task. Rain forecasting pretenses right from the primeval times as a challenging task, because it be influenced by numerous parameters like temperature, wind speed and direction, rainfall, humidity, station level pressure, mean sea-level pressure, dry bulb temperature, dew point temperature and vapour pressure. Various data mining techniques were implemented for rain forecasting. With compared to orthodox methods predicting rainfall rate, the methods that were applying chronological records and data mining technology shows improvement in computing accurate results with more accuracy. Many researchers have done excellent works to construct forecasting models with data mining methods;but in them most just test the predicting accuracy at one particular geographical area. In this paper, we analyzed the performance of k-NN, Random Forest, C5.0 and AdaBoost algorithms on different locations and compared the performance using precision, recall, f-measure and classification accuracy. The daily surface data was collected from India Meteorological Department (IMD), Pune of 3 stations form the period 2005 to 2015. The k-NN algorithm perform better accuracy 98.02 % on Jodhpur dataset with compare to other datasets, the ratio of 90:10 of training and testing records and the value of K is 10. The highest accuracy is 99.270 % of AdaBoost algorithm.
Index Terms: AdaBoost, C5.0, k-NN, , Rainfall Prediction, Random Forest, and Weather Forecasting.
Scope of the Article: Data Mining