Visualizing the Clinical Data of Diabetes using Data Science and Machine Learning Algorithms
Vennapusa Vishnu Priya1, Abdul Gaffar.H2

1Vennapusa Vishnu Priya, SCOPE, Vellore Institute of Technology, Vellore, (Tamil Nadu), India.
2Abdul Gaffar H, SCOPE, Vellore Institute of Technology, Vellore, (Tamil Nadu), India.

Manuscript received on 23 March 2019 | Revised Manuscript received on 30 March 2019 | Manuscript published on 30 March 2019 | PP: 1960-1963 | Volume-7 Issue-6, March 2019 | Retrieval Number: F2788037619/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (

Abstract: In recent decades Machine learning and Data Science are providing best ways to analyze and solve various problems. In fact, those Machine Learning algorithms gives the best and optimized solutions. These methods are playing key role in providing efficient solutions for the health care problems like predicting the diseases in early stage, and even some automated systems run by Machine Learning Algorithms are prescribing medicines based on the patient’s symptoms. Diabetes is one among the chronic diseases from past years, which leads to the damage of patients eyes, nerves, heart and kidneys etc., In this project we are going to create a pipeline in which the data collected from the source is undergone through some preprocessing techniques and the Machine Learning Algorithms like SVM, KNN, Gradient Boasting, logistic regression and Random Forest are used to classify whether the patient is diabetic or not and the accuracy of these algorithms was measured by using some Evaluation methods like Train/Test Split. Finally, these data will be visualized by using Visualization Tools.
Keywords: SVM, KNN, Gradient Boasting, Logistic regression, Random Forest
Scope of the Article: Artificial Intelligence and Machine Learning