Diabetes is a kind of chronic disease, which refers to any illness that lasts for a longer period of time. It happens when the body’s blood sugar level is too high. Diabetes affects the elderly and obese in especially. Diabetic foot syndrome, heart attack, kidney failure, high blood pressure, and other health concerns can all be caused by diabetes. Machine learning is a vast field that learns from previous data and makes accurate predictions. Diabetes can be predicted early on, which can lead to better treatment. For this study, we used the Pima Indian Diabetes (PID) dataset from the UCI Machine Learning Repository. The dataset contains information about 768 patients and their corresponding nine unique attributes. We used five ML algorithms on the dataset to predict diabetes. The performance of all the five classifiers is compared using Accuracy score, Precision, Recall, F-measure Receiver Operating Curve (ROC) from each model and we observed that the Gradient Boosting techniques provide the Best Accuracy of 92.10% and ROC of 91.8%.

1.
World Health Organization: WHO & World Health Organization: WHO
. (
2023
).
Diabetes.
www.who.int. https://www.who.int/news-room/fact-sheets/detail/diabeteshttps://www.who.int/health-topics/diabetes
2.
Sharma
,
N. C.
(
2019b
, October 10).
Government survey found 11.8% prevalence of diabetes in India | Mint. Mint.
https://www.livemint.com/science/health/government-survey-found-11-8-prevalence-of-diabetes-in-india-11570702665713.html
3.
Diabetes - NIDDK
. (
2023
).
National Institute of Diabetes and Digestive and Kidney Diseases.
https://www.niddk.nih.gov/health-information/diabetes
4.
Martin
,
L.
(
2021
, July 29).
What to know about diabetes in India.
https://www.medicalnewstoday.com/articles/diabetes-in-india
5.
Chacko
,
B.
,
Chacko
,
B.
, &
Indiaspend
. (
2019
).
Indiaspend. Indiaspend.
https://www.indiaspend.com/1-in-2-indian-diabetics-unaware-of-their-condition-study/
6.
What is Machine Learning?
|
IBM.
(
2023
). https://www.ibm.com/in-en/cloud/learn/machine-learning
7.
Patil
,
P.
(
2022
, May 30).
What is Exploratory Data Analysis?
-
Towards Data Science. Medium.
https://towardsdatascience.com/exploratory-data-analysis-8fc1cb20fd15
8.
GeeksforGeeks
. (
2020
).
Interquartile Range to Detect Outliers in Data
.
GeeksforGeeks.
https://www.geeksforgeeks.org/interquartile-range-to-detect-outliers-in-data/
9.
Malato
,
G.
(
2022
, January 4).
Outlier identification using Interquartile Range
-
Towards Data Science. Medium.
https://towardsdatascience.com/outlier-identification-using-interquartile-range-74f5de12932a
10.
Alam
,
M.
(
2021
, December 16).
Anomaly detection with Local Outlier Factor (LOF
) -
Towards Data Science. Medium.
https://towardsdatascience.com/anomaly-detection-with-local-outlier-factor-lof-d91e41df10f2
11.
Ferhatmetin
. (
2021b
, December 27).
LOCAL OUTLIER FACTOR - Data Science Earth
-
Medium. Medium.
https://medium.com/datasciencearth/local-outlier-factor-7821b5651bc5
12.
Rosencrance
,
L.
(
2021b
).
feature engineering
.
Data Management.
https://searchdatamanagement.techtarget.com/definition/feature-engineering
13.
Bhandari
,
A.
(
2023
).
Feature Engineering: Scaling, Normalization, and Standardization (Updated 2023
).
Analytics Vidhya.
https://www.analyticsvidhya.com/blog/2020/04/feature-scaling-machine-learning-normalization-standardization/
14.
Machine Learning: What It i.s., Tutorial
,
Definition, Types - Javatpoint.
(
2022
). www.javatpoint.com. https://www.javatpoint.com/machine-learning
16.
Bajaj
,
A.
(
2023
).
Performance Metrics in Machine Learning [Complete Guide]
.
neptune.ai.
https://neptune.ai/blog/performance-metrics-in-machine-learning-complete-guide
17.
Jobeda Jamal
Khanam
A comparison of machine learning algorithms for diabetes prediction
,
Elsevier
Open Access (
2021
)”;
18.
D. Jashwanth
Reddy
,
B.
Mounika
,
S.
Sindhu
, “
Predictive machine learning model for early detection and analysis of diabetes
,
Elsevier
(
2020
)”;
19.
Talha Mahboob
Alam
,
Muhammad Atif
Iqbal
,
Yasir
Ali
, “
A modelfor early prediction of diabetes
,
ELSEVIER
(
2019
)”;
20.
Hang
Lai
,
Huaxiong
Huang
,
Karim
Keshavjee
, “
Predictive models for diabetes mellitus using machine learning techniques, Springer Nature(2019
)”
21.
Amani
Yahyaoui
,
Akhtar
Jamil
, “
A Decision Support System for Diabetes Prediction Using Machine Learning and Deep Learning Techniques
,
IEEE
(
2019
)”; https://ieeexplore.ieee.org/document/8965556
22.
Muhammad Daniyal Baig
, “
Diabetes prediction using machine learning algorithms
,
ResearchGate
(
2020
)”;
23.
Atishay
Jain
,
Yashovardhan
Malhotra
,
M.
Karthikeyan
, “
Early Stage Detection of Diabetes Using Exploratory Machine Learning Algorithms
,
Annals of R.S.C.B
(
2021
)”; https://annalsofrscb.ro/index.php/journal/article/view/6489
24.
Vandana C.
Bavkar
,
Arundhati A.
Shinde
, “
Machine learning algorithms for Diabetes prediction and neural network method for blood glucose measurement
,
IJST
(
2021
)”; https://indjst.org/articles/machine-learning-algorithms-for-diabetes-prediction-and-neural-network-method-for-blood-glucose-measurement
25.
Himanshu
Gupta
,
Hirdesh
Varshney
,
Nikhil
Pachauri
,
Om Prakash
Verma
, “
Comparative performance analysis of quantum machine learning with deep learning for diabetes prediction
,
Springer
(
2021
)”;
26.
L. J.
Muhammad
,
Ebrahem A.
Algehyne
,
Sani Sharif
Usman
, “
Predictive Supervised Machine Learning Models for Diabetes Mellitus
,
Springer Nature
(
2020
)”; https://link.springer.com/article/10.1007/s42979-020-00250-8
27.
B.B. An
Dinh
,
Stacey
Miertschin
,
Amber
Young
and
Somya D.
Mohanty
, “
A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
,
Springer Nature
(
2019
)”;
28.
B.B. Huaping
Zhou
,
Raushan
Myrzashova
and
Rui
Zheng
, “
Diabetes prediction model based on an enhanced deep neural network
,
Springer Open Access
(
2020
)”;
29.
Huma
Naz
&
Sachin
Ahuja
, “
Deep learning approach for diabetes prediction using PIMA Indian dataset
,
Springer Nature
(
2020
)”;
30.
Minyechil
Alehegn
,
Rahul
Joshi
&
Dr. Preeti
Mulay
, “
Analysis and Prediction of Diabetes Mellitus using Machine Learning Algorithm
,
IJPAM
(
2018
)”; https://www.acadpubl.eu/jsi/2018-118-7-9/articles/9/87.pdf
31.
Dasari
Bhulakshmi
,
Glory
Gandhi
, “
The Prediction of Diabetes in Pima Indian women Mellitus Based on XGBOOST Ensemble Modeling using data science 2020
https://easychair.org/publications/preprint/gz9B
32.
Ahamed
,
B. S.
(
2021
, March 5).
LGBM Classifier based Technique for Predicting Type-2 Diabetes.
https://ejmcm.com/article_9403.html
33.
Roopesh Padmaraju
Alluri
, “
Diabetes Prediction Using Ensemble Techniques
https://www.ripublication.com/ijaer21/ijaerv16n5_12.pdf,
International Journal of Applied Engineering Research
ISSN 0973-4562 Volume
16
, Number
5
(
2021
) pp.
410
415
This content is only available via PDF.
You do not currently have access to this content.