Type 2 diabetes is a global disease issue and is one of leading causes of death. Current discovery indicates that this disease could be categorized into many sub-clusters, which is a step for precision medicine. In this paper, we aim to analyze and compare two approaches of data reduction, i.e. with and without principal component analysis (PCA) on the standardized and normalized data. Data preparation was first performed. The model was then developed and validated by plotting Elbow method and silhouette width graph. Normalized data with principal component (PC) of 2 gives the best clustering visualization, the lowest within cluster sum of squared (WCSS) score (195.41) and highest Silhouette score (0.3491) compared to using both standardized data and standardized data (PC = 2) with 23518.82 (WCSS score) and 0.1976 (Silhouette score). We concluded that by integrating PCA with k-means clustering, the score value of WCSS shown to be lower while higher value recorded for Silhouette score.
Skip Nav Destination
,
,
,
,
,
Article navigation
7 April 2025
SUSTAINABLE AND INTEGRATED ENGINEERING INTERNATIONAL CONFERENCE: SIE2022
12–13 December 2022
Langkawi, Malaysia
Research Article|
April 07 2025
Integration of principal component analysis and K-means clustering for type 2 diabetes sub-clustering model Available to Purchase
Nashuha Omar;
Nashuha Omar
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
Search for other works by this author on:
Asnida Abdul Wahab;
Asnida Abdul Wahab
a)
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
2
Medical Device and Technology Center (MEDITEC), Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
a)Corresponding author: [email protected]
Search for other works by this author on:
Eko Supriyanto;
Eko Supriyanto
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
Search for other works by this author on:
Rania Hussein Al-Ashwal;
Rania Hussein Al-Ashwal
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
Search for other works by this author on:
Muhammad Hanif Ramlee;
Muhammad Hanif Ramlee
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
Search for other works by this author on:
Gan Hong Seng
Gan Hong Seng
3
Department of Data Science, Universiti Malaysia Kelantan
, 16100 UMK City Campus Pengkalan Chepa, Kelantan Malaysia
Search for other works by this author on:
Nashuha Omar
1
Asnida Abdul Wahab
1,2,a)
Eko Supriyanto
1
Rania Hussein Al-Ashwal
1
Muhammad Hanif Ramlee
1
Gan Hong Seng
3
1
Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
2
Medical Device and Technology Center (MEDITEC), Universiti Teknologi Malaysia (UTM)
, Skudai, 81310 Malaysia
3
Department of Data Science, Universiti Malaysia Kelantan
, 16100 UMK City Campus Pengkalan Chepa, Kelantan Malaysia
a)Corresponding author: [email protected]
AIP Conf. Proc. 3056, 070001 (2025)
Citation
Nashuha Omar, Asnida Abdul Wahab, Eko Supriyanto, Rania Hussein Al-Ashwal, Muhammad Hanif Ramlee, Gan Hong Seng; Integration of principal component analysis and K-means clustering for type 2 diabetes sub-clustering model. AIP Conf. Proc. 7 April 2025; 3056 (1): 070001. https://doi.org/10.1063/5.0209965
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
9
Views
Citing articles via
The implementation of reflective assessment using Gibbs’ reflective cycle in assessing students’ writing skill
Lala Nurlatifah, Pupung Purnawarman, et al.
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
Related Content
K-Means cluster optimization for potentiality student grouping using elbow method
AIP Conf. Proc. (November 2022)
LRFMP model and K-means clustering for customer profiling in online retail market
AIP Conf. Proc. (June 2024)
Clustering of disaster prones areas in Java Island
AIP Conf. Proc. (November 2024)
Unsupervised machine learning to analyze corneal tissue surfaces
APL Mach. Learn. (November 2023)
Seeking the truth beyond the data. An unsupervised machine learning approach
AIP Conf. Proc. (August 2023)