The Ministry of Villages PDTT in collaboration with the National Development Planning Agency and the Central Statistics Agency issued Village Potential data in 2015 (Podes 2015) consisting of 74093 villages and having 42 indicators/attributes with podes status used as labels, but in the historical data there is a problem with the dataset owned by Podes 2015 identified as having data with class data to unbalanced an. According to data problems such as class imbalances can affect the performance of the algorithm to be poor, this is because if the minority data class is smaller or lower than the majority data class, the prediction results will be more inclined to the majority data class. The gradient boosted decision tree method has the advantage of good performance in handling classifications that combine simple parameter functions with ’bad’ results (high prediction errors) to create highly accurate predictions. However, the gradient boosted decision tree algorithm has a disadvantage that it cannot be applied to the problem of regression from a small distribution of data, therefore it takes large data to use algorithms, where complex interactions will be modelled simply. To solve the problem can be done with a method that can balance the class and improve accuracy. Adaboost is one of the boosting methods that is able to balance classes by giving weight to the level of classification errors that can change the distribution of data while SMOTE is a well-known method applied in order to deal with class imbalances. This technique synthesizes a new sample of minority classes to balance the dataset by creating a new instance of the minority class with the formation of a consolidation convex of adjacent instances. Experiments were carried out by applying the adaboost method to the gradient boosted decision tree (GBDT) to get optimal results and a good level of accuracy. The experimental results obtained from the proposed method are the SMOTE technique on the gradient boosted decision tree with ada boost for accuracy of 8 8.91%, classification error of 11.09% compared to the naïve bayes algorithm where only get accuracy 40.16%, classification error 59.84. On the measurement of Classification Error. Finally, in kappa measurements, it can be concluded in determining the status of villages using the smote technique method for gradient boosted decision trees and adaboost proven to be able to solve class imbalance problems and increase high accuracy and can reduce the classification error rate.
Skip Nav Destination
Article navigation
23 July 2024
PANCASAKTI INTERNATIONAL CONFERENCES ENGINEERING AND COMPUTER SCIENCE 2022
20–21 July 2022
Tegal, Indonesia
Research Article|
July 23 2024
SMOTE technique in comparison of gradient boosted trees with Naive Bayes for adaboost implementation in imbalance problem solving class on village status prediction Available to Purchase
Hasbi Firmansyah;
Hasbi Firmansyah
a)
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
a)Corresponding author: [email protected]
Search for other works by this author on:
Eko Budiraharjo;
Eko Budiraharjo
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
Search for other works by this author on:
Ali Sofyan
Ali Sofyan
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
Search for other works by this author on:
Hasbi Firmansyah
a)
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
Eko Budiraharjo
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
Ali Sofyan
Faculty of Engineering and Computer Science, Universitas Pancasakti Tegal
, Tegal, Indonesia
a)Corresponding author: [email protected]
AIP Conf. Proc. 2952, 060020 (2024)
Citation
Hasbi Firmansyah, Eko Budiraharjo, Ali Sofyan; SMOTE technique in comparison of gradient boosted trees with Naive Bayes for adaboost implementation in imbalance problem solving class on village status prediction. AIP Conf. Proc. 23 July 2024; 2952 (1): 060020. https://doi.org/10.1063/5.0211898
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
26
Views
Citing articles via
The implementation of reflective assessment using Gibbs’ reflective cycle in assessing students’ writing skill
Lala Nurlatifah, Pupung Purnawarman, et al.
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
Related Content
Enhancing MBTI personality prediction: Integrating hybrid machine learning models with SMOTE for balanced data in predictive analytics
AIP Conf. Proc. (January 2025)
Evaluation of ensemble method for multiclass classification on unbalanced data
AIP Conf. Proc. (December 2022)
Improving the performance of cardiac abnormality detection from PCG signal
AIP Conf. Proc. (March 2016)
An empirical studies on online gender-based violence: Classification analysis utilizing XGBOOST
AIP Conf. Proc. (January 2025)
Improving stunting prediction in children: Evaluating ensemble algorithms with SMOTE and feature selection
AIP Conf. Proc. (January 2025)