Winsorized tree is a modified tree-based classifier that is able to investigate and to handle all outliers in all nodes along the process of constructing the tree. It overcomes the tedious process of constructing a classical tree where the splitting of branches and pruning go concurrently so that the constructed tree would not grow bushy. This mechanism is controlled by the proposed algorithm. In winsorized tree, data are screened for identifying outlier. If outlier is detected, the value is neutralized using winsorize approach. Both outlier identification and value neutralization are executed recursively in every node until predetermined stopping criterion is met. The aim of this paper is to search for significant stopping criterion to stop the tree from further splitting before overfitting. The result obtained from the conducted experiment on pima indian dataset proved that the node could produce the final successor nodes (leaves) when it has achieved the range of 70% in information gain.
Skip Nav Destination
Article navigation
22 November 2017
PROCEEDINGS OF THE 13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017)
4–7 December 2017
Kedah, Malaysia
Research Article|
November 22 2017
The stopping rules for winsorized tree
Chee Keong Ch’ng;
Chee Keong Ch’ng
a)
1
School of Quantitative Sciences, College of Arts and Sciences, Universiti Utara Malaysia
, 06010 UUM Sintok, Kedah, Malaysia
Search for other works by this author on:
Nor Idayu Mahat
Nor Idayu Mahat
b)
2
School of Quantitative Sciences, College of Arts and Sciences, Universiti Utara Malaysia
, 06010 UUM Sintok, Kedah, Malaysia
Search for other works by this author on:
a)
Corresponding author: [email protected]
AIP Conf. Proc. 1905, 050014 (2017)
Citation
Chee Keong Ch’ng, Nor Idayu Mahat; The stopping rules for winsorized tree. AIP Conf. Proc. 22 November 2017; 1905 (1): 050014. https://doi.org/10.1063/1.5012233
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
41
Views
Citing articles via
Inkjet- and flextrail-printing of silicon polymer-based inks for local passivating contacts
Zohreh Kiaee, Andreas Lösel, et al.
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Students’ mathematical conceptual understanding: What happens to proficient students?
Dian Putri Novita Ningrum, Budi Usodo, et al.
Related Content
Treatment on outliers in UBJ-SARIMA models for forecasting dengue cases on age groups not eligible for vaccination in Baguio City, Philippines
AIP Conference Proceedings (November 2017)
A framework of mixed variables classification in the presence of outliers: A robust location model
AIP Conference Proceedings (November 2017)
Robust linear discriminant analysis with distance based estimators
AIP Conference Proceedings (November 2017)
Classification predictive models of running- and cycling-induced fatigue
AIP Conf. Proc. (August 2019)
A hybrid ARIMA and neural network model applied to forecast catch volumes of Selar crumenophthalmus
AIP Conference Proceedings (November 2017)