Calibration is often overlooked in machine-learning problem-solving approaches, even in situations where an accurate estimation of predicted probabilities, and not only a discrimination between classes, is critical for decision-making. One of the reasons is the lack of readily available open-source software packages which can easily calculate calibration metrics. In order to provide one such tool, we have developed a custom modification of the Weka data mining software, which implements the calculation of Hosmer-Lemeshow groups of risk and the Pearson chi-square statistic comparison between estimated and observed frequencies for binary problems. We provide calibration performance estimations with Logistic regression (LR), BayesNet, Naïve Bayes, artificial neural network (ANN), support vector machine (SVM), k-nearest neighbors (KNN), decision trees and Repeated Incremental Pruning to Produce Error Reduction (RIPPER) models with six different datasets. Our experiments show that SVMs with RBF kernels exhibit the best results in terms of calibration, while decision trees, RIPPER and KNN are highly unlikely to produce well-calibrated models.
Skip Nav Destination
Article navigation
9 February 2015
INTERNATIONAL CONFERENCE ON INTEGRATED INFORMATION (IC-ININFO 2014): Proceedings of the 4th International Conference on Integrated Information
5–8 September 2014
Madrid, Spain
Research Article|
February 09 2015
Calculating classifier calibration performance with a custom modification of Weka
Alexander Zlotnik;
Alexander Zlotnik
Department of Electronic Engineering, Technical University of Madrid, ETSI Telecomunicación, Ciudad Universitaria, 28040 Madrid, Spain and Ramón y Cajal University Hospital, C/ de Colmenar Viejo, km 9,100 28031 Madrid,
Spain
Search for other works by this author on:
Ascensión Gallardo-Antolín;
Ascensión Gallardo-Antolín
Department of Signal Theory and Communications, Carlos III University, C/ Madrid, 126, 28903 Getafe,
Spain
Search for other works by this author on:
Juan Manuel Montero Martínez
Juan Manuel Montero Martínez
Department of Electronic Engineering, Technical University of Madrid, ETSI Telecomunicación, Ciudad Universitaria, 28040 Madrid,
Spain
Search for other works by this author on:
AIP Conf. Proc. 1644, 128–132 (2015)
Citation
Alexander Zlotnik, Ascensión Gallardo-Antolín, Juan Manuel Montero Martínez; Calculating classifier calibration performance with a custom modification of Weka. AIP Conf. Proc. 9 February 2015; 1644 (1): 128–132. https://doi.org/10.1063/1.4907827
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
9
Views
Citing articles via
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
Social mediated crisis communication model: A solution for social media crisis?
S. N. A. Hamid, N. Ahmad, et al.
The effect of a balanced diet on improving the quality of life in malignant neoplasms
Yu. N. Melikova, A. S. Kuryndina, et al.
Related Content
Diabetes and heart disease prediction using machine learning classifiers based on Weka, python
AIP Conf. Proc. (December 2023)
Identification and evaluation of machine learning classification algorithm to predict the efficacy of gRNA in CRISPR/Cas9 genome editing system using WEKA
AIP Conf. Proc. (December 2023)
Evaluates the performance of the ensemble image filters with classifiers on image data set using WEKA
AIP Conf. Proc. (July 2023)
Improve the classifiers efficiency by handling missing values in diabetes dataset using WEKA filters
AIP Conference Proceedings (July 2021)
An exploration of teaching-learning using data mining classification algorithms in higher education
AIP Conf. Proc. (August 2023)