The question classification phase is considered as one of the most significant phases in a Question Answering System to help the system find or construct an accurate answer which results in an improvement of the quality of question answering systems. In this work, we proposed a question classification into a 2-layer taxonomy called Coarse-Fine taxonomy. This is the first work for Indonesian question classification into Coarse-Fine taxonomy. We employed a feature selection and machine learning classification using Support Vector Machine algorithm. In the feature selection, we found that Unigram+TFIDF+Word Shape is the best combination that reached the highest accuracy with 92.9% in Coarse category. On the other hand, the combination of Unigram+TFIDF+WH word features is the best combination for Fine category with 79.3% accuracy.
Skip Nav Destination
Article navigation
14 February 2023
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY
7–8 September 2021
Yogyakarta, Indonesia
Research Article|
February 14 2023
Indonesian question classification using feature extraction and selection approach on coarse and fine taxonomy
Irfandy Thalib;
Irfandy Thalib
a)
1
Department of Electrical Engineering and Information Technology, Universitas Gadjah Mada
, Yogyakarta, Indonesia
a)Corresponding author: irfandythalib@mail.ugm.ac.id
Search for other works by this author on:
Widyawan;
Widyawan
b)
1
Department of Electrical Engineering and Information Technology, Universitas Gadjah Mada
, Yogyakarta, Indonesia
Search for other works by this author on:
Indah Soesanti
Indah Soesanti
c)
1
Department of Electrical Engineering and Information Technology, Universitas Gadjah Mada
, Yogyakarta, Indonesia
Search for other works by this author on:
AIP Conf. Proc. 2654, 020007 (2023)
Citation
Irfandy Thalib, Widyawan, Indah Soesanti; Indonesian question classification using feature extraction and selection approach on coarse and fine taxonomy. AIP Conf. Proc. 14 February 2023; 2654 (1): 020007. https://doi.org/10.1063/5.0114188
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
28
Views
Citing articles via
Related Content
Improved VSM for Incremental Text Classification
AIP Conference Proceedings (November 2008)
Clustering on ‘Z’ e-learning application’s reviews using self organizing maps and K-means methods
AIP Conference Proceedings (October 2022)
Document‐level optimization in speech recognition
J Acoust Soc Am (November 2006)
The effects of pre-processing strategies in sentiment analysis of online movie reviews
AIP Conference Proceedings (October 2017)
Modified relevance frequency feature weighting for email spam classification
AIP Conference Proceedings (July 2016)