"Kata terikat" represent a commonly utilized word class in Indonesian journalistic articles, yet their usage is often erroneous. "Kata terikat" are categorized into three types based on their division: words that are connected, separated by a space, and divided by a hyphen (-). This presents an opportunity for the automation of detection and error checking of "kata terikat" usage. The Rabin-Karp algorithm is employed for the detection of "kata terikat" due to their varied patterns, and the Random Forest algorithm is applied for the classification and correction of incorrectly used "kata terikat". The dataset used for this research is accommodated by Tribun News in form of nearly 1000 samples of journalistic article. The research conducted reveals that the "kata terikat" correction model achieved an accuracy of 86.24%. Three rounds of testing were carried out using 10, 20, and 40 journalistic articles from the Tribun News dataset, yielding accuracies of 85.71%, 91.67%, and 86.67%, respectively.
Skip Nav Destination
Article navigation
8 October 2024
ETLTC2024 INTERNATIONAL CONFERENCE SERIES ON ICT, ENTERTAINMENT TECHNOLOGIES, AND INTELLIGENT INFORMATION MANAGEMENT IN EDUCATION AND INDUSTRY
23–26 January 2024
Aizuwakamatsu, Japan
Research Article|
October 08 2024
Development of “kata terikat” detection and writing errors correction using Rabin-Karp and random forest algorithm Available to Purchase
Vallencius Gavriel Alfredo Siswanto;
Vallencius Gavriel Alfredo Siswanto
a)
Faculty of Engineering and Informatics, Universitas Multimedia Nusantara
, Tangerang, Indonesia
Search for other works by this author on:
Marlinda Vasty Overbeek
Marlinda Vasty Overbeek
b)
Faculty of Engineering and Informatics, Universitas Multimedia Nusantara
, Tangerang, Indonesia
b)Corresponding author: [email protected]
Search for other works by this author on:
Vallencius Gavriel Alfredo Siswanto
a)
Faculty of Engineering and Informatics, Universitas Multimedia Nusantara
, Tangerang, Indonesia
Marlinda Vasty Overbeek
b)
Faculty of Engineering and Informatics, Universitas Multimedia Nusantara
, Tangerang, Indonesia
b)Corresponding author: [email protected]
AIP Conf. Proc. 3220, 040004 (2024)
Citation
Vallencius Gavriel Alfredo Siswanto, Marlinda Vasty Overbeek; Development of “kata terikat” detection and writing errors correction using Rabin-Karp and random forest algorithm. AIP Conf. Proc. 8 October 2024; 3220 (1): 040004. https://doi.org/10.1063/5.0235496
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
11
Views
Citing articles via
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
With synthetic data towards part recognition generalized beyond the training instances
Paul Koch, Marian Schlüter, et al.
Related Content
Development of detection and correction of errors in spelling and compound words using long short-term memory
AIP Conf. Proc. (October 2024)
Harnessing long short-term memory algorithm for enhanced di-di word error detection and correction
AIP Conf. Proc. (October 2024)
Implicit solvent approach based on generalized Born and transferable graph neural networks for molecular dynamics simulations
J. Chem. Phys. (May 2023)
Enhanced Sorensen dice coefficient using POS tagging for similarity detection system
AIP Conf. Proc. (January 2025)