Due to the COVID-19 pandemic, education institutions have to rely on e-learning tools, including in programming courses. Automatic graders can be used to speed up the process of evaluating the correctness. Unfortunately, answers for coding exercises can be easily plagiarized. Manual grading of all student submissions may notbe feasible. Therefore, a system that can help detecting similar codes is needed. The detection can be done by grouping similar source codes based on their structure. This method is used in previous research by using automatic K-means iterations algorithm. That algorithm, although produced decent clusters, had a long execution time. The purpose of this research is to improve the time efficiency and clusters result quality by using bisecting K-means algorithm. The results showed a significant improvement in execution time from 11.68 seconds to 6.64 seconds. Bisecting K-means also produced fewer clusters with slightly better Rand Index than K-means iterations. We also conduct experiments using 2-grams to 6-grams and confirm that 4-grams result in the best performance.
Skip Nav Destination
Article navigation
15 December 2023
2ND INTERNATIONAL CONFERENCE ON APPLIED SCIENCES, TECHNOLOGY, ENGINEERING AND MATHEMATICS (ICASTEM 2021)
2–3 November 2021
Tangerang, Indonesia
Research Article|
December 15 2023
C source code clustering using bisecting K-means Available to Purchase
Ahmad Ridha;
Ahmad Ridha
a)
Department of Computer Science, IPB University
, Bogor, Indonesia
a)Corresponding author: [email protected]
Search for other works by this author on:
Arizal Notyasa
Arizal Notyasa
b)
Department of Computer Science, IPB University
, Bogor, Indonesia
Search for other works by this author on:
Ahmad Ridha
a)
Department of Computer Science, IPB University
, Bogor, Indonesia
Arizal Notyasa
b)
Department of Computer Science, IPB University
, Bogor, Indonesia
a)Corresponding author: [email protected]
AIP Conf. Proc. 2877, 020003 (2023)
Citation
Ahmad Ridha, Arizal Notyasa; C source code clustering using bisecting K-means. AIP Conf. Proc. 15 December 2023; 2877 (1): 020003. https://doi.org/10.1063/5.0177501
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
22
Views
Citing articles via
Effect of coupling agent type on the self-cleaning and anti-reflective behaviour of advance nanocoating for PV panels application
Taha Tareq Mohammed, Hadia Kadhim Judran, et al.
Design of a 100 MW solar power plant on wetland in Bangladesh
Apu Kowsar, Sumon Chandra Debnath, et al.
With synthetic data towards part recognition generalized beyond the training instances
Paul Koch, Marian Schlüter, et al.
Related Content
Learning experience in inquiry-based physics e-book integrated with traditional games: Feasibility and student response
AIP Conf. Proc. (April 2024)
Computational based GeoGebra applied in numerical methods
AIP Conf. Proc. (January 2024)
Sentiment analysis regarding the impact of Covid-19 on education in Indonesia with the naïve bayes classifier
AIP Conf. Proc. (March 2024)
Self regulated learning skills of pre-service biology teachers in animal physiology: A preliminary analysis
AIP Conf. Proc. (December 2022)
The validity of noticing questionnaire based on the realistic mathematics education principles
AIP Conf. Proc. (April 2021)