Submission of information has been supported by the presence of electronic documents in recent years. Checking the similarity of the contents of the electronic documents continues to be carried out to improve the quality of information and overcome plagiarism. The problem with checking the similarity of document contents at the Faculty of Engineering and Information Systems, Yapis University Papua, is that the process and results of checking the similarity of document contents that have been carried out so far have not been optimal. The research aimed at increasing the effectiveness and efficiency of the document content similarity checking process by presenting a document similarity detection system and applying the Rabin Karp algorithm along with stemming to determine the percentage similarity of electronic document contents. It also applied the waterfall development model, fishbone analysis model, and UML. The preprocessing stages on the Rabin Karp algorithm are case folding, tokenizing, filtering, and stemming. The research results are a website-based system that can check the documents being tested, namely ten Information System Student Journals at Yapis Papua University, with a total time of 33.896 seconds and produce optimal percentages in journals with text content lengths of less than 14,500 mixed characters without spaces.

1.
K. M.
Jambi
,
I. H.
Khan
, and
M. A.
Siddiqui
, “
Evaluation of Different Plagiarism Detection Methods: A Fuzzy MCDM Perspective
,”
Applied Sciences
, vol.
12
, no.
9
, p.
4580
,
2022
.
2.
D.
Leman
,
M.
Rahman
,
F.
Ikorasaki
,
B. S.
Riza
, and
M. B.
Akbbar
, “
Rabin karp and Winnowing algorithm for Statistics of text document plagiarism detection
,” vol.
7
, pp.
1
5
:
IEEE
.
3.
K.
Manaf
,
S. W.
Pitara
,
B.
Subaeki
, and
R.
Gunawan
, “
Comparison of carp rabin algorithm and Jaro-Winkler distance to determine the equality of Sunda languages
,” pp.
77
81
:
IEEE
.
4.
A.
Yudhana
and
I. A.
Mukaromah
, “
Implementation of Winnowing Algorithm with Dictionary English-Indonesia Technique to Detect Plagiarism
,”
International Journal of Advanced Computer Science and Applications
, vol.
9
, no.
5
,
2018
.
5.
A.
Yaqin
,
A.
Dahlan
, and
R. D.
Hermawan
, “
Implementation of Algorithm Rabin-Karp for Thematic Determination of Thesis
,” pp.
395
400
:
IEEE
.
6.
I.
Wijaya
,
K. A.
Seputra
, and
W. G. S.
Parwita
, “
Comparison of the BM25 and rabinkarp algorithm for plagiarism detection
,” vol.
1810
, p.
012032
:
IOP Publishing
.
7.
E. Y.
Puspaningrum
,
B.
Nugroho
,
A.
Setiawan
, and
N.
Hariyanti
, “
Detection of text similarity for indication plagiarism using winnowing algorithm based K-gram and jaccard coefficient
,” vol.
1569
, p.
022044
:
IOP Publishing
.
8.
A.
Yudhana
and
A. D.
Djayali
, “
Implementation of Pattern Matching Algorithm for Portable Document Format
,”
International Journal of Advanced Computer Science and Applications
, vol.
8
, no.
11
,
2017
.
9.
M. M.
Musthofa
and
A.
Yaqin
, “
Implementation of Rabin Karp algorithm for essay writing test system on organization XYZ
,” pp.
502
507
:
IEEE
.
10.
J.
Luo
,
W.
Xiong
,
J.
Du
,
Y.
Liu
,
J.
Li
, and
D.
Hu
, “
Traditional Chinese Medicine Text Similarity Calculation Model Based on the Bidirectional Temporal Siamese Network
,”
Evidence-Based Complementary and Alternative Medicine
, vol.
2021
,
2021
.
11.
W. G. S.
Parwita
,
I. G. A. A. D.
Indradewi
, and
I. N. S. W.
Wijaya
, “
String matching based plagiarism detection for document in Bahasa Indonesia
,” pp.
54
58
:
IEEE
.
12.
S.
Jeganmohan
,
N.
Nagendran
,
R.
Gayathri
, and
S.
Gurusubramani
, “
Plagiarism Detection with Optical Character Recognition
,”
i-Manager’s Journal on Computer Science
, vol.
9
, no.
1
, p.
15
,
2021
.
13.
I.
Prismana
,
D. R.
Prehanto
,
D. A.
Dermawan
,
A. C.
Herlingga
, and
S. C.
Wibawa
, “
Nazief & Adriani Stemming Algorithm With Cosine Similarity Method For Integrated Telegram Chatbots With Service
,” vol.
1125
, p.
012039
:
IOP Publishing
.
This content is only available via PDF.
You do not currently have access to this content.