Increasing of technology had made categorizing documents become important. It caused by increasing of number of documents itself. Managing some documents by categorizing is one of Information Retrieval application, because it involve text mining on its process. Whereas, categorization technique could be done both Fuzzy C-Means (FCM) and K-Nearest Neighbors (KNN) method. This experiment would consolidate both methods. The aim of the experiment is increasing performance of document categorize. First, FCM is in order to clustering training documents. Second, KNN is in order to categorize testing document until the output of categorization is shown. Result of the experiment is 14 testing documents retrieve relevantly to its category. Meanwhile 6 of 20 testing documents retrieve irrelevant to its category. Result of system evaluation shows that both precision and recall are 0,7.

1.
Afrianto
,
R. B.
, &
Kurniawati
,
L. Y.
(
2013
). Kategorisasi Dokumen Teks Secara Multi Label Menggunakan Fuzzy C-Means dan K-nearest neighbors Pada Artikel Berbahasa Indonesia.
Jurnal Institut Teknologi Surabaya
.
2.
Agusta
,
L.
(
2009
). Perbandingan Algoritma Stemming Porter Dengan Algoritma Nazief & Adriani untuk Stemming Dokumen Teks Bahasa Indonesia.
Konferensi Nasional Sistem dan Informatika
.
3.
Darujati
,
C.
, &
Gumilang
,
A.
(
2012
). Pemanfaatan Teknik Supervised Untuk Klasifikasi Teks Bahasa Indonesia.
Jurnal Link
.
4.
Feldman
,
R.
, &
Sanger
,
J.
(
2007
).
The Text Mining Handbook
.
New York
:
Cambridge University
.
5.
Leskovec
,
J.
,
Rajaraman
,
A.
, &
Ullman
,
J. D.
(
2014
). Mining of Massive Datasets.
Stanford University
.
6.
Manning
,
C. D.
,
Raghavan
,
P.
,
Schūtze
, &
Hinrich
. (
2009
).
An Introduction to Information Retrieval
.
Cambridge, England
:
Cambridge University Press
.
7.
Milatina
,
Syukur
, A., &
Supriyanto
,
C.
(
2012
).
Pengaruh Text Preprocessing pada Clustering Dokumen Teks Berbahasa Indonesia
.
Jurnal Teknologi Informasi
,
29
.
8.
Miyamoto
,
S.
(
2004
).
Algorithms for Generating Clusters with Nonlinear Boundaries
.
Systems and Information Engineering
.
9.
Porter
,
M. F.
(
1980
).
An algorithm for suffix stripping
.
Program
14
(
3
):
130
137
. 33,529.
10.
Ramadhana
,
C.
,
Lulu
W. Y.
, &
Diah
,
K.
(
2013
).
Data Mining dengan Algoritma Fuzzy C-Means Clustering Dalam Kasus Penjualan di PT Sepatu Bata
.
Seminar Nasional Teknologi Informasi & Komunikasi Terapan
2013, (p.
59
). Semarang.
11.
Sukma
,
A.
,
Zaman
,
B.
, &
Purwanti
,
E.
(
2015
).
Information Retrieval Document Classified with K-Nearest Neighbor
.
Prosiding International Conference on Record and Library
2015.
12.
Susanti
,
A.
,
Yuhana
,
U. L.
, &
Fajrin
,
N.
(
2013
).
Rancang Bangun Modul Pengelompokan Dokumen pada Sistem Manajemen Dokumen Kepegawaian
.
Jurnal Teknik POMITS
Vol.
2
,
1
4
.
13.
Tan
,
P. N.
(
2005
).
Introduction to Data Mining
. Pearson.
14.
Wu
,
J.
(
2012
).
Advanced in K-means Clustering
.
Springer
.
15.
Zaman
,
B.
,
Hariyanti
,
E.
, &
Purwanti
,
E.
(
2015
).
Sistem Deteksi Bahasa Pada Dokumen Menggunakan N-Gram
.
Multinetics
,
20
26
.
This content is only available via PDF.