Sistem Stemming Otomatis untuk Kata dalam Bahasa Indonesia

Main Authors: Mandala, Rila, Koryanti, Erry, Munir, Rinaldi, Harlili, Harlili
Format: Article info application/pdf eJournal
Bahasa: eng
Terbitan: Jurusan Teknik Informatika, Fakultas Teknologi Industri, Universitas Islam Indonesia , 2009
Online Access: http://journal.uii.ac.id/index.php/Snati/article/view/1827
http://journal.uii.ac.id/index.php/Snati/article/view/1827/1607
Daftar Isi:
  • Stemming is a process to restore words to its base form, by stripping each word fromits derivational and affixes. A stemming process has an important role for machinetranslationand other computational lingustics area. In Malaysian there is a stemmingalgorithm that has been developed and tested for application in information retrieval which isknown as Othman algorithm. There are several differences of Bahasa Indonesia’smorphology and Malay’s morphology, so The Othman algorithm can not be applied directlyin bahasa Indonesia. Furthermore, the accuracy of Othman algorithm also is not good. Thispaper proposes some modifications from Othman algorithm. The modifications includes,various stemming procedures, rule of affixes, and dictionary of root words. Experiments showthat Our modification method has a better accuracy in stemming Bahasa Indonesia’s words.Keywords: stemming, word-lemmatization, affix-stripping