AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH

Main Author: Prihantoro, Prihantoro
Format: Article info application/pdf eJournal
Bahasa: eng
Terbitan: Faculty of Cultural Sciences , 2016
Subjects:
Online Access: https://journal.ugm.ac.id/jurnal-humaniora/article/view/8709
https://journal.ugm.ac.id/jurnal-humaniora/article/view/8709/6633
Daftar Isi:
  • The research problems in this research are 1) how lexicogrammar takes role in determining polarity of F-Word1 and 2) how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA). In this corpus, F-word is proven to be highest in frequency as compared to its distribution across corpora. Corpus methodology is applied by sending queries to retrieve F-Words to COCA interface. Tokens combination surrounding F-words resulted in the phrase and clause unit accompanying F-words, which are significant cues to determine F-word polarity. The polarity is later proven to be not necessarily negative. I also designed a computational resource to allow the retrieval of F-words offline so that users might apply it to any digital text collections.