Categorización automática de documentos en español: algunos resultados experimentales

Main Authors: G.-Figuerola, Carlos, Zazo, Ángel F., Alonso-Berrocal, José-Luis
Other Authors: Brisaboa, Nieves R.
Format: Proceeding NonPeerReviewed application/pdf
Bahasa: es
Terbitan: , 2000
Subjects:
Online Access: http://eprints.rclis.org/14009/1/figuerola2000retrieval.pdf
http://eprints.rclis.org/14009/
Daftar Isi:
  • The automatic categorization can be viewed as a learning process, during which a program captures the characteristics that distinguish each category or class from others, ie those who must have documents to belong to that category. On the other hand, few experiments have been carried out yet with documents in Spanish. It shows the possibilities of elaborating pattern vectors which collect the characteristics of different classes or categories of documents by techniques based on those applied in the expansion of queries by relevance. At the same time, describes an experiment involving the application of these techniques to a collection of press releases in Spanish, for categorization. The results are, overall, qualified, or even better than those obtained in similar experiments, for some categories, these results improve