Mejoras en la recuperación web combinando campos
Main Authors: | G.-Figuerola, Carlos, Alonso-Berrocal, José-Luis, Zazo, Ángel F. |
---|---|
Other Authors: | Ruiz, R., Álvarez, J. L., Arjona, J. L., Corchuelo, R. |
Format: | Journal PeerReviewed application/pdf |
Bahasa: | es |
Terbitan: |
, 2009
|
Subjects: | |
Online Access: |
http://eprints.rclis.org/3909/1/zoco-09-figuerola-recuperacion.pdf http://eprints.rclis.org/3909/ |
Daftar Isi:
- This article describes some of the activities of the REINA research group about Web information retrieval. These activities have focused on proving the retrieval that can be expected from diverse informative present in the elements of web pages, besides the text that the user visualizes normally in the browser. Our aim was to try to the performance when mixing or combining these elements. Combining terms from diverse elements in one unique index can be obtained using the frequency of the terms in the vector space model, when uses a TFxIDF scheme. The BODY field is obviously the most powerful, but the text of the ANCHORs of the backlinks that receive the pages add a considerable improvement retrieval performance. The content of the METa tags, nevertheless, pay little to the improvement in the retrieval performance.