Publicación:
Analyzing information retrieval methods to recover broken web links

dc.contributor.authorMartínez Romo, Juan
dc.contributor.authorAraujo Serna, M. Lourdes
dc.date.accessioned2024-05-21T13:03:34Z
dc.date.available2024-05-21T13:03:34Z
dc.date.issued2011-06-19
dc.description.abstractIn this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the page containing the link, and the cache page in some digital library.The selected information is processed and submitted to a search engine. We have compared different information retrievalmethods for both, the selection of terms used to construct the queries submitted to the search engine, and the ranking of the candidate pages that it provides, in order to help the user to find the best replacement. In particular, we have used term frequencies, and a language model approach for the selection of terms; and cooccurrence measures and a language model approach for ranking the final results. To test the different methods, we have also defined a methodology which does not require the user judgments, what increases the objectivity of the results.en
dc.description.versionversión publicada
dc.identifier.urihttps://hdl.handle.net/20.500.14468/19989
dc.language.isoen
dc.relation.centerE.T.S. de Ingeniería Informática
dc.relation.departmentLenguajes y Sistemas Informáticos
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0
dc.titleAnalyzing information retrieval methods to recover broken web linkses
dc.typeconference proceedingsen
dc.typeactas de congresoes
dspace.entity.typePublication
relation.isAuthorOfPublication91b7e317-2a30-494f-98e9-3a0e026747b1
relation.isAuthorOfPublication77c4023e-4374-442a-9dfb-b9d4b609c31e
relation.isAuthorOfPublication.latestForDiscovery91b7e317-2a30-494f-98e9-3a0e026747b1
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Documento.pdf
Tamaño:
352.98 KB
Formato:
Adobe Portable Document Format