Publicación: A data driven approach for person name disambiguation in web search results
dc.contributor.author | Víctor Fresno, Víctor | |
dc.contributor.author | Montalvo, Soto | |
dc.contributor.author | Delgado Muñoz, Agustín Daniel | |
dc.contributor.author | Martínez Unanue, Raquel | |
dc.date.accessioned | 2024-05-21T13:03:29Z | |
dc.date.available | 2024-05-21T13:03:29Z | |
dc.date.issued | 2014-08-23 | |
dc.description.abstract | This paper presents an unsupervised approach for the task of clustering the results of a search engine when the query is a person name shared by different individuals. We propose an algorithm that calculates the number of clusters and establishes the groups of web pages according to the different individuals without the need to any training data or predefined thresholds, as the successful state of the art systems do. In addition, most of those systems do not deal with social media web pages and their performance could fail in a real scenario. In this paper we also propose a heuristic method for the treatment of social networking profiles. Our approach is compared with four gold standard collections for this task obtaining really competitive results, comparable to those obtained by some approaches with supervision. | en |
dc.description.version | versión final | |
dc.identifier.uri | https://hdl.handle.net/20.500.14468/19983 | |
dc.language.iso | en | |
dc.relation.center | E.T.S. de Ingeniería Informática | |
dc.relation.department | Lenguajes y Sistemas Informáticos | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0 | |
dc.title | A data driven approach for person name disambiguation in web search results | es |
dc.type | conference proceedings | en |
dc.type | actas de congreso | es |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 387ad233-199b-4a04-98cb-488b89724355 | |
relation.isAuthorOfPublication | 085ba044-ea75-4751-ab01-512f39c160a7 | |
relation.isAuthorOfPublication.latestForDiscovery | 387ad233-199b-4a04-98cb-488b89724355 |
Archivos
Bloque original
1 - 1 de 1