Publicación:
WHAD : Wikipedia historical attributes data. Historical structured data extraction and vandalism detection from the Wikipedia edit history

dc.contributor.authorAlfonseca, Enrique
dc.contributor.authorGarrido, Guillermo
dc.contributor.authorDelort, Jean Yves
dc.contributor.authorPeñas Padilla, Anselmo::virtual::5662::600
dc.contributor.authorPeñas Padilla, Anselmo
dc.contributor.authorPeñas Padilla, Anselmo
dc.contributor.authorPeñas Padilla, Anselmo
dc.date.accessioned2024-05-21T13:03:26Z
dc.date.available2024-05-21T13:03:26Z
dc.date.issued2013-05-28
dc.description.abstractThis paper describes the generation of temporally anchored infobox attribute data from the Wikipedia history of revisions. By mining (attribute, value) pairs from the revision history of the English Wikipedia we are able to collect a comprehensive knowledge base that contains data on how attributes change over time. When dealing with the Wikipedia edit history, vandalic and erroneous edits are a concern for data quality. We present a study of vandalism identification in Wikipedia edits that uses only features from the infoboxes, and show that we can obtain, on this dataset, an accuracy comparable to a state-of-the-art vandalism identification method that is based on the whole article. Finally, we discuss different characteristics of the extracted dataset, which we make available for further study.es
dc.description.versionversión publicada
dc.identifier.doihttp://doi.org/10.1007/s10579-013-9232-5
dc.identifier.issn1574-020X (print version) ISSN: 1574-0218 (electronic version)
dc.identifier.urihttps://hdl.handle.net/20.500.14468/19979
dc.language.isoen
dc.publisherSpringer Verlag (Germany)
dc.relation.centerE.T.S. de Ingeniería Informática
dc.relation.departmentLenguajes y Sistemas Informáticos
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/deed.es
dc.subject.keywordsWikipedia
dc.subject.keywordsInfobox
dc.subject.keywordsattributes
dc.subject.keywordstemporal data
dc.titleWHAD : Wikipedia historical attributes data. Historical structured data extraction and vandalism detection from the Wikipedia edit historyes
dc.typeconference proceedingsen
dc.typeactas de congresoes
dspace.entity.typePublication
relation.isAuthorOfPublication1e1b14bc-1284-4aef-908c-bccf31bd055e
relation.isAuthorOfPublication1e1b14bc-1284-4aef-908c-bccf31bd055e
relation.isAuthorOfPublication1e1b14bc-1284-4aef-908c-bccf31bd055e
relation.isAuthorOfPublication1e1b14bc-1284-4aef-908c-bccf31bd055e
relation.isAuthorOfPublication.latestForDiscovery1e1b14bc-1284-4aef-908c-bccf31bd055e
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Documento.pdf
Tamaño:
908.77 KB
Formato:
Adobe Portable Document Format