Publicación:
Unsupervised discovery of domain-specific knowledge from text

dc.contributor.authorHovy, Dirk
dc.contributor.authorZhang, Chunliang
dc.contributor.authorHovy, Eduard H.
dc.contributor.authorPeñas Padilla, Anselmo
dc.date.accessioned2024-05-21T13:03:33Z
dc.date.available2024-05-21T13:03:33Z
dc.date.issued2011-06-19
dc.description.abstractLearning by Reading (LbR) aims at enabling machines to acquire knowledge from and reason about textual input. This requires knowledge about the domain structure (such as entities, classes, and actions) in order to do inference. We present a method to infer this implicit knowledge from unlabeled text. Unlike previous approaches, we use automatically extracted classes with a probability distribution over entities to allow for context-sensitive labeling. From a corpus of 1.4m sentences, we learn about 250k simple propositions about American football in the form of predicateargument structures like “quarterbacks throw passes to receivers”. Using several statistical measures, we show that our model is able to generalize and explain the data statistically significantly better than various baseline approaches. Human subjects judged up to 96.6% of the resulting propositions to be sensible. The classes and probabilistic model can be used in textual enrichment to improve the performance of LbR end-to-end systems.es
dc.description.versionversión publicada
dc.identifier.urihttps://hdl.handle.net/20.500.14468/19987
dc.language.isoen
dc.relation.centerE.T.S. de Ingeniería Informática
dc.relation.departmentLenguajes y Sistemas Informáticos
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/deed.es
dc.titleUnsupervised discovery of domain-specific knowledge from textes
dc.typeconference proceedingsen
dc.typeactas de congresoes
dspace.entity.typePublication
relation.isAuthorOfPublication1e1b14bc-1284-4aef-908c-bccf31bd055e
relation.isAuthorOfPublication.latestForDiscovery1e1b14bc-1284-4aef-908c-bccf31bd055e
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Documento.pdf
Tamaño:
408.56 KB
Formato:
Adobe Portable Document Format