Fecha
2021-02-27
Editor/a
Director/a
Tutor/a
Coordinador/a
Prologuista
Revisor/a
Ilustrador/a
Derechos de acceso
info:eu-repo/semantics/openAccess
Título de la revista
ISSN de la revista
Título del volumen
Editor
Springer Nature
Resumen
Medieval documents are a rich source of historical data. Performing named-entity recognition (NER) on this genre of texts can provide us with valuable historical evidence. However, traditional NER categories and schemes are usually designed with modern documents in mind (i.e. journalistic text) and the general-domain NER annotation schemes fail to capture the nature of medieval entities. In this paper we explore the challenges of performing named-entity annotation on a corpus of Spanish medieval documents: we discuss the mismatches that arise when applying traditional NER categories to a corpus of Spanish medieval documents and we propose a novel humanist-friendly TEI-compliant annotation scheme and guidelines intended to capture the particular nature of medieval entities.
Descripción
This is an Accepted Manuscript of an article published by Springer Nature in "Language Resources and Evaluation, 55(2), 525-549", available at: https://doi.org/10.1007/s10579-020-09516-2
Este es el manuscrito aceptado del artículo publicado por Springer Nature en "Language Resources and Evaluation, 55(2), 525-549", disponible en línea: https://doi.org/10.1007/s10579-020-09516-2
Categorías UNESCO
Palabras clave
Named-entity annotation, Annotation scheme, Historical NER, Medieval named entities, Medieval Spanish corpus
Citación
Álvarez-Mellado, E., Díez-Platas, M. L., Ruiz-Fabo, P., Bermúdez, H., Ros, S., & González-Blanco, E. (2021). TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus. Language Resources and Evaluation, 55(2), 525-549. https://doi.org/10.1007/S10579-020-09516-2
Centro
E.T.S. de Ingeniería Informática
Departamento
Sistemas de Comunicación y Control