Publicación:
Medieval Spanish (12th–15th centuries) named entity recognition and attribute annotation system based on contextual information

dc.contributor.authorDíez Platas, Mª Luisa
dc.contributor.authorRos Muñoz, Salvador
dc.contributor.authorGonzález-Blanco, Elena
dc.contributor.authorRuiz Fabo, Pablo
dc.contributor.authorÁlvarez Mellado, Elena
dc.date.accessioned2025-07-10T15:26:28Z
dc.date.available2025-07-10T15:26:28Z
dc.date.issued2021-09-10
dc.descriptionThe registered version of this article, first published in “Journal of the Association for Information Science and Technology, 72(2), 224-238, 2021", is available online at the publisher's website: WILEY, https://doi.org/10.1002/ASI.24399 La versión registrada de este artículo, publicado por primera vez en “Journal of the Association for Information Science and Technology, 72(2), 224-238, 2021", está disponible en línea en el sitio web del editor: WILEY, https://doi.org/10.1002/ASI.24399
dc.description.abstractThe recognition of named entities in Spanish medieval texts presents great complexity, involving specific challenges: First, the complex morphosyntactic characteristics in proper-noun use in medieval texts. Second, the lack of strict orthographic standards. Finally, diachronic and geographical variations in Spanish from the 12th to 15th century. In this period, named entities usually appear as complex text structure. For example, it was frequent to add nicknames and information about the persons role in society and geographic origin. To tackle this complexity, named entity recognition and classification system has been implemented. The system uses contextual cues based on semantics to detect entities and assign a type. Given the occurrence of entities with attached attributes, entity contexts are also parsed to determine entity-type-specific dependencies for these attributes. Moreover, it uses a variant generator to handle the diachronic evolution of Spanish medieval terms from a phonetic and morphosyntactic viewpoint. The tool iteratively enriches its proper lexica, dictionaries, and gazetteers. The system was evaluated on a corpus of over 3,000 manually annotated entities of different types and periods, obtaining F1 scores between 0.74 and 0.87. Attribute annotation was evaluated for a person and role name attributes with an overall F1 of 0.75.en
dc.description.versionversión publicada
dc.identifier.citationDíez Platas, M. L., Ros Muñoz, S., González-Blanco, E., Ruiz Fabo, P., & Álvarez Mellado, E. (2021). Medieval Spanish (12th–15th centuries) named entity recognition and attribute annotation system based on contextual information. Journal of the Association for Information Science and Technology, 72(2), 224-238. https://doi.org/10.1002/ASI.24399
dc.identifier.doihttps://doi.org/10.1002/asi.24399
dc.identifier.issn2330-1635, eISSN 2330-1643
dc.identifier.urihttps://hdl.handle.net/20.500.14468/29390
dc.journal.issue2
dc.journal.titleJournal of the Association for Information Science and Technology, JASIST
dc.journal.volume72
dc.language.isoen
dc.page.final238
dc.page.initial224
dc.publisherWILEY
dc.relation.centerE.T.S. de Ingeniería Informática
dc.relation.departmentSistemas de Comunicación y Control
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.uriAtribución-CompartirIgual 4.0 Internacional
dc.subject1203.17 Informática
dc.subject57 Lingüística
dc.titleMedieval Spanish (12th–15th centuries) named entity recognition and attribute annotation system based on contextual informationen
dc.typeartículoes
dc.typejournal articleen
dspace.entity.typePublication
relation.isAuthorOfPublicationd25ad74f-42fc-47ac-911d-1e5515319a58
relation.isAuthorOfPublicationc5e8ac29-961d-427f-992d-53e5c72e5088
relation.isAuthorOfPublication.latestForDiscoveryd25ad74f-42fc-47ac-911d-1e5515319a58
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
ROS_SALVADOR_MEDIEVAL_SALVADOR ROS MUNOZ.pdf
Tamaño:
1.73 MB
Formato:
Adobe Portable Document Format
Bloque de licencias
Mostrando 1 - 1 de 1
No hay miniatura disponible
Nombre:
license.txt
Tamaño:
3.62 KB
Formato:
Item-specific license agreed to upon submission
Descripción: