Persona:
Díaz Paredes, Aitor

Cargando...
Foto de perfil
Dirección de correo electrónico
ORCID
0000-0002-1779-122X
Fecha de nacimiento
Proyectos de investigación
Unidades organizativas
Puesto de trabajo
Apellidos
Díaz Paredes
Nombre de pila
Aitor
Nombre

Resultados de la búsqueda

Mostrando 1 - 3 de 3
  • Publicación
    PoetryLab as Infrastructure for the Analysis of Spanish Poetry
    (Linköping University Electronic Press, 2021-06-22) Rosa, Javier de la; Pérez Pozo, Álvaro; Hernández Lorenzo, Laura; Díaz Paredes, Aitor; Ros Muñoz, Salvador; González Blanco, Elena
    The development of the network of ontologies of the ERC POSTDATA Project brought to light some deficiencies in terms of completeness in the currently available European poetry corpora. To tackle the issue in the realm of the Spanish poetic tradition, our approach consisted in designing a set of tools that any scholar could use to automatically enrich the analysis of Spanish poetry. The effort crystallized in the PoetryLab, an extensible open source toolkit for syllabification, scansion, enjambment detection, rhyme detection, stanza identification, and historical named entity recognition for Spanish poetry. We designed the system to be interoperable, compliant with the project ontologies, easy to use by tech-savvy and non-expert researchers, and requiring minimal maintenance and setup. Furthermore, we propose the integration of the PoetryLab as a core functionality in the tool catalog of CLARIN for Spanish poetry.
  • Publicación
    Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models
    (Springer, 2023) Rosa, Javier de la; Pérez Pozo, Álvaro; Sisto, Mirella De; Hernández Lorenzo, Laura; Díaz Paredes, Aitor; Ros Muñoz, Salvador; González Blanco, Elena
    The splitting of words into stressed and unstressed syllables is the foundation for the scansion of poetry, a process that aims at determining the metrical pattern of a line of verse within a poem. Intricate language rules and their exceptions, as well as poetic licenses exerted by the authors, make calculating these patterns a nontrivial task. Some rhetorical devices shrink the metrical length, while others might extend it. This opens the door for interpretation and further complicates the creation of automated scansion algorithms useful for automatically analyzing corpora on a distant reading fashion. In this paper, we compare the automated metrical pattern identification systems available for Spanish, English, and German, against fine-tuned monolingual and multilingual language models trained on the same task. Despite being initially conceived as models suitable for semantic tasks, our results suggest that transformers-based models retain enough structural information to perform reasonably well for Spanish on a monolingual setting, and outperforms both for English and German when using a model trained on the three languages, showing evidence of the benefits of cross-lingual transfer between the languages.
  • Publicación
    Exploring Spanish contemporary song lyrics through Digital Humanities methods: Some thematic and structural properties
    (Oxford Academic, 2021-11-08) Hernández Lorenzo, Laura; Díaz Paredes, Aitor; Pérez Pozo, Álvaro; Ros Muñoz, Salvador; González-Blanco, Elena; Oxford Academic
    In this article, we present a quantitative study with Digital Humanities methods on an extensive corpus of Spanish contemporary song lyrics, a type of text related to poetry. On the one hand, poetry and songs not only have been connected since their origins, but they share some characteristics, such as the division in lines or the use of rhymes. On the other hand, Digital Humanities quantitative approaches have already been applied to poetry, but we still lack a study in the same fashion for lyrics. Taking advantage of the advances in automatic scansion and syllabification, rhyme detection, or Topic Modeling technologies, the present study analyzes Spanish contemporary song lyrics’ main thematic and structural properties, comparing them with those used in poetic texts. Our results offered new insights into the characteristics of the analyzed texts and their connections to poetic ones.