Publicación: Generative Adversarial Networks for text-to-face synthesis & generation: A quantitative–qualitative analysis of Natural Language Processing encoders for Spanish
dc.contributor.author | Yauri Lozano, Eduardo | |
dc.contributor.author | Orozco Barbosa, Luis | |
dc.contributor.author | García Castro, Raúl | |
dc.contributor.author | Castillo Cara, José Manuel | |
dc.date.accessioned | 2024-05-20T11:38:08Z | |
dc.date.available | 2024-05-20T11:38:08Z | |
dc.date.issued | 2024-01 | |
dc.description.abstract | In recent years, the development of Natural Language Processing (NLP) text-to-face encoders and Generative Adversarial Networks (GANs) has enabled the synthesis and generation of facial images from textual description. However, most encoders have been developed for the English language. This work presents the first study of three text-to-face encoders, namely, the RoBERTa pre-trained model and the Sent2Vec and RoBERTa models, trained with the CelebA dataset in Spanish. It then introduces customised and fine-tuned conditional Deep Convolutional Generative Adversarial Networks (cDCGANs) trained with the CelebA dataset for text-to-face generation in Spanish. To validate the results obtained, a qualitative evaluation was carried out with a visual analysis and a quantitative evaluation based on the IS, FID and LPIPS metrics. Our findings show promising results with respect to the literature, improving the numerical metrics of FID and LPIPS by 5% and 37%, respectively. Our results also show, through a quantitative–qualitative comparison of the cDCGAN training epochs, that the IS metric is not a reliable objective metric to be considered in the evaluation of similar works | en |
dc.description.version | versión publicada | |
dc.identifier.doi | https://doi.org/10.1016/j.ipm.2024.103667 | |
dc.identifier.issn | 0306-4573 eISSN 1873-5371 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14468/12342 | |
dc.journal.issue | 3 | |
dc.journal.title | Information Processing and Management | |
dc.journal.volume | 61 | |
dc.language.iso | es | |
dc.publisher | Elsevier | |
dc.relation.center | E.T.S. de Ingeniería Informática | |
dc.relation.department | Informática y Automática | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/deed.es | |
dc.subject.keywords | mage synthesis | |
dc.subject.keywords | CelebA dataset | |
dc.subject.keywords | RoBERTa transformer | |
dc.subject.keywords | Spanish | |
dc.subject.keywords | cDCGAN | |
dc.subject.keywords | Text-to-face generation | |
dc.subject.keywords | Text-to-image synthesis | |
dc.title | Generative Adversarial Networks for text-to-face synthesis & generation: A quantitative–qualitative analysis of Natural Language Processing encoders for Spanish | es |
dc.type | journal article | en |
dc.type | artículo | es |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | c0e39bd2-c0d8-4743-953d-488baf6b977e | |
relation.isAuthorOfPublication.latestForDiscovery | c0e39bd2-c0d8-4743-953d-488baf6b977e |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- Castillo_Cara_Jose_Manuel_GANs.pdf
- Tamaño:
- 1.74 MB
- Formato:
- Adobe Portable Document Format