Publicación:
Speak2Subs: Evaluating State-of-the-Art Speech Recognition Models and Compliant Subtitle Generation

dc.contributor.authorFresneda García, Julio
dc.date.accessioned2024-06-11T16:19:53Z
dc.date.available2024-06-11T16:19:53Z
dc.date.issued2024-02
dc.description.abstractWith recent advances in largue language models, the evolution of speech-to-text tasks has been exponential. While state-of-the-art automatic speech recognition (ASR) models have taken a big step in speech transcription, creating quality subtitles still requires human intervention. This project has two main aspects: evaluating cutting-edge ASR models for speech-to-text, and developing a package that uses this ASR models to generate high-quality and compliant subtitles. ASR models do not inherently provide results suitable for subtitles. Therefore, one of the primary objectives of this package is to utilize and enhance the output generated by ASR models to create subtitles of a quality that requires minimal human modification. This enhancement is necessary because ASR models alone are incapable of producing subtitles that meet the required standards of quality. Speak2Subs has achieved this goal, being a tool that produces high-quality subtitles with minimal human interaction.en
dc.description.versionversión final
dc.identifier.urihttps://hdl.handle.net/20.500.14468/22596
dc.language.isoen
dc.publisherUniversidad Nacional de Educación a Distancia (España). Escuela Técnica Superior de Ingeniería Informática.
dc.relation.centerE.T.S. de Ingeniería Informática
dc.relation.departmentInteligencia Artificial
dc.rightsAtribución-NoComercial-SinDerivadas 4.0 Internacional
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0
dc.subject.keywordsASR
dc.subject.keywordsLLM
dc.subject.keywordsSpeech-To-Text
dc.subject.keywordsSubtitle
dc.titleSpeak2Subs: Evaluating State-of-the-Art Speech Recognition Models and Compliant Subtitle Generationes
dc.typetesis de maestríaes
dc.typemaster thesisen
dspace.entity.typePublication
Archivos
Bloque original
Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
Fresneda_Garcia_Julio_Antonio_TFM.pdf
Tamaño:
2.85 MB
Formato:
Adobe Portable Document Format