Publicación: Sign Language Segmentation Using a Transformer-based Approach
Cargando...
Fecha
2022-09-01
Autores
Editor/a
Director/a
Tutor/a
Coordinador/a
Prologuista
Revisor/a
Ilustrador/a
Derechos de acceso
info:eu-repo/semantics/openAccess
Título de la revista
ISSN de la revista
Título del volumen
Editor
Universidad Nacional de Educación a Distancia (España). Escuela Técnica Superior de Ingeniería Informática. Departamento de Inteligencia Artificial
Resumen
Continuous Sign Language Recognition (CSLR), predicting the meaning of the signs in sign language sentences, is one of the current challenges in translation between sign and spoken languages, that would benefit people with hearing impairment. An important limitation of this research field is the lack of annotated datasets, which could be minimized with Sign Segmentation approaches by automating the costly task of manually annotating the beginning and ending of each sign. The goal of this paper is to study the performance of an architecture which combines I3D CNN extracted features with a transformer-based model called ASFormer which was created specifically for Action Segmentation task. In our approach ASFormer, instead of separating actions in motions is separating signs in a signed speech. Several ablation studies are performed, and it is shown that ASFormer is suitable for segmenting the signs, with a performance near the ones of the state-of-the-art models, confirming the promising benefits of using attention-based approaches in this field.
Descripción
Categorías UNESCO
Palabras clave
Citación
Centro
Facultades y escuelas::E.T.S. de Ingeniería Informática
Departamento
Inteligencia Artificial