Hand pose estimation from depth data with Convolutional Neural Networks

Vinagre Ruiz, Manuel. (2017). Hand pose estimation from depth data with Convolutional Neural Networks Master Thesis, Universidad Nacional de Educación a Distancia (España). Escuela Técnica Superior de Ingeniería Informática. Departamento de Inteligencia Artificial

Ficheros (Some files may be inaccessible until you login with your e-spacio credentials)
Nombre Descripción Tipo MIME Size
Vinagre_Ruiz_Manuel_TFM.pdf Vinagre_Ruiz_Manuel_TFM.pdf application/pdf 1.35MB

Título Hand pose estimation from depth data with Convolutional Neural Networks
Autor(es) Vinagre Ruiz, Manuel
Abstract The estimation of hand position and orientation, pose, is of special interest in many applications related to Human Robot Interaction, such as human activity recognition, sign language interpretation, or as a human computer interface in virtual reality systems, advanced entertainment games, gesture-driven interfaces, and in teleoperated or in autonomous robotic systems. This project focusses on the problem of hand pose estimation using convolutional neural networks (CNN) from depth data. Recently, dierent CNN architectures have been proposed in order to nd an ecent and reliable methodolgy to resolve the complexity that involves the variablity in apperance of a hand, with its gestures, changes of orientation, occlusions and so. The use of CNN opens new opportunities for improvements in this research by providing the capability of learning from many samples. This work pretends to advance a step further on the hand pose estimation problem. With this aim, the hand pose estimation using CNN by modifying the output data with a new representation of the hand pose is proposed. An Euclidean Matrix Distance (EDM) is proposed as a hand pose representation. This representation encodes structural information of the hand pose and captures local correlations and dependecies between some hand keypoints. To evaluate the performance of the proposed method, dierent CNN architectures using EDM representation are explored and compared with the hand pose representation dened by the position of hand keypoints in the 3D Cartesian coordinate system. Experimental results show that using EDM representation as target layer in the convolutional network increases the performance of all the proposed architectures in terms of mean square error, both in training and testing sets. As a conclusion, this work shows that EDM representations help to reduce some ambiguities of current hand pose estimation methods using a CNN, by incorporating structural information of the hand and capturing keypoint/joint correlations. This research also gives some insights to investigate in future advances in hand pose estimation using CNN models and will help to explore new strategies for this problem.
Notas adicionales Trabajo de Fin de Máster. Máster Universitario en I.A. Avanzada: Fundamentos, Métodos y Aplicaciones. UNED
Materia(s) Ingeniería Informática
Editor(es) Universidad Nacional de Educación a Distancia (España). Escuela Técnica Superior de Ingeniería Informática. Departamento de Inteligencia Artificial
Director/Tutor Rincón Zamorano, Mariano
Fecha 2017-09-02
Formato application/pdf
Identificador bibliuned:master-ETSInformatica-IAA-Mvinagre
http://e-spacio.uned.es/fez/view/bibliuned:master-ETSInformatica-IAA-Mvinagre
Idioma eng
Versión de la publicación acceptedVersion
Nivel de acceso y licencia http://creativecommons.org/licenses/by-nc-nd/4.0
info:eu-repo/semantics/openAccess
Tipo de recurso master Thesis
Tipo de acceso Acceso abierto

 
Versiones
Versión Tipo de filtro
Contador de citas: Google Scholar Search Google Scholar
Estadísticas de acceso: 36 Visitas, 5 Descargas  -  Estadísticas en detalle
Creado: Thu, 12 Dec 2019, 19:02:49 CET