Publicación: Hand pose estimation from depth data with Convolutional Neural Networks
Cargando...
Fecha
2017-09-02
Autores
Editor/a
Director/a
Tutor/a
Coordinador/a
Prologuista
Revisor/a
Ilustrador/a
Derechos de acceso
Atribución-NoComercial-SinDerivadas 4.0 Internacional
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Título de la revista
ISSN de la revista
Título del volumen
Editor
Universidad Nacional de Educación a Distancia (España). Escuela Técnica Superior de Ingeniería Informática. Departamento de Inteligencia Artificial
Resumen
The estimation of hand position and orientation, pose, is of special interest in many applications related to Human Robot Interaction, such as human activity recognition, sign language interpretation, or as a human computer interface in virtual reality systems, advanced entertainment games, gesture-driven interfaces, and in teleoperated or in autonomous robotic systems. This project focusses on the problem of hand pose estimation using convolutional neural networks (CNN) from depth data. Recently, dierent CNN architectures have been proposed in order to nd an ecent and reliable methodolgy to resolve the complexity that involves the variablity in apperance of a hand, with its gestures, changes of orientation, occlusions and so. The use of CNN opens new opportunities for improvements in this research by providing the capability of learning from many samples. This work pretends to advance a step further on the hand pose estimation problem. With this aim, the hand pose estimation using CNN by modifying the output data with a new representation of the hand pose is proposed. An Euclidean Matrix Distance (EDM) is proposed as a hand pose representation. This representation encodes structural information of the hand pose and captures local correlations and dependecies between some hand keypoints. To evaluate the performance of the proposed method, dierent CNN architectures using EDM representation are explored and compared with the hand pose representation dened by the position of hand keypoints in the 3D Cartesian coordinate system. Experimental results show that using EDM representation as target layer in the convolutional network increases the performance of all the proposed architectures in terms of mean square error, both in training and testing sets. As a conclusion, this work shows that EDM representations help to reduce some ambiguities of current hand pose estimation methods using a CNN, by incorporating structural information of the hand and capturing keypoint/joint correlations. This research also gives some insights to investigate in future advances in hand pose estimation using CNN models and will help to explore new strategies for this problem.
Descripción
Categorías UNESCO
Palabras clave
Citación
Centro
Facultades y escuelas::E.T.S. de Ingeniería Informática
Departamento
Inteligencia Artificial