Hand3D

Hand3D: Hand Pose Estimation using 3D Neural Network

Xiaoming Deng¹ Shuo Yang¹ Yinda Zhang² Ping Tan³ Liang Chang⁴ Hongan Wang¹

¹ Institute of Software CAS ² Princeton University ³ Simon Fraser University ⁴ Beijing Normal University

Abstract

We propose a novel 3D neural network architecture for 3D hand pose estimation from a single depth image. Different from previous works that mostly run on 2D depth image domain and require intermediate or post process to bring in the supervision from 3D space, we convert the depth map to a 3D volumetric representation, and feed it into a 3D convolutional neural network(CNN) to directly produce the pose in 3D requiring no further process. Our system does not require the ground truth reference point for initialization, and our network architecture naturally integrates both local feature and global context in 3D space. To increase the coverage of the hand pose space of the training data, we render synthetic depth image by transferring hand pose from existing real image datasets. We evaluation our algorithm on two public benchmarks and achieve the state-of-the-art performance. The synthetic hand pose dataset will be available.

Video

Paper

X. Deng*, S. Yang*, Y. Zhang*, P. Tan, L. Chang, H. Wang.
"Hand3D: Hand Pose Estimation using 3D Neural Network"
arXiv:1704.04922 [cs.CV] (6 April 2017).

Download

The synthetic hand pose dataset will be available.