Two-Layer Motion Semantic Recognition by Fusing the Restricted Boltzmann Machine Based Generative Model and Discriminative Model
Zhou Bing1,2), Peng Shujuan1,2)*, and Liu Xin1,2)
1) (College of Computer Science and Technology, Huaqiao University, Xiamen 361021)2) (Key Lab of Pattern Recognition and Computer Vision, Xiamen City, Xiamen 361021)
The semantic gap problem between the low-level features and high-level semantics often exists within the motion capture data. To tackle this problem, we refer to the deep learning theory and propose a two-layer motion recognition approach by fusing the Restricted Boltzmann Machine (RBM) based generative model and discriminative model, in which the generative layer is utilized for feature representation and the discriminative layer is selected for semantic discrimination. Within the proposed approach, we first utilize the autoregressive model to establish an one-way three-factored conditional RBM, whereby the spatiotemporal features of the captured motions can be well obtained. Then, these features are coupled with their corresponding labels and selected as the visible input of the RBM based discriminative model. Finally, by adding a voting space, the motion semantics can be efficiently recognized via this two layer fused model. The experimental results have shown that our proposed approach is able to recognize different kinds of motion poses, featuring robustness and expandability to the motion capture data. It is expected that the proposed approach would be well utilized for motion capture data reusing in a practical way.
motion capture data; spatiotemporal feature; deep learning; Restricted Boltzmann Machine; discriminative model