高级检索
李庆辉, 李艾华, 崔智高, 苏延召. 采用时空共生特征与改进VLAD的行为识别[J]. 计算机辅助设计与图形学学报, 2018, 30(10): 1910-1916. DOI: 10.3724/SP.J.1089.2018.16970
引用本文: 李庆辉, 李艾华, 崔智高, 苏延召. 采用时空共生特征与改进VLAD的行为识别[J]. 计算机辅助设计与图形学学报, 2018, 30(10): 1910-1916. DOI: 10.3724/SP.J.1089.2018.16970
Li Qinghui, Li Aihua, Cui Zhigao, Su Yanzhao. Action Recognition Using Spatio-temporal Co-occurrence Features and Improved VLAD[J]. Journal of Computer-Aided Design & Computer Graphics, 2018, 30(10): 1910-1916. DOI: 10.3724/SP.J.1089.2018.16970
Citation: Li Qinghui, Li Aihua, Cui Zhigao, Su Yanzhao. Action Recognition Using Spatio-temporal Co-occurrence Features and Improved VLAD[J]. Journal of Computer-Aided Design & Computer Graphics, 2018, 30(10): 1910-1916. DOI: 10.3724/SP.J.1089.2018.16970

采用时空共生特征与改进VLAD的行为识别

Action Recognition Using Spatio-temporal Co-occurrence Features and Improved VLAD

  • 摘要: 特征提取与编码是决定行为识别系统性能的关键步骤.为实现对视频行为的准确描述,提出一种采用时空共生特征与改进VLAD编码的人体行为识别算法.在特征提取环节,采用扩展的人体矩形框对密集采样特征点的进行筛选,在光流场中跟踪筛选后的特征点得到密集轨迹,并在以密集轨迹为中心的时空体内提取时空共生特征;在特征编码环节,将每个特征向量分配到近邻多个单词,以这些单词为基向量在最小平方误差的准则下线性组合逼近对应特征向量,得到的组合系数作为隶属度,最后以隶属度为权值在多个单词上计算VLAD.在KTH,YouTube和HMDB51数据集上进行实验的结果表明,该算法具备较高的识别准确度,适用于复杂场景中的人体行为识别.

     

    Abstract: In order to obtain an accurate description of human action in videos,a novel human action recognition algorithm using spatio-temporal co-occurrence features and improved VLAD is proposed.In the feature extraction step,the traditional dense interest points in the videos are refined by the expanded human rectangular box.Dense trajectories are obtained by tracking the refined points using optical flow field.And spatio-temporal co-occurrence features are extracted in the spatio-temporal volume along the trajectory.In the feature encoding step,each feature vector is assigned to multiple visual words from the codebook,which brings the complementary shape information within the encoding process.Then a linear combination of these basis to reconstruct each descriptor is learned under the least square error criterion.The linear combination coefficients are taken as the weights of VLAD to accumulate the residuals.Experimental results show that the proposed algorithm achieves the state-of-the-art result on KTH,YouTube and HMDB51 datasets.

     

/

返回文章
返回