采用时空共生特征与改进VLAD的行为识别

李庆辉; 李艾华; 崔智高; 苏延召

doi:10.3724/SP.J.1089.2018.16970

采用时空共生特征与改进VLAD的行为识别

Action Recognition Using Spatio-temporal Co-occurrence Features and Improved VLAD

摘要

摘要: 特征提取与编码是决定行为识别系统性能的关键步骤.为实现对视频行为的准确描述,提出一种采用时空共生特征与改进VLAD编码的人体行为识别算法.在特征提取环节,采用扩展的人体矩形框对密集采样特征点的进行筛选,在光流场中跟踪筛选后的特征点得到密集轨迹,并在以密集轨迹为中心的时空体内提取时空共生特征;在特征编码环节,将每个特征向量分配到近邻多个单词,以这些单词为基向量在最小平方误差的准则下线性组合逼近对应特征向量,得到的组合系数作为隶属度,最后以隶属度为权值在多个单词上计算VLAD.在KTH,YouTube和HMDB51数据集上进行实验的结果表明,该算法具备较高的识别准确度,适用于复杂场景中的人体行为识别.

Abstract: In order to obtain an accurate description of human action in videos,a novel human action recognition algorithm using spatio-temporal co-occurrence features and improved VLAD is proposed.In the feature extraction step,the traditional dense interest points in the videos are refined by the expanded human rectangular box.Dense trajectories are obtained by tracking the refined points using optical flow field.And spatio-temporal co-occurrence features are extracted in the spatio-temporal volume along the trajectory.In the feature encoding step,each feature vector is assigned to multiple visual words from the codebook,which brings the complementary shape information within the encoding process.Then a linear combination of these basis to reconstruct each descriptor is learned under the least square error criterion.The linear combination coefficients are taken as the weights of VLAD to accumulate the residuals.Experimental results show that the proposed algorithm achieves the state-of-the-art result on KTH,YouTube and HMDB51 datasets.

HTML全文

参考文献(0)

施引文献

资源附件(0)