高级检索
林梦翔, 林志玮, 黄秀萍, 洪思弟. 融合全局与随机局部特征的鸟类姿态识别模型[J]. 计算机辅助设计与图形学学报, 2022, 34(4): 581-591. DOI: 10.3724/SP.J.1089.2022.18953
引用本文: 林梦翔, 林志玮, 黄秀萍, 洪思弟. 融合全局与随机局部特征的鸟类姿态识别模型[J]. 计算机辅助设计与图形学学报, 2022, 34(4): 581-591. DOI: 10.3724/SP.J.1089.2022.18953
Lin Mengxiang, Lin Zhiwei, Huang Xiuping, Hong Sidi. Bird Postures Recognition Model Fusing Global and Random Local Features[J]. Journal of Computer-Aided Design & Computer Graphics, 2022, 34(4): 581-591. DOI: 10.3724/SP.J.1089.2022.18953
Citation: Lin Mengxiang, Lin Zhiwei, Huang Xiuping, Hong Sidi. Bird Postures Recognition Model Fusing Global and Random Local Features[J]. Journal of Computer-Aided Design & Computer Graphics, 2022, 34(4): 581-591. DOI: 10.3724/SP.J.1089.2022.18953

融合全局与随机局部特征的鸟类姿态识别模型

Bird Postures Recognition Model Fusing Global and Random Local Features

  • 摘要: 为了高效地进行鸟类姿态分类,提出一种基于全局与随机局部特征融合的鸟类姿态识别模型.首先利用融合多分辨率的网络提取鸟类姿态全局特征;然后于网络中浅层与深层的高分辨率特征引入随机定位模块,即根据随机抽取的特征图求取最大值位置,形成包围盒裁剪原图;再将裁剪的局部图片送入子分类网络提取鸟类姿态局部特征;最后将全局和随机局部特征进行融合,并采用融合全局损失和局部损失的多损失策略进行网络调整,构建一种融合全局与随机局部特征的鸟类姿态识别模型.对CUB200-2011中存在完整单种姿态的鸟类图片进行整理汇总得到包含蹲伏、飞翔、游水和站立4种姿态的鸟类姿态数据集,基于该数据集进行实验的结果表明,所提模型的分类精度优于主流卷积神经网络框架,达到96.1%;对随机定位模块及其内部是否随机、分组情况和多损失策略等进行消融实验的结果表明,引入随机定位模块和多损失策略能够提高识别正确率,证明了随机定位模块和多损失策略的有效性.

     

    Abstract: In order to efficiently classify bird postures,a bird postures recognition model based on global and random local features fusion is proposed.Firstly,the global features of bird postures are extracted by using a multi-resolution fusion network.Then,a random localization module is used in the network’s shallow and deep,high-resolution features.It obtains the positions of the maximum value according to the randomly extracted feature maps,which is used to form a bounding box to cut the original image.In addition,the cropped local images are taken as input for the sub-classification network to extract the local features of bird postures.Subsequently,the global and random local features are fused,and the multi-loss strategy with the global and local losses is adopted to adjust the network.Finally,the bird postures recognition model with fusing global and random local features is constructed.The bird images with the single intact posture in the CUB200-2011 are collected and summarized to obtain the bird postures dataset,including crouching,flying,swimming,and standing postures.Experimental results based on the dataset show that the recognition accuracy of the model is better than that of the popular convolutional neural networks and achieves 96.1%.The results of ablation experiments on the random localization module and its internal randomness,grouping situation,and multi-loss strategy show that the random localization module and the multi-loss strategy can improve the recognition accuracy,which proves the effectiveness of the random localization module and the multi-loss strategy.

     

/

返回文章
返回