结合深度特征与多标记分类的图像语义标注

李志欣; 郑永哲; 张灿龙; 史忠植

doi:10.3724/SP.J.1089.2018.16235

结合深度特征与多标记分类的图像语义标注

Combining Deep Feature and Multi-label Classification for Semantic Image Annotation

摘要

摘要: 为了缩减不同模态数据间的语义鸿沟,提出一种结合深度卷积神经网络和集成分类器链的多标记图像语义标注方法.该方法主要由生成式特征学习和判别式语义学习2个阶段构成.首先利用深度卷积神经网络学习图像的高层视觉特征;然后基于获取的视觉特征与图像的语义标记集训练集成分类器链,并学习视觉特征包含的语义信息;最后利用训练得到的模型对未知图像进行自动语义标注.在Corel5K和PASCAL VOC 2012图像数据集上的实验结果表明,与一些当前国际先进水平的方法相比,文中方法的鲁棒性更强,标注结果更精确.

Abstract: To bridge the semantic gap between data of difference modalities,a multi-label semantic image annotation approach combining deep CNN(convolutional neural network)and ECC(ensemble of classifier chains)is proposed.The approach is composed of two stages,i.e.generative feature learning and discriminative semantic learning.Deep CNN is utilized to extract high level visual features of images in the first step.Then ECC is trained based on the obtained visual features and image semantic label set.Meanwhile,semantic information in visual features is learned.Eventually,automatic annotation on unseen images can be done by the trained model.Experimental results on Corel5K and PASCAL VOC 2012 image datasets show that the proposed approach is more robust and more accurate than some state-of-the-art approaches.

HTML全文

参考文献(0)

施引文献

资源附件(0)