高级检索
张浩, 李海鹏, 彭国琴, 柳雁安, 徐丹. 多层次特征融合表征的图像情感识别[J]. 计算机辅助设计与图形学学报, 2023, 35(10): 1566-1576. DOI: 10.3724/SP.J.1089.2023.19742
引用本文: 张浩, 李海鹏, 彭国琴, 柳雁安, 徐丹. 多层次特征融合表征的图像情感识别[J]. 计算机辅助设计与图形学学报, 2023, 35(10): 1566-1576. DOI: 10.3724/SP.J.1089.2023.19742
Zhang Hao, Li Haipeng, Peng Guoqin, Liu Yan'an, Xu Dan. Image Emotion Recognition via Fusion Multi-Level Representations[J]. Journal of Computer-Aided Design & Computer Graphics, 2023, 35(10): 1566-1576. DOI: 10.3724/SP.J.1089.2023.19742
Citation: Zhang Hao, Li Haipeng, Peng Guoqin, Liu Yan'an, Xu Dan. Image Emotion Recognition via Fusion Multi-Level Representations[J]. Journal of Computer-Aided Design & Computer Graphics, 2023, 35(10): 1566-1576. DOI: 10.3724/SP.J.1089.2023.19742

多层次特征融合表征的图像情感识别

Image Emotion Recognition via Fusion Multi-Level Representations

  • 摘要: 可视化的结果证明层次化的网络结构依赖于深层语义信息, 而忽略了对于唤起情感至关重要的浅层视觉细节. 针对当前图像情感分析领域中的方法多单一聚焦于低级视觉特征或高级语义表示上, 未能综合考虑不同层次特征的问题, 提出一种多层次混合表征模型. 首先提出一种浅层视觉特征提取器, 其被嵌入到主干网络用于提取浅层视觉细节; 然后主干的深层语义特征和分支的浅层视觉细节表示被融合层聚合成混合表征, 用于图像情感识别; 最后引入一种损失函数来优化模型, 解决图像情感数据集中数据样本不平衡的问题. 在 FI, Emotion6等 6个情感数据集(包括普通图像和艺术图像)上的实验结果表明, 与当前模型相比, 所提模型的情感识别准确率得到超过 1.5%的性能提升.

     

    Abstract: Visualizations prove that the hierarchical structure mainly relies on deep semantic information whereas ignored shallow visual details are crucial for emotional evocation. A multi-level hybrid representation model is proposed to address the problem that most current methods in image sentiment analysis focus on low-level visual features or high-level semantics but fail to comprehensively consider features on different levels. Therefore, we propose a shallow visual feature extractor embedded into the backbone to obtain shallow visual information. The deep semantics of the backbone and the shallow visual detail representations of the branches are aggregated into hybrid representations through a fusion layer for emotion recognition. In addition, a loss function is introduced to optimize the model in the case of imbalanced samples in image emotion datasets. Experiments on 6 image emotion datasets (including ordinary and art images) such as FI and Emotion6, show that the emotion recognition accuracy has been improved by more than 1.5%.

     

/

返回文章
返回