高级检索
薛峰, 张涛, 李书杰. 基于单词等级和关联性语义多模态主题模型的社会事件分类[J]. 计算机辅助设计与图形学学报, 2022, 34(10): 1477-1488. DOI: 10.3724/SP.J.1089.2022.19746
引用本文: 薛峰, 张涛, 李书杰. 基于单词等级和关联性语义多模态主题模型的社会事件分类[J]. 计算机辅助设计与图形学学报, 2022, 34(10): 1477-1488. DOI: 10.3724/SP.J.1089.2022.19746
Xue Feng, Zhang Tao, Li Shujie. Multi-Modal Topic Model Based on Word Rank and Relevance Semantic for Social Events Classification[J]. Journal of Computer-Aided Design & Computer Graphics, 2022, 34(10): 1477-1488. DOI: 10.3724/SP.J.1089.2022.19746
Citation: Xue Feng, Zhang Tao, Li Shujie. Multi-Modal Topic Model Based on Word Rank and Relevance Semantic for Social Events Classification[J]. Journal of Computer-Aided Design & Computer Graphics, 2022, 34(10): 1477-1488. DOI: 10.3724/SP.J.1089.2022.19746

基于单词等级和关联性语义多模态主题模型的社会事件分类

Multi-Modal Topic Model Based on Word Rank and Relevance Semantic for Social Events Classification

  • 摘要: 多媒体社会事件分类问题是多媒体研究领域中的热点问题.现有基于有监督主题模型的社会事件分类方法,未充分利用语料库(文本、视觉等模态)的内部语义信息,模型分类性能有待进一步提升.针对此问题,提出了一种融合单词等级和单词文档关联性语义的多模态监督主题模型(multi-modal supervised topic model based on word rank and relevance semantic weighting,DPRF-MMSTM),利用依存句法分析结果来划分文本模态单词对文档表征的贡献等级,挖掘出文本单词的等级语义;同时,考虑多模态单词的关联文档频数信息,用于单词文档关联性语义的提取;将2种语义融合到多模态单词的采样过程,实现基于有监督主题模型的社会事件分类.在多模态和单模态数据集上的对比实验表明,对比现有方法,DPRF-MMSTM模型在社会事件分类精度上分别提高了1.200%,1.630%,在主题一致性上分别提高了38.0%,8.5%.

     

    Abstract: Social event classification of multimedia is a hot issue in the field of multimedia research.The existing social event classification methods based on supervised topic model fail to make full use of the internal semantic information(text,vision,etc.)in the corpus,and the classification performance of the model can be further improved.To solve this problem,a multi-modal supervised topic model(multi-modal supervised topic model based on word rank and relevance semantic weighting,DPRF-MMSTM)is proposed,which integrates word rank semantics and word document relevance semantics.According to the results of dependency parsing,the contribution of text modal words to document representation can be divided,and the hierarchical semantics of text words can be mined.In addition,the relevance frequency of multi-modal words is considered to extract the relevance semantics of word documents.The two semantics are integrated into the sampling process of multi-modal words to achieve the classification of social events based on supervised topic model.Compared with the existing models,the comparative experiments on multi-modal and text modal datasets show that the DPRF-MMSTM model proposed in this paper improves the classification accuracy of social events by 1.200%and 1.630%respectively,and the topic consistency is increased by 38.0%and 8.5%respectively.

     

/

返回文章
返回