高级检索
汤斯亮, 程璐, 邵健, 吴飞, 鲁伟明. 基于概率主题建模的新闻文本可视化综述[J]. 计算机辅助设计与图形学学报, 2015, 27(5): 771-782.
引用本文: 汤斯亮, 程璐, 邵健, 吴飞, 鲁伟明. 基于概率主题建模的新闻文本可视化综述[J]. 计算机辅助设计与图形学学报, 2015, 27(5): 771-782.
Tang Siliang, Cheng Lu, Shao Jian, Wu Fei, Lu Weiming. A Survey on News Text Visualization via Probabilistic Topic Modeling[J]. Journal of Computer-Aided Design & Computer Graphics, 2015, 27(5): 771-782.
Citation: Tang Siliang, Cheng Lu, Shao Jian, Wu Fei, Lu Weiming. A Survey on News Text Visualization via Probabilistic Topic Modeling[J]. Journal of Computer-Aided Design & Computer Graphics, 2015, 27(5): 771-782.

基于概率主题建模的新闻文本可视化综述

A Survey on News Text Visualization via Probabilistic Topic Modeling

  • 摘要: 伴随着信息技术的发展,传统纸质新闻逐渐向新媒体新闻转变.与此同时,近年来数据挖掘和自然语言处理等技术得到了极大的发展,使得对新闻所蕴含丰富语义和主题进行深度挖掘成为可能.然而,信息的超载使得主题可视化成为一个新的挑战,即如何以更好的方式来呈现海量互联网文本所蕴含的主题.隐形语义分析(LDA)是近年来兴起的主题建模方法,被当前学术界认为是主流的主题建模技术.文中首先介绍以LDA为主的文本概率主题建模技术及其发展,讨论了新闻主题建模特点;随后概括对比新闻主题可视化的若干方法,并对其进行分类,分析不同方法的适用性和局限性;最后对新闻主题可视化进行总结和展望.

     

    Abstract: With the development of information technology, traditional media is making transformation to the "New Media", which is based on the Internet. Meanwhile, data mining and natural language processing have been developed greatly these years. These technologies are utilized to uncover the semantics and topics in news articles. In addition, the overflow of information motivates the development of new visualization methods. Overall, these methods have great potential of making people better understand news. LDA is a promising topic modeling method, which is adopted by lots of researchers. In this paper, we first introduced the text probabilistic topic modeling technologies based on LDA and discussed the unique characteristics of topic modeling for news. Then, we summarized different visualization methods of topic modeling for news. We also analyzed and compared advantages and shortcomings of these methods. At last, we discussed some future directions of topic modeling for news.

     

/

返回文章
返回