期刊文献+

面向舆情分析的短文本频繁模式聚类算法 被引量:7

Short Text Frequent Pattern Clustering Algorithm for Public Opinion Analysis
在线阅读 下载PDF
导出
摘要 基于短文本的舆情分析是当前信息挖掘与情感分析领域的研究重点,针对网络环境中大量的短文本信息的鲜明特点,本文突破了传统基于词的分类方法,提出一种基于后缀数组频繁模式发现的聚类算法,利用后缀数组频繁模式精确去重算法得到关键词库,结合局部性原理对位置点聚类之后作有意义字串挖掘,进而进行文本舆情分析,以便及时动态了解网络群体的情感方向以及社会舆情热点。 The analysis of public opinion based on short text is the focus of the field of information mining and sensation analysis.Different from the traditional classification method based on words,a clustering algorithm,which based on suffix arrays is proposed.By removeing repetitive string accurately,meaningful strings are obtained after the clustering analysis of repeat string alterations in accordance with the principle of position.Public opinion toward these meaningful strings are analyzed and the dynamic emotional direction and social public opinion of network groups are discovered.
作者 刘建波 杨峰
出处 《北京电子科技学院学报》 2010年第4期6-11,共6页 Journal of Beijing Electronic Science And Technology Institute
关键词 短文本 舆情分析 后缀数组 频繁模式 聚类 short text public opinion analysis suffix arrays frequent pattern clustering
  • 相关文献

参考文献9

二级参考文献65

共引文献356

同被引文献35

  • 1Teutle, "Twitter: Network Properties Analysis", 978-1-4244- 5353-5/2010 IEEE.
  • 2Meeder, Karrer, et al., "We Know Who You Followed Last Sumer: Inferring Social Link CreationTimes In Twitter", In Prec. W~. (Hyderabad, India, March 28 April I, 2011).
  • 3Sriram, Fuhry, et al., "Short text classification in twit- ter to improve information filtering", In Proc. SIGIR. (Geneva, Switzerland, July 19 23, 2010).
  • 4Phinecos.基于朴素贝叶斯分类器的文本分类算法(上).[EB/0L].(2008-10-21)[2012-04-26].http://www.cnblogs.com/phinecos/archive/2008/10/21/1315948.html.
  • 5Phinecos.基于朴素贝叶斯分类器的文本分类算法(下).[EB/OL].(2008-10-21)[2012-04-26].http://www.cnblogs.com/phinecos/archive/2008/10/21/1316044.html.
  • 6KIM S M, HOVY E. Determining the sentiment of opinions [ C ]// Proc of the 20th International Conference on Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2004: 1367-1373.
  • 7NASUKAWA T, YI J. Sentiment analysis: capturing favorability using natural language processing[ C ]//Proc of the 2nd International Conference on Knowledge Capture. New York: ACM Press,2003:70-77.
  • 8WILSON T, HOFFMANN P, SOMASUNDARAN S, et al. Opinion- Finder: a system for subjectivity analysis[ C ]//Proc of HLT/EMNLP on Interactive Demonstrations. Stroudsburg: Association for Computational Linguistics ,2005:34 - 35.
  • 9HU Min-qing, LiU Bing. Mining and summarizing customer reviews [ C]//Proc of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM Press,2004: 168-177.
  • 10陈中干.现代汉语复句研究[M].北京:语文出版社,1998.

引证文献7

二级引证文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部