摘要
情感词典作为判断词语和文本情感倾向的重要工具,其自动构建方法已成为情感分析和观点挖掘领域的一项重要研究内容.本文整理了现有的中、英文情感词典资源,同时分别从知识库、语料库、以及两者结合的角度,归纳现有英文和中文情感词典的构建方法,分析了各种方法的优缺点,并总结了情感词典构建中的若干难点问题.之后,我们回顾了情感词典性能评估方法及相关评测竞赛.最后总结了情感词典构建任务的发展前景以及一些亟需解决的问题.
Sentiment lexicon is an important tool of identifying the sentiment polarity of words and texts. How to automatically construct sentiment lexicons has become a research topic in the field of sentiment analysis and opinion mining. We review the existing sentiment lexicon construction methods, for both English and Chinese languages, from the perspectives of lexicons, corpus, and the combination of the two. We analyze the advantages and disadvantages of each method and point out some special problems in sentiment lexicon construction. We furthermore summarize the evaluation methods and review several competitions related to sentiment lexicon construction. Finally, we discuss the prospect of sentiment lexicon construction, and present some problems that remain to be solved.
出处
《自动化学报》
EI
CSCD
北大核心
2016年第4期495-511,共17页
Acta Automatica Sinica
基金
国家自然科学基金(61305090)
软件新技术国家重点实验室开放基金
江苏省自然科学基金(BK2012396)资助~~
关键词
自然语言处理
情感分析
观点挖掘
情感词典
词典构建
Natural language processing
sentiment analysis
opinion mining
sentiment lexicon
lexicon construction