期刊文献+

基于质心迁移的领域适应性情感分类 被引量:4

ADAPTIVE DOMAIN SENTIMENT CLASSIFICATION BASED ON CENTROID TRANSFER
在线阅读 下载PDF
导出
摘要 针对监督学习方法在文本的跨领域情感分析效果较差的问题,提出基于质心迁移的领域间适应性情感分类方法。该方法利用源领域的标注文本对目标领域的大量未标注文本进行分类,选择一部分可信度高的文本加入到训练集,同时去除源领域中距离目标领域测试集质心较远的文本,通过迭代逐渐缩小两个领域间的质心距离,减小领域间差异。实验结果表明,该方法能提高跨领域倾向性分析的精度。 Supervised learning techniques do not perform well in documents cross-domain sentiment analysis.To deal with this problem,we proposed a novel approach,that is the adaptive intra-domain sentiment classification based on centroid transfer.The method makes full use of the labelled documents in source domain to classify a great deal of unlabelled documents in target's domain and chooses part of the high-confidence documents to join into the training set,simultaneously removes some of the documents in source domain which are far form the centroid of the target domain test set,through the iteration it gradually narrows the centroid distance between two domains,and reduces the differences between the domains.Experiment results indicate that the proposed algorithm can improve the precision of cross-domain inclination analysis.
出处 《计算机应用与软件》 CSCD 2011年第12期26-28,74,共4页 Computer Applications and Software
基金 国家自然科学基金(90920004 60970056 60873150) 江苏省自然科学基金(BK2008160) 江苏省高校自然科学重大基础研究项目(08KJA520002)
关键词 领域适应 情感分析 质心迁移 观点分类 Domain adaption Sentiment analysis Centroid transfer Opinion classification
  • 相关文献

参考文献10

  • 1Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques [ C ]//Proceedings of EMNLP-02, 2002:79 - 86.
  • 2Hal Daum, Daniel Marcu. Domain Adaptation for Statistical Classifiers [J ]. Journal of Artificial Intelligence Research 2006,26:101 - 126.
  • 3Yee Seng Chan, Hwee Tou Ng. Estimating Class Priors in Domain Ad- aptation For Word Sense Disambiguation [ C ]//Proceedings of the 21 st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL,2006 : 89 - 96.
  • 4Pang B, Lee L. A Sentimental Education: Sentiment Analysis using Subjectivity Summarization based on Minimum Cuts[ C ]//Proceedings of ACL - 04,2004.
  • 5Ahmed Abbasi, Hsinchun Chert, Arab Salem . Sentiment Analysis in Multiple Languages Feature Selection for Opinion Classification in Web Forums [ J ]. ACM ,2007.
  • 6Prem Melville, Wojciech Gryc, Richard D Larence . Sentiment analysis of blogs by Combining Lexical Knowledge with Text Classification [ C ]//Proceedings of KDD - 09 : 1275 - 1283.
  • 7Aue A, Gamon M. Customizing Sentiment Classifiers to New Domains: a Case Study[ C]//RANLP, 2005.
  • 8John Blitzer, Mark Dredze, Fernando Pereira. Domain adaptation for senti- ment classification[ C]//Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics ,2007:440 - 447.
  • 9Songbo Tan ,Gaowei Wu , Huifeng Tang, et al. A Novel Scheme for Do- main-transfer Problem in the context of Sentiment Analysis [ C ]//Proceed- ings of the sixteenth ACM conference on Conference on information and knowledge management. Lisbon, Portugal , 2007:979 - 982.
  • 10吴琼,谭松波,张刚,段洣毅,程学旗.跨领域倾向性分析相关技术研究[J].中文信息学报,2010,24(1):77-83. 被引量:10

二级参考文献24

  • 1赵军,许洪波,黄萱菁,谭松波,刘康,张奇.中文倾向性分析评测技术报告[C]//第一届中文倾向性分析评测会议(The First Chinese Opinion Analysis Evaluation).COAE,2008.
  • 2Weifu Du, Songbo Tan. An Iterative Reinforcement Approach for Fine-Grained Opinion Mining[C]//Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Boulder, Colorado, 2009: 486-493.
  • 3Huifeng Tang, Songbo Tan and Xueqi Cheng. A Survey on Sentiment Detection of Reviews. Expert Systems With Applications[J]. Elsevier. 2009, 36 (7) : 10760-10773.
  • 4Chang CC, Lin CJ. LIBSVM: a library for supportvector machines. 2001. Software available at http:// www. csie. ntu. edu. tw/-cjlin/libsvm.
  • 5Songbo Tan, Xueqi Cheng, Moustafa M. Ghanem, Bin Wang, Hongbo Xu. A Novel Refinement Approach for Text Categorization[C]//Proceedings of the 14^th ACM international conference on Information and knowledge management. Bremen, Germany, 2005: 469-476.
  • 6Songbo Tan. An Effective Refinement Strategy for KNN Text Classifier. Expert Systems With Applications[J]. Elsevier. 2006, 30(2): 290-298.
  • 7Tan S. B. Neighbor-weighted K nearest neighbor for unbalanced text corpus[J]. Expert Systems with Applications. 2005, 28: 667-671.
  • 8John Blitzer, Mark Dredze, Fernando Pereira. Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classifieation [C]// Proceedings of the 45^th Annual Meeting of the Associ ation of Computational Linguistics. Prague. 2007: 440-447.
  • 9Songbo Tan, Xueqi Cheng, Yuefen Wang and Hongbo Xu. Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis [C]//31^st European Conference on Information Retrieval. Springer Berlin: Heidelberg, 2009: 337-349.
  • 10Songbo Tan, Gaowei Wu, Huifeng Tang and Xueqi Cheng. A Novel Scheme for Domain-transfer Problem in the context of Sentiment Analysis [C]//Proceedings of the sixteenth ACM conference on Conference on information and knowledge management. Lisbon, Portugal, 2007:979-982.

共引文献9

同被引文献23

  • 1Pang B, Lee L. Opinion Mining and Sentiment Analysis [ J ]. Foundations and Trends in Information Retrieval, 2008, 2( 1/2 ) : 1 - 135.
  • 2Blitzer J, Dredze M, Pereira F. Biographies, Bollywood, Boom - boxes and Blenders: Domain Adaptation for Sentiment Classifica- tion [ C ]. In: Proceedings of the 45tit Annual Meeting of the Associ- ation for Computational Linguistics. 2007:440 - 447.
  • 3Tan S B, Cheng X Q, Wang Y F, et al. Adapting Naive Bayes toomain Adaptation for Sentiment Analysis [ C ]. in: Proceedings of the 31st European Conference on IR Research on Advances in Infor- mation Retrieval. Berlin, Heidelberg: Springer - Verlag, 2009 : 337 - 349.
  • 4Pan S J, Ni X C, Sun J T, et al. Cross - domain Sentiment Classi- fication via Spectral Feature Alignment [ C ]. In: Proceedings of the 19th International Conference on World Wide Web. New York, NY, USA: ACM,2010:751 -760.
  • 5Tan S B, Wu G W, Tang H F, et al. A Novel Scheme for Domain -transfer Problem in the Context of Sentiment Analysis[ C]. In: Proceedings of the 16th ACM Conference on Information and Knowl- edge Management. New York, NY, USA: ACM, 2007:979 - 982.
  • 6Chung F R K. Spectral Graph Theory [ M ]. American Mathemati- cal Society, 1997.
  • 7李新福,赵蕾蕾,何海斌,李芳.使用Logistic回归模型进行中文文本分类[J].计算机工程与应用,2009,45(14):152-154. 被引量:10
  • 8赵妍妍,秦兵,刘挺.文本情感分析[J].软件学报,2010,21(8):1834-1848. 被引量:558
  • 9黄贤立.基于典型相关分析的多视图跨领域情感分类[J].计算机工程,2010,36(24):186-188. 被引量:6
  • 10吴琼,谭松波,许洪波,段洣毅,程学旗.基于随机游走模型的跨领域倾向性分析研究[J].计算机研究与发展,2010,47(12):2123-2131. 被引量:12

引证文献4

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部