期刊文献+

基于潜在狄利特雷分布模型的网络评论产品特征抽取方法 被引量:21

Product features extraction of online reviews based on LDA model
在线阅读 下载PDF
导出
摘要 针对网络评论挖掘中的产品特征抽取准确度不高、人工参与较多和难以处理口语化表述等问题,提出一种基于潜在狄利特雷分布模型的产品特征抽取方法。该方法首先应用中文分词工具对网络评论信息进行分词和词性标注,得到最初的产品特征名词集合;然后采用潜在狄利特雷分布文本训练模型筛选出候选产品特征词集合,进而通过同义词词林拓展和过滤规则得到最终的产品特征集合。以京东网上的相机和手机评论数据为例,通过实验对比分析验证了所提方法的有效性。 Aiming at the problems that low accuracy of product feature extraction, much human participation and dif ficult to handle the colloquial expression, a new product feature extraction method was proposed based on Latent Dirichlet Allocation (LDA). The online product reviews were parsed and labeled by using Chinese lexical analysis tool to generate the initial nouns set of product feature. The set of candidate product feature words was selected by LDA text training model, and the final product feature set was obtained through synonym lexicon expansion and fea ture filtering rules. The evaluate data of camera and mobile phone from JD. com was taken as the example to verify the effectiveness of the proposed method.
出处 《计算机集成制造系统》 EI CSCD 北大核心 2014年第1期96-103,共8页 Computer Integrated Manufacturing Systems
基金 国家自然科学基金资助项目(71128003,70972006,71102111) 新世纪优秀人才支持计划资助项目(NCET-11-0792)~~
关键词 网络评论 产品特征抽取 潜在狄利特雷分布 数据挖掘 online reviews product feature extraction Latent Dirichlet allocation data mining
  • 相关文献

参考文献27

  • 1IRESEARCH.2012-2013年中国网络购物行业年度监测报告简版[EB/OL].[2013-09-01].http://www.iresearch.com.cn/Report/2007.html.
  • 2SOTIRIADIS M D,VAN ZYL C.Electronic word-of-mouth and online reviews in tourism services:the use of twitter by tourists[J].Electronic Commerce Research,2013,13 (1):103-124.
  • 3李实,叶强,李一军,RobLaw.中文网络客户评论的产品特征挖掘方法研究[J].管理科学学报,2009,12(2):142-152. 被引量:131
  • 4BALAHUR A,HERMIDA J M,MONTOYO A.Detecting implicit expressions of emotion in text:a comparative analysis[J].Decision Support Systems,2012,53(4):742-753.
  • 5PINHEIRO R H W,CAVALCANTI G D C,CORREA R F,et al.A global-ranking local feature selection method for text categorization[J].Expert Systems with Applications,2012,39(17):12851-12857.
  • 6DING Xiaowen,LIU Bing,YU P S.A holistic lexicon-based approach to opinion mining[C] //Proceedings of the International Conference on Web Search and Web Data Mining.New York,N.Y.,USA:ACM,2008:231-240.
  • 7MONTOYO A,MARTINEZ-BARCO P,BALAHUR A.Subjectivity and sentiment analysis:an overview of the current state of the area and envisaged developments[J].Decision Support Systems,2012,53(4):675-679.
  • 8郗亚辉,张明,袁方,王煜.产品评论挖掘研究综述[J].山东大学学报(理学版),2011,46(5):16-23. 被引量:15
  • 9CARENINI G,CHEUNG J C K,PAULS A.Multi-document summarization of evaluative text[J].Computational Intelligence,DOI:10.1111/j.1467-8640.2012.00417.x.
  • 10BLAIR-GOLDENSOHN S,HANNAN K,MCDONALD R,et al.Building a sentiment summarizer for local service reviews[C] //Proceedings of International Workshop on NLP Challenges in the Information Explosion Era.New York,N.Y.,USA:ACM,2008.

二级参考文献57

共引文献193

同被引文献224

  • 1张振刚,罗泰晔.基于在线评论数据挖掘和Kano模型的产品需求分析[J].管理评论,2022,34(11):109-117. 被引量:40
  • 2李永锋,周俊,朱丽萍.基于田口质量观的老年人电子产品用户体验评价研究[J].机械设计,2020,37(2):131-137. 被引量:10
  • 3郭伟,胡明艳.基于Web源的客户需求获取及分析方法[J].计算机集成制造系统,2004,10(9):1165-1170. 被引量:15
  • 4徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:124
  • 5KIM S M, HOVY E. Determining the sentiment of opinions[C]//Proceedings of the 20th International Conference on Computational Linguistics (COLING).Morristown:Association for Computational Linguistics, 2004:1367-1373.
  • 6KAMAL A, ABULAISH M, ANWAR T. Mining feature-opinion pairs and their reliability scores from web opinion sources [C]//Proceedings of International Conference on Web Intelligence, Mining and Semantics(WIMS'2012). [S.l.]:[s.n.], 2012.
  • 7WILSON T, WIEBE J, HOFFMANN P. Recognizing contextual polarity in phrase-level sentiment analysis [C]//Proceedings of Human Language Technologies Conference/Conference on Empirical Methods in Natural Language(HLT/EMNLP 2005). Vancouver, BC, Canada: [s.n.], 2005:347-354.
  • 8PANG Bo, LEE L. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales [C]//Proceedings of the 43rd Annual Meeting of Association for Computational Linguistics. Somerset: ACL, 2005:115-124.
  • 9TAN Songbo, ZHANG Jin. An empirical study of sentiment analysis for Chinese documents [J]. Expert Systems with Applications, 2008, 34(4):2622-2629.
  • 10APTE C. Automated learning of decision rules for text categorization [J]. ACM transactions on information systems, 1994, 12: 233-251.

引证文献21

二级引证文献177

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部