期刊文献+

LSA Based Classification of Advertising Spam Reviews

LSA Based Classification of Advertising Spam Reviews
在线阅读 下载PDF
导出
摘要 In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding the classification of opinion spam documents, which is very rare in foreign studies; however, there are no studies that classify advertising reviews from Korean reviews. In this study, the Naive Bayes Classifier was used to classify review documents and the POS (Part-of-Speech)-Tagging and bigram methods were used to extract specific words. The frequency calculation methods for the probability value of specific words were: (1) The general number of appearances of words (2) the frequency calculation of specific words through the suggested Latent Semantic Analysis (LSA), and by recalculating the result from (1) in (2), the performances of each method were compared. As a result, the methods from (2) showed 88.43% accuracy which is 8.89% higher than 79.54% which was the previous result from using the POS-Tagging + Bigram method. Therefore, it was proved that the method suggested in this study is effective at classifying or extracting advertising reviews from Korean product review documents.
出处 《Computer Technology and Application》 2011年第12期998-1006,共9页 计算机技术与应用(英文版)
关键词 Opinion review spam review advertising review latent semantic analysis (LSA). 朴素贝叶斯分类器 广告内容 垃圾文件 LSA 评论 频率计算 潜在语义分析 POS机
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部