期刊文献+

微博用户分类的特征词权重优化及推荐策略 被引量:1

The Strategy of Feature Weight Optimizing and Feature Words Recommendation based on the Classificationof Weibo Users
在线阅读 下载PDF
导出
摘要 文章提出了与时间因素相结合的特征词权重更新方式以及关联分析与TF-IDF相结合的特征词推荐策略,通过实验数据证明可以及时更新特征词权重、大幅提高特定话题相关特征词推荐的有效性,从而提高对用户分类的准确度。文章提出的两个策略优于传统策略,通过实验数据证明有效可行。 The solutions presented in this article isthat update the weight with time factor and the new feature words recommendation strategy combine with TF-IDF. The data bear out that it can update the weight timely and improve the microblogusers classification,and the high degreeofaccurancy in feature words recommendation.The two strategies presented in this paper do well in the actual word, and the data bear out that two strategies is feasible.
机构地区 国防科技大学
出处 《信息网络安全》 2012年第8期136-139,共4页 Netinfo Security
基金 国家863计划项目[2010AA012505 2011AA010702 2012AA01A401 2012AA01A402] 自然科学基金课题[60933005] 科技支撑计划课题[2012BAH38B04] 国家242信息安全计划课题[2011A010]
关键词 特定话题 TF—IDF关联分析 分类算法 特征权重 given theme TF-IDF correlation analysis classification feature weight
  • 相关文献

参考文献10

  • 1中国互联网信息中心(CNNIC)第23次中国互联网络发展状况统计报告[DB/OL]2009.lhttp://www.cnnic.netcn/uploadfiles/pdf/2009/1/13/92458pdf.
  • 2周运清,苏娜.网络行为与社会控制[J].情报杂志,1999,18(3):11-13. 被引量:17
  • 3张玉芳,彭时名,吕佳.基于文本分类TFIDF方法的改进与应用[J].计算机工程,2006,32(19):76-78. 被引量:121
  • 4台德艺,王俊.文本分类特征权重改进算法[J].计算机工程,2010,36(9):197-199. 被引量:26
  • 5HumbefioT.MarquesNt.CharaeterizingBroadbandUserBehaviorNRBC' 04[C] 2004.
  • 6Ron Hutehins Usage Characteristics of Dial-in lnternet Users:A National Study[C] 2002.
  • 7Alfredo Alba, VarunBhagwan, Tyrone Grandison. Accessing the deep web: when good ideas go bad[C]. Companion to the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications. 2009: 815-818.
  • 8KuaiXu, Zhi-LiZhang, SuPratikBattaeharrya, Profiling Internet BaekboneTrattie:Behavior Models and Applications[C]. In:ACMSigcomm 2005.PhiladelPhia, PA.Angust2005.
  • 9RamineTinati, Identifying User Types Within Twitter[DB/OL].
  • 10百度百科.Apfiofi算法[EB/OL]http://baike.baidu.com.

二级参考文献19

  • 1唐焕玲,孙建涛,陆玉昌.文本分类中结合评估函数的TEF-WA权值调整技术[J].计算机研究与发展,2005,42(1):47-53. 被引量:26
  • 2Rocchio J.The SMART Retrieval System:Experiments in Automatic Document Processing[M].Englewood Cliffs,USA:Prentice-Hall,1971.
  • 3Salton G,Buckley C.Term Weighting Approaches in Automatic Text Retrieval[J].Information Processing and Management,1988,24(5):513-523.
  • 4Salton G.Developments in Automatic Text Retrieval[J].Science,1991,253(5023):974-979.
  • 5Sebastiani F.Machine Learning in Automated Text Categoriza-tion[J].ACM Computing Surveys,2002,34(1):1-47.
  • 6Shankar S,Karypis G.A Feature Weight Adjustment Algorithm for Document Categorization[C]//Proc.of KDD'00.New York,USA:ACM Press,2000.
  • 7Forman G.BNS Feature Scaling:An Improved Representation over TF-IDF for SVM Text Classification[C]//Proc.of the 12th ACM Conference on Information and Knowledge Management.Napa Valley,CA,USA:ACM Press,2008:26-30.
  • 8Zhang Yuntao,Gong Ling,Wang Yongcheng.An Improved TF-IDF Approach for Text Classification[J].Journal of Zhejiang University,2005,6A(1):49-55.
  • 9梁久祯,兰东俊.基于先验知识的网页特征压缩与线性分类器设计[C].第十二届全国神经计算学术大会讨论文集.北京:人民邮电出版社,2002:494-501.
  • 10Rudolph G.Convergence Properties of Canonical Genetic Algorithms[J].IEEE Trans.on Neural Networks,1994,5(1):96-101.

共引文献158

同被引文献17

  • 1艾瑞网.五大数据解读微博Q1财报[EB/OL].2015-5-18.http://web2.iresearch.cn/media/20150518/249980.shtml.
  • 2Oulasvirta A,Lehtonen E, Kurvinen E,Raento M. Making the ordinary visible in microblogs[J]. Personal & Ubiquitous Computing, 2010,14(3): 237-249.
  • 3Takhteyev Y, Gruzd A,Wellman B. Geography of Twitter networks[J]. Social Networks, 2012,34(1): 73-81.
  • 4Levinson M H. The tipping point: how little things can make a big difference[J]. Business Economics,2007,25(6): 580-580.
  • 5Java A,Song X,Finin T,Tseng B. Why we twitter: understanding microblogging usage and communities[C], proceedings of the Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis: 56-65.
  • 6Naaman M, Boase J, Lai C H. Is it really about me: message content in social awareness streams[J]. Proc Cscw, 2010,189-192.
  • 7Suh B, Hong L,Pirolli P, Chi E H. Want to be retweeted large scale analytics on factors impacting retweet in twitter network [C] proceedings of the social computing (SocialCom). 2010 IEEE Second International Conference on: 177-184.
  • 8Ren D,Zhang X,Wang Z,Li J,Yuan X. Weibo Events: A crowd sourcing weibo visual analytic system[C]. Proceedings of the 2014 IEEE Pacific Visualization Symposium (PacificVis): 330-334.
  • 9刘乙坐,黄奇杰.传播学视野下的微博基本分类初探[J].中国科技信息,2011(5):148-150. 被引量:6
  • 10靖鸣,王瑞.微博暴力的成因及其应对之策[J].新闻与写作,2012(2):31-34. 被引量:9

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部