期刊文献+

基于近邻用户和近邻项目的协同过滤改进算法

Incremental collaborative filtering algorithm based on N-nearest users and N-nearest items
在线阅读 下载PDF
导出
摘要 协同过滤算法研究正面临两大挑战:一是提高推荐系统的质量,尤其是高维稀疏数据系统的推荐质量;二是提高算法的可伸缩性。为了解决该问题,笔者提出了一个基于用户近邻和项目近邻的协同过滤改进算法。为了提高系统在线推荐性能,该算法分2步:1)线下的相似度计算和近邻计算;2)在线预测。通过对N个用户近邻和N个项目近邻的有效结合,该算法在线计算的空间复杂度为O(N)且具有较好的可伸缩性。实验表明,与经典的Pearson协同过滤算法相比,该算法不仅提高了推荐性能,而且也适用于高维稀疏数据系统。 Researchers of Collaborative Filtering(CF) algorithms are facing challenges with improving the quality of recommendations for users with sparse data and improving the scalability of the CF algorithms.To address these issues,an incremental CF algorithm based on N-nearest users and N-nearest items is proposed in this paper.To improve the online recommendation performance,the algorithm is divided into two steps: 1) The offline computation of similarities and neighbors;2) The online prediction.By using N-nearest users and N-nearest items in the prediction generation,the algorithm requires an O(N) space for the online prediction computation and at the same time gets improvement of scalability.Experiments suggest that the incremental algorithm provides better quality than the best available classic Pearson correlation-based CF algorithms when the data set is sparse.
作者 张阳 申华
出处 《沈阳师范大学学报(自然科学版)》 CAS 2012年第3期382-385,共4页 Journal of Shenyang Normal University:Natural Science Edition
基金 国家自然科学基金资助项目(11172060)
关键词 协同过滤 推荐系统 稀疏数据 collaborative filtering recommender systems data sparsity
  • 相关文献

参考文献12

二级参考文献70

  • 1朱颢东,钟勇,赵向辉.一种优化初始中心点的K-Means文本聚类算法[J].郑州大学学报(理学版),2009,41(2):29-32. 被引量:13
  • 2刘罗曼,张颖南.因子分析和聚类分析的一个简单应用[J].沈阳师范大学学报(自然科学版),2005,23(2):130-132. 被引量:8
  • 3Xu HL,Wu X,Li XD,Yan BP.Comparison study of Internet recommendation system.Journal of Software,2009,20(2):350-362 (in Chinese with English abstract).http://www.jos.org.cn/1000-9825/3388.htm[doi:10.3724/SP.J.1001.2009.03388].
  • 4Marlin B.Collaborative Filtering:A machine learning perspective[MS.Thesis].Toronto:University of Toronto,2004.
  • 5Hofmann T.Latent semantic models for collaborative filtering.ACM Trans.on Information System,2004,22(1):89-115.[doi:10.1145/963770.963774].
  • 6Blei DM,Ng AY,Jordan MI.Latent Dirichlet allocation.Journal of Machine Learning Research,2003,3(3):993-1022.[doi:10.1162/ jmlr.2003.3.4-5.993].
  • 7Netflix update:Try this at home.2006.http://sifter.org/~simon/journal/20061211.html.
  • 8Zhang S,Wang WH,Ford J,Makedon F.Learning from incomplete ratings using non-negative matrix factorization.In:Ghosh J,ed.Proc.of the 6th SIAM Conf.on Data Mining.Bethesda:SIAM,2006.549-553.
  • 9Cheng YZ,Church GM.Biclustering of expression data.In:Bourne PE,ed.Proc.of the 8th Int'l Conf.on Intelligent Systems for Molecular Biology.La Jolla:AAAI Press,2000.93-103.[doi:10.1016/j.ipm.2008.12.004].
  • 10Cheng G,Wang F,Zhang CS.Collaborative filtering using orthogonal nonnegative matrix tri-factorization.Information Processing & Management,2009,45(3):368-379.

共引文献279

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部