摘要
基于Hadoop开源分布式计算框架和Mahout协同过滤推荐引擎技术构建图书推荐引擎系统,并利用云模型和Pearson系数对传统协同过滤推荐算法进行改进,改善传统单机推荐算法在高维稀疏矩阵上进行运算所导致的系统性能不佳及推荐结果不准确的问题。利用实验对分布式推荐平台的整体性能及改善后的协同过滤推荐算法进行测试评估,发现当虚拟机节点不断增加时,协同过滤推荐引擎的计算时间不断减少,这表明推荐引擎系统的总体性能较传统单机推荐引擎得到提升;利用MAE分别对原始协同过滤推荐效果和改进后的推荐算法进行测评,发现改进后的推荐引擎算法的推荐准确率较改进前提高13.1%。
Firstly, this paper builds a book recommendation engine system based on the Hadoop open source distributed computing framework and mahout collaborative filtering recommendation engine technology. Then it takes advantage of the cloud model and Pearson coefficient to improve the traditional collaborative filtering recommendation algorithm, aml resolves the problems of poor system performance and recommendation results inaccurate of traditional stand -alone recommendation algorithm in high-dimensional sparse matrix operations. Thirdly, it experiments and evaluates the overall performance of the distributed recommendation platform and the improved collaborative filtering algorithm. It finds that: ( 1 ) when the virtual machine nodes are increasing, the computation time of collaborative filtering recommendation engine is declining in the experimental tests, which shows that the overall performance of the system has been improved. (2) it improves the mahout original collaborative filtering recommendation engine with the Pearson coefficient and evaluates the recommended effect with MAE indices of the original collaborative filtering recommendation algorithm, which finds the recommendation accuracy rate increases 13.1% and the subjectivity differences of user ratings have great impact on the recommendation accuracy.
出处
《图书情报工作》
CSSCI
北大核心
2013年第18期116-121,共6页
Library and Information Service