摘要
在分析Hadoop框架与TF-IDF算法的基础上,给出了TF-IDF算法在Hadoop分布式框架下的具体实现。实验表明,在处理大数据量时,与传统方法相比,新方法的效率更高。
In this paper, carefully analyzed the Hadoop framework and TF-IDF algorithm, give the TF-IDF algorithm based on the Hadoop framework. Experiments show that in the case of massive data computing, the new method applying Hadoop framework is more efficient than the traditional methods.
出处
《微型机与应用》
2012年第7期14-16,共3页
Microcomputer & Its Applications