摘要
随着网络和数据挖掘技术的发展,Web数据挖掘得到了较多的研究。该文从Web结构挖掘的角度出发,在分析了网络有向图的总体结构以及导航页面、目标页面和网络功能的基础上,研究了结构挖掘算法,针对Hub页面的多主题性、无关页面、无关链接等问题,提出了HITS算法的改进算法。
With the development of the Internet and the data mining, more and more research work are come out with the Web data mining. From the direction of Web structure mining and on the base of analyzing the oriented graph of the network, the navigated pages, the aim pages and the function of the Internet, this paper studies the algorithms of structure mining. Comes up with the improvement algorithm of the HITS algorithm aim for the multi-subjects of the Hub pages, unrelated pages, irrelevant links and so on.
出处
《计算机工程》
CAS
CSCD
北大核心
2005年第B07期125-127,共3页
Computer Engineering