期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
A Novel Feature Selection Framework for Automatic Web Page Classification 被引量:3
1
作者 J.Alamelu Mangai V.Santhosh Kumar S.Appavu alias Balamurugan 《International Journal of Automation and computing》 EI 2012年第4期442-448,共7页
The number of Internet users and the number of web pages being added to www increase dramatically every day. It is therefore required to automatically and efficiently classify web pages into web directories. This help... The number of Internet users and the number of web pages being added to www increase dramatically every day. It is therefore required to automatically and efficiently classify web pages into web directories. This helps the search engines to provide users with relevant and quick retrieval results. As web pages are represented by thousands of features, feature selection helps the web page classifiers to resolve this large scale dimensionality problem. This paper proposes a new feature selection method using Ward’s minimum variance measure. This measure is first used to identify clusters of redundant features in a web page. In each cluster, the best representative features are retained and the others are eliminated. Removing such redundant features helps in minimizing the resource utilization during classification. The proposed method of feature selection is compared with other common feature selection methods. Experiments done on a benchmark data set, namely WebKB show that the proposed method performs better than most of the other feature selection methods in terms of reducing the number of features and the classifier modeling time. 展开更多
关键词 Feature selection web page classification Ward’s minimum variance information gain webkb
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部