摘要
随着Internet的迅速发展,使得"数据丰富而信息贫乏"这对矛盾显得日益突出,数据挖掘技术正是应了这一需求而结合了机器学习、模式识别、统计学、人工智能、神经网络等多学科而出现的一项新技术,基于Web的数据挖掘是数据挖掘技术在网络信息处理中的应用。本文叙述了Web数据挖掘的概念、分类、技术等,重点讨论了基于XML语言的Web数据挖掘技术,解决了Internet上绝大多数非结构化甚至是无结构的、Web信息的组织结构性差而导致的Web数据挖掘困难的问题。
With the rapid development of internet, the phenomenon of "data is rich but information is poor" is become more and more evident, data mining technology that cater the demand and integrate the machine learning, pattern recognition, statistics, artificial intelligence, nerve network and so on that become a new technology, Web-based data mining is defined as the application of data mining technology on the network information processing. This paper describes the concept,classification and technology of web-based data mining, then discusses the web data mining based on XML,solved the web data mining problem which is caused by the non-structure of the much Internet data and the poor structure of the Web information.
出处
《科技广场》
2010年第1期73-75,共3页
Science Mosaic