摘要
Web资源含有大量的有用信息 ,但由于它们欠结构化 ,不能为传统的数据库型查询系统所利用。如何将这些信息抽取出来 ,转化成结构化信息 ,供其它信息集成系统所利用 ,成为该领域的研究热点。本文介绍了一个简单的 Web信息抽取模型 ,对于基于该模型的 wrapper归纳技术进行了探讨 ,并描述了一个
There is plenty of useful information in web resource.It can't be used by the traditional database query system because it is not well-structured.Recently considerable attention has been received on how to extract it from web resource and transfer it to structured information that can be used by other information integration systems.This paper presents a simple web information extraction model,discussed the technology of wrapper induction based on the model and describes automatic generation prototype system of wrapper.
出处
《情报科学》
CSSCI
北大核心
2002年第12期1282-1284,共3页
Information Science