摘要
本文介绍了搜索引擎的四个组成部分 :搜索器、索引器、检索器和用户接口 ,并分析其工作原理 ;给出搜索引擎中包含的关键技术算法 :分词技术、多路归并算法和大文件处理技术 ;最后结合当前最新的几种搜索引擎 ,探讨搜索引擎在多语言处理、专业化和有效性等方面的技术改进目标。
In this paper we introduce the four components of a search engine: spider, indexer, searcher and UI, and analyse its working mechanism. We also give the key algorithms included in the search engine: the word split algorithm, the merge sort algorithm and the large file processing technology. With several major search engines we point out how to improve in the aspects of multilingual processing, specialization and efficiency.
出处
《计算机工程与科学》
CSCD
2002年第4期18-20,共3页
Computer Engineering & Science