摘要
如何在海量的非结构文档内容中准确、快捷找到自己所需要的信息,是信息检索技术的研究重点。全文检索是现代信息检索技术一个非常重要的分支,是解决非结构化数据检索需求的重要技术手段。以已发布的各类通信业务管理规范的全文检索需求为切入点,设计并实现了适用于国家级气象信息化业务管理的非结构化文档全文检索系统。该系统基于Java技术,并采用Lucene技术框架,对业务规范信息进行了分析和重新数据组织,确保良好的检索时效与准确率。系统应用后能快速应对业务变化,在已有的大量的规定、规范、标准和公文函件中迅速、准确、全面地查找有关资料信息,帮助用户准确把握气象信息化发展脉络。
.To find the information needed accurately and quickly from massive unstructured document content has become one of the information retrieval technology research priorities recently. Full-text retrieval is one of the key technologies of the search engine, and a powerful tool for dealing with nonstructural data. This paper develops an example of unstruc- tured documents full-text retrieval system, to meet the demand of national meteorological information system regulations' text retrieval. The system is based on Java technology and uses Lucene framework, functioned in data analyzing and re-or- ganizing to improve the efficiency and accuracy as well. The system put into service shows the ability of retrieve data in a quick, accurate and comprehensive way, reflecting the precise changes of the existing database of regulations,policies, standards and documentations to help the user.
出处
《软件导刊》
2013年第10期100-102,共3页
Software Guide
基金
中国气象局气象关键技术集成与应用项目(2012)