摘要
文本挖掘技术是从海量文本信息中获取潜在有用知识的有效途径。传统的文本挖掘方法由于不能有效运用语义信息而难以达到更高的准确度。本体论为语义信息的合理表示和有效组织提供了理论支持和技术手段。介绍和分析了常识本体和领域本体以及基于这些本体的文本挖掘方法。
Text mining is an effective means of detecting potentially useful knowledge from large text database. However, conventional text mining technology cannot achieve high accuracy, because it cannot effectively make use of the semantic information of the text. Ontology provides theoretical basis and technical support for semantic information representation and organization. This paper introduces common ontology and domain ontology, and analyzes text mining technology based on these ontologies.
出处
《计算机应用》
CSCD
北大核心
2006年第9期2013-2015,共3页
journal of Computer Applications
基金
国家863计划资助项目(2004AA1120202003AA1152102003AA111020)
关键词
文本挖掘
本体论
常识本体
领域本体
text mining
ontology
common ontology
domain ontology