摘要
[目的/意义]超声检查是判断患者病情的重要依据,目前主要检查数据是以文本形式存在。本文提出一种基于超声检查数据的文本结构化和知识网络构建方法,为进一步挖掘临床知识奠定数据基础。[方法/过程]对自然语言处理技术在超声文本环境下的应用进行改进,包括分词处理、内容定位、结构化识别三个主要步骤,实现对超声文本的切分与标记,并且在此基础上建立其结构化知识网络。[结果/结论]真实数据测试结果显示,本文提出的面向超声检查文本的结构化方法具有较好的性能表现。该方法可以实现对批量超声文本结构化网络的自动构建,能够反映超声文本中结构化内容的层次关系与属性结构等潜在知识。
[ Purpose/significance ] Ultrasound examination is an important basis for diagnosis, but the major examination data is in the form of text. So, based these data, this paper studies a method that can automatically structure natural language texts and construct knowledge network, which lays the data foundation for further mining clinical knowledge hidden in EMR.[ Method/process ] This paper improved the application of natural language processing technology in ultrasonic, including three main steps: segmentation processing, content location and structured recognition, to realize the segmentation and labeling of ultrasonic text, and on this basis, the ultrasound examination knowledge network was established.[Result/ conclusion ] The test results of real data show that the method for structuring ultrasound texts proposed in this paper has better performance. This method can realize the automatic construction of knowledge network of batch ultrasound texts, and can reflect the potential knowledge of hierarchical relationship and attribute structure of structured content in ultrasonic text.
作者
尚小溥
许吴环
赵红梅
张润彤
朱燊
Shang Xiaopu;Xu Wuhuan;Zhao Hongmei;Zhang Runtong;Zhu Shen(Department of Information Management, School of Economic Management, Beijing Jiaotong University, Beijing 100044;Peking University People's Hospital, Beijing 100044)
出处
《图书情报工作》
CSSCI
北大核心
2019年第16期112-120,共9页
Library and Information Service
基金
国家自然科学基金项目“面向临床决策辅助的电子病历文本结构化方法与知识挖掘研究”(项目编号:61702033)
教育部人文社科项目“基于电子病历文本的临床知识挖掘研究”(项目编号:17YJC870015)研究成果之一
关键词
超声文本
自然语言处理
文本结构化
知识网络
ultrasonic text
natural language processing
text structuring
knowledge network