期刊文献+

嵌入意图识别的医疗健康问答文本语义分类模型 被引量:5

Semantic Classification Model for Healthcare Q&A Texts with Embedded Intent Recognition
原文传递
导出
摘要 【目的】基于意图识别构建更准确的语义相似度分类模型,为中文医疗健康查询服务提供更精准的答案匹配结果。【方法】融合BERT和卷积神经网络(CNN)构建意图识别模型,然后将其作为模型嵌入层构建嵌入意图识别的孪生BERT(ITBERT)语义分类模型。【结果】在CHIP-STS数据集上,相较于BERT和TextCNN单一模型,融合模型在意图识别的Top-1结果准确率分别提高了1.5和8.2个百分点,达到了73.6%;Top-3结果准确率分别提高了3.2和7.6个百分点,达到了91.2%,证明了融合模型对于意图识别效果的提升。对于语义相似度分类结果,相较于基准模型,ITBERT模型AUC值提高了0.015~0.087,证明了意图知识的嵌入对医疗语义相似度分类效果的提升。【局限】人工标注的意图信息存在一定的偏差,可能会影响语义相似度的分类结果。【结论】融合模型可以改善医疗健康查询服务中的意图识别效果,嵌入识别的意图知识可以提高语义相似度分类模型的准确率,有利于提供更准确的医疗健康自动问答服务。 [Objective]This study builds a more accurate semantic similarity classification model based on intent recognition to provide precise answer-matching results for Chinese medical and health Q&A services.[Methods]We integrated BERT and Convolutional Neural Networks(CNN)to construct an intent recognition model,which is used as an embedding layer to develop an intent-recognizing twin BERT(ITBERT)semantic classification model.[Results]On the CHIP-STS dataset,compared to single BERT and TextCNN models,the integrated model improved the Top-1 accuracy of intent recognition by 8.2%and 1.5%,reaching 73.6%.The Top-3 accuracy improved by 7.6%and 3.2%,reaching 91.2%,demonstrating the new model's enhanced intent recognition effectiveness.For semantic similarity classification,the ITBERT model improves the AUC value by 0.015 to 0.087 compared to benchmark models,proving that embedding intent knowledge improves the effectiveness of medical semantic similarity classification.[Limitations]Manually annotated intent information may contain biases and affect the classification results of semantic similarity.[Conclusions]The proposed model can improve intent recognition in medical and health Q&A services.Embedding intent knowledge enhances the accuracy of semantic similarity classification models,contributing to more precise automated Q&A services.
作者 谌文佳 杨琳 李金林 Chen Wenjia;Yang Lin;Li Jinlin(College of Management Science and Engineering,Beijing Information Science&Technology University,Beijing 100192,China;School of Management,Beijing Institute of Technology,Beijing 100081,China)
出处 《数据分析与知识发现》 北大核心 2025年第2期26-38,共13页 Data Analysis and Knowledge Discovery
基金 国家自然科学基金项目(项目编号:71972012) 北京信息科技大学基金项目(项目编号:2023XJJ21)的研究成果之一。
关键词 意图识别 医疗问答 语义相似度分类 深度学习 Intent Recognition Medical Question Answering Semantic Similarity Classification Deep Learning
  • 相关文献

参考文献20

二级参考文献141

共引文献247

同被引文献55

引证文献5

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部