基于变分注意力知识选择和预训练语言模型的对话生成

Conversation Generation Based on Variational Attention Knowledge Selection and Pre-trained Language Model

下载PDF

导出

摘要基于知识的神经对话研究常常面临外部知识包含冗余甚至与对话主题不相关信息的问题,从而导致对话系统性能下降.知识选择成为解决该问题的重要途径,但现有研究对诸如知识选择器的设计、选择出的知识的利用以及知识选择对话方法适用的场景等问题,还缺乏深入研究.针对这些问题,提出了一个新的基于变分注意力知识选择和预训练模型的神经对话方法,使用一个基于条件变分自编码(conditional variational autoencoder,CVAE)和多层注意力机制的知识选择算法,自动选择出与当前对话最相关文本知识集合.该算法有效利用了训练数据中的对话回复来提高知识选择的效率.使用预训练语言模型Bart作为编码器-解码器架构,将选择的文本知识合并到Bart模型中,并在训练过程中对其进行微调.实验结果表明,与现有的一些代表性研究方法相比,提出的模型能生成多样性和连贯性更好、准确率更高的对话回复. Research on knowledge-grounded dialogue often suffers from the problem of external knowledge containing redundant or even noisy information irrelevant to the conversation topic,which leads to a degradation in the performance of the dialogue system.Knowledge selection becomes an important approach to solve this issue.However,existing work has not yet investigated in depth some issues involving it such as how to design a knowledge selector,how to exploit the selected knowledge,what are the suitable scenarios for the knowledge selection convers-ation methods,etc.In this paper,we propose a new neural conversation method based on conditional variational attention knowledge selection and a pre-trained language model.This method employs a knowledge selection algori-thm based on conditional variational autoencoder(CVAE)and a multi-layer attention mechanism to pick up the most relevant textual knowledge collection to the current conversation,which effectively exploits the dialogue response in training data to improve the efficiency of knowledge selection.Our novel model adopts the pre-trained language model Bart as encoder-decoder architecture and incorporates selected textual knowledge into the Bart model to fine-tune it during the training process.The experimental results show that the model proposed,in contrast to the current representative dialog models,can generate more diverse and coherent dialogue responses with higher accuracy.

作者张乃洲曹薇张啸剑李石君 Zhang Naizhou;Cao Wei;Zhang Xiaojian;Li Shijun(College of Computer and Information Engineering,Henan University of Economics and Law,Zhengzhou 450046;School of Computer Science,Wuhan University,Wuhan 430072)

机构地区河南财经政法大学计算机与信息工程学院武汉大学计算机学院

出处《计算机研究与发展》北大核心 2025年第8期1902-1917,共16页 Journal of Computer Research and Development

基金国家自然科学基金项目(62072156) 河南省自然科学杰出青年基金(252300421061) 河南省科技攻关项目(242102210076)。

关键词基于知识的对话知识选择预训练语言模型条件变分自编码注意力机制记忆网络 knowledge-grounded dialogue knowledge selection pre-trained language model CVAE attention mechanism memory network

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1孙润鑫,马龙轩,张伟男,刘挺.基于文档的对话研究[J].计算机研究与发展,2021,58(9):1915-1924. 被引量：1

1梁峻欣,孙建,彭小龙,成浪.小样本场景基于多任务学习的表面缺陷视觉检测方法[J].起重运输机械,2025(15):57-65.
2全转录组条件变分自编码器成功识别出引发人类复杂疾病的关键基因组[J].生物医学工程与临床,2025,29(4):472-472.
3胡新雨,宋博川,仝杰,李云鹏,毛艳芳,吕晓祥,张强,孙大军,陈群丰.基于大模型的大规模电力数据零样本实体关系抽取方法[J].电力信息与通信技术,2025,23(5):61-67. 被引量：2
4刘美玲.人工智能在广播电视内容制作中的应用[J].现代电视技术,2025(5):93-96. 被引量：1
5刘晋州.自然语言处理技术在智能客服系统中的应用[J].电脑知识与技术,2025,21(18):43-45. 被引量：1
6赵天龙,勾智楠,高凯.基于信息增强与双重解码策略的同理心对话生成[J].软件导刊,2025,24(7):61-66.
7龙湘婕.中小学美术教学中知识深入存在的问题及对策[J].美术馆,2025,6(2):64-66.

计算机研究与发展

2025年第8期

浏览历史

内容加载中请稍等...

基于变分注意力知识选择和预训练语言模型的对话生成

参考文献1

相关作者

相关机构

相关主题

浏览历史