期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
AdaptForever:Elastic and Mutual Learning for Continuous NLP Task Mastery
1
作者 Ke Chen Cheng Peng +4 位作者 Xinyang He Jiakang Sun Xu Liu Xiaolin Qin Yong Zhong 《Computers, Materials & Continua》 2025年第3期4003-4019,共17页
In natural language processing(NLP),managing multiple downstream tasks through fine-tuning pre-trained models often requires maintaining separate task-specific models,leading to practical inefficiencies.To address thi... In natural language processing(NLP),managing multiple downstream tasks through fine-tuning pre-trained models often requires maintaining separate task-specific models,leading to practical inefficiencies.To address this challenge,we introduce AdaptForever,a novel approach that enables continuous mastery of NLP tasks through the integration of elastic and mutual learning strategies with a stochastic expert mechanism.Our method freezes the pre-trained model weights while incorporating adapters enhanced with mutual learning capabilities,facilitating effective knowledge transfer from previous tasks to new ones.By combining Elastic Weight Consolidation(EWC)for knowledge preservation with specialized regularization terms,AdaptForever successfully maintains performance on earlier tasks while acquiring new capabilities.Experimental results demonstrate that AdaptForever achieves superior performance across a continuous sequence of NLP tasks compared to existing parameter-efficient methods,while effectively preventing catastrophic forgetting and enabling positive knowledge transfer between tasks. 展开更多
关键词 adapter-tuning large language model pre-trained language model parameter-efficient fine tuning continue learning mutual learning mixture of expert
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部