期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Multi-Task ConvMixer Networks with Triplet Attention for Low-Resource Keyword Spotting
1
作者 Alexander Rogath Kivaisi Qingjie Zhao Yuanbing Zou 《Tsinghua Science and Technology》 2025年第2期875-893,共19页
Customized keyword spotting needs to adapt quickly to small user samples.Current methods primarily solve the problem under moderate noise conditions.Recent work increases the level of difficulty in detecting keywords ... Customized keyword spotting needs to adapt quickly to small user samples.Current methods primarily solve the problem under moderate noise conditions.Recent work increases the level of difficulty in detecting keywords by introducing keyword interference.However,the current solution has been explored on large models with many parameters,making it unsuitable for deployment on small devices.When applying the current solution to lightweight models with minimal training data,the performance degrades compared to the baseline model.Therefore,we propose a light-weight multi-task architecture(<9.0×10^(4)parameters)created from integrating the triplet attention module in the ConvMixer networks and a new auxiliary mixed labeling encoding to address the challenge.The results of our experiment show that the proposed model outperforms similar light-weight models for keyword spotting,with accuracy gains ranging from 0.73%to 2.95%for a clean set and from 2.01%to 3.37%for a mixed set under different scales of training set.Furthermore,our model shows its robustness in different low-resource language datasets while converging faster. 展开更多
关键词 KeyWord Spotting(KWS) multi-task learning cross-dimension attention low-resource mixed speech
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部