期刊文献+

一种新的潜在语义分析语言模型 被引量:3

A new latent semantic analysis language model
在线阅读 下载PDF
导出
摘要 提出了基于聚类的方法实现词的快速量化表示,并由此导出潜在语义分析语言模型预测置信度,同时运用新提出的几何加权静态插值方式同三元文法模型相结合,构建了一种新的潜在语义分析语言模型,并将其应用于汉语语音识别.实验表明其效率和性能均优于传统基于奇异值分解的潜在语义分析语言模型,相比于三元文法模型,识别错误率相对下降为3.6%~7.1%左右,并为有效量化表示词对进一步提高潜在语义分析语言模型性能提供了新的途径. In this paper, latent semantic analysis automatically uncovered the salient semantic relationships between words in a given training corpus by a novel faster method for quantizing word via clustering, it was used for mandarin speech recognition through combining with trigram model via a new proposed static geometric weighting interpolation manner. Experiments show that it outperformed the traditional singular value decomposition-based latent semantic analysis model for its better efficiency and performance. Compared with the trigram model, the reduction of relative recognition error rate is about 3.6% -7.1%. Furthermore, it provides a novel approach for improving latent semantic analysis model through quantizing word pair effectively.
出处 《高技术通讯》 CAS CSCD 北大核心 2005年第8期1-5,共5页 Chinese High Technology Letters
基金 国家高技术研究发展计划(863计划)
关键词 语言模型 语音识别 N元文法 潜在语义分析 奇异值分解 汉语语音识别 模型性能 模型预测 插值方式 量化表 language model, speech recognition, N-gram, latent semantic analysis, singular value decomposition
  • 相关文献

参考文献7

  • 1Deerwester S, Dumais S T, Furnas G W, et al. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 1990, 41 : 391.
  • 2Bellegarda J R. A mtdtispan language modeling framework for large vocabtdary speech recognition. IEEE Trans Speech Audio Processing, 1998, 6:456.
  • 3Bellegarda J R. Exploiting latent semantic information in statistical language modeling. Proceedings of IEEE, 2000, 8:1279.
  • 4Berry M W. Large-scale sparse singular value computations.The International Journal of Supercomputer Applications, 1992,6:13.
  • 5Martin S, Liermann J, Ney H. Algorithms for bigram andtrigram word clustering. Speech Commutation, 1998, 1 : 19.
  • 6Coccaro N, Jurafsky D. Towards better integration ofsemantic predictors in statistical language modeling. In: Proceedings of ICSLP. Sydney, Australia, 1998,6:2403.
  • 7王作英.基于段长分布的HMM语音识别模型[A]..第二届全国汉字语音识别会议[C].庐山,1989..

共引文献3

同被引文献55

引证文献3

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部