期刊文献+

基于标点信息和统计语言模型的语音停顿预测 被引量:8

Prediction of Speech Pauses Based on Punctuation Information and Statistical Language Model
原文传递
导出
摘要 语音停顿被认为是有声语言的标点符号.在语言交流中,说话人会在韵律短语的边界处插入长短不同的停顿.利用这一性质,在调查标点符号停顿作用的基础上,提出基于标点信息预测语音停顿的思想,阐述基于标点和统计模型的训练语料自动获取以及语音停顿预测方法,讨论训练语料规模对模型性能的影响,并比较基于标点信息的自动获取语料与人工标注语料的性能.实验结果显示,汉语的标点提供有价值的停顿信息,基于汉语标点信息能够有效预测语音停顿. Speech pauses are considered as punctuation marks of spoken language. People always insert different pauses at the boundaries of rhythmic phrases when communicating by language. Based on this characteristic, the speech pause of punctuation marks is investigated and the concept of predicting speech pauses using punctuation information is proposed. The punctuation-based and SLM-based methods are introduced to obtain training corpus and predict speech pauses. The influence of training corpus size on the performance of model is discussed. And the performance of punctuation-based corpus and manually-labeled corpus is compared. Experimental results show that the Chinese punctuation supplies valuable information on pause, and the method based on punctuation information can predict the Chinese speech pauses effectively.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2008年第4期541-545,共5页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基会资助项目(No.60572159 60573184 60473139)
关键词 标点符号 语音停顿 统计语言模型 语料获取 Punctuation Marks, Speech Pause, Statistic Language Model, Corpus Obtaining
  • 相关文献

参考文献10

二级参考文献54

  • 1王洪君.汉语的韵律词与韵律短语[J].中国语文,2000(6):525-536. 被引量:106
  • 2蔡莲红,魏华武,周俏峰.汉语文-语转换中的语言学处理[J].中文信息学报,1995,9(1):31-36. 被引量:4
  • 3周强,俞士汶.汉语短语标注标记集的确定[J].中文信息学报,1996,10(4):1-11. 被引量:35
  • 4周强.一个汉语短语自动界定模型[J].软件学报,1996,7(A00):315-322. 被引量:9
  • 5叶军.停顿的声学征兆.第三界全国语音学研讨会论文集[M].北京:-,1996.21-22.
  • 6[1]Fodor J D. Prosodic disambiguation in silent reading. In: M Hirotani ed. Proceedings of the North East Linguistic Society 32. CSLA,University of Massachusetts, Amherst,2002
  • 7[3]Levelt W J M. Speaking: from intention to articulation. MTT Press, 1989
  • 8[4]Levelt W J M. Models of word production. Trends in Cognitive Sciences, 1999, 3(6): 223~232
  • 9[5]Sevald C A, Dell G S, Cole J S. Syllable structure in speech production: Are syllables chunks or schemas ? Journal of Memory and Language, 1995, 34:807~820
  • 10[6]Costa A, Sebastian-Gallés N. Abstract phonological structure in language production: evidence from Spanish. Journal of Experimental Psychology: Learning, Memory, and Cognition, 1998, 24(4): 886~903

共引文献84

同被引文献57

引证文献8

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部