期刊文献+

计算机唇读研究进展 被引量:2

Research Advances in Computer Lip-Reading
在线阅读 下载PDF
导出
摘要 计算机唇读是利用计算机对说话者的唇动等视觉语音信息进行分析以识别出其所说内容的过程,并可与听觉语音信息相融合以进一步提高计算机的识别率,从而使人机交互更加自然。本文从计算机唇读系统的各环节入手综述了该领域的研究进展,并讨论了现有诸方法的优缺点,最后提出了有待进一步研究的问题。 As a hotspot in the field of human-computer interaction, computer lip-reading aims at recognizing what human says by analyzing visual speech information, such as lip movement. It can be further integrated with audio speech information to improve recognition accuracy for more convenient human-computer interaction. This paper gives a survey of lip-reading approaches and discusses their benefits and drawbacks. Finally, several key issues to be researched in the field are pointed out.
出处 《数据采集与处理》 CSCD 北大核心 2007年第3期353-359,共7页 Journal of Data Acquisition and Processing
基金 国家自然科学基金(60121101)资助项目
关键词 唇读 定位 特征抽取 信息融合 lip-reading location feature extraction information fusion
  • 相关文献

参考文献50

  • 1Hennecke M E,Prasad K V,Stork D G.Automatic speech recognition system using acoustic and visual signals[C]//29th Annual Asilomar Conference on Signals,Systems and Computers.Pacific Grove,CA:IEEE Computer Society Press,1995,2:1214-1218.
  • 2Petajan E D.Automatic lipreading to enhance speech recognition[D].Urbana-Champain:University of Illinois at Urbana-Champain,1984.
  • 3姚鸿勋,刘明宝,高文,范旭彤,张洪明,吕雅娟.基于彩色图像的色系坐标变换的面部定位与跟踪法[J].计算机学报,2000,23(2):158-165. 被引量:54
  • 4王瑞,高文.非监督、多级嘴唇区域分割方法[J].计算机工程与应用,2003,39(2):53-56. 被引量:4
  • 5Lee D J,Bates D,Dromey C,et al.An image system correlating lip shapes with tongue contact patterns for speech pathology research[C]//16th IEEE Symposium on Computer-Based Medical Systems.New York,USA:IEEE Computer Society Press,2003:307-313.
  • 6Lewis T W,Powers D M W.Lip feature extraction using red exclusion[C]//Selected papers from Pan-Sydney Area Workshop on Visual Information Processing.Sydney,Australia:Australian Computer Society,2001,2:61-67.
  • 7Kaynak M N,Zhi Q,Cheok A D,et al.Analysis of lip geometric features for audio-visual speech recognition[J].IEEE Transactions on System,Man,and Cybernetics,Part A:Systems and Humans,2004,34(4):564-570.
  • 8Zhang X Z,Mersereau R M,Clements M A.Audiovisual speech recognition by speechreading[C]//14th International Conference on Digital Signal Processing.Santorini,Greece:IEEE,2002,2:1069-1072.
  • 9Xie L,Cai X L,Fu Z H,et al.Lip temporal pattern analysis for automatic visual speech recognition[C]//7th International Conference on Signal Processing.Beijing,China:IEEE Computer Society Press,2004,1:703-706.
  • 10Delmas P,Coulon P Y,Fristot V.Automatic snakes for robust lip boundaries extraction[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.Phoenix,AZ,USA:IEEE,1999,6:3069-3072.

二级参考文献41

  • 1Potamianos G, Neti C, Iyengar G, et al. A cascade visual front end for speaker independent automatic speechreading[J]. International Journal of speech technology, 2001 (4) :193 -208.
  • 2Gerasimos Potamianos, Chalapathy Neti. Improved ROI and within frame discriminant features for lipreading[A]. In: Proceedings of the International Conference on Image Processing[C]. Piscataway: IEEE, 2001.
  • 3Kazuhiro Nakamura, Noriaki Murakam, Ka-zuyoshi Takagi, et al. A real-time lipreading LSI for word recognition [J/OL]. http:∥www. ap-asic. org/2002/proceedings/SC/3C _ 5. pdf, 2002.
  • 4AWC Liew, SH Leung, WH Lau. Lip contour extraction from color images using a deformable model[J]. Pattern Recognition, 2002, 35: 2949- 2962.
  • 5Uda K, Tagawa N, Minagawa A, et al. Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading[A]. In: Proceedings 11th International Conference on Image Analysis and Processing[C]. Los Alamitos: IEEE, 2001.
  • 6Matthews I, Potamianos G, Neti C, et al. A comparison of model and transform-based visual features for audiovisual LVCSR[A]. In: Proc lnt Conf Multimedia Expo[C]. Los Alamitos: IEEE, 2001.
  • 7lain Matthews J , Andrew Bangham , Richard Harvey.Extraction of visual features for lipreading [J]. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2002,24(2) :198 -213.
  • 8Zhang Jian-ming, Wang Liang-min, Niu De-jiao, et al.Research and implementation of a real time approach to lip detection in video sequences [A]. In: Proceedings of 2003 International Conference on Machine Learning and Cybernetics [C]. Piscataway: IEEE, 2003.
  • 9Lewis T W, Powers D M. Lip feature extraction using red exclusion[A]. In: Proc Selected Papers from PanSydney Workshop on Visual Information Processing[C],Sydney: Australian Computer Society, 2000.
  • 10[1]Marcus E Hennecke,David G Stork ,K Venkatesh Prasad.Visionary speech :Looking ahead to practical speechreading systems[C].In:DavidG Stork,Marcus E Hennecke eds. Speechreading by Humans and Machines, volume 150 of NATO ASI Series, Series F: Computer and Systems Sciences,Berlin, 1995

共引文献109

同被引文献15

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部