摘要
自动语音识别(ASR)技术的目的是让机器能够"听懂"人类的语音,将人类语音信息转化为可读的文字信息,是实现人机交互的关键技术,也是长期以来的研究热点。最近几年,随着深度神经网络的应用,加上海量大数据的使用和云计算的普及,语音识别取得了突飞猛进的进展,在多个行业突破了实用化的门槛,越来越多的语音技术产品进入了人们的日常生活,包括苹果的Siri、亚马逊的Alexa、讯飞语音输入法、叮咚智能音箱等都是其中的典型代表。对语音识别技术的发展情况、最近几年的关键突破性技术进行了介绍,并对语音识别技术的发展趋势做了展望。
The purpose of automatic speech recognition (ASR) is to make the machine to be able to "understand" the human speech and transform it to readable text information. ASR is one of the key technologies of human machine interaction and also a hot research domain for a long time. In recent years, due to the application of deep neural networks, the use of big data and the popularity of cloud computing, ASR has made great progress and break through the threshold of application in many industries. More and more products with ASR have entered people's daily life, such as Apple's Sift, Amazon's Alexa, IFLYTEK speech input method and Dingdong intelligent speaker and so on. The development status and key breakthrough technologies in recent years were introduced. Also, a forecast of ASR technologies' trend of development was given.
出处
《电信科学》
2018年第2期1-11,共11页
Telecommunications Science
关键词
自动语音识别
深度神经网络
声学模型
语言模型
automatic speech recognition, deep neural network, acoustic model, language model