Spoken dialogue systems are an active research field with wide applications. But the differences in the Chinese spoken dialogue system are not as distinct as that of English. In Chinese spoken dialogues, there are man...Spoken dialogue systems are an active research field with wide applications. But the differences in the Chinese spoken dialogue system are not as distinct as that of English. In Chinese spoken dialogues, there are many language phenomena. Firstly, most utterances are ill-formed. Secondly, ellipsis, anaphora and negation are also widely used in Chinese spoken dialogue. Determining how to extract semantic information from incomplete sentences and resolve negation, anaphora and ellipsis is crucial. SHTQS (Shanghai Transportation Query System) is an intelligent telephone-based spoken dialogue system providing information about the best route between any two sites in Shanghai. After a brief description of the system, the natural language processing is emphasized. Speech recognition sentences unavoidably contain errors. In language sequence processing procedures, these errors can be easily passed to the later parts and take on a ripple effect. To detect and recover these from errors as early as possible, language-processing strategies are specially considered. For errors resulting from divided words in speech recognition, segmentation and POS Tagging approaches that can rectify these errors are designed. Since most of the inquiry utterances are ill-formed and negation, anaphora and ellipsis are common language phenomena, the language understanding must be adequately adaptive. So, a partial syntactic parsing scheme is adopted and a chart algorithm is used. The parser is based on unification grammar. The semantic frame that extracts from the best arc set of the chart is used to represent the meaning of sentences. The negation, anaphora and ellipsis are also analyzed and corresponding processing approaches are presented. The accuracy of the language processing part is 88.39% and the testing result shows that the language processing strategies are rational and effective.展开更多
SHTQS is an intelligent telephone-besed spoken dialyze system providing the infomation about the best route between two sites in Shanghai. Instead of separated parts of speech decoding and language parsing, a close co...SHTQS is an intelligent telephone-besed spoken dialyze system providing the infomation about the best route between two sites in Shanghai. Instead of separated parts of speech decoding and language parsing, a close cool,ration is carded out in SHTQS by integrating automatic speech recognizer (AS,R), language understanding, dialogue management and speech generatot. In such a way, the erroneous analysis and uncertainty happening in the preceding stages would be recovered and determined acourately with high-level knowledge, Moreover, instead of shallow word-level analysis or simply keyword or key phrase matching, a deeper analysis is performed in our system by integrating a robust parser and a semantic interpreter. The robust parser is particularly important for spontanecos speech inputs because most of the inquiry sentences/phrases are ill-formed. In addition, in designinga mixed-initiative dialogue system, understanding users' inquiries is essential; however, simply matching keywords and/or key phrases can hardly achieve this. Therefore, a semantic interpreter is incorporated in oar system. The performnce of is also evaluated. The dialogue efficiency is 4.4 sentences per query on an average and the case precision rate of language understanding module is up to 81%. The results are satisfactory.展开更多
语音是现有嵌入式移动设备广泛使用的一种输入接口.尽管现有的云端服务提供商提供了强大的语音语言理解(Spoken Language Understanding,SLU)服务,但也对用户隐私造成了极大的威胁.为此,基于信息解耦的隐私保护编码器被提出,以在不影响...语音是现有嵌入式移动设备广泛使用的一种输入接口.尽管现有的云端服务提供商提供了强大的语音语言理解(Spoken Language Understanding,SLU)服务,但也对用户隐私造成了极大的威胁.为此,基于信息解耦的隐私保护编码器被提出,以在不影响SLU功能的前提下,从语音信号中移除敏感信息.然而,这些编码器往往需要较高的内存和复杂的计算,因而在资源受限的小型设备上难以实际应用.本文基于大量实验观察到了一个关键现象,即SLU依赖于整个语句的全局信息,而隐私敏感词的识别则多为局部信息依赖.利用这一观察,我们提出了一个面向语音意图理解的高效编码器(SImpLe ENCodEr designed for efficient privacy-preserving SLU offloading,SILENCE)系统.我们在STM32H7微控制单元上实现了该系统,并在不同的攻击场景下评估了其效果.实验结果表明:SILENCE在语音意图提取任务的性能和隐私保护能力上可与传统隐私保护编码器媲美,同时实现了高达53.3倍的速度提升和134.1倍的内存占用减少,首次在内存仅有1 MB的微控制单元上实现了隐私保护的SLU服务.展开更多
文摘Spoken dialogue systems are an active research field with wide applications. But the differences in the Chinese spoken dialogue system are not as distinct as that of English. In Chinese spoken dialogues, there are many language phenomena. Firstly, most utterances are ill-formed. Secondly, ellipsis, anaphora and negation are also widely used in Chinese spoken dialogue. Determining how to extract semantic information from incomplete sentences and resolve negation, anaphora and ellipsis is crucial. SHTQS (Shanghai Transportation Query System) is an intelligent telephone-based spoken dialogue system providing information about the best route between any two sites in Shanghai. After a brief description of the system, the natural language processing is emphasized. Speech recognition sentences unavoidably contain errors. In language sequence processing procedures, these errors can be easily passed to the later parts and take on a ripple effect. To detect and recover these from errors as early as possible, language-processing strategies are specially considered. For errors resulting from divided words in speech recognition, segmentation and POS Tagging approaches that can rectify these errors are designed. Since most of the inquiry utterances are ill-formed and negation, anaphora and ellipsis are common language phenomena, the language understanding must be adequately adaptive. So, a partial syntactic parsing scheme is adopted and a chart algorithm is used. The parser is based on unification grammar. The semantic frame that extracts from the best arc set of the chart is used to represent the meaning of sentences. The negation, anaphora and ellipsis are also analyzed and corresponding processing approaches are presented. The accuracy of the language processing part is 88.39% and the testing result shows that the language processing strategies are rational and effective.
文摘SHTQS is an intelligent telephone-besed spoken dialyze system providing the infomation about the best route between two sites in Shanghai. Instead of separated parts of speech decoding and language parsing, a close cool,ration is carded out in SHTQS by integrating automatic speech recognizer (AS,R), language understanding, dialogue management and speech generatot. In such a way, the erroneous analysis and uncertainty happening in the preceding stages would be recovered and determined acourately with high-level knowledge, Moreover, instead of shallow word-level analysis or simply keyword or key phrase matching, a deeper analysis is performed in our system by integrating a robust parser and a semantic interpreter. The robust parser is particularly important for spontanecos speech inputs because most of the inquiry sentences/phrases are ill-formed. In addition, in designinga mixed-initiative dialogue system, understanding users' inquiries is essential; however, simply matching keywords and/or key phrases can hardly achieve this. Therefore, a semantic interpreter is incorporated in oar system. The performnce of is also evaluated. The dialogue efficiency is 4.4 sentences per query on an average and the case precision rate of language understanding module is up to 81%. The results are satisfactory.
文摘语音是现有嵌入式移动设备广泛使用的一种输入接口.尽管现有的云端服务提供商提供了强大的语音语言理解(Spoken Language Understanding,SLU)服务,但也对用户隐私造成了极大的威胁.为此,基于信息解耦的隐私保护编码器被提出,以在不影响SLU功能的前提下,从语音信号中移除敏感信息.然而,这些编码器往往需要较高的内存和复杂的计算,因而在资源受限的小型设备上难以实际应用.本文基于大量实验观察到了一个关键现象,即SLU依赖于整个语句的全局信息,而隐私敏感词的识别则多为局部信息依赖.利用这一观察,我们提出了一个面向语音意图理解的高效编码器(SImpLe ENCodEr designed for efficient privacy-preserving SLU offloading,SILENCE)系统.我们在STM32H7微控制单元上实现了该系统,并在不同的攻击场景下评估了其效果.实验结果表明:SILENCE在语音意图提取任务的性能和隐私保护能力上可与传统隐私保护编码器媲美,同时实现了高达53.3倍的速度提升和134.1倍的内存占用减少,首次在内存仅有1 MB的微控制单元上实现了隐私保护的SLU服务.