期刊文献+
共找到636篇文章
< 1 2 32 >
每页显示 20 50 100
VOICINGDECISIONUSINGCONTINUOUSNONLINEARNETWORK
1
作者 周志杰 胡光锐 李群 《Journal of Shanghai Jiaotong university(Science)》 EI 1998年第2期50-53,共4页
A voicing decision algorithm using continuous nonlinear network is discussed. A five dimensional feature vector is used to describe the voicing characteristic of speech segment, and a continuous network is trained wi... A voicing decision algorithm using continuous nonlinear network is discussed. A five dimensional feature vector is used to describe the voicing characteristic of speech segment, and a continuous network is trained with a gradient descent algorithm is served as the voicing decision maker. Computer simulation shows that this algorithm is an outperform way to make voicing decision. The correct rate of this method reaches 97.8%. 展开更多
关键词 SPEECH processing NEURAL network voicing DECISION PITCH EXTRACTION
在线阅读 下载PDF
Assessing the performance of mobile AI assistants in delivering medical advice for early knee osteoarthritis
2
作者 Abasi Maimaitiabula Wan-Bo Zhu +3 位作者 Mo Chen Xian-Yue Shen Xian-Zuo Zhang Chen Zhu 《Medical Data Mining》 2025年第3期33-39,共7页
Background:This study evaluates the ability of mobile AI voice assistants(AI-VAs)to provide accurate medical advice for early knee osteoarthritis(KOA)and compares their performance with conventional web searches and h... Background:This study evaluates the ability of mobile AI voice assistants(AI-VAs)to provide accurate medical advice for early knee osteoarthritis(KOA)and compares their performance with conventional web searches and human clinicians.Methods:From September to October 2024,two AI-VAs(Apple’s Siri and Huawei’s Xiaoyi)were tested on 15 KOA-related questions in Chinese and English.The assessment focused on the accuracy of voice recognition,response capabilities,and medical advice.Siri was further tested in four international regions(USA,UK,Germany,Hong Kong)using localized languages.Results:In Chinese-language tests,Siri and Xiaoyi showed comparable voice recognition(recognition accuracy:95.6%vs.93.3%)and response ability(speech response:88.9%vs.85.7%).However,Siri provided significantly more accurate medical advice(medical advice:95.6%vs.53.3%;Z=2.762,P<0.001).External validation via Global Quality Score further confirmed Siri’s superiority(mean Global Quality Score=4.0 vs.Xiaoai=0.9).Siri outperformed Xiaoyi in English-language tests(53.3%vs.0%).While Siri’s medical advice accuracy(95.6%)surpassed non-specialist clinicians(Z=2.685,P=0.007),it primarily reflects filtered search results(Baidu/Google)rather than clinical synthesis.Claims of equivalence to junior surgeons(98.2%)must be interpreted cautiously,as AI-VAs lack diagnostic reasoning capabilities.This distinction is critical to avoid overstating their role in clinical decision-making.Conclusion:Current AI-VAs offer limited value in providing precise medical advice for KOA,primarily serving as intermediaries for web search results.Their performance varies across languages,regions,and search engines. 展开更多
关键词 mobile devices voice assistant SIRI Xiaoyi knee osteoarthritis
暂未订购
Making China’s Voice Better Heard
3
作者 CHINA TODAY 《China Today》 2025年第4期44-47,共4页
Du Zhanyuan,Standing Committee member of the 14th CPPCC National Committee and CICG president,on how to tell engaging stories about China.AS changes unseen in a century accelerate across the world,cultural exchange an... Du Zhanyuan,Standing Committee member of the 14th CPPCC National Committee and CICG president,on how to tell engaging stories about China.AS changes unseen in a century accelerate across the world,cultural exchange and mutual learning between civilizations are becoming increasingly important. 展开更多
关键词 BECOMING VOICE mutual
在线阅读 下载PDF
CLIP-ASN:A Multi-Model Deep Learning Approach to Recognize Dog Breeds
4
作者 Asif Nawaz Rana Saud Shoukat +2 位作者 Mohammad Shehab Khalil El Hindi Zohair Ahmed 《Computers, Materials & Continua》 2025年第12期4777-4793,共17页
The kingdom Animalia encompasses multicellular,eukaryotic organisms known as animals.Currently,there are approximately 1.5 million identified species of living animals,including over 195 distinct breeds of dogs.Each b... The kingdom Animalia encompasses multicellular,eukaryotic organisms known as animals.Currently,there are approximately 1.5 million identified species of living animals,including over 195 distinct breeds of dogs.Each breed possesses unique characteristics that can be challenging to distinguish.Each breed has its own characteristics that are difficult to identify.Various computer-based methods,including machine learning,deep learning,transfer learning,and robotics,are employed to identify dog breeds,focusing mainly on image or voice data.Voice-based techniques often face challenges such as noise,distortion,and changes in frequency or pitch,which can impair the model’s performance.Conversely,image-based methods may fail when dealing with blurred images,which can result from poor camera quality or photos taken from a distance.This research presents a hybrid model combining voice and image data for dog breed identification.The proposed method Contrastive Language-Image Pre-Training-Audio Stacked Network(CLIP-ASN)improves robustness,compensating when one data type is compromised by noise or poor quality.By integrating diverse data types,the model can more effectively identify unique breed characteristics,making it superior to methods relying on a single data type.The key steps of the proposed model are data collection,feature extraction based on Contrastive Language Image for image-based feature extraction and Audio stacked-based voice features extraction,co-attention-based classification,and federated learning-based training and distribution.From the experimental evaluation,it has been concluded that the performance of the proposed work in terms of accuracy 89.75%and is far better than the existing benchmark methods. 展开更多
关键词 Machine learning ensemble methods image detection voice detection animal breeds
在线阅读 下载PDF
Research on Intelligent Design from the Perspective of Age-Friendly Design:Taking“Remembering”as an Example
5
作者 Wenrui Xie Dan Ni +1 位作者 Xinyi Liu Xiaoxiu Cong 《Journal of Electronic Research and Application》 2025年第3期152-158,共7页
This study focuses on the elderly memo app“Remembering,”addressing memory decline and operational difficulties.It introduces a progressive interaction system with three core modules:Dynamic font adjustment,intellige... This study focuses on the elderly memo app“Remembering,”addressing memory decline and operational difficulties.It introduces a progressive interaction system with three core modules:Dynamic font adjustment,intelligent voice reminders,and family warning systems.Health monitoring and remote care functions are also integrated,creating a simple operation process.The research highlights four design dimensions for elderly-friendly products:Usability,security,emotionalization,and personalization.This innovation reduces the digital barrier and provides a model for smart elderly-friendly product development. 展开更多
关键词 Elderly people Memorandum Voice interaction Intelligent reminder Age-friendly design
在线阅读 下载PDF
Speak up in a safe space:The role of inclusive leadership and collectivism in fostering upward voice
6
作者 Longmei Wang Jiawen Liu Lei Lu 《Journal of Psychology in Africa》 2025年第3期309-317,共9页
This study examined the relationship between inclusive leadership and subordinates’upward voice,focusing on the mediating role of psychological safety and the moderating role of collectivism.Data were collected from ... This study examined the relationship between inclusive leadership and subordinates’upward voice,focusing on the mediating role of psychological safety and the moderating role of collectivism.Data were collected from 284 subordinates and supervisors across 11 organizations in China in three cross-lagged waves.Structural equation modeling results indicated that inclusive leadership was associated with subordinates’upward voice via psychological safety.Moreover,collectivism strengthens the association between inclusive leadership and upward voice via psychological safety,leading to a higher upward voice.These findings highlight the importance of inclusive leadership in fostering an environment that promotes open communication and psychological safety between supervisors and subordinates,ultimately enhancing workplace health and well-being.The implications of these findings suggest that management practices should cultivate inclusive leadership behaviors for enhancing psychological safety,and encouraging subordinates to voice their opinions for the overall success of the organization. 展开更多
关键词 inclusive leadership psychological safety COLLECTIVISM upward voice
在线阅读 下载PDF
Towards Friendly Digital Cities
7
作者 GE LIJUN 《ChinAfrica》 2025年第9期48-50,共3页
During the 2025 Beijing Digital Economy Experience Week from 27 June to 5 July,interactive projects combining technology and culture-illustrated books created by voice AI,augmented reality tours,and markerless motion ... During the 2025 Beijing Digital Economy Experience Week from 27 June to 5 July,interactive projects combining technology and culture-illustrated books created by voice AI,augmented reality tours,and markerless motion capture-attracted many visitors.“More than a dozen themed areas offered the opportunity to dive into new worlds and discover the latest innovations from more than 50 companies,”Lu Yumin,a Beijing resident who visited the event,told ChinAfrica. 展开更多
关键词 interactive projects innovations friendly digital cities culture markerless motion capture augmented reality voice AI technology
原文传递
Deep Learning⁃Based Speech Emotion Recognition: Leveraging Diverse Datasets and Augmentation Techniques for Robust Modeling
8
作者 Ayush Porwal Praveen Kumar Tyagi +1 位作者 Ajay Sharma Dheeraj Kumar Agarwal 《Journal of Harbin Institute of Technology(New Series)》 2025年第3期54-65,共12页
In recent years,Speech Emotion Recognition(SER)has developed into an essential instrument for interpreting human emotions from auditory data.The proposed research focuses on the development of a SER system employing d... In recent years,Speech Emotion Recognition(SER)has developed into an essential instrument for interpreting human emotions from auditory data.The proposed research focuses on the development of a SER system employing deep learning and multiple datasets containing samples of emotive speech.The primary objective of this research endeavor is to investigate the utilization of Convolutional Neural Networks(CNNs)in the process of sound feature extraction.Stretching,pitch manipulation,and noise injection are a few of the techniques utilized in this study to improve the data quality.Feature extraction methods including Zero Crossing Rate,Chroma_stft,Mel⁃scale Frequency Cepstral Coefficients(MFCC),Root Mean Square(RMS),and Mel⁃Spectogram are used to train a model.By using these techniques,audio signals can be transformed into recognized features that can be utilized to train the model.Ultimately,the study produces a thorough evaluation of the models performance.When this method was applied,the model achieved an impressive accuracy of 94.57%on the test dataset.The proposed work was also validated on the EMO⁃BD and IEMOCAP datasets.These consist of further data augmentation,feature engineering,and hyperparameter optimization.By following these development paths,SER systems will be able to be implemented in real⁃world scenarios with greater accuracy and resilience. 展开更多
关键词 voice signal emotion recognition deep learning CNN
在线阅读 下载PDF
The distinct speech and voice phenotypes among TCM constitution for adults:A cross-sectional study
9
作者 ZHANG Weiqiang SUN Xiaoru +5 位作者 ZHANG Menghan TANG Dezhi QIU Jian’ge JIANG Binghua WANG Yongjun WANG Jiucun 《World Journal of Integrated Traditional and Western Medicine》 2025年第2期55-65,共11页
Objectives:By investigating the distinct speech and voice phenotype among TCM constitution for adults,this study aims at providing a convenient and objective methodological reference for judging TCM constitution.Metho... Objectives:By investigating the distinct speech and voice phenotype among TCM constitution for adults,this study aims at providing a convenient and objective methodological reference for judging TCM constitution.Methods:Acoustic analysis and TCM constitution assessment were performed for all 620 participants using Praat software and the CCMQ,respectively.Results:For formant features,the speech duration of special constitution participants was shorter than that of neutral,phlegm-dampness,dampness-heat,Yin-deficiency,or Yang-deficiency participants when pronuncing the vowels/a/,/i/,and/u/.Compare to Yang-deficiency,Qi-deficiency participants had a shorter speech duration when pronucing/i/.For/u/,blood-stasis participants exhibited a lower F1 value than neutral participants.For vocal features,special constitution participants showed higher local jitter than neutral,dampness-heat,and Yang-deficiency participants(for/a/,/i/,and/u/).Higher absolute local jitter than neutral or dampness-heat participants.Compared with neutral or Yang-deficiency participants,special participants owned a higher local shimmer(dB).Special participants had a lower harmonicity autocorrelation than neutral,dampness-heat,or Yang-deficiency participants.Conclusions:Formant features may effectively differentiate special constitution from neutral,phlegm-dampness,dampness-heat,Yin-deficiency,or Yang-deficiency constitutions based on vowel duration measurements(/a/,/i/,/u/).For the vowel/u/,F1 values may help distinguish blood-stasis from neutral constitution.Vocal features appear particularly useful for distinguishing special constitution from neutral,dampness-heat,or Yang-deficiency constitution,with local jitter and harmonicity autocorrelation showing significant discriminatory power. 展开更多
关键词 Speech and voice phenotype Acoustic feature TCM constitution Chinmedphenomics
暂未订购
Impact of nurse and beloved family member’s voice stimulus on the level of consciousness and physiological parameters in comatose patients
10
作者 Smritikana ADAK Rashmimala PRADHAN +3 位作者 Sujyotsna JENA Subhalaxmi PRADHAN Lulup Kumar SAHOO Mamata SWAIN 《Journal of Integrative Nursing》 2025年第1期33-41,共9页
Objective:The objective of this study was to compare the effect of nurse and beloved family member’s recording voice on consciousness and physical parameters in patients with coma state.Materials and Methods:A random... Objective:The objective of this study was to compare the effect of nurse and beloved family member’s recording voice on consciousness and physical parameters in patients with coma state.Materials and Methods:A randomized control trial parallel group design was conducted among 45 comatose patients divided into two intervention groups,i.e.nurse voice stimulus group,receiving nurses voice with standard care,family members voice stimulus group receiving their beloved family member voice with standard care and one control group receiving only standard care in medicine intensive care unit.The intervention was provided three times a day,each lasting 5 min for 7 days in addition to standard care.Repeated measure analysis of variance and independent t-test were used to compare within and between groups,respectively.Results:The study found significant differences in Glasgow coma scale(GCS)scores within both the nurse(F=2.78,P=0.042)and family member(F=10.27,P=0.0001)voice stimulus groups over 7 days.Comparing GCS scores between intervention groups showed significant variations before(P=0.028),during(P=0.047),and after(P=0.036)the intervention on day 7.Comparing GCS scores between the family members’voice stimulus group and the control group,significant changes were observed on days 5 and 7(P=0.043,0.030,0.030,and 0.014,0.012,0.012)before,during,and after the intervention.Conclusions:The use of beloved family members’voices proved more effective in elevating the patients’level of consciousness compared to both the nurse voice stimulus group and the control group. 展开更多
关键词 Comatose patients level of consciousness physiological parameters voice stimulus
暂未订购
Semi-Autonomous Navigation Based on Local Semantic Map for Mobile Robot
11
作者 ZHAO Yanfei XIAO Peng +1 位作者 WANG Jingchuan GUO Rui 《Journal of Shanghai Jiaotong university(Science)》 2025年第1期27-33,共7页
Mobile robots represented by smart wheelchairs can assist elderly people with mobility difficulties.This paper proposes a multi-mode semi-autonomous navigation system based on a local semantic map for mobile robots,wh... Mobile robots represented by smart wheelchairs can assist elderly people with mobility difficulties.This paper proposes a multi-mode semi-autonomous navigation system based on a local semantic map for mobile robots,which can assist users to implement accurate navigation(e.g.,docking)in the environment without prior maps.In order to overcome the problem of repeated oscillations during the docking of traditional local path planning algorithms,this paper adopts a mode-switching method and uses feedback control to perform docking when approaching semantic goals.At last,comparative experiments were carried out in the real environment.Results show that our method is superior in terms of safety,comfort and docking accuracy. 展开更多
关键词 semi-autonomous navigation mobile robot semantic map voice interaction
原文传递
Leader-employee calling congruence and voice behaviour: The mediating role of perceived insider status
12
作者 Xiaolin Zhang Shujie Li Enguo Wang 《Journal of Psychology in Africa》 2025年第1期75-81,共7页
This study examined the relationship between leader-employee calling congruence on employees’voice behaviour.Participants were 173 leader-employee dyads from the Chinese service industry.They completed online surveys... This study examined the relationship between leader-employee calling congruence on employees’voice behaviour.Participants were 173 leader-employee dyads from the Chinese service industry.They completed online surveys on calling,perceived insider status,and voice behaviour.Results from polynomial regression and response surface analysis showed that employees perceived insider status to be weaker with the low leader-low subordinate calling congruence,and stronger with high leader and high subordinate calling congruence.Employees perceived insider status is stronger in low leader and high subordinate calling incongruence compared with high leader and low subordinate calling incongruence.Perceived insider status plays a mediating role among calling congruence and voice behaviour.This study’sfindings suggest pathways of calling congruence on voice behaviour,which are important for promoting employee voice behaviour and guiding organisational recruitment in the workplace. 展开更多
关键词 CALLING voice behaviour perceived insider status response surface analysis
在线阅读 下载PDF
The Efficacy of Written Corrective Feedback Explicitness on the Grammatical Accuracy of Passive Voice Tenses
13
作者 Syed Muhammad Mujtaba Manjet Kaur Mehar Singh 《Chinese Journal of Applied Linguistics》 2025年第2期183-206,320,共25页
Although substantial research shows the effectiveness of written corrective feedback(WCF)in treating simple grammar structures,more research is still needed to refute Truscott’s claim that WCF may not work on complex... Although substantial research shows the effectiveness of written corrective feedback(WCF)in treating simple grammar structures,more research is still needed to refute Truscott’s claim that WCF may not work on complex grammar structures.Similarly,a previous body of research has shown that the degree of explicitness of feedback moderates the efficacy of WCF.However,most WCF studies have systematically manipulated only direct corrective feedback.The current study was therefore conducted to fill these gaps in the literature.To this end,five intact classes of Functional English were recruited and later randomly assigned to four treatment groups:DCF,DCF+ME,ICF,and ICF+ME,and one control group that received no feedback.All the groups took part in three WCF treatment sessions,during which they wrote two different pieces:a news report and a picture description.Later,only the treatment groups received the WCF.The WCF’s effectiveness was measured by writing tests and grammaticality judgment tasks(GJT).The results demonstrated that WCF helped L2 learners improve their grammatical accuracy of passive voice tenses.The study further showed that the group that received the most explicit type of WCF fared better than the ones that received the least explicit type of WCF.Important pedagogical implications for ESL/EFL teachers are discussed. 展开更多
关键词 written corrective feedback direct corrective feedback indirect corrective feedback metalinguistic explanation passive voice
在线阅读 下载PDF
Vibrotactile pattern recognition:Influence of interstimulus intervals
14
作者 Nashmin YEGANEH Ivan MAKAROV +1 位作者 Arni KRISTJANSSON Runar UNNTHORSSON 《虚拟现实与智能硬件(中英文)》 2025年第5期483-500,共18页
Background Vibrotactile feedback systems are widely used in assistive technology,wearable devices,and virtual environments to deliver precise tactile information.The timing of interstimulus intervals(ISIs)plays a crit... Background Vibrotactile feedback systems are widely used in assistive technology,wearable devices,and virtual environments to deliver precise tactile information.The timing of interstimulus intervals(ISIs)plays a critical role in determining how accurately users perceive and interpret vibrotactile patterns.The optimal use of ISIs can increase the effectiveness of these systems,improve user interaction,and enable reliable,intuitive feedback in diverse applications.We examined how different interstimulus intervals ISIs impact the accuracy of vibrotactile pattern recognition.Methods Participants wore a forearm-mounted device with six voice coil actuators arranged in a 3×2 grid,delivering Braille-based vibrotactile patterns sequentially at ISIs ranging from 10 to 2500 ms.Eight participants performed identification tasks involving Icelandic Braille patterns categorized as either short(2-3 actuators)or long(4-5 actuators).A repeated measures ANOVA was conducted to assess the effects of ISI,pattern type,and practice(across two testing blocks)on pattern recognition accuracy.Results For short patterns,accuracy was highest(92%-98%)at ISIs of 50-700 ms,with peak performance at 300 ms.For long patterns,accuracy reached 86%-94%at ISIs of 100-500 ms,peaking at 400 ms.Participants were more accurate with short patterns,and performance improved significantly over time for both short and long patterns,highlighting the importance of training for vibrotactile pattern recognition.Conclusions These results underscore the importance of careful selection of ISIs in vibrotactile feedback systems for accurate pattern identification.The findings provide valuable insights for conveying tactile information using wearable devices,contributing to better tactile feedback and performance in applications requiring precise vibrotactile information delivery. 展开更多
关键词 Voice coil actuator Wearable vibrotactile device Vibratory stimulus Vibrotactile localization Vibrotactile frequency Vibrotactile discrimination Interstimulus Interval(ISI)
暂未订购
基于VoiceXML的语音电子邮件系统的设计 被引量:4
15
作者 吴英 徐敬东 吴功宜 《计算机工程》 EI CAS CSCD 北大核心 2005年第5期122-124,共3页
设计的语音电子邮件系统将传统的电子邮件服务扩展到有线或无线、固定或移动电话系统,使用户可以通过普通的电话来方便地接收自己的电子邮件。该文对基于VoiceXML标准的语音电子邮件系统开发进行了探讨,并对语音电子邮件网关的设计进行... 设计的语音电子邮件系统将传统的电子邮件服务扩展到有线或无线、固定或移动电话系统,使用户可以通过普通的电话来方便地接收自己的电子邮件。该文对基于VoiceXML标准的语音电子邮件系统开发进行了探讨,并对语音电子邮件网关的设计进行了深入的研究。 展开更多
关键词 语音电子邮件 VOICE XML POP3
在线阅读 下载PDF
一种自适应语音端点检测算法 被引量:6
16
作者 孙战先 储飞黄 王江 《计算机工程与应用》 CSCD 2014年第1期206-210,共5页
针对基于短时能量和短时过零率的语音端点检测算法不能自适应环境,在低信噪比时性能较差问题,提出了一种新算法。该算法利用最小短时能量评估环境噪声,优化参数提取算法,提高了参数本身的抗噪能力和自适应能力,再通过参数融合有效平衡... 针对基于短时能量和短时过零率的语音端点检测算法不能自适应环境,在低信噪比时性能较差问题,提出了一种新算法。该算法利用最小短时能量评估环境噪声,优化参数提取算法,提高了参数本身的抗噪能力和自适应能力,再通过参数融合有效平衡了音节之间的差异,放大了语音与噪声之间的差异,最后通过一个动态检测门限,实现了不同信噪比下的端点检测。 展开更多
关键词 语音活动检测(端点检测) 自适应 噪声评估 特征融合 VOICE Activity Detection(VAD)
在线阅读 下载PDF
一个语音信息门户的设计与实现 被引量:4
17
作者 周宽久 曾琳铖曦 李瑶 《计算机工程》 EI CAS CSCD 北大核心 2006年第9期101-103,共3页
语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该... 语音门户是利用了CTI技术实现电话网与互联网集成的重要部件,支持了用户通过普通电话访问互联网获取信息,是由IVR(Interactive Voice Response)、TTS(Text To Speech)、ASR(Automatic Speech Recognition)、Voice XML 4个子系统组成,该文在一个实用的语音门户系统的基础上,讨论了系统结构以及4个模块的设计实现,系统设计采用面向对象技术、自动机技术将板卡、通道以其语音合成、识别等资源有机集成在一个系统内,方便了系统设计与功能扩充。 展开更多
关键词 语音门户 交互式语音问答 语音合成 语音识别 VOICE XML
在线阅读 下载PDF
术后发音矫治对嗓音显微手术治疗声带息肉发音功能的影响 被引量:4
18
作者 王杰 胥斌 +2 位作者 付启红 刘国旗 柳庆君 《中国耳鼻咽喉头颈外科》 北大核心 2009年第6期345-346,共2页
嗓音显微手术是在手术显微镜下,应用先进的显微技术,在力求治愈疾病的同时最大限度的保护发音功能的一种喉内手术,切除病变与改善发音功能是嗓音显微手术的两个主要目的川。回顾性分析2006年7月-2006年12月随访3个月以上的80例经嗓... 嗓音显微手术是在手术显微镜下,应用先进的显微技术,在力求治愈疾病的同时最大限度的保护发音功能的一种喉内手术,切除病变与改善发音功能是嗓音显微手术的两个主要目的川。回顾性分析2006年7月-2006年12月随访3个月以上的80例经嗓音显微手术治疗的声带息肉患者,其中40例辅助术后嗓音矫治,所有患者均进行动态喉镜及嗓音声学分析以评价治疗效果。 展开更多
关键词 发音障碍(Articulation Disorders) 显微外科手术(Microsurgery) 语音训练(Voice Training)
暂未订购
ILBC编码算法及其在VOIP中的应用 被引量:6
19
作者 郭廷廷 李敬 《电子技术应用》 北大核心 2006年第7期119-121,共3页
VOIP正在成为热门的应用领域,语音质量的提高是其难点之一。ILBC算法是一种基于CELP(CodeExcitedLinearPredictiveCoding)的新的低比特率语音编码算法,其优异的语音质量、突出的长时预测方法[1]及丢包掩蔽[2](PLC)技术,很好地解决了互... VOIP正在成为热门的应用领域,语音质量的提高是其难点之一。ILBC算法是一种基于CELP(CodeExcitedLinearPredictiveCoding)的新的低比特率语音编码算法,其优异的语音质量、突出的长时预测方法[1]及丢包掩蔽[2](PLC)技术,很好地解决了互联网上传输语音的问题。详细探讨了ILBC的实现原理,对其关键技术进行了较深入的分析,给出了实验结论,并对其应用作了一些展望。 展开更多
关键词 ILBC(Intemet Low BIT Vate Codec) VOIP(Voice Over IP) PLC 丢包掩蔽 动态码本
在线阅读 下载PDF
基于数字信号处理的嗓音控制开关(VOX)算法研究 被引量:2
20
作者 张天骐 李伟 +1 位作者 林孝康 刘林 《应用声学》 CSCD 北大核心 2005年第3期157-163,共7页
本文提出了一种基于数字谱分析的嗓音控制开关(VOX,Voice-OperatedTransmit)的新算法,该算法简单、实用,在某种程度上克服了传统VOX算法的结构复杂、参数难调等局限,对噪声的鲁棒性也较好,而且易于用数字信号处理实现。首先利用信号功... 本文提出了一种基于数字谱分析的嗓音控制开关(VOX,Voice-OperatedTransmit)的新算法,该算法简单、实用,在某种程度上克服了传统VOX算法的结构复杂、参数难调等局限,对噪声的鲁棒性也较好,而且易于用数字信号处理实现。首先利用信号功率谱二次处理,提取出语音的平均幅度包络,然后对所得包络进行阈值处理、限幅放大,最后就得到VOX函数。理论分析和计算机模拟结果表明,该算法不仅能较为准确地提取出语音波形的平均幅度包络,而且能工作在较低的信噪比条件下。 展开更多
关键词 数字信号处理 控制开关 算法研究 嗓音 Voice 信号功率谱 二次处理 阈值处理 模拟结果 语音波形 VOX 谱分析 新算法 鲁棒性 包络 计算机 信噪比 幅度 平均 提取 限幅
在线阅读 下载PDF
上一页 1 2 32 下一页 到第
使用帮助 返回顶部