期刊文献+
共找到834篇文章
< 1 2 42 >
每页显示 20 50 100
A REAL-TIME IMPLEMENTATION OF 4.2Kb/s CELP SPEECH CODING
1
作者 Bao Changchun Dai Yisong Fan Changxin(information Science Institute, Xidian University, Xi’an 710071) (Dept. of Electronic Eng., Jilin University of technology 130025) 《Journal of Electronics(China)》 1997年第1期52-58,共7页
This paper presents a real-time implementation of 4.2Kb/s CELP speech coding on single DSP chip. An algorithm reducing search complexity for adaptive codebook is suggested; the solving method that the parameters are c... This paper presents a real-time implementation of 4.2Kb/s CELP speech coding on single DSP chip. An algorithm reducing search complexity for adaptive codebook is suggested; the solving method that the parameters are changed into LSP parameters is discussed. The realtime implementation process of this coding on a commercial development board with a single TMS320C30 is described. 展开更多
关键词 speech coding LINEAR prediction VECTOR QUANTIZATION
在线阅读 下载PDF
15.2kb/s LD-CELP语音编码算法及实时实现
2
作者 张雪英 张刚 《电路与系统学报》 CSCD 1999年第1期92-96,共5页
本文以G.728语音编码标准为基础,提出一个15.2kb/s LD-CELP语音编码算法。通过对波形码书进行重新设计,降低了算法复杂性,同时保持高质合成语音。最后,用双片TMS320C31高速DSP系统全双工实时实现... 本文以G.728语音编码标准为基础,提出一个15.2kb/s LD-CELP语音编码算法。通过对波形码书进行重新设计,降低了算法复杂性,同时保持高质合成语音。最后,用双片TMS320C31高速DSP系统全双工实时实现了该算法。 展开更多
关键词 语音编码 DSP ld-celp
在线阅读 下载PDF
Text-and-Timbre-Based Speech Semantic Coding for Ultra-Low-Bitrate Communications
3
作者 Yang Xiaoniu Qian Liping +2 位作者 Lyu Sikai Wang Qian Wang Wei 《China Communications》 2025年第1期7-24,共18页
To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design... To address the contradiction between the explosive growth of wireless data and the limited spectrum resources,semantic communication has been emerging as a promising communication paradigm.In this paper,we thus design a speech semantic coded communication system,referred to as Deep-STS(i.e.,Deep-learning based Speech To Speech),for the lowbandwidth speech communication.Specifically,we first deeply compress the speech data through extracting the textual information from the speech based on the conformer encoder and connectionist temporal classification decoder at the transmitter side of Deep-STS system.In order to facilitate the final speech timbre recovery,we also extract the short-term timbre feature of speech signals only for the starting 2s duration by the long short-term memory network.Then,the Reed-Solomon coding and hybrid automatic repeat request protocol are applied to improve the reliability of transmitting the extracted text and timbre feature over the wireless channel.Third,we reconstruct the speech signal by the mel spectrogram prediction network and vocoder,when the extracted text is received along with the timbre feature at the receiver of Deep-STS system.Finally,we develop the demo system based on the USRP and GNU radio for the performance evaluation of Deep-STS.Numerical results show that the ac-Received:Jan.17,2024 Revised:Jun.12,2024 Editor:Niu Kai curacy of text extraction approaches 95%,and the mel cepstral distortion between the recovered speech signal and the original one in the spectrum domain is less than 10.Furthermore,the experimental results show that the proposed Deep-STS system can reduce the total delay of speech communication by 85%on average compared to the G.723 coding at the transmission rate of 5.4 kbps.More importantly,the coding rate of the proposed Deep-STS system is extremely low,only 0.2 kbps for continuous speech communication.It is worth noting that the Deep-STS with lower coding rate can support the low-zero-power speech communication,unveiling a new era in ultra-efficient coded communications. 展开更多
关键词 low coding rate semantic communication speech recognition speech synthesis
在线阅读 下载PDF
Realtime robust speech communication based on iterative joint source-channel decoding and demodulation algorithm for MELP vocoder
4
作者 彭坦 Cui Huijuan Tang Kun 《High Technology Letters》 EI CAS 2010年第2期111-116,共6页
Realtime speech communications require high efficient compression algorithms to encode speech signals. As the compressed speech parameters are highly sensitive to transmission errors, robust source and channel decodin... Realtime speech communications require high efficient compression algorithms to encode speech signals. As the compressed speech parameters are highly sensitive to transmission errors, robust source and channel decoding and demodulation schemes are both important and of practical use. In this paper, an it- erative joint souree-channel decoding and demodulation algorithm is proposed for mixed excited linear pre- diction (MELP) vocoder by both exploiting the residual redundancy and passing soft information through- out the receiver while introducing systematic global iteration process to further enhance the performance. Being fully compatible with existing transmitter structure, the proposed algorithm does not introduce addi- tional bandwidth expansion and transmission delay. Simulations show substantial error correcting perfor- mance and synthesized speech quality improvement over conventional separate designed systems in delay and bandwidth constraint channels by using the joint source-channel decoding and demodulation (JSCCM) algorithm. 展开更多
关键词 speech coding joint souree-channel coding and modulation (JSCCM) iterative decoding
在线阅读 下载PDF
基于Codec2语音编码的短波OFDM通信系统
5
作者 关晓磊 龚志红 《通信技术》 2025年第8期839-845,共7页
短波通信一般为窄带通信,传输速率严重受限,用于传输语音必须进行极低码率语音编码。Codec2声码器基于线性预测语音编码技术,采用语音信号基频及其谐波正弦信号构建浊音激励,采用白噪声信号构建清音激励,能够在保持话音质量的同时实现... 短波通信一般为窄带通信,传输速率严重受限,用于传输语音必须进行极低码率语音编码。Codec2声码器基于线性预测语音编码技术,采用语音信号基频及其谐波正弦信号构建浊音激励,采用白噪声信号构建清音激励,能够在保持话音质量的同时实现极低码率编码,且采用复杂度较低的开源算法,可以绕开各种语音编码专利的限制。因此,设计了一种基于Codec2语音编码的短波正交频分复用(Orthogonal Frequency Division Multiplexing,OFDM)通信系统,能够实现以极低的传输速率传输高质量话音,并搭建了实时仿真系统。通过测试发现,Codec2能够以较好的话音质量在短波3kHz带宽信道下进行传输,支持远距离的短波话音通信。 展开更多
关键词 谐波正弦激励 声码话 短波通信 OFDM
在线阅读 下载PDF
Noise Feedback Coding Revisited:Refurbished Legacy Codecs and New Coding Models 被引量:2
6
作者 Stéphane Ragot Balázs Kvesi Alain Le Guyader 《ZTE Communications》 2012年第2期34-44,共11页
Noise feedback coding (NFC) has attracted renewed interest with the recent standardization of backward-compatible enhancements for ITU-T G.711 and G.722. It has also been revisited with the emergence of proprietary ... Noise feedback coding (NFC) has attracted renewed interest with the recent standardization of backward-compatible enhancements for ITU-T G.711 and G.722. It has also been revisited with the emergence of proprietary speech codecs, such as BV16, BV32, and SILK, that have structures different from CELP coding. In this article, we review NFC and describe a novel coding technique that optimally shapes coding noise in embedded pulse-code modulation (PCM) and embedded adaptive differential PCM (ADPCM). We describe how this new technique was incorporated into the recent ITU-T G.711.1, G.711 App. III, and G.722 Annex B (G.722B) speech-coding standards. 展开更多
关键词 speech coding noise shaping noise feedback coding G.711 G.722
在线阅读 下载PDF
Implementation of a LD-CELP Codec with Echo Canceller Functions
7
作者 TianWenshun NiWeizhen 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 1995年第2期18-23,共6页
In this paper, the authors present a method to handle the Echo Canceller as an on-side job of LD-CELP codec and a circuitry to embed echo canceller into a LD-CELP codec. The Possibility to implement a system with t... In this paper, the authors present a method to handle the Echo Canceller as an on-side job of LD-CELP codec and a circuitry to embed echo canceller into a LD-CELP codec. The Possibility to implement a system with the integration of LD-CELP codec and echo canceller in real time by two chips of TMS320C30 isdiscussed. 展开更多
关键词 speech coding echo canceller signal processing
原文传递
Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification
8
作者 王晶 匡镜明 赵胜辉 《Journal of Beijing Institute of Technology》 EI CAS 2007年第2期187-192,共6页
A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is p... A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1 016 CELP at 4.8 kbit/s and similar to G. 723.1 ACELP at 5.3 kbit/s. 展开更多
关键词 variable bit rate speech coding characteristic waveform interpolation phonetic classification
在线阅读 下载PDF
A speech coding algorithm based on harmonic-stochastic excitation
9
作者 DING Qinghai, CAO Tieyong, ZAHNG Xiongwei, CHEN Xianzhi (Institute of Communication Engineering Nanjing Nanjing 210016) 《Chinese Journal of Acoustics》 2001年第3期250-256,共7页
A very low bit rate algorithm for encoding speech signals at 825 bps based on a mixed harmonic and stochastic modeling of the excitation signal is presented. The algorithm is more robust in the V/UV decision, reliable... A very low bit rate algorithm for encoding speech signals at 825 bps based on a mixed harmonic and stochastic modeling of the excitation signal is presented. The algorithm is more robust in the V/UV decision, reliable pitch estimation, and excitation signals synthesis. The bit allocation schedules in every case and the analysis-by-synthesis processes of the parameters are also described. The Diagnostic Rhyme Test (DRT) results show that the performance of the proposed algorithm is comparable to that of the MELP algorithm at 2.4 kbps, and the speech distinctness is 90.25%. 展开更多
关键词 LSP CODE A speech coding algorithm based on harmonic-stochastic excitation UV
原文传递
Lattice Vector Quantization Applied to Speech and Audio Coding 被引量:1
10
作者 Minjie Xie 《ZTE Communications》 2012年第2期25-33,共9页
Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process,... Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process, and it significantly reduces the amount of memory required. Therefore, LVQ is suitable for use in low-complexity speech and audio coding. In this paper, we describe the basic concepts of LVQ and its advantages over conventional vector quantization. We also describe some LVQ techniques that have been used in speech and audio coding standards of international standards developing organizations (SDOs). 展开更多
关键词 Vector quantization lattice vector quantization speech and audio coding transform coding
在线阅读 下载PDF
A Review of Speech Coding 被引量:3
11
作者 Bao ChangchunAssociate professor of Information Engineering, Beijing Polytechnic University, Ph.D, CIE senior member (Department of Electronic Engineering, Beijing Polytechnic University, Beijing 100022) Fan ChangxinProfessor with Xidian University, C 《通信学报》 EI CSCD 北大核心 1998年第5期45-56,共12页
AReviewofSpechCodingBaoChangchun(DepartmentofElectronicEngineering,BeijingPolytechnicUniversity,Beijing10002... AReviewofSpechCodingBaoChangchun(DepartmentofElectronicEngineering,BeijingPolytechnicUniversity,Beijing100022)FanChangxin?.. 展开更多
关键词 语音编码 线性估计 综合分析 波形编码
在线阅读 下载PDF
A FRACTAL INTERPOLATION SPEECH CODING ALGORITHM
12
作者 周志杰 胡光锐 《Journal of Shanghai Jiaotong university(Science)》 EI 1997年第1期55-59,共5页
It is supposed that speech is the output of a LPC filter which is excited by LPC residual. Consequently, speech can be reproduced if a signal, which occupies main characteristics of the LPC residual, excites the LPC f... It is supposed that speech is the output of a LPC filter which is excited by LPC residual. Consequently, speech can be reproduced if a signal, which occupies main characteristics of the LPC residual, excites the LPC filter. Based on this hypothesis, a new speech coding algorithm is proposed. Its excitation of synthesizer is the fractal interpolation of down sampled LPC residual with the same fractal dimension of LPC residual. Computer simulation shows that this speech coding algorithm can provide high quality coded speech at bit rate of 6.4 kb/s. Some essential issues are also presented to demonstrate this algorithm such as the calculation of fractal dimension, the implementation of fractal interpolation. 展开更多
关键词 speech coding FRACTAL CHAOS
在线阅读 下载PDF
A Novel Low-bit-rate Speech Coding Based on Decomposition of the Pitch-cycle Waveform of the Linear Predictive Residual
13
作者 Bao ChangchunAssociate professor of Information Engineering, Beijing Polytechnic University, Ph.D, CIE senior member (Department of Electronic Engineering, Beijing Polytechnic University, Beijing 100022) Fan ChangxinProfessor of Information Engineerin 《通信学报》 EI CSCD 北大核心 1998年第5期39-44,共6页
ANovelLowbitrateSpechCodingBasedonDecompositionofthePitchcycleWaveformoftheLinearPredictiveResidualBaoChangc... ANovelLowbitrateSpechCodingBasedonDecompositionofthePitchcycleWaveformoftheLinearPredictiveResidualBaoChangchun(Departm... 展开更多
关键词 线性估计 语音编码 失量量化 分解 节圈波形
在线阅读 下载PDF
老年性聋患者言语声诱发听性脑干反应特征分析
14
作者 陈鱼 张玥琦 +2 位作者 李培鸿 王舒雅 王巍 《中国耳鼻咽喉头颈外科》 2025年第2期72-75,共4页
目的分析老年性聋患者言语声诱发听性脑干反应(speech-evoked auditory brainstem response,s-ABR)特征,探讨其言语编码机制。方法选取老年性聋组、老年正常组及青年对照组各30例,以40 ms复合语音/da/为刺激声,记录s-ABR波形,分析各组... 目的分析老年性聋患者言语声诱发听性脑干反应(speech-evoked auditory brainstem response,s-ABR)特征,探讨其言语编码机制。方法选取老年性聋组、老年正常组及青年对照组各30例,以40 ms复合语音/da/为刺激声,记录s-ABR波形,分析各组潜伏期、振幅及V-A复合波斜率。结果老年性聋组波V、波A的潜伏期比老年正常组及青年对照组均明显延长(P<0.05),而老年正常组与青年对照组之间各波潜伏期差异无统计学意义(P>0.05);老年性聋组波A振幅及V-A复合波斜率均显著低于青年对照组(P<0.05),其余各波振幅差异均没有统计学意义。结论老年性聋患者s-ABR的特征提示老年性聋患者对刺激声时间响应同步性较差,对高频及快速变化声音信息的编码有缺陷,是老年性聋患者言语能力下降的可能机制之一。 展开更多
关键词 诱发电位 听觉 脑干 老年性聋 言语编码
暂未订购
基于Kolmogorov-Arnold网络的混合激励线性预测语声编码改进算法
15
作者 俞英健 王德清 +1 位作者 魏云龙 张龙基 《应用声学》 北大核心 2025年第6期1641-1651,共11页
高效、高质量的极低码率语声压缩技术,是水声通信等极端通信的迫切需求。混合激励线性预测(MELP)编码通过线谱频率(LSF)等参数的量化实现压缩,是常用的语声编码方法,但其低比特率量化会带来量化失真,导致语声质量下降。为了提高MELP编... 高效、高质量的极低码率语声压缩技术,是水声通信等极端通信的迫切需求。混合激励线性预测(MELP)编码通过线谱频率(LSF)等参数的量化实现压缩,是常用的语声编码方法,但其低比特率量化会带来量化失真,导致语声质量下降。为了提高MELP编码的语声质量和效率,该文提出一种基于Kolmogorov-Arnold网络(KAN)和残差向量量化技术的KAN-AE自编解码器网络,对MELP编码中LSF参数的量化进行改进,从参数层提取接近语义层次的信息,从而减少冗余信息,实现更高效的编码。实验结果表明,在1.96 kbit/s的码率,LSF参数量化比特低于15 bit时,该方法的语声质量感知评价评分约为2.6,短时客观可懂度评分约为0.68,在多种语言和不同噪声环境下的性能良好,在嵌入式STM32MP135F-DK开发板的测试模型运行时间为2.74 ms,实时性较好。 展开更多
关键词 深度学习 重构性 低速率语声编码 混合激励线性预测
在线阅读 下载PDF
自然言语理解中预测编码的神经计算与建模
16
作者 张新淼 张丹 《心理科学》 北大核心 2025年第4期861-875,共15页
预测编码被认为是实现高效、准确言语理解的核心机制。随着自然范式的发展和大语言模型的应用,研究者得以在具有高生态效度的语境中对语言预测过程进行探索。用于探索自然言语理解中预测编码机制的计算与建模方法近年来快速发展,包括基... 预测编码被认为是实现高效、准确言语理解的核心机制。随着自然范式的发展和大语言模型的应用,研究者得以在具有高生态效度的语境中对语言预测过程进行探索。用于探索自然言语理解中预测编码机制的计算与建模方法近年来快速发展,包括基于语言模型的预测编码计算、脑际视角下的预测编码计算以及预测编码的动态机制建模等。这些方法在三个层面推动了预测编码机制的实证研究和理论进步:在现象层面,预测加工在语言理解中普遍存在,且在复杂环境中具备稳健性;在计算层面,大脑能够并行整合多层级语境,生成多尺度预测输出;在神经机制层面,多频带神经振荡协同参与预测编码。对上述方法与进展的系统梳理有助于深化对语言理解中预测机制的认识。 展开更多
关键词 预测编码 自然言语理解 大语言模型 计算建模
原文传递
基于IP包拆分重组技术的混合语音压缩编码算法研究
17
作者 李凌云 李肖克 +2 位作者 陈奕钊 王国法 王辉 《电子技术应用》 2025年第2期70-74,共5页
针对某特殊通信网业务系统中,在10 kb/s的窄带信道上传输1路标准G.729编码格式的VoIP语音数据的特殊通信场景,提出一种基于IP包拆分重组技术的混合语音压缩编码算法,将G.729压缩后的语音数据进行解压缩,再通过AMBE进行二次压缩,结合IP... 针对某特殊通信网业务系统中,在10 kb/s的窄带信道上传输1路标准G.729编码格式的VoIP语音数据的特殊通信场景,提出一种基于IP包拆分重组技术的混合语音压缩编码算法,将G.729压缩后的语音数据进行解压缩,再通过AMBE进行二次压缩,结合IP包拆分重组技术,保留语音数据中有效载荷,剔除多余开销数据,减小语音数据传输所需带宽。仿真实验验证了该方法的有效性,当G.729和AMBE的语音压缩编码速率分别为8 kb/s、2.4 kb/s,载荷长度为20 ms,IP包打包周期为8包时,实验表明无论在何种光路状态下,平均句子可懂度达85%以上,话音信号等级达3级以上,满足话音传输系统要求。 展开更多
关键词 语音压缩编码 G.729 AMBE IP包拆分重组 窄带通信
在线阅读 下载PDF
基于SIP消息的三步四维法提升4G/5G语音用户感知 被引量:1
18
作者 胡坚 李福秧 +3 位作者 丁茂 刘毅 赵星海 张叶江 《电信工程技术与标准化》 2025年第5期54-61,共8页
随着4G/5G通信技术的快速发展,VoLTE和VoNR作为基于IMS架构的语音解决方案实现了高质量语音服务,正在逐步取代传统的电路交换语音服务。SIP作为VoLTE和VoNR中的核心信令协议,在语音呼叫过程中起着至关重要的作用,但由于缺乏快速定位分... 随着4G/5G通信技术的快速发展,VoLTE和VoNR作为基于IMS架构的语音解决方案实现了高质量语音服务,正在逐步取代传统的电路交换语音服务。SIP作为VoLTE和VoNR中的核心信令协议,在语音呼叫过程中起着至关重要的作用,但由于缺乏快速定位分析工具,导致在实际优化过程中难以准确定位问题根因和提升优化效率。本文从语音失败的SIP响应码入手,结合4G/5G语音网络架构,探索如何通过SIP消息来定位遇到的注册失败和呼叫建立失败差等问题,通过工具固化分析定位思路,达到提升优化效率和快速解决问题的目的。 展开更多
关键词 SIP消息 SIP响应码 VoNR语音感知
在线阅读 下载PDF
Improved Multiple Descriptions Sinusoidal Coder Adaptive to Packet Loss Rate
19
作者 郎玥 王晶 +1 位作者 赵胜辉 匡镜明 《Journal of Beijing Institute of Technology》 EI CAS 2008年第2期202-207,共6页
To make the multiple descriptions codec adaptive to the packet loss rate, which can minimize the final distortion, a novel adaptive multiple descriptions sinusoidal coder (AMDSC) is proposed, which is based on a sin... To make the multiple descriptions codec adaptive to the packet loss rate, which can minimize the final distortion, a novel adaptive multiple descriptions sinusoidal coder (AMDSC) is proposed, which is based on a sinusoidal model and a noise model. Firstly, the sinusoidal parameters are extracted in the sinusoidal model, and ordered in a decrease manner. Odd indexed and even indexed parameters are divided into two descriptions. Secondly, the output vector from the noise model is split vector quantized. And the two sub-vectors are placed into two descriptions too. Finally, the number of the extracted parameters and the redundancy between the two descriptions are adjusted according to the packet loss rate of the network. Analytical and experimental resuits show that the proposed AMDSC outperforms existing MD speech coders by taking network loss characteristics into account. Therefore, it is very suitable for unreliable channels 展开更多
关键词 speech coding multiple descriptions coding sinusoidal model
在线阅读 下载PDF
LD-CELP语音编码算法中矢量量化过程的改进 被引量:1
20
作者 黄德智 马尽文 《电子学报》 EI CAS CSCD 北大核心 2001年第10期1415-1417,共3页
本文介绍了LD CELP算法的基本原理 ,在分析其编码的矢量量化过程的基础上提出了一种旨在提高编码速度的改进方案 .模拟实验表明 ,改进算法的编码速度平均提高了一倍 .虽然信噪比有所下降 ,但下降幅度仅为1 2dB 。
关键词 ld-celp算法 矢量量化 语音编码
在线阅读 下载PDF
上一页 1 2 42 下一页 到第
使用帮助 返回顶部