期刊文献+
共找到477篇文章
< 1 2 24 >
每页显示 20 50 100
A Novel Frame Error Concealment Scheme Based on Gain Control for TCX Audio Codec
1
作者 XIANG Kai HU Ruimin 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第2期133-138,共6页
A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stabil... A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stability of linear predictive coding(LPC)filter,predicts the lost frames by utilizing the linear spectrum frequency and different continuous attenuation factor of different kinds of lost frames.Signal noise ratio(SNR)test and multiple stimuli with hidden reference and anchor(MUSHRA)test are conducted to evaluate the performance of this approach in adaptive multi-rate wideband plus(AMR-WB+)audio codec.Compared with the original frame error concealment scheme,our scheme achieves better audio recovery quality in AMR-WB+audio codec. 展开更多
关键词 frame error concealment audio codec transform coded excitation (TCX)
原文传递
MPEG-4 Audio Version2新概念
2
作者 杨杰 陈健 《电声技术》 北大核心 2000年第7期3-6,共4页
介绍了MPEG-4音频第2版的新概念,包括容错健壮性,低延迟音频编码,精细的频段分级,参数音频编码,CELP静音压缩,扩展的HVXC等。通过与第一版的比较,提出了若干改进之处。
关键词 MPEG-4 音频编码 CELP静音压缩
在线阅读 下载PDF
MPEG AUDIO LAYER 3数字音频压缩编码原理深度分析
3
作者 王伟 《煤炭技术》 CAS 北大核心 2011年第12期200-201,共2页
MPEG AUDIO LAYER 3是目前为止开发得最为成功的数字音频压缩技术之一。从音频压缩理论的角度,阐述MPEG AUDIO LAYER 3数字音频压缩编码原理。
关键词 MPEG audio LAYER 3数字音频 压缩 编码原理
原文传递
Audio Vivid标准关键技术研究及系统试验 被引量:6
4
作者 周芸 庞超 +1 位作者 王喆 郭晓强 《广播与电视技术》 2023年第7期35-42,共8页
本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio V... 本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio Vivid标准端到端技术试验情况,为Audio Vivid标准应用部署提供技术参考。 展开更多
关键词 audio Vivid 三维声 编解码 渲染 HOA空间编码 基于神经网络的音频编码
在线阅读 下载PDF
Algorithm of Adaptive Bit Allocation Wavelet Transform Audio Coding 被引量:2
5
作者 Ma HongfeiMa Hongfei:associate professor, is with the Information Science Institute, Xidian University,Xi’an,China Fan ChangxinFan Changxin:professor, IEEE fellow, is with the Information Science Institute, Xidian University, Xi’an, China Song Guo 《通信学报》 EI CSCD 北大核心 1998年第5期80-83,共4页
AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,... AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,Xi’an71... 展开更多
关键词 声音编码 小波变换 心理模式 自适应位分布
在线阅读 下载PDF
Nonlinear Prediction with Deep Recurrent Neural Networks for Non-Blind Audio Bandwidth Extension 被引量:2
6
作者 Lin Jiang Ruimin Hu +2 位作者 Xiaochen Wang Weiping Tu Maosheng Zhang 《China Communications》 SCIE CSCD 2018年第1期72-85,共14页
Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually gen... Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually generated by a duplication of the corresponding low frequencies and some parameters of high frequencies. However, the perception quality of coding will significantly degrade if the correlation between high frequencies and low frequencies becomes weak. In this paper, we quantitatively analyse the correlation via computing mutual information value. The analysis results show the correlation also exists in low frequency signal of the context dependent frames besides the current frame. In order to improve the perception quality of coding, we propose a novel method of high frequency coarse spectrum generation to improve the conventional replication method. In the proposed method, the coarse high frequency spectrums are generated by a nonlinear mapping model using deep recurrent neural network. The experiments confirm that the proposed method shows better performance than the reference methods. 展开更多
关键词 audio CODING non-blind audiobandwidth EXTENSION context correlation deeprecurrent neural network
在线阅读 下载PDF
High Quality Audio Object Coding Framework Based on Non-Negative Matrix Factorization 被引量:1
7
作者 Tingzhao Wu Ruimin Hu +2 位作者 Xiaochen Wang Shanfa Ke Jinshan Wang 《China Communications》 SCIE CSCD 2017年第9期32-41,共10页
Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more... Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more and more attentions have been paid to the object-based audio coding. However, existing object-based techniques have poor sound quality because of low parameter frequency domain resolution. In order to achieve high quality audio object coding, we propose a new coding framework with introducing the non-negative matrix factorization(NMF) method. We extract object parameters with high resolution to improve sound quality, and apply NMF method to parameter coding to reduce the high bitrate caused by high resolution. And the experimental results have shown that the proposed framework can improve the coding quality by 25%, so it can provide a better solution to encode audio scene in a more flexible and higher quality way. 展开更多
关键词 object-based audio CODING non-negative matrix FACTORIZATION audio scenecoding
在线阅读 下载PDF
High throughput bandwidth optimized VLSI design for motion compensation in AVS HDTV decoder 被引量:1
8
作者 Kai LUO Dong-xiao LI Ming ZHANG 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第6期822-832,共11页
In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolatio... In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolation, intensive pixel calculation and huge memory access are required. We propose a parallel serial filtering mixed luma interpolation data flow and a three-stage multiplication free chroma interpolation scheme. Compared to the conventional designs, the integrated architecture supports about 2.7 times filtering throughput. The proposed MC design utilizes Vertical Z processing order for reference data re-use and saves up to 30% memory bandwidth. The whole design requires 44.3k gates when synthesized at 108 MHz clock frequency using 0.18-μm CMOS technology and can support up to 1920×1088@30 fps AVS HDTV video decoding. 展开更多
关键词 audio Video coding Standard (AVS) Motion compensation (MC) INTERPOLATION VLSI Architecture
在线阅读 下载PDF
Lattice Vector Quantization Applied to Speech and Audio Coding 被引量:1
9
作者 Minjie Xie 《ZTE Communications》 2012年第2期25-33,共9页
Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process,... Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process, and it significantly reduces the amount of memory required. Therefore, LVQ is suitable for use in low-complexity speech and audio coding. In this paper, we describe the basic concepts of LVQ and its advantages over conventional vector quantization. We also describe some LVQ techniques that have been used in speech and audio coding standards of international standards developing organizations (SDOs). 展开更多
关键词 Vector quantization lattice vector quantization speech and audio coding transform coding
在线阅读 下载PDF
MPACodec移动音频编解码技术 被引量:1
10
作者 潘兴德 《电声技术》 2011年第2期75-77,共3页
利用波形编码和参数编码的优点,设计了一种移动音频编码技术。该技术采用子带滤波、预测编码、变换编码、频带扩展和参数立体声编码技术,以较低的复杂度实现了宽带音频信号的高效率编码。
关键词 音频编码 参数立体声 频带扩展 子带预测编码
在线阅读 下载PDF
Bark-Band Residual Noise Model for Parametric Audio Coding
11
作者 王晶 晋艳伟 +1 位作者 赵胜辉 匡镜明 《Journal of Beijing Institute of Technology》 EI CAS 2004年第S1期1-6,共6页
A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrie... A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrieved by Bark-scale piecewise constant magnitude estimates along with random phases. In the proposed noise model, Bark bands information is obtained by short-time FFT method and window overlap-add technique is exploited to remove boundary discontinuities. SVQ is also incorporated into parameter quantization process for the low bit-rate coding demand. Simulation results and informal listening tests show that when the sinusoidal model is combined with the Bark-band noise model, better synthesis audio quality can be achieved compared with the original sinusoidal modeling audio codec. 展开更多
关键词 parametric audio coding sinusoidal model: residual noise model Bark band equivalent rectangular band (ERB) split vector quantization (SVQ)
在线阅读 下载PDF
3D Audio Coding Approach Based on Spatial Perception Features
12
作者 Cheng Yang Ruimin Hu +3 位作者 Xiaochen Wang Yuhong Yang Maosheng Zhang Wei Chen 《China Communications》 SCIE CSCD 2017年第11期126-140,共15页
A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and... A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and the non-uniform perceptual quantization is proposed based on the spatial perception features of the human auditory system, which is named as concentric spheres spatial quantization(CSSQ) method. Comparison results were presented, which showed that a better distance perceptual quality of 3D audio can be enhanced by 5.7%~8.8% through extracting and coding the distance side information comparing with the directional audio coding, and the bit rate of our coding method is decreased of 8.07% comparing with the spatial squeeze surround audio coding. 展开更多
关键词 3D audio coding non-uniform perceptual quantization distance perceptual quality
在线阅读 下载PDF
Improved Sinusoid Analysis and Post-Processing in Parametric Audio Coding
13
作者 周宏 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 2003年第2期163-168,共6页
This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids... This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids. The sinusoid which contributes most to the reduction of overall NMR is chosen. Combined with our improved parametric psychoacoustic model and advanced peak riddling techniques, the number of sinusoids required can be greatly reduced and the coding efficiency can be greatly enhanced. A lightweight version is also given to reduce the amount of computation with only little sacrifice of performance. Secondly, we propose two enhancement techniques for sinusoid synthesis: bandwidth enhancement and line enhancement. With little overhead, the effective bandwidth can be extended one more octave; the timbre tends to sound much brighter, thicker and more beautiful. 展开更多
关键词 parametric audio coding SINUSOID POST-PROCESSING
在线阅读 下载PDF
Review of AVS Audio Coding Standard
14
作者 ZHANG Tao ZHANG Caixia ZHAO Xin 《ZTE Communications》 2016年第2期56-62,共7页
Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached t... Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached the international standard. Its coding efficiency is 2 to 3 times greater than that of MPEG -2. This technical solution is more simple, and it can greatly save channel resource. After more than ten years' development, AVS has achieved great success. The latest version of the AVS audio coding standard is ongoing and mainly aims at the increasing demand for low bitrate and high quality audio services. The paper reviews the history and recent development of AVS audio coding standard in terms of basic features, key techniques and performance. Finally, the future development of AVS audio coding standard is discussed. 展开更多
关键词 audio Video Coding Standard (AVS) audio coding AVS1 au-dio AVS2 audio
在线阅读 下载PDF
SCALABLE PERCEPTUAL AUDIO REPRESENTATION WITH AN ADAPTIVE THREE TIME-SCALE SINUSOIDAL SIGNAL MODEL
15
作者 Al-Moussawy Raed 《Journal of Electronics(China)》 2004年第3期213-221,共9页
This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is descri... This work is concerned with the development and optimization of a signal model for scalable perceptual audio coding at low bit rates. A complementary two-part signal model consisting of Sines plus Noise (SN) is described. The paper presents essentially a fundamental enhancement to the sinusoidal modeling component. The enhancement involves an audio signal scheme based on carrying out overlap-add sinusoidal modeling at three successive time scales, large, medium, and small. The sinusoidal modeling is done in an analysis-by-synthesis overlap- add manner across the three scales by using a psychoacoustically weighted matching pursuits. The sinusoidal modeling residual at the first scale is passed to the smaller scales to allow for the modeling of various signal features at appropriate resolutions.This approach greatly helps to correct the pre-echo inherent in the sinusoidal model. This improves the perceptual audio quality upon our previous work of sinusoidal modeling while using tile same number of sinusoids. Tile most obvious application for the SN model is in scalable, high fidelity audio coding and signal modification. 展开更多
关键词 Multiresolution sinusoidal modeling Parametric audio coding Low-rate audio coding Signal modifications
在线阅读 下载PDF
Quantization of wavelet packet audio coding
16
作者 谭建国 Zhang +1 位作者 Wenjun LiuPeilin 《High Technology Letters》 EI CAS 2006年第3期295-299,共5页
Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acou... Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acoustic model in the frequency domain to the signal in the time domain; the Discrete Wavelet Packet Transform (DWPF) is performed; the energy in each subband is regarded as the maximum allowed quantization noise energy. The experimental result shows that the proposed method can attain the nearly transparent audio quality below 64kbps for the most testing audio signals. 展开更多
关键词 QUANTIZATION wavelet packet audio coding DFT
在线阅读 下载PDF
Analysis and application of error concealment tools in AVS-M decoder
17
作者 YANG Cheng SHI Lei WU Xiao-yang ZHANG Ci-xun 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第z1期54-58,共5页
Audio Video coding Standard (AVS) is the latest audio and video coding standard of China. AVS Part 7 (also known as AVS-M) targets mobility applications where error concealment is of great importance. This paper first... Audio Video coding Standard (AVS) is the latest audio and video coding standard of China. AVS Part 7 (also known as AVS-M) targets mobility applications where error concealment is of great importance. This paper first briefly introduces the general concept of error concealment. Then two error concealment schemes are proposed and implemented on AVS-M decoder under different test conditions. Simulation results of the schemes and suggestions on how to use these tools are also provided. 展开更多
关键词 audio VIDEO coding Standard (AVS) Error concealment VIDEO communication
在线阅读 下载PDF
HI-FI AUDIO CODING TECHNOLOGY FOR ISDN
18
作者 黄晓利 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 1998年第2期63-67,共5页
A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation accordin... A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation according to the algorithm and gets satisfactory quality of the reconstructed music signal. The estimation of operation steps and simulation of decoder finished by a TMS 320C548 simulator are presented. The result is the same as that of the C language simulation. 展开更多
关键词 source CODING audio compression MPEG SIGNAL processing
在线阅读 下载PDF
MAP-based Audio Coding Compensation for Speaker Recognition
19
作者 Tao Jiang Jiqing Han 《Journal of Signal and Information Processing》 2011年第3期165-169,共5页
The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstr... The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstrum coefficients, a method of MAP-based audio coding compensation for speaker recognition is proposed. The proposed method firstly sets a standard codec as a reference and trains the speaker models in this codec format, then learns the deviation distributions between the standard codec format and the other ones, next gets the current bias via using a small number adaptive data and the MAP-based adaptive technique, and then adjusts the model parameters by the type of coming audio codec format and its related bias. During the test, the features of the coming speaker are used to match with the adjusted model. The experimental result shows that the accuracy reached 82.4% with just one second adaptive data, which is higher 5.5% than that in the baseline system. 展开更多
关键词 audio CODING COMPENSATION SPEAKER RECOGNITION MAP-Based
在线阅读 下载PDF
基于音视频编解码技术的高清视频会议终端设计
20
作者 郭斌 《无线互联科技》 2025年第22期69-72,共4页
针对当前高清视频会议终端在实际应用过程中存在的视频清晰度欠佳、流畅度不足等问题,文章提出一种基于音视频编解码技术的高清视频会议终端设计方案。该研究以开发板作为核心硬件,通过设计多样化接口,实现终端的数据交互功能。运用音... 针对当前高清视频会议终端在实际应用过程中存在的视频清晰度欠佳、流畅度不足等问题,文章提出一种基于音视频编解码技术的高清视频会议终端设计方案。该研究以开发板作为核心硬件,通过设计多样化接口,实现终端的数据交互功能。运用音视频编解码技术对视频流数据进行编码与解码处理并对解码后的视频开展可视化处理,从而完成高清视频会议终端的设计。实验结果表明,所设计终端的视频分辨率能够达到1080P,帧率高于100 fps,终端性能表现良好,可有效保障会议视频具备较高的清晰度与流畅度。 展开更多
关键词 音视频编解码技术 高清视频会议 终端
在线阅读 下载PDF
上一页 1 2 24 下一页 到第
使用帮助 返回顶部