期刊文献+
共找到489篇文章
< 1 2 25 >
每页显示 20 50 100
Unequal Error Protection Based on Expanding Window Fountain for Object-Based 3D Audio
1
作者 YANG Cheng HU Ruimin +3 位作者 SONG Yucheng SU Liuyue WANG Xiaochen CHEN Wei 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2017年第4期323-328,共6页
This paper proposes an unequal error protection(UEP)coding method to improve the transmission performance of three-dimensional(3D)audio based on expanding window fountain(EWF).Different from other transmissions ... This paper proposes an unequal error protection(UEP)coding method to improve the transmission performance of three-dimensional(3D)audio based on expanding window fountain(EWF).Different from other transmissions with equal error protection(EEP)when transmitting the 3D audio objects.An approach of extracting the important audio object is presented,and more protection is given to more important audio object and comparatively less protection is given to the normal audio objects.Objective and subjective experiments have shown that the proposed UEP method achieves better performance than equal error protection method,while the bits error rates(BER)of the important audio object can decrease from 10^(–3) to 10^(–4),and the subjective quality of UEP is better than that of EEP by 14%. 展开更多
关键词 object-based 3D audio unequal error protection equal error protection
原文传递
High Quality Audio Object Coding Framework Based on Non-Negative Matrix Factorization 被引量:1
2
作者 Tingzhao Wu Ruimin Hu +2 位作者 Xiaochen Wang Shanfa Ke Jinshan Wang 《China Communications》 SCIE CSCD 2017年第9期32-41,共10页
Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more... Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more and more attentions have been paid to the object-based audio coding. However, existing object-based techniques have poor sound quality because of low parameter frequency domain resolution. In order to achieve high quality audio object coding, we propose a new coding framework with introducing the non-negative matrix factorization(NMF) method. We extract object parameters with high resolution to improve sound quality, and apply NMF method to parameter coding to reduce the high bitrate caused by high resolution. And the experimental results have shown that the proposed framework can improve the coding quality by 25%, so it can provide a better solution to encode audio scene in a more flexible and higher quality way. 展开更多
关键词 object-based audio coding non-negative matrix FACTORIZATION audio scenecoding
在线阅读 下载PDF
Lattice Vector Quantization Applied to Speech and Audio Coding 被引量:1
3
作者 Minjie Xie 《ZTE Communications》 2012年第2期25-33,共9页
Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process,... Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process, and it significantly reduces the amount of memory required. Therefore, LVQ is suitable for use in low-complexity speech and audio coding. In this paper, we describe the basic concepts of LVQ and its advantages over conventional vector quantization. We also describe some LVQ techniques that have been used in speech and audio coding standards of international standards developing organizations (SDOs). 展开更多
关键词 Vector quantization lattice vector quantization speech and audio coding transform coding
在线阅读 下载PDF
Algorithm of Adaptive Bit Allocation Wavelet Transform Audio Coding 被引量:2
4
作者 Ma HongfeiMa Hongfei:associate professor, is with the Information Science Institute, Xidian University,Xi’an,China Fan ChangxinFan Changxin:professor, IEEE fellow, is with the Information Science Institute, Xidian University, Xi’an, China Song Guo 《通信学报》 EI CSCD 北大核心 1998年第5期80-83,共4页
AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,... AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,Xi’an71... 展开更多
关键词 声音编码 小波变换 心理模式 自适应位分布
在线阅读 下载PDF
HI-FI AUDIO CODING TECHNOLOGY FOR ISDN
5
作者 黄晓利 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 1998年第2期63-67,共5页
A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation accordin... A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation according to the algorithm and gets satisfactory quality of the reconstructed music signal. The estimation of operation steps and simulation of decoder finished by a TMS 320C548 simulator are presented. The result is the same as that of the C language simulation. 展开更多
关键词 source coding audio compression MPEG SIGNAL processing
在线阅读 下载PDF
Review of AVS Audio Coding Standard
6
作者 ZHANG Tao ZHANG Caixia ZHAO Xin 《ZTE Communications》 2016年第2期56-62,共7页
Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached t... Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached the international standard. Its coding efficiency is 2 to 3 times greater than that of MPEG -2. This technical solution is more simple, and it can greatly save channel resource. After more than ten years' development, AVS has achieved great success. The latest version of the AVS audio coding standard is ongoing and mainly aims at the increasing demand for low bitrate and high quality audio services. The paper reviews the history and recent development of AVS audio coding standard in terms of basic features, key techniques and performance. Finally, the future development of AVS audio coding standard is discussed. 展开更多
关键词 audio Video coding Standard (AVS) audio coding AVS1 au-dio AVS2 audio
在线阅读 下载PDF
A Novel Frame Error Concealment Scheme Based on Gain Control for TCX Audio Codec
7
作者 XIANG Kai HU Ruimin 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第2期133-138,共6页
A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stabil... A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stability of linear predictive coding(LPC)filter,predicts the lost frames by utilizing the linear spectrum frequency and different continuous attenuation factor of different kinds of lost frames.Signal noise ratio(SNR)test and multiple stimuli with hidden reference and anchor(MUSHRA)test are conducted to evaluate the performance of this approach in adaptive multi-rate wideband plus(AMR-WB+)audio codec.Compared with the original frame error concealment scheme,our scheme achieves better audio recovery quality in AMR-WB+audio codec. 展开更多
关键词 frame error concealment audio codec transform coded excitation (TCX)
原文传递
3D Audio Coding Approach Based on Spatial Perception Features
8
作者 Cheng Yang Ruimin Hu +3 位作者 Xiaochen Wang Yuhong Yang Maosheng Zhang Wei Chen 《China Communications》 SCIE CSCD 2017年第11期126-140,共15页
A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and... A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and the non-uniform perceptual quantization is proposed based on the spatial perception features of the human auditory system, which is named as concentric spheres spatial quantization(CSSQ) method. Comparison results were presented, which showed that a better distance perceptual quality of 3D audio can be enhanced by 5.7%~8.8% through extracting and coding the distance side information comparing with the directional audio coding, and the bit rate of our coding method is decreased of 8.07% comparing with the spatial squeeze surround audio coding. 展开更多
关键词 3D audio coding non-uniform perceptual quantization distance perceptual quality
在线阅读 下载PDF
Bark-Band Residual Noise Model for Parametric Audio Coding
9
作者 王晶 晋艳伟 +1 位作者 赵胜辉 匡镜明 《Journal of Beijing Institute of Technology》 EI CAS 2004年第S1期1-6,共6页
A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrie... A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrieved by Bark-scale piecewise constant magnitude estimates along with random phases. In the proposed noise model, Bark bands information is obtained by short-time FFT method and window overlap-add technique is exploited to remove boundary discontinuities. SVQ is also incorporated into parameter quantization process for the low bit-rate coding demand. Simulation results and informal listening tests show that when the sinusoidal model is combined with the Bark-band noise model, better synthesis audio quality can be achieved compared with the original sinusoidal modeling audio codec. 展开更多
关键词 parametric audio coding sinusoidal model: residual noise model Bark band equivalent rectangular band (ERB) split vector quantization (SVQ)
在线阅读 下载PDF
Improved Sinusoid Analysis and Post-Processing in Parametric Audio Coding
10
作者 周宏 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 2003年第2期163-168,共6页
This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids... This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids. The sinusoid which contributes most to the reduction of overall NMR is chosen. Combined with our improved parametric psychoacoustic model and advanced peak riddling techniques, the number of sinusoids required can be greatly reduced and the coding efficiency can be greatly enhanced. A lightweight version is also given to reduce the amount of computation with only little sacrifice of performance. Secondly, we propose two enhancement techniques for sinusoid synthesis: bandwidth enhancement and line enhancement. With little overhead, the effective bandwidth can be extended one more octave; the timbre tends to sound much brighter, thicker and more beautiful. 展开更多
关键词 parametric audio coding SINUSOID POST-PROCESSING
在线阅读 下载PDF
Quantization of wavelet packet audio coding
11
作者 谭建国 Zhang +1 位作者 Wenjun LiuPeilin 《High Technology Letters》 EI CAS 2006年第3期295-299,共5页
Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acou... Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acoustic model in the frequency domain to the signal in the time domain; the Discrete Wavelet Packet Transform (DWPF) is performed; the energy in each subband is regarded as the maximum allowed quantization noise energy. The experimental result shows that the proposed method can attain the nearly transparent audio quality below 64kbps for the most testing audio signals. 展开更多
关键词 QUANTIZATION wavelet packet audio coding DFT
在线阅读 下载PDF
MAP-based Audio Coding Compensation for Speaker Recognition
12
作者 Tao Jiang Jiqing Han 《Journal of Signal and Information Processing》 2011年第3期165-169,共5页
The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstr... The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstrum coefficients, a method of MAP-based audio coding compensation for speaker recognition is proposed. The proposed method firstly sets a standard codec as a reference and trains the speaker models in this codec format, then learns the deviation distributions between the standard codec format and the other ones, next gets the current bias via using a small number adaptive data and the MAP-based adaptive technique, and then adjusts the model parameters by the type of coming audio codec format and its related bias. During the test, the features of the coming speaker are used to match with the adjusted model. The experimental result shows that the accuracy reached 82.4% with just one second adaptive data, which is higher 5.5% than that in the baseline system. 展开更多
关键词 audio coding COMPENSATION SPEAKER RECOGNITION MAP-Based
在线阅读 下载PDF
Interpolation-Based Reversible Data Hiding in Encrypted Audio with Scalable Embedding Capacity
13
作者 Yuan-Yu Tsai Alfrindo Lin +1 位作者 Wen-Ting Jao Yi-Hui Chen 《Computers, Materials & Continua》 2025年第7期681-697,共17页
With the rapid expansion of multimedia data,protecting digital information has become increasingly critical.Reversible data hiding offers an effective solution by allowing sensitive information to be embedded in multi... With the rapid expansion of multimedia data,protecting digital information has become increasingly critical.Reversible data hiding offers an effective solution by allowing sensitive information to be embedded in multimedia files while enabling full recovery of the original data after extraction.Audio,as a vital medium in communication,entertainment,and information sharing,demands the same level of security as images.However,embedding data in encrypted audio poses unique challenges due to the trade-offs between security,data integrity,and embedding capacity.This paper presents a novel interpolation-based reversible data hiding algorithm for encrypted audio that achieves scalable embedding capacity.By increasing sample density through interpolation,embedding opportunities are significantly enhanced while maintaining encryption throughout the process.The method further integrates multiple most significant bit(multi-MSB)prediction and Huffman coding to optimize compression and embedding efficiency.Experimental results on standard audio datasets demonstrate the proposed algorithm’s ability to embed up to 12.47 bits per sample with over 9.26 bits per sample available for pure embedding capacity,while preserving full reversibility.These results confirm the method’s suitability for secure applications that demand high embedding capacity and perfect reconstruction of original audio.This work advances reversible data hiding in encrypted audio by offering a secure,efficient,and fully reversible data hiding framework. 展开更多
关键词 Reversible data hiding encrypted audio INTERPOLATION sampling multi-MSB prediction Huffman coding
在线阅读 下载PDF
MPEG-4 Audio Version2新概念
14
作者 杨杰 陈健 《电声技术》 北大核心 2000年第7期3-6,共4页
介绍了MPEG-4音频第2版的新概念,包括容错健壮性,低延迟音频编码,精细的频段分级,参数音频编码,CELP静音压缩,扩展的HVXC等。通过与第一版的比较,提出了若干改进之处。
关键词 MPEG-4 音频编码 CELP静音压缩
在线阅读 下载PDF
MPEG AUDIO LAYER 3数字音频压缩编码原理深度分析
15
作者 王伟 《煤炭技术》 CAS 北大核心 2011年第12期200-201,共2页
MPEG AUDIO LAYER 3是目前为止开发得最为成功的数字音频压缩技术之一。从音频压缩理论的角度,阐述MPEG AUDIO LAYER 3数字音频压缩编码原理。
关键词 MPEG audio LAYER 3数字音频 压缩 编码原理
原文传递
Audio Vivid标准关键技术研究及系统试验 被引量:6
16
作者 周芸 庞超 +1 位作者 王喆 郭晓强 《广播与电视技术》 2023年第7期35-42,共8页
本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio V... 本文在对三维声行业标准《三维声编解码及渲染》(Audio Vivid)深入研究的基础上,分析三维声编解码和渲染端到端技术框架,介绍基于神经网络的通用码率音频编码、元数据编码、扬声器渲染和双耳渲染等关键技术,给出卡塔尔世界杯期间Audio Vivid标准端到端技术试验情况,为Audio Vivid标准应用部署提供技术参考。 展开更多
关键词 audio Vivid 三维声 编解码 渲染 HOA空间编码 基于神经网络的音频编码
在线阅读 下载PDF
Nonlinear Prediction with Deep Recurrent Neural Networks for Non-Blind Audio Bandwidth Extension 被引量:2
17
作者 Lin Jiang Ruimin Hu +2 位作者 Xiaochen Wang Weiping Tu Maosheng Zhang 《China Communications》 SCIE CSCD 2018年第1期72-85,共14页
Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually gen... Non-blind audio bandwidth extension is a standard technique within contemporary audio codecs to efficiently code audio signals at low bitrates. In existing methods, in most cases high frequencies signal is usually generated by a duplication of the corresponding low frequencies and some parameters of high frequencies. However, the perception quality of coding will significantly degrade if the correlation between high frequencies and low frequencies becomes weak. In this paper, we quantitatively analyse the correlation via computing mutual information value. The analysis results show the correlation also exists in low frequency signal of the context dependent frames besides the current frame. In order to improve the perception quality of coding, we propose a novel method of high frequency coarse spectrum generation to improve the conventional replication method. In the proposed method, the coarse high frequency spectrums are generated by a nonlinear mapping model using deep recurrent neural network. The experiments confirm that the proposed method shows better performance than the reference methods. 展开更多
关键词 audio coding non-blind audiobandwidth EXTENSION context correlation deeprecurrent neural network
在线阅读 下载PDF
High throughput bandwidth optimized VLSI design for motion compensation in AVS HDTV decoder 被引量:1
18
作者 Kai LUO Dong-xiao LI Ming ZHANG 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第6期822-832,共11页
In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolatio... In this paper we present a motion compensation (MC) design for the newest Audio Video coding Standard (AVS) of China. Because of compression-efficient techniques of variable block size (VBS) and sub-pixel interpolation, intensive pixel calculation and huge memory access are required. We propose a parallel serial filtering mixed luma interpolation data flow and a three-stage multiplication free chroma interpolation scheme. Compared to the conventional designs, the integrated architecture supports about 2.7 times filtering throughput. The proposed MC design utilizes Vertical Z processing order for reference data re-use and saves up to 30% memory bandwidth. The whole design requires 44.3k gates when synthesized at 108 MHz clock frequency using 0.18-μm CMOS technology and can support up to 1920×1088@30 fps AVS HDTV video decoding. 展开更多
关键词 audio Video coding Standard (AVS) Motion compensation (MC) INTERPOLATION VLSI Architecture
在线阅读 下载PDF
MPACodec移动音频编解码技术 被引量:1
19
作者 潘兴德 《电声技术》 2011年第2期75-77,共3页
利用波形编码和参数编码的优点,设计了一种移动音频编码技术。该技术采用子带滤波、预测编码、变换编码、频带扩展和参数立体声编码技术,以较低的复杂度实现了宽带音频信号的高效率编码。
关键词 音频编码 参数立体声 频带扩展 子带预测编码
在线阅读 下载PDF
Analysis and application of error concealment tools in AVS-M decoder
20
作者 YANG Cheng SHI Lei WU Xiao-yang ZHANG Ci-xun 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第z1期54-58,共5页
Audio Video coding Standard (AVS) is the latest audio and video coding standard of China. AVS Part 7 (also known as AVS-M) targets mobility applications where error concealment is of great importance. This paper first... Audio Video coding Standard (AVS) is the latest audio and video coding standard of China. AVS Part 7 (also known as AVS-M) targets mobility applications where error concealment is of great importance. This paper first briefly introduces the general concept of error concealment. Then two error concealment schemes are proposed and implemented on AVS-M decoder under different test conditions. Simulation results of the schemes and suggestions on how to use these tools are also provided. 展开更多
关键词 audio VIDEO coding Standard (AVS) Error concealment VIDEO communication
在线阅读 下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部