期刊文献+
共找到488篇文章
< 1 2 25 >
每页显示 20 50 100
Unequal Error Protection Based on Expanding Window Fountain for Object-Based 3D Audio
1
作者 YANG Cheng HU Ruimin +3 位作者 SONG Yucheng SU Liuyue WANG Xiaochen CHEN Wei 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2017年第4期323-328,共6页
This paper proposes an unequal error protection(UEP)coding method to improve the transmission performance of three-dimensional(3D)audio based on expanding window fountain(EWF).Different from other transmissions ... This paper proposes an unequal error protection(UEP)coding method to improve the transmission performance of three-dimensional(3D)audio based on expanding window fountain(EWF).Different from other transmissions with equal error protection(EEP)when transmitting the 3D audio objects.An approach of extracting the important audio object is presented,and more protection is given to more important audio object and comparatively less protection is given to the normal audio objects.Objective and subjective experiments have shown that the proposed UEP method achieves better performance than equal error protection method,while the bits error rates(BER)of the important audio object can decrease from 10^(–3) to 10^(–4),and the subjective quality of UEP is better than that of EEP by 14%. 展开更多
关键词 object-based 3D audio unequal error protection equal error protection
原文传递
Interpolation-Based Reversible Data Hiding in Encrypted Audio with Scalable Embedding Capacity
2
作者 Yuan-Yu Tsai Alfrindo Lin +1 位作者 Wen-Ting Jao Yi-Hui Chen 《Computers, Materials & Continua》 2025年第7期681-697,共17页
With the rapid expansion of multimedia data,protecting digital information has become increasingly critical.Reversible data hiding offers an effective solution by allowing sensitive information to be embedded in multi... With the rapid expansion of multimedia data,protecting digital information has become increasingly critical.Reversible data hiding offers an effective solution by allowing sensitive information to be embedded in multimedia files while enabling full recovery of the original data after extraction.Audio,as a vital medium in communication,entertainment,and information sharing,demands the same level of security as images.However,embedding data in encrypted audio poses unique challenges due to the trade-offs between security,data integrity,and embedding capacity.This paper presents a novel interpolation-based reversible data hiding algorithm for encrypted audio that achieves scalable embedding capacity.By increasing sample density through interpolation,embedding opportunities are significantly enhanced while maintaining encryption throughout the process.The method further integrates multiple most significant bit(multi-MSB)prediction and Huffman coding to optimize compression and embedding efficiency.Experimental results on standard audio datasets demonstrate the proposed algorithm’s ability to embed up to 12.47 bits per sample with over 9.26 bits per sample available for pure embedding capacity,while preserving full reversibility.These results confirm the method’s suitability for secure applications that demand high embedding capacity and perfect reconstruction of original audio.This work advances reversible data hiding in encrypted audio by offering a secure,efficient,and fully reversible data hiding framework. 展开更多
关键词 Reversible data hiding encrypted audio INTERPOLATION sampling multi-MSB prediction Huffman coding
在线阅读 下载PDF
High Quality Audio Object Coding Framework Based on Non-Negative Matrix Factorization 被引量:1
3
作者 Tingzhao Wu Ruimin Hu +2 位作者 Xiaochen Wang Shanfa Ke Jinshan Wang 《China Communications》 SCIE CSCD 2017年第9期32-41,共10页
Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more... Object-based audio coding is the main technique of audio scene coding. It can effectively reconstruct each object trajectory, besides provide sufficient flexibility for personalized audio scene reconstruction. So more and more attentions have been paid to the object-based audio coding. However, existing object-based techniques have poor sound quality because of low parameter frequency domain resolution. In order to achieve high quality audio object coding, we propose a new coding framework with introducing the non-negative matrix factorization(NMF) method. We extract object parameters with high resolution to improve sound quality, and apply NMF method to parameter coding to reduce the high bitrate caused by high resolution. And the experimental results have shown that the proposed framework can improve the coding quality by 25%, so it can provide a better solution to encode audio scene in a more flexible and higher quality way. 展开更多
关键词 object-based audio coding non-negative matrix FACTORIZATION audio scenecoding
在线阅读 下载PDF
Lattice Vector Quantization Applied to Speech and Audio Coding 被引量:1
4
作者 Minjie Xie 《ZTE Communications》 2012年第2期25-33,共9页
Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process,... Lattice vector quantization (LVQ) has been used for real-time speech and audio coding systems. Compared with conventional vector quantization, LVQ has two main advantages: It has a simple and fast encoding process, and it significantly reduces the amount of memory required. Therefore, LVQ is suitable for use in low-complexity speech and audio coding. In this paper, we describe the basic concepts of LVQ and its advantages over conventional vector quantization. We also describe some LVQ techniques that have been used in speech and audio coding standards of international standards developing organizations (SDOs). 展开更多
关键词 Vector quantization lattice vector quantization speech and audio coding transform coding
在线阅读 下载PDF
Algorithm of Adaptive Bit Allocation Wavelet Transform Audio Coding 被引量:2
5
作者 Ma HongfeiMa Hongfei:associate professor, is with the Information Science Institute, Xidian University,Xi’an,China Fan ChangxinFan Changxin:professor, IEEE fellow, is with the Information Science Institute, Xidian University, Xi’an, China Song Guo 《通信学报》 EI CSCD 北大核心 1998年第5期80-83,共4页
AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,... AlgorithmofAdaptiveBitAlocationWaveletTransformAudioCodingMaHongfeiFanChangxinSongGuoxiang(XidianUniversity,Xi’an71... 展开更多
关键词 声音编码 小波变换 心理模式 自适应位分布
在线阅读 下载PDF
HI-FI AUDIO CODING TECHNOLOGY FOR ISDN
6
作者 黄晓利 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 1998年第2期63-67,共5页
A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation accordin... A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation according to the algorithm and gets satisfactory quality of the reconstructed music signal. The estimation of operation steps and simulation of decoder finished by a TMS 320C548 simulator are presented. The result is the same as that of the C language simulation. 展开更多
关键词 source coding audio compression MPEG SIGNAL processing
在线阅读 下载PDF
Review of AVS Audio Coding Standard
7
作者 ZHANG Tao ZHANG Caixia ZHAO Xin 《ZTE Communications》 2016年第2期56-62,共7页
Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached t... Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached the international standard. Its coding efficiency is 2 to 3 times greater than that of MPEG -2. This technical solution is more simple, and it can greatly save channel resource. After more than ten years' development, AVS has achieved great success. The latest version of the AVS audio coding standard is ongoing and mainly aims at the increasing demand for low bitrate and high quality audio services. The paper reviews the history and recent development of AVS audio coding standard in terms of basic features, key techniques and performance. Finally, the future development of AVS audio coding standard is discussed. 展开更多
关键词 audio Video coding Standard (AVS) audio coding AVS1 au-dio AVS2 audio
在线阅读 下载PDF
A Novel Frame Error Concealment Scheme Based on Gain Control for TCX Audio Codec
8
作者 XIANG Kai HU Ruimin 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第2期133-138,共6页
A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stabil... A novel frame error concealment scheme is proposed to improve the decoded audio quality of the receiver for transform coded excitation(TCX)audio codec.This scheme,which is a gain control approach based on the stability of linear predictive coding(LPC)filter,predicts the lost frames by utilizing the linear spectrum frequency and different continuous attenuation factor of different kinds of lost frames.Signal noise ratio(SNR)test and multiple stimuli with hidden reference and anchor(MUSHRA)test are conducted to evaluate the performance of this approach in adaptive multi-rate wideband plus(AMR-WB+)audio codec.Compared with the original frame error concealment scheme,our scheme achieves better audio recovery quality in AMR-WB+audio codec. 展开更多
关键词 frame error concealment audio codec transform coded excitation (TCX)
原文传递
3D Audio Coding Approach Based on Spatial Perception Features
9
作者 Cheng Yang Ruimin Hu +3 位作者 Xiaochen Wang Yuhong Yang Maosheng Zhang Wei Chen 《China Communications》 SCIE CSCD 2017年第11期126-140,共15页
A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and... A new three-dimensional(3D) audio coding approach is presented to improve the spatial perceptual quality of 3D audio. Different from other audio coding approaches, the distance side information is also quantified, and the non-uniform perceptual quantization is proposed based on the spatial perception features of the human auditory system, which is named as concentric spheres spatial quantization(CSSQ) method. Comparison results were presented, which showed that a better distance perceptual quality of 3D audio can be enhanced by 5.7%~8.8% through extracting and coding the distance side information comparing with the directional audio coding, and the bit rate of our coding method is decreased of 8.07% comparing with the spatial squeeze surround audio coding. 展开更多
关键词 3D audio coding non-uniform perceptual quantization distance perceptual quality
在线阅读 下载PDF
Bark-Band Residual Noise Model for Parametric Audio Coding
10
作者 王晶 晋艳伟 +1 位作者 赵胜辉 匡镜明 《Journal of Beijing Institute of Technology》 EI CAS 2004年第S1期1-6,共6页
A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrie... A Bark-band residual noise model integrated with the human hearing mechanism is proposed to efficiently complement sinusoidal model in parametric audio coding. The time-varying spectrum of the residual noise is retrieved by Bark-scale piecewise constant magnitude estimates along with random phases. In the proposed noise model, Bark bands information is obtained by short-time FFT method and window overlap-add technique is exploited to remove boundary discontinuities. SVQ is also incorporated into parameter quantization process for the low bit-rate coding demand. Simulation results and informal listening tests show that when the sinusoidal model is combined with the Bark-band noise model, better synthesis audio quality can be achieved compared with the original sinusoidal modeling audio codec. 展开更多
关键词 parametric audio coding sinusoidal model: residual noise model Bark band equivalent rectangular band (ERB) split vector quantization (SVQ)
在线阅读 下载PDF
Improved Sinusoid Analysis and Post-Processing in Parametric Audio Coding
11
作者 周宏 陈健 《Journal of Shanghai Jiaotong university(Science)》 EI 2003年第2期163-168,共6页
This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids... This paper proposed improvements to the low bit rate parametric audio coder with sinusoid model as its kernel. Firstly, we propose a new method to effectively order and select the perceptually most important sinusoids. The sinusoid which contributes most to the reduction of overall NMR is chosen. Combined with our improved parametric psychoacoustic model and advanced peak riddling techniques, the number of sinusoids required can be greatly reduced and the coding efficiency can be greatly enhanced. A lightweight version is also given to reduce the amount of computation with only little sacrifice of performance. Secondly, we propose two enhancement techniques for sinusoid synthesis: bandwidth enhancement and line enhancement. With little overhead, the effective bandwidth can be extended one more octave; the timbre tends to sound much brighter, thicker and more beautiful. 展开更多
关键词 parametric audio coding SINUSOID POST-PROCESSING
在线阅读 下载PDF
Quantization of wavelet packet audio coding
12
作者 谭建国 Zhang +1 位作者 Wenjun LiuPeilin 《High Technology Letters》 EI CAS 2006年第3期295-299,共5页
Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acou... Abstract The method of quantization noise control of audio coding in the wavelet domain is proposed. Using the inverse Discrete Fourier Transform (DFT), it converts the masking threshold coming from MPEG psycho-acoustic model in the frequency domain to the signal in the time domain; the Discrete Wavelet Packet Transform (DWPF) is performed; the energy in each subband is regarded as the maximum allowed quantization noise energy. The experimental result shows that the proposed method can attain the nearly transparent audio quality below 64kbps for the most testing audio signals. 展开更多
关键词 QUANTIZATION wavelet packet audio coding DFT
在线阅读 下载PDF
MAP-based Audio Coding Compensation for Speaker Recognition
13
作者 Tao Jiang Jiqing Han 《Journal of Signal and Information Processing》 2011年第3期165-169,共5页
The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstr... The performance of the speaker recognition system declines when training and testing audio codecs are mismatched. In this paper, based on analyzing the effect of mismatched audio codecs in the linear prediction cepstrum coefficients, a method of MAP-based audio coding compensation for speaker recognition is proposed. The proposed method firstly sets a standard codec as a reference and trains the speaker models in this codec format, then learns the deviation distributions between the standard codec format and the other ones, next gets the current bias via using a small number adaptive data and the MAP-based adaptive technique, and then adjusts the model parameters by the type of coming audio codec format and its related bias. During the test, the features of the coming speaker are used to match with the adjusted model. The experimental result shows that the accuracy reached 82.4% with just one second adaptive data, which is higher 5.5% than that in the baseline system. 展开更多
关键词 audio coding COMPENSATION SPEAKER RECOGNITION MAP-Based
在线阅读 下载PDF
MPEG-4 Audio Version2新概念
14
作者 杨杰 陈健 《电声技术》 北大核心 2000年第7期3-6,共4页
介绍了MPEG-4音频第2版的新概念,包括容错健壮性,低延迟音频编码,精细的频段分级,参数音频编码,CELP静音压缩,扩展的HVXC等。通过与第一版的比较,提出了若干改进之处。
关键词 MPEG-4 音频编码 CELP静音压缩
在线阅读 下载PDF
面向4K超高清电视广播的AVS3视频编解码技术应用
15
作者 高燕 张林 《电视技术》 2025年第7期226-228,共3页
针对提高4K超高清电视广播视频压缩效率,减少存储空间和网络带宽这一需求,提出应用音视频编码标准3(Audio Video Coding Standard 3,AVS3)视频编解码技术,利用AVS3编解码标准的高效性,提升4K超高清视频内容的编码质量和传输效率。首先,... 针对提高4K超高清电视广播视频压缩效率,减少存储空间和网络带宽这一需求,提出应用音视频编码标准3(Audio Video Coding Standard 3,AVS3)视频编解码技术,利用AVS3编解码标准的高效性,提升4K超高清视频内容的编码质量和传输效率。首先,从节目制播环节入手,通过扩展帧内角度预测和信源动态编码,在保持图像质量的同时降低编码比特率,优化编码过程,节省传输带宽;其次,通过视频分层并行解码和音频解码还放声音技术,提升接收解码及呈现的速度和质量;最后,通过应用实验验证AVS3视频编解码技术在4K超高清电视广播领域的实际效果。 展开更多
关键词 4K超高清 音视频编码标准3(AVS3) 视频编解码
在线阅读 下载PDF
MPEG AUDIO LAYER 3数字音频压缩编码原理深度分析
16
作者 王伟 《煤炭技术》 CAS 北大核心 2011年第12期200-201,共2页
MPEG AUDIO LAYER 3是目前为止开发得最为成功的数字音频压缩技术之一。从音频压缩理论的角度,阐述MPEG AUDIO LAYER 3数字音频压缩编码原理。
关键词 MPEG audio LAYER 3数字音频 压缩 编码原理
原文传递
流动电台系统实时音视频数据传输与处理技术研究
17
作者 李国 《电视技术》 2025年第9期223-225,共3页
随着移动互联网的发展,流动电台系统的实时音视频传输面临高速移动和网络不稳定等挑战。为了满足低延迟和高稳定性的传输需求,针对H.264视频编码、高级音频编码(Advanced Audio Coding,AAC)和安全可靠传输(Secure Reliable Transport,S... 随着移动互联网的发展,流动电台系统的实时音视频传输面临高速移动和网络不稳定等挑战。为了满足低延迟和高稳定性的传输需求,针对H.264视频编码、高级音频编码(Advanced Audio Coding,AAC)和安全可靠传输(Secure Reliable Transport,SRT)协议展开分析。研究表明,H.264视频编码通过采用帧间预测、运动补偿等技术,实现高压缩比与优质画面;AAC通过改进离散余弦变换(Modified Discrete Cosine Transform,MDCT)及动态码率控制,适应带宽波动并保持音质;SRT协议通过融合自动重传请求(Automatic Repeat reQuest,ARQ)与前向纠错(Forward Error Correction,FEC)机制,保障音视频低延迟稳定传输。三者协同可为流动电台系统提供关键技术支撑。 展开更多
关键词 流动电台 音视频传输 H.264 高级音频编码(AAC)
在线阅读 下载PDF
应急广播中数字音频信号快速接入相关技术应用研究
18
作者 孔庆海 《电声技术》 2025年第7期22-24,共3页
突发事件应急处置对信息传播的时效性和覆盖面提出更高要求。文章在分析应急广播音频接入需求的基础上,从优化音频数据打包与传输方案、改进音频编解码算法和数据压缩技术、简化会话发起协议(Session Initiation Protocol,SIP)呼叫建立... 突发事件应急处置对信息传播的时效性和覆盖面提出更高要求。文章在分析应急广播音频接入需求的基础上,从优化音频数据打包与传输方案、改进音频编解码算法和数据压缩技术、简化会话发起协议(Session Initiation Protocol,SIP)呼叫建立流程、优化会话描述协议(Session Description Protocol,SDP)媒体协商机制等方面,系统阐述应急广播中的数字音频快速接入技术策略。 展开更多
关键词 应急广播 数字音频 快速接入 自适应编码
在线阅读 下载PDF
提升卫星传输音频质量的编码调制策略分析
19
作者 刘毅 《电声技术》 2025年第8期163-165,共3页
在卫星传输场景下,编码调制方法与音频质量关联紧密。剖析压缩编码算法复杂度、调制方式频谱效率、信道编码增益等因素对音频传输的作用机制,提出自适应编码协同、高阶调制优化等策略。经仿真模型与实测数据验证,在多径衰落环境中,这些... 在卫星传输场景下,编码调制方法与音频质量关联紧密。剖析压缩编码算法复杂度、调制方式频谱效率、信道编码增益等因素对音频传输的作用机制,提出自适应编码协同、高阶调制优化等策略。经仿真模型与实测数据验证,在多径衰落环境中,这些策略可使音频信噪比提升6~8 dB,主观音质评分提高15%~20%,为卫星音频通信系统优化设计提供了理论与技术支撑。 展开更多
关键词 卫星传输 音频质量 编码调制 自适应算法 信道优化
在线阅读 下载PDF
基于可逆网络双嵌入和攻击层的鲁棒音频水印方法
20
作者 张旭龙 瞿晓阳 +2 位作者 李鹏程 肖春光 王健宗 《大数据》 2025年第4期87-101,共15页
数字音频水印技术旨在将信息嵌入音频,并从含水印的音频中准确提取信息。传统方法依靠专家经验设计的算法将水印嵌入信号的时域或变换域。随着深度神经网络的发展,基于深度学习的神经音频水印技术应运而生。与传统算法相比,神经音频水... 数字音频水印技术旨在将信息嵌入音频,并从含水印的音频中准确提取信息。传统方法依靠专家经验设计的算法将水印嵌入信号的时域或变换域。随着深度神经网络的发展,基于深度学习的神经音频水印技术应运而生。与传统算法相比,神经音频水印通过在训练过程中考虑各种攻击,实现了更强的鲁棒性。然而,当前的神经音频水印方法存在容量低、不可感知性差等问题。此外,水印定位问题在神经音频水印中尤为重要,却未得到充分研究。设计了一种基于可逆网络用于高效定位的双嵌入水印模型,并在鲁棒性训练中考虑了攻击层对可逆神经网络的影响,从而提高了模型的合理性和稳定性。实验表明,与现有方法相比,所提出的方法能够承受各种攻击,具有更高的容量和更有效的定位能力。 展开更多
关键词 神经音频水印 可逆神经网络 双嵌入 同步码 鲁棒性
在线阅读 下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部