The Head-Related Transfer Function(HRTF)describes the effects of sound reflection and scattering caused by the environment and the human body when sound signals are transmitted from a source to the human ear.It contai...The Head-Related Transfer Function(HRTF)describes the effects of sound reflection and scattering caused by the environment and the human body when sound signals are transmitted from a source to the human ear.It contains a significant amount of auditory cue information used for sound localization.Consequently,HRTF renders 3D audio accurately in numerous immersive multimedia applications.Because HRTF is high-dimensional,complex,and nonlinear,it is a relatively large and intricate dataset,typically consisting of hundreds of thousands of samples.Storing HRTF requires a significant amount of storage space in practical applications.Based on this,high-dimensional,complex,and nonlinear HRTFs need to be compressed and reconstructed.In this study,inspired by the conventional common-pole/zero model,we propose a method for representing HRTF based on the common-pole/zero model and principal component analysis(PCA).Our method utilizes human auditory features and extends the traditional Common-Acoustical-Pole/Zero(CAPZ)method to estimate the common pole and zero coefficients across multiple subjects.Subsequently,the zero coefficients are compressed using the PCA procedure.Experimental results on the CIPIC database show that the compression ratio can reach 9.5%when the average spectral distortion is less than 2 dB.展开更多
文摘The Head-Related Transfer Function(HRTF)describes the effects of sound reflection and scattering caused by the environment and the human body when sound signals are transmitted from a source to the human ear.It contains a significant amount of auditory cue information used for sound localization.Consequently,HRTF renders 3D audio accurately in numerous immersive multimedia applications.Because HRTF is high-dimensional,complex,and nonlinear,it is a relatively large and intricate dataset,typically consisting of hundreds of thousands of samples.Storing HRTF requires a significant amount of storage space in practical applications.Based on this,high-dimensional,complex,and nonlinear HRTFs need to be compressed and reconstructed.In this study,inspired by the conventional common-pole/zero model,we propose a method for representing HRTF based on the common-pole/zero model and principal component analysis(PCA).Our method utilizes human auditory features and extends the traditional Common-Acoustical-Pole/Zero(CAPZ)method to estimate the common pole and zero coefficients across multiple subjects.Subsequently,the zero coefficients are compressed using the PCA procedure.Experimental results on the CIPIC database show that the compression ratio can reach 9.5%when the average spectral distortion is less than 2 dB.