The Perceptual Spectrum Distortion (PSD), based on auditory properties of human being, is presented to measure speech distortion. The PSD measure calculates the speech distortion distance by simulating the auditory p...The Perceptual Spectrum Distortion (PSD), based on auditory properties of human being, is presented to measure speech distortion. The PSD measure calculates the speech distortion distance by simulating the auditory properties of human being and converting short-time speech power spectrum to auditory perceptual spectrum. Preliminary simulative experiments in comparison with the Itakura measure have been done. The results show that the PSD measure is a perferable speech distortion measure and more consistent with subjective assessment of speech quality.展开更多
In order to investigate the influence of dummy head on measuring speech intelligi- bility, the objective and subjective speech intelligibility evaluation experiments were respectively carried out for different spatial...In order to investigate the influence of dummy head on measuring speech intelligi- bility, the objective and subjective speech intelligibility evaluation experiments were respectively carried out for different spatial configurations of a target source and a noise source in the horizontal plane. The differences between standard STIPA measured without a dummy head and binaural STIPA measured with a dummy head were compared and the correlation of subjective speech intelligibility and objective STIPA was analyzed. It is showed that the position of sound source affects significantly on binaural STIPA and subjective intelligibility measured by a dummy head or measured in a real-life scenario. The standard STIPA is closer to the lower value of the two binaural STIPA values. The speech intelligibility is higher for a single ear which is on the same side with the target source or on the other side of the noise source. Binaural speech intelligibility is always the lowest when both target and noise sources are at the same place but once apart the speech intelligibility will increase sharply. It is also found that the subjective intelligibility measured by a dummy head or measured in a real-life scenario is uncorrelated with standard STIPA, but correlated highly with STIPA measured with a dummy head. The subjective intelligibility of one single ear is correlated highly with STIPA measured at the same ear, and the binaural speech intelligibility is in well agreement with the higher value of the two binaural STIPA values.展开更多
Objective:To discuss the current clinical application and usefulness,shortcomings and future directions of traditional and artificial intelligence(AI)-driven acoustic assessment techniques to detect voice dysfunction....Objective:To discuss the current clinical application and usefulness,shortcomings and future directions of traditional and artificial intelligence(AI)-driven acoustic assessment techniques to detect voice dysfunction.Data Sources:Literature review.Conclusion:AI-based acoustic voice analysis techniques have huge potential to improve the early recognition,diagnosis,and tracking of treatment success in patients with voice disorders or diseases affecting voice function.Through smartphones,wearable devices,and server-based solutions,acoustic voice assessment techniques have become widely available and may be extended to workplace and private settings.However,the transformative potential is thwarted by several limitations including a lack of(a)consistent data collection and reporting standards,leading to heterogeneity of current databases and literature;(b)characterization what acoustic analysis techniques including AI can detect or track reliably,and whether the derived outcomes serve as a reliable marker of dysfunction,pathology,or an improvement thereof;(c)clinical validation studies in unselected patients;and(d)ethical and legal controversies.Thus,substantial effort to research,define and establish guidelines for the collection,storage,and processing of acoustic data and valid clinical applications is warranted to designsensible strategies for analysis and use.展开更多
文摘The Perceptual Spectrum Distortion (PSD), based on auditory properties of human being, is presented to measure speech distortion. The PSD measure calculates the speech distortion distance by simulating the auditory properties of human being and converting short-time speech power spectrum to auditory perceptual spectrum. Preliminary simulative experiments in comparison with the Itakura measure have been done. The results show that the PSD measure is a perferable speech distortion measure and more consistent with subjective assessment of speech quality.
基金supported by the National Nature Science Foundation of China(11204278)
文摘In order to investigate the influence of dummy head on measuring speech intelligi- bility, the objective and subjective speech intelligibility evaluation experiments were respectively carried out for different spatial configurations of a target source and a noise source in the horizontal plane. The differences between standard STIPA measured without a dummy head and binaural STIPA measured with a dummy head were compared and the correlation of subjective speech intelligibility and objective STIPA was analyzed. It is showed that the position of sound source affects significantly on binaural STIPA and subjective intelligibility measured by a dummy head or measured in a real-life scenario. The standard STIPA is closer to the lower value of the two binaural STIPA values. The speech intelligibility is higher for a single ear which is on the same side with the target source or on the other side of the noise source. Binaural speech intelligibility is always the lowest when both target and noise sources are at the same place but once apart the speech intelligibility will increase sharply. It is also found that the subjective intelligibility measured by a dummy head or measured in a real-life scenario is uncorrelated with standard STIPA, but correlated highly with STIPA measured with a dummy head. The subjective intelligibility of one single ear is correlated highly with STIPA measured at the same ear, and the binaural speech intelligibility is in well agreement with the higher value of the two binaural STIPA values.
文摘Objective:To discuss the current clinical application and usefulness,shortcomings and future directions of traditional and artificial intelligence(AI)-driven acoustic assessment techniques to detect voice dysfunction.Data Sources:Literature review.Conclusion:AI-based acoustic voice analysis techniques have huge potential to improve the early recognition,diagnosis,and tracking of treatment success in patients with voice disorders or diseases affecting voice function.Through smartphones,wearable devices,and server-based solutions,acoustic voice assessment techniques have become widely available and may be extended to workplace and private settings.However,the transformative potential is thwarted by several limitations including a lack of(a)consistent data collection and reporting standards,leading to heterogeneity of current databases and literature;(b)characterization what acoustic analysis techniques including AI can detect or track reliably,and whether the derived outcomes serve as a reliable marker of dysfunction,pathology,or an improvement thereof;(c)clinical validation studies in unselected patients;and(d)ethical and legal controversies.Thus,substantial effort to research,define and establish guidelines for the collection,storage,and processing of acoustic data and valid clinical applications is warranted to designsensible strategies for analysis and use.