The proposed mass model of vocal fold vibration holds a significant importance in the auxiliary diagnosis and treatment of human vocal fold disorders.Mathematical models are proposed in aerodynamics and acoustics to s...The proposed mass model of vocal fold vibration holds a significant importance in the auxiliary diagnosis and treatment of human vocal fold disorders.Mathematical models are proposed in aerodynamics and acoustics to simulate vocal fold vibration during phonation.This has always been a hot topic in pathological linguistics research.Over the past few decades,researchers have designed various types of mass models of vocal fold vibration based on experiments.These models differ in principles,computational complexity,and degrees of freedom.Therefore,we classify and describe the mass models according to modeling methods.We summarize the research status and characteristics of different models,and based on this,we look forward to future research directions for vocal fold mass models.展开更多
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing de...Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.展开更多
Variation in the vocal behavior of nonhuman vertebrates includes graded transitions and more dramatic changes.Wapiti males produce a reproductive bugle that has a fundamental frequency that surpasses 2,000 Hz with evi...Variation in the vocal behavior of nonhuman vertebrates includes graded transitions and more dramatic changes.Wapiti males produce a reproductive bugle that has a fundamental frequency that surpasses 2,000 Hz with evidence of biphonation and other nonlinear phenomena.Here,we analyze the acoustic structure of captive wapiti vocalizations to compare the male bugle with 3 categories of distress vocalizations:neonate distress(capture)calls,calf isolation calls,and adult female isolation calls.These 4 high-arousal call categories serve a common general function in recruiting conspecifics but occur in different behavioral contexts(capture,isolation,reproduction).Our goal was to distinguish characteristics that vary in graded steps that may correspond to an animal’s age or size from characteristics that are unique to the bugle.Characteristics of the high and loud fundamental(G0)varied in an age/size-graded manner with a decrease in minimum G0,an increase in the maximum and range of G0,with no evidence of sex differences.The nonlinear phenomena of deterministic chaos,biphonation,and frequency jumps were present in all 4 call categories and became more common from the distress vocalizations of neonates to calves to adult females to the male bugle.Two temporal characteristics sharply distinguished the bugle from the 3 categories of distress vocalizations:these included a prolonged call duration and a maximum G0 that occurred much later in the call for the bugle than for distress vocalizations.Our results suggest that distress vocalizations of different age groups and the reproductive bugle of wapiti share a high G0,with age/size-graded changes in G0 and nonlinear phenomena,but differ sharply in temporal characteristics.展开更多
With the development of new media technology and the popularity of the TikTok platform in China,a large number of popular vocal music teachers have flocked to online platforms for teaching.Online vocal music education...With the development of new media technology and the popularity of the TikTok platform in China,a large number of popular vocal music teachers have flocked to online platforms for teaching.Online vocal music education in China is undergoing a transformation and facing challenges.This study adopts an exploratory research approach,interviewing students learning pop vocal music,and observing popular pop teachers on TikTok.The advantages,disadvantages,techniques,and methods of domestic TikTok pop vocal music teaching were investigated and studied,and a series of experiences and suggestions for optimizing TikTok teaching were put forward.The results of this study are helpful for understanding the advantages and disadvantages of TikTok pop vocal music teaching and grasping the correct development direction.These guidance and suggestions can stimulate teachers’creativity and improve their vocal music teaching level.展开更多
目的:探讨三维超声VOCAL技术评估胰岛素抵抗不孕患者子宫内膜容受性的临床价值。方法:纳入2023年2月至2024年6月我院收治的胰岛素抵抗不孕患者50例(研究组)及同期正常孕龄妇女50例(对照组),均行三维超声VOCAL检查,比较两组内膜容积、VI/...目的:探讨三维超声VOCAL技术评估胰岛素抵抗不孕患者子宫内膜容受性的临床价值。方法:纳入2023年2月至2024年6月我院收治的胰岛素抵抗不孕患者50例(研究组)及同期正常孕龄妇女50例(对照组),均行三维超声VOCAL检查,比较两组内膜容积、VI/FI/VFI参数及血清SHBG、HOMA-IR等指标。结果:研究组内膜容积([2.45±0.31)cm^(3)]、VI(7.74±1.36)、FI(20.46±3.67)、VFI(158.14±10.20)均显著低于对照组(P<0.05),血清SHBG([45.69±5.78)nmol/L]、HDL-C([1.36±0.35)mmol/L]水平亦低于对照组(P<0.05)。研究组血清T([8.58±2.58)ng/mL]、TG([2.11±0.36)mmol/L]、FPG([7.69±1.13)mmol/L]、2 h PG([10.24±1.69)mmol/L]、FINS([27.41±3.16)μIU/mL]、HOMA-IR(2.24±0.35)水平高于对照组(P<0.05)。Pearson相关性分析显示,内膜容积与血清SHBG(r=0.536,P<0.001)、HDL-C(r=0.574,P<0.001)水平呈正相关,与血清T(r=-0.584,P<0.001)、TG(r=-0.496,P<0.001)、FPG(r=-0.547,P<0.001)、2 h PG(r=-0.558,P<0.001)、FINS(r=-0.569,P<0.001)、HOMA-IR(r=-0.585,P<0.001)均呈负相关。VI与血清SHBG(r=0.587,P<0.001)、HDL-C(r=0.604,P<0.001)水平呈正相关,与血清T(r=-0.512,P<0.001)、TG(r=-0.536,P<0.001)、FPG(r=-0.523,P<0.001)、2 h PG(r=-0.514,P<0.001)、FINS(r=-0.525,P<0.001)、HOMA-IR(r=-0.496,P<0.001)均呈负相关。FI与血清SHBG(r=0.601,P<0.001)、HDL-C(r=0.551,P<0.001)水平呈正相关,与血清T(r=-0.555,P<0.001)、TG(r=-0.587,P<0.001)、FPG(r=-0.546,P<0.001)、2 h PG(r=-0.567,P<0.001)、FINS(r=-0.516,P<0.001)、HOMA-IR(r=-0.478,P<0.001)均呈负相关。VFI与血清SHBG(r=0.598,P<0.001)、HDL-C(r=0.527,P<0.001)水平呈正相关,与血清T(r=-0.511,P<0.001)、TG(r=-0.571,P<0.001)、FPG(r=-0.532,P<0.001)、2 h PG(r=-0.510,P<0.001)、FINS(r=-0.536,P<0.001)、HOMA-IR(r=-0.546,P<0.001)均呈负相关。结论:三维超声VOCAL技术可有效评估胰岛素抵抗不孕患者的子宫内膜容受性,为个体化治疗提供影像学依据。展开更多
With the rapid development of information technology and the advancement of educational modernization,the teaching mode of vocal music in colleges and universities is undergoing a new transformation,which complies wit...With the rapid development of information technology and the advancement of educational modernization,the teaching mode of vocal music in colleges and universities is undergoing a new transformation,which complies with the trend of digital age and brings new challenges.This paper explores the specific implementation path of artificial intelligence technology,virtual reality technology,big data technology and intelligent interaction technology in vocal music teaching in colleges and universities,aiming to inject new vitality into the traditional teaching mode and improve teaching quality and efficiency.展开更多
KHOOMEI is a traditional throat-singing art created by the Mongolian ethnic group and is regarded as one of the oldest forms of this low rumbling vocal technique in the world.In 2006,Khoomei was included in China’s f...KHOOMEI is a traditional throat-singing art created by the Mongolian ethnic group and is regarded as one of the oldest forms of this low rumbling vocal technique in the world.In 2006,Khoomei was included in China’s first national list of intangible cultural heritage items,followed in 2009 by its inclusion on UNESCO’s Representative List of the Intangible Cultural Heritage of Humanity.展开更多
Bird vocalizations are pivotal for ecological monitoring,providing insights into biodiversity and ecosystem health.Traditional recognition methods often neglect phase information,resulting in incomplete feature repres...Bird vocalizations are pivotal for ecological monitoring,providing insights into biodiversity and ecosystem health.Traditional recognition methods often neglect phase information,resulting in incomplete feature representation.In this paper,we introduce a novel approach to bird vocalization recognition(BVR)that integrates both amplitude and phase information,leading to enhanced species identification.We propose MHARes Net,a deep learning(DL)model that employs residual blocks and a multi-head attention mechanism to capture salient features from logarithmic power(POW),Instantaneous Frequency(IF),and Group Delay(GD)extracted from bird vocalizations.Experiments on three bird vocalization datasets demonstrate our method's superior performance,achieving accuracy rates of 94%,98.9%,and 87.1%respectively.These results indicate that our approach provides a more effective representation of bird vocalizations,outperforming existing methods.This integration of phase information in BVR is innovative and significantly advances the field of automatic bird monitoring technology,offering valuable tools for ecological research and conservation efforts.展开更多
The aims of this research were (1) to provide a description of spurfowl Pternistis spp. calls and their social context;(2) to describe the divergence of advertisement calls;and (3) to appropri-ate 23 spurfowl species ...The aims of this research were (1) to provide a description of spurfowl Pternistis spp. calls and their social context;(2) to describe the divergence of advertisement calls;and (3) to appropri-ate 23 spurfowl species to homologous sound groups which have been synthesized with recognized monophyletic groups within Pternistis spurfowls. Sound group partitioning was primarily based on male advertisement calls. A total of 218 recordings (rendering^300 identifiable calls) were analyzed covering 22 out of 23 spurfowl species in Africa. One species was assessed from written accounts. The repertoire size per spurfowl varies between 7 and 11 calls. Spurfowl calls were arranged into three broad categories including (1) advertisement calls;(2) maintenance calls including distress calls, juve-nile whining (“mews”), cheeps and comfort calls;and (3) male-female and female-offspring bonding calls. Spurfowl species were set out in eight sound groups of which five were more or less congruent with the monophyletic groups of Hall (1963), but sound groups produced more partitioning as Hall described only five groups relevant to Pternistis spp. The divergence of advertisement calls appar-ently minimizes hybridization between sympatric species but the“genetic distance”between spurfowl species is relatively small causing hybridization among spurfowl species. Despite the vocalizations of Hartlaub’s Spurfowl (P. hartlaubi) differing significantly from the rest of the spurfowls, sound analy-ses suggest that it remains within Pternistis.展开更多
Signals within animals’vocal communication are considered functional referential and context-specific.Even in the absence of the context,receivers are expected to acquire the information of calls and respond specific...Signals within animals’vocal communication are considered functional referential and context-specific.Even in the absence of the context,receivers are expected to acquire the information of calls and respond specificallyWhereas the framework was supported by plenty of evidence,its exhaustivity in describing all animal vocalisations has been questioned.Here,we investigated the vocal repertoire of a cooperatively breeding species,Azure-winged Magpie(Cyanopica cyanus),to present evidence for referential signals.The results showed that Azure-winged Magpies had a relatively large vocal repertoire,consisting of twelve distinct calls.These calls were associated with the context including movement,begging for food,contact,vigilance against predators,etc.However,even the predator-specific alarm calls would induce various responses of receivers.This implies that multiple pieces of information are involved in the vocalisation,which could be utilised by the receiver to select an appropriate response based on the surroundings.Our study gives a detailed description of the context and function of the vocal repertoire in Azure-winged Magpies,laying the foundation for further investigation on the developmental mechanisms of bird vocalisations.This study also suggests that the referential signals of animal vocalisations may not be limited to the context-specific responses of receivers and need to be discussed from a broader perspective.展开更多
A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of di...A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of different pathological factors of vocal cords on model parameters are studied. Some typical spectrum distribution of the simulated speech signals are given. Moreover, hoarse speech signals of some typical cases are analyzed by the methods of digital signal processing, including FFT, LPC, Cepstrum technique, Pseudocolor encoding, etc. The experiment results show that the three mass model analysis of vocal cords is an efficient method for analysis of hoarse speech signals.展开更多
Ultrasonic communication in vertebrates is attracting increasing research interest.To determine if ultrasonic vocalization is common in birds,we recorded their vocalizations with ultrasound detectors in the Dongzhai N...Ultrasonic communication in vertebrates is attracting increasing research interest.To determine if ultrasonic vocalization is common in birds,we recorded their vocalizations with ultrasound detectors in the Dongzhai National Nature Reserve of Henan Province,China.We found varying degrees of high frequency components in the vocalizations of 14 species and in several of these species,the frequency of harmonics was up to the range of ultrasound.We suggest that more studies are required to determine whether the high frequency components in avian vocalizations have functions and what these functions are.In addition,the ability of birds to hear sounds in the high frequency range also requires re-examination.展开更多
Background:In the last decade,enigmatic male-like cuckoo calls have been reported several times in East Asia.These calls exhibited a combination of vocal traits of both Oriental Cuckoo(Cuculus optatus)and Common Cucko...Background:In the last decade,enigmatic male-like cuckoo calls have been reported several times in East Asia.These calls exhibited a combination of vocal traits of both Oriental Cuckoo(Cuculus optatus)and Common Cuckoo(Cuculus canorus)advertising calls,and some authors therefore suggested that the enigmatic calls were produced by either Common×Oriental Cuckoo male hybrids or Common Cuckoo males having a gene mutation.However,the exact identity of calling birds are still unknown.Methods:We recorded previously unknown male-like calls from three captive Oriental Cuckoo females,and compared these calls with enigmatic vocalizations recorded in the wild as well as with advertising vocalizations of Common and Oriental Cuckoo males.To achieve this,we measured calls automatically.Besides,we video-recorded captive female emitting male-like calls,and compared these recordings with the YouTube recordings of calling males of both Common and Oriental Cuckoos to get insight into the mechanism of call production.Results:The analysis showed that female male-like calls recorded in captivity were similar to enigmatic calls recorded in the wild.Therefore,Oriental Cuckoo females might produce the latter calls.Two features of these female calls appeared to be unusual among birds.First,females produced male-like calls at the time of spring and autumn migratory activity and on migration in the wild.Because of this,functional significance of this call remained puzzling.Secondly,the male-like female call unexpectedly combined features of both closed-mouth(closed beak and simultaneous inflation of the‘throat sac')and open-mouth(prominent harmonic spectrum and the maximum neck extension observed at the beginning of a sound)vocal behaviors.Conclusions:The Cuculus vocalizations outside the reproductive season remain poorly understood.Here,we found for the first time that Oriental Cuckoo females can produce male-like calls in that time.Because of its rarity,this call might be an atavism.Indeed,female male-like vocalizations are still known in non-parasitic tropical and apparently more basal cuckoos only.Therefore,our findings may shed light on the evolution of vocal communication in avian brood parasites.展开更多
With the continuous development of social economy in our country,communication between international culture,artistic and other aspects is also increasingly frequent.The improvement of material life inevitably bring t...With the continuous development of social economy in our country,communication between international culture,artistic and other aspects is also increasingly frequent.The improvement of material life inevitably bring the change of mental life.As a cross discipline between musical aesthetics and artistic aesthetics,vocal music aesthetics has gradually become the core content of vocal music art,and has been paid more and more attention by experts and scholars,the research on vocal music aesthetics has become an important subject in the ? eld of theoretical research.By discussing the value of vocal music aesthetics in vocal music art,this paper further explores the connotation and function of vocal music aesthetics,in order to make a contribution to the prosperity and development of vocal music art of our country.展开更多
基金the Shanghai Educational Sciences Research Program(No.C2021016)。
文摘The proposed mass model of vocal fold vibration holds a significant importance in the auxiliary diagnosis and treatment of human vocal fold disorders.Mathematical models are proposed in aerodynamics and acoustics to simulate vocal fold vibration during phonation.This has always been a hot topic in pathological linguistics research.Over the past few decades,researchers have designed various types of mass models of vocal fold vibration based on experiments.These models differ in principles,computational complexity,and degrees of freedom.Therefore,we classify and describe the mass models according to modeling methods.We summarize the research status and characteristics of different models,and based on this,we look forward to future research directions for vocal fold mass models.
基金The author Dr.Arshiya S.Ansari extends the appreciation to the Deanship of Postgraduate Studies and Scientific Research at Majmaah University for funding this research work through the project number(R-2025-1538).
文摘Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.
基金The University of Winnipeg and The University of Winnipeg Foundation contributed funding to this research.
文摘Variation in the vocal behavior of nonhuman vertebrates includes graded transitions and more dramatic changes.Wapiti males produce a reproductive bugle that has a fundamental frequency that surpasses 2,000 Hz with evidence of biphonation and other nonlinear phenomena.Here,we analyze the acoustic structure of captive wapiti vocalizations to compare the male bugle with 3 categories of distress vocalizations:neonate distress(capture)calls,calf isolation calls,and adult female isolation calls.These 4 high-arousal call categories serve a common general function in recruiting conspecifics but occur in different behavioral contexts(capture,isolation,reproduction).Our goal was to distinguish characteristics that vary in graded steps that may correspond to an animal’s age or size from characteristics that are unique to the bugle.Characteristics of the high and loud fundamental(G0)varied in an age/size-graded manner with a decrease in minimum G0,an increase in the maximum and range of G0,with no evidence of sex differences.The nonlinear phenomena of deterministic chaos,biphonation,and frequency jumps were present in all 4 call categories and became more common from the distress vocalizations of neonates to calves to adult females to the male bugle.Two temporal characteristics sharply distinguished the bugle from the 3 categories of distress vocalizations:these included a prolonged call duration and a maximum G0 that occurred much later in the call for the bugle than for distress vocalizations.Our results suggest that distress vocalizations of different age groups and the reproductive bugle of wapiti share a high G0,with age/size-graded changes in G0 and nonlinear phenomena,but differ sharply in temporal characteristics.
文摘With the development of new media technology and the popularity of the TikTok platform in China,a large number of popular vocal music teachers have flocked to online platforms for teaching.Online vocal music education in China is undergoing a transformation and facing challenges.This study adopts an exploratory research approach,interviewing students learning pop vocal music,and observing popular pop teachers on TikTok.The advantages,disadvantages,techniques,and methods of domestic TikTok pop vocal music teaching were investigated and studied,and a series of experiences and suggestions for optimizing TikTok teaching were put forward.The results of this study are helpful for understanding the advantages and disadvantages of TikTok pop vocal music teaching and grasping the correct development direction.These guidance and suggestions can stimulate teachers’creativity and improve their vocal music teaching level.
文摘目的:探讨三维超声VOCAL技术评估胰岛素抵抗不孕患者子宫内膜容受性的临床价值。方法:纳入2023年2月至2024年6月我院收治的胰岛素抵抗不孕患者50例(研究组)及同期正常孕龄妇女50例(对照组),均行三维超声VOCAL检查,比较两组内膜容积、VI/FI/VFI参数及血清SHBG、HOMA-IR等指标。结果:研究组内膜容积([2.45±0.31)cm^(3)]、VI(7.74±1.36)、FI(20.46±3.67)、VFI(158.14±10.20)均显著低于对照组(P<0.05),血清SHBG([45.69±5.78)nmol/L]、HDL-C([1.36±0.35)mmol/L]水平亦低于对照组(P<0.05)。研究组血清T([8.58±2.58)ng/mL]、TG([2.11±0.36)mmol/L]、FPG([7.69±1.13)mmol/L]、2 h PG([10.24±1.69)mmol/L]、FINS([27.41±3.16)μIU/mL]、HOMA-IR(2.24±0.35)水平高于对照组(P<0.05)。Pearson相关性分析显示,内膜容积与血清SHBG(r=0.536,P<0.001)、HDL-C(r=0.574,P<0.001)水平呈正相关,与血清T(r=-0.584,P<0.001)、TG(r=-0.496,P<0.001)、FPG(r=-0.547,P<0.001)、2 h PG(r=-0.558,P<0.001)、FINS(r=-0.569,P<0.001)、HOMA-IR(r=-0.585,P<0.001)均呈负相关。VI与血清SHBG(r=0.587,P<0.001)、HDL-C(r=0.604,P<0.001)水平呈正相关,与血清T(r=-0.512,P<0.001)、TG(r=-0.536,P<0.001)、FPG(r=-0.523,P<0.001)、2 h PG(r=-0.514,P<0.001)、FINS(r=-0.525,P<0.001)、HOMA-IR(r=-0.496,P<0.001)均呈负相关。FI与血清SHBG(r=0.601,P<0.001)、HDL-C(r=0.551,P<0.001)水平呈正相关,与血清T(r=-0.555,P<0.001)、TG(r=-0.587,P<0.001)、FPG(r=-0.546,P<0.001)、2 h PG(r=-0.567,P<0.001)、FINS(r=-0.516,P<0.001)、HOMA-IR(r=-0.478,P<0.001)均呈负相关。VFI与血清SHBG(r=0.598,P<0.001)、HDL-C(r=0.527,P<0.001)水平呈正相关,与血清T(r=-0.511,P<0.001)、TG(r=-0.571,P<0.001)、FPG(r=-0.532,P<0.001)、2 h PG(r=-0.510,P<0.001)、FINS(r=-0.536,P<0.001)、HOMA-IR(r=-0.546,P<0.001)均呈负相关。结论:三维超声VOCAL技术可有效评估胰岛素抵抗不孕患者的子宫内膜容受性,为个体化治疗提供影像学依据。
基金Education Department of Hainan Province(Project No.:Hnjg2024-112&Hnjg2025ZC-80)。
文摘With the rapid development of information technology and the advancement of educational modernization,the teaching mode of vocal music in colleges and universities is undergoing a new transformation,which complies with the trend of digital age and brings new challenges.This paper explores the specific implementation path of artificial intelligence technology,virtual reality technology,big data technology and intelligent interaction technology in vocal music teaching in colleges and universities,aiming to inject new vitality into the traditional teaching mode and improve teaching quality and efficiency.
文摘KHOOMEI is a traditional throat-singing art created by the Mongolian ethnic group and is regarded as one of the oldest forms of this low rumbling vocal technique in the world.In 2006,Khoomei was included in China’s first national list of intangible cultural heritage items,followed in 2009 by its inclusion on UNESCO’s Representative List of the Intangible Cultural Heritage of Humanity.
基金supported by the Beijing Natural Science Foundation (5252014)the National Natural Science Foundation of China (62303063)。
文摘Bird vocalizations are pivotal for ecological monitoring,providing insights into biodiversity and ecosystem health.Traditional recognition methods often neglect phase information,resulting in incomplete feature representation.In this paper,we introduce a novel approach to bird vocalization recognition(BVR)that integrates both amplitude and phase information,leading to enhanced species identification.We propose MHARes Net,a deep learning(DL)model that employs residual blocks and a multi-head attention mechanism to capture salient features from logarithmic power(POW),Instantaneous Frequency(IF),and Group Delay(GD)extracted from bird vocalizations.Experiments on three bird vocalization datasets demonstrate our method's superior performance,achieving accuracy rates of 94%,98.9%,and 87.1%respectively.These results indicate that our approach provides a more effective representation of bird vocalizations,outperforming existing methods.This integration of phase information in BVR is innovative and significantly advances the field of automatic bird monitoring technology,offering valuable tools for ecological research and conservation efforts.
文摘The aims of this research were (1) to provide a description of spurfowl Pternistis spp. calls and their social context;(2) to describe the divergence of advertisement calls;and (3) to appropri-ate 23 spurfowl species to homologous sound groups which have been synthesized with recognized monophyletic groups within Pternistis spurfowls. Sound group partitioning was primarily based on male advertisement calls. A total of 218 recordings (rendering^300 identifiable calls) were analyzed covering 22 out of 23 spurfowl species in Africa. One species was assessed from written accounts. The repertoire size per spurfowl varies between 7 and 11 calls. Spurfowl calls were arranged into three broad categories including (1) advertisement calls;(2) maintenance calls including distress calls, juve-nile whining (“mews”), cheeps and comfort calls;and (3) male-female and female-offspring bonding calls. Spurfowl species were set out in eight sound groups of which five were more or less congruent with the monophyletic groups of Hall (1963), but sound groups produced more partitioning as Hall described only five groups relevant to Pternistis spp. The divergence of advertisement calls appar-ently minimizes hybridization between sympatric species but the“genetic distance”between spurfowl species is relatively small causing hybridization among spurfowl species. Despite the vocalizations of Hartlaub’s Spurfowl (P. hartlaubi) differing significantly from the rest of the spurfowls, sound analy-ses suggest that it remains within Pternistis.
基金supported by the National Key Research and Development Program of China(2022YFC3202104)Natural Science Foundation of Jiangsu Province,China(BK20211151)。
文摘Signals within animals’vocal communication are considered functional referential and context-specific.Even in the absence of the context,receivers are expected to acquire the information of calls and respond specificallyWhereas the framework was supported by plenty of evidence,its exhaustivity in describing all animal vocalisations has been questioned.Here,we investigated the vocal repertoire of a cooperatively breeding species,Azure-winged Magpie(Cyanopica cyanus),to present evidence for referential signals.The results showed that Azure-winged Magpies had a relatively large vocal repertoire,consisting of twelve distinct calls.These calls were associated with the context including movement,begging for food,contact,vigilance against predators,etc.However,even the predator-specific alarm calls would induce various responses of receivers.This implies that multiple pieces of information are involved in the vocalisation,which could be utilised by the receiver to select an appropriate response based on the surroundings.Our study gives a detailed description of the context and function of the vocal repertoire in Azure-winged Magpies,laying the foundation for further investigation on the developmental mechanisms of bird vocalisations.This study also suggests that the referential signals of animal vocalisations may not be limited to the context-specific responses of receivers and need to be discussed from a broader perspective.
文摘A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of different pathological factors of vocal cords on model parameters are studied. Some typical spectrum distribution of the simulated speech signals are given. Moreover, hoarse speech signals of some typical cases are analyzed by the methods of digital signal processing, including FFT, LPC, Cepstrum technique, Pseudocolor encoding, etc. The experiment results show that the three mass model analysis of vocal cords is an efficient method for analysis of hoarse speech signals.
基金supported by the National Basic Research Program of China(No.2007CB411606)
文摘Ultrasonic communication in vertebrates is attracting increasing research interest.To determine if ultrasonic vocalization is common in birds,we recorded their vocalizations with ultrasound detectors in the Dongzhai National Nature Reserve of Henan Province,China.We found varying degrees of high frequency components in the vocalizations of 14 species and in several of these species,the frequency of harmonics was up to the range of ultrasound.We suggest that more studies are required to determine whether the high frequency components in avian vocalizations have functions and what these functions are.In addition,the ability of birds to hear sounds in the high frequency range also requires re-examination.
基金performed within the frameworks of state contract with the Institute of Plant and Animal Ecology,Ural Branch,Russian Academy of Sciences(project number 18-9-4-22)a part of Program of the Russian Academy of Sciences 2013–2020,No.AAAA-A18-118042690110-1[0109-2019-0003]‘Ecological and evolutionary aspects of animal behavior and communication’supported by the Russian Science Foundation(grant number 20-14-00058)。
文摘Background:In the last decade,enigmatic male-like cuckoo calls have been reported several times in East Asia.These calls exhibited a combination of vocal traits of both Oriental Cuckoo(Cuculus optatus)and Common Cuckoo(Cuculus canorus)advertising calls,and some authors therefore suggested that the enigmatic calls were produced by either Common×Oriental Cuckoo male hybrids or Common Cuckoo males having a gene mutation.However,the exact identity of calling birds are still unknown.Methods:We recorded previously unknown male-like calls from three captive Oriental Cuckoo females,and compared these calls with enigmatic vocalizations recorded in the wild as well as with advertising vocalizations of Common and Oriental Cuckoo males.To achieve this,we measured calls automatically.Besides,we video-recorded captive female emitting male-like calls,and compared these recordings with the YouTube recordings of calling males of both Common and Oriental Cuckoos to get insight into the mechanism of call production.Results:The analysis showed that female male-like calls recorded in captivity were similar to enigmatic calls recorded in the wild.Therefore,Oriental Cuckoo females might produce the latter calls.Two features of these female calls appeared to be unusual among birds.First,females produced male-like calls at the time of spring and autumn migratory activity and on migration in the wild.Because of this,functional significance of this call remained puzzling.Secondly,the male-like female call unexpectedly combined features of both closed-mouth(closed beak and simultaneous inflation of the‘throat sac')and open-mouth(prominent harmonic spectrum and the maximum neck extension observed at the beginning of a sound)vocal behaviors.Conclusions:The Cuculus vocalizations outside the reproductive season remain poorly understood.Here,we found for the first time that Oriental Cuckoo females can produce male-like calls in that time.Because of its rarity,this call might be an atavism.Indeed,female male-like vocalizations are still known in non-parasitic tropical and apparently more basal cuckoos only.Therefore,our findings may shed light on the evolution of vocal communication in avian brood parasites.
文摘With the continuous development of social economy in our country,communication between international culture,artistic and other aspects is also increasingly frequent.The improvement of material life inevitably bring the change of mental life.As a cross discipline between musical aesthetics and artistic aesthetics,vocal music aesthetics has gradually become the core content of vocal music art,and has been paid more and more attention by experts and scholars,the research on vocal music aesthetics has become an important subject in the ? eld of theoretical research.By discussing the value of vocal music aesthetics in vocal music art,this paper further explores the connotation and function of vocal music aesthetics,in order to make a contribution to the prosperity and development of vocal music art of our country.