期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Foundation models for digital mental health:igniting the dawn
1
作者 Kun Qian Haojie Zhang +3 位作者 Xin Jing Bin Hu Yoshiharu Yamamoto Bjorn W.Schuller 《Medicine Plus》 2025年第2期19-22,共4页
In the past decade,information technologies(e.g.,artificial intelligence(AI),big data,wearables)have deeply influenced the field of mental health.As a typical breaking-through idea,computational psychophysiology(CPP)h... In the past decade,information technologies(e.g.,artificial intelligence(AI),big data,wearables)have deeply influenced the field of mental health.As a typical breaking-through idea,computational psychophysiology(CPP)has changed the paradigm of mental healthcare from traditional“symptom description-driven”to“data-driven”. 展开更多
关键词 mental healthas mental healthcare symptom description driven digital mental health wearables big data artificial intelligence computational psychophysiology
原文传递
Audio Enhancement for Computer Audition—An Iterative Training Paradigm Using Sample Importance 被引量:1
2
作者 Manuel Milling Shuo Liu +2 位作者 Andreas Triantafyllopoulos Ilhan Aslan Björn W.Schuller 《Journal of Computer Science & Technology》 SCIE EI CSCD 2024年第4期895-911,共17页
Neural network models for audio tasks,such as automatic speech recognition(ASR)and acoustic scene classification(ASC),are susceptible to noise contamination for real-life applications.To improve audio quality,an enhan... Neural network models for audio tasks,such as automatic speech recognition(ASR)and acoustic scene classification(ASC),are susceptible to noise contamination for real-life applications.To improve audio quality,an enhancement module,which can be developed independently,is explicitly used at the front-end of the target audio applications.In this paper,we present an end-to-end learning solution to jointly optimise the models for audio enhancement(AE)and the subsequent applications.To guide the optimisation of the AE module towards a target application,and especially to overcome difficult samples,we make use of the sample-wise performance measure as an indication of sample importance.In experiments,we consider four representative applications to evaluate our training paradigm,i.e.,ASR,speech command recognition(SCR),speech emotion recognition(SER),and ASC.These applications are associated with speech and nonspeech tasks concerning semantic and non-semantic features,transient and global information,and the experimental results indicate that our proposed approach can considerably boost the noise robustness of the models,especially at low signal-to-noise ratios,for a wide range of computer audition tasks in everyday-life noisy environments. 展开更多
关键词 audio enhancement computer audition joint optimisation multi-task learning voice suppression
原文传递
Federated Abnormal Heart Sound Detection with Weak to No Labels 被引量:1
3
作者 Wanyong Qiu Chen Quan +5 位作者 Yongzi Yu Eda Kara Kun Qian Bin Hu Bjorn W.Schuller Yoshiharu Yamamoto 《Cyborg and Bionic Systems》 2024年第1期91-107,共17页
Cardiovascular diseases are a prominent cause of mortality,emphasizing the need for early prevention and diagnosis.Utilizing artificial intelligence(AI)models,heart sound analysis emerges as a noninvasive and universa... Cardiovascular diseases are a prominent cause of mortality,emphasizing the need for early prevention and diagnosis.Utilizing artificial intelligence(AI)models,heart sound analysis emerges as a noninvasive and universally applicable approach for assessing cardiovascular health conditions.However,real-world medical data are dispersed across medical institutions,forming“data islands”due to data sharing limitations for security reasons.To this end,federated learning(FL)has been extensively employed in the medical field,which can effectively model across multiple institutions.Additionally,conventional supervised classification methods require fully labeled data classes,e.g.,binary classification requires labeling of positive and negative samples.Nevertheless,the process of labeling healthcare data is timeconsuming and labor-intensive,leading to the possibility of mislabeling negative samples.In this study,we validate an FL framework with a naive positive-unlabeled(PU)learning strategy.Semisupervised FL model can directly learn from a limited set of positive samples and an extensive pool of unlabeled samples.Our emphasis is on vertical-FL to enhance collaboration across institutions with different medical record feature spaces.Additionally,our contribution extends to feature importance analysis,where we explore 6 methods and provide practical recommendations for detecting abnormal heart sounds.The study demonstrated an impressive accuracy of 84%,comparable to outcomes in supervised learning,thereby advancing the application of FL in abnormal heart sound detection. 展开更多
关键词 federated learning semi supervised learning feature importance analysis vertical federated learning abnormal heart sound detection artificial intelligence ai modelsheart sound analysis cardiovascular diseases weak labels
原文传递
Learning Representations from Heart Sound:A Comparative Study on Shallow and Deep Models
4
作者 Kun Qian Zhihao Bao +12 位作者 Zhonghao Zhao Tomoya Koike Fengquan Dong Maximilian Schmitt Qunxi Dong Jian Shen Weipeng Jiang Yajuan Jiang Bo Dong Zhenyu Dai Bin Hu Björn W.Schuller Yoshiharu Yamamoto 《Cyborg and Bionic Systems》 2024年第1期687-698,共12页
Leveraging the power of artificial intelligence to facilitate an automatic analysis and monitoring of heart sounds has increasingly attracted tremendous efforts in the past decade.Nevertheless,lacking on standard open... Leveraging the power of artificial intelligence to facilitate an automatic analysis and monitoring of heart sounds has increasingly attracted tremendous efforts in the past decade.Nevertheless,lacking on standard open-access database made it difficult to maintain a sustainable and comparable research before the first release of the PhysioNet CinC Challenge Dataset.However,inconsistent standards on data collection,annotation,and partition are still restraining a fair and efficient comparison between different works.To this line,we introduced and benchmarked a first version of the Heart Sounds Shenzhen(HSS)corpus.Motivated and inspired by the previous works based on HSS,we redefined the tasks and make a comprehensive investigation on shallow and deep models in this study.First,we segmented the heart sound recording into shorter recordings(10 s),which makes it more similar to the human auscultation case.Second,we redefined the classification tasks.Besides using the 3 class categories(normal,moderate,and mild/severe)adopted in HSS,we added a binary classification task in this study,i.e.,normal and abnormal.In this work,we provided detailed benchmarks based on both the classic machine learning and the state-of-the-art deep learning technologies,which are reproducible by using open-source toolkits.Last but not least,we analyzed the feature contributions of best performance achieved by the benchmark to make the results more convincing and interpretable. 展开更多
关键词 deep learning physionet cinc challenge datasethoweverinconsistent heart sound classification tasks analysis monitoring heart sounds shallow models deep models machine learning
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部