期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Interpretable Visual Neural Decoding with Unsupervised Semantic Disentanglement
1
作者 Qiongyi Zhou Changde Du +3 位作者 Dan Li Bincheng Wen Le Chang Huiguang He 《Machine Intelligence Research》 2025年第3期553-570,共18页
In the field of brain decoding research,reconstructing visual perception from neural recordings is a challenging but crucial task.With the use of superior algorithms,many methods have been dedicated to enhancing decod... In the field of brain decoding research,reconstructing visual perception from neural recordings is a challenging but crucial task.With the use of superior algorithms,many methods have been dedicated to enhancing decoding performance.However,these models that map neural activities onto semantically entangled feature space are difficult to interpret.It is hard to understand the connections between neural activities and these abstract features.In this paper,we propose an interpretable neural decoding model that projects neural activities onto a semantically disentangled feature space with each dimension representing distinct visual attributes,such as gender and facial pose.A two-stage algorithm is designed to achieve this goal.First,a deep generative model learns semantically-disentangled image representations in an unsupervised way.Second,neural activities are linearly embedded into the semantic space,which the generator uses to reconstruct visual stimuli.Due to modality heterogeneity,it is challenging to learn such a neural embedded high-level semantic representation.We induce pixel,feature,and semantic alignment to ensure reconstruction quality.Three experimental fMRI datasets containing handwritten digits,characters,and human face stimuli are used to evaluate the neural decoding performance of our model.We also demonstrate the model interpretability through a reconstructed image editing application.The experimental results indicate that our model maintains a competitive decoding performance while remaining interpretable. 展开更多
关键词 visual neural decoding disentangled representation learning model interpretability cross-modal generation generative adversarial networks
原文传递
Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation
2
作者 Chiara Innocente Matteo Boemio +6 位作者 Gianmarco Lorenzetti Ilaria Pulito Diego Romagnoli Valeria Saponaro Giorgia Marullo Luca Ulrich Enrico Vezzetti 《Computer Modeling in Engineering & Sciences》 2025年第5期1355-1379,共25页
Lip-reading technology,based on visual speech decoding and automatic speech recognition,offers a promising solution to overcoming communication barriers,particularly for individuals with temporary or permanent speech ... Lip-reading technology,based on visual speech decoding and automatic speech recognition,offers a promising solution to overcoming communication barriers,particularly for individuals with temporary or permanent speech impairments.However,most Visual Speech Recognition(VSR)research has primarily focused on the English language and general-purpose applications,limiting its practical applicability in medical and rehabilitative settings.This study introduces the first Deep Learning(DL)based lip-reading system for the Italian language designed to assist individuals with vocal cord pathologies in daily interactions,facilitating communication for patients recovering from vocal cord surgeries,whether temporarily or permanently impaired.To ensure relevance and effectiveness in real-world scenarios,a carefully curated vocabulary of twenty-five Italian words was selected,encompassing critical semantic fields such as Needs,Questions,Answers,Emergencies,Greetings,Requests,and Body Parts.These words were chosen to address both essential daily communication and urgent medical assistance requests.Our approach combines a spatiotemporal Convolutional Neural Network(CNN)with a bidirectional Long Short-Term Memory(BiLSTM)recurrent network,and a Connectionist Temporal Classification(CTC)loss function to recognize individual words,without requiring predefined words boundaries.The experimental results demonstrate the system’s robust performance in recognizing target words,reaching an average accuracy of 96.4%in individual word recognition,suggesting that the system is particularly well-suited for offering support in constrained clinical and caregiving environments,where quick and reliable communication is critical.In conclusion,the study highlights the importance of developing language-specific,application-driven VSR solutions,particularly for non-English languages with limited linguistic resources.By bridging the gap between deep learning-based lip-reading and real-world clinical needs,this research advances assistive communication technologies,paving the way for more inclusive and medically relevant applications of VSR in rehabilitation and healthcare. 展开更多
关键词 LIP-READING deep learning automatic speech recognition visual speech decoding 3D convolutional neural network
在线阅读 下载PDF
Neural Decoding of Visual Information Across Different Neural Recording Modalities and Approaches 被引量:4
3
作者 Yi-Jun Zhang Zhao-Fei Yu +1 位作者 Jian.K.Liu Tie-Jun Huang 《Machine Intelligence Research》 EI CSCD 2022年第5期350-365,共16页
Vision plays a peculiar role in intelligence.Visual information,forming a large part of the sensory information,is fed into the human brain to formulate various types of cognition and behaviours that make humans becom... Vision plays a peculiar role in intelligence.Visual information,forming a large part of the sensory information,is fed into the human brain to formulate various types of cognition and behaviours that make humans become intelligent agents.Recent advances have led to the development of brain-inspired algorithms and models for machine vision.One of the key components of these methods is the utilization of the computational principles underlying biological neurons.Additionally,advanced experimental neuroscience techniques have generated different types of neural signals that carry essential visual information.Thus,there is a high demand for mapping out functional models for reading out visual information from neural signals.Here,we briefly review recent progress on this issue with a focus on how machine learning techniques can help in the development of models for contending various types of neural signals,from fine-scale neural spikes and single-cell calcium imaging to coarse-scale electroencephalography(EEG)and functional magnetic resonance imaging recordings of brain signals. 展开更多
关键词 Neural decoding machine learning deep learning visual decoding brain-inspired vision
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部