期刊文献+
共找到24,230篇文章
< 1 2 250 >
每页显示 20 50 100
Adaptive Reinforcement Learning with Multi-Modal Perception for Autonomous Formation Control and Exploration in Large-Scale Multi-UAV Swarms
1
作者 Ziyuan Ma Huajun Gong Xinhua Wang 《Journal of Beijing Institute of Technology》 2026年第1期63-83,共21页
To address the challenge of achieving decentralized,scalable,and adaptive control for large-scale multiple unmanned aerial vehicle(multi-UAV)swarms in dynamic urban environments with obstacles and wind perturbations,w... To address the challenge of achieving decentralized,scalable,and adaptive control for large-scale multiple unmanned aerial vehicle(multi-UAV)swarms in dynamic urban environments with obstacles and wind perturbations,we proposed a hybrid framework integrating adaptive reinforcement learning(RL),multi-modal perception fusion,and enhanced pigeon flock optimization(PFO)with curiosity-driven exploration to enable robust autonomous and formation control.The framework leverages meta-learning to optimize RL policies for real-time adaptation,fuses sensor data for precise state estimation,and enhances PFO with learned leader-follower dynamics and exploration rewards to maintain cohesive formations and explore uncertain areas.For swarms of 10–30 UAVs,it achieves 34%faster convergence,61%reduced stability root mean square error(RMSE),88%fewer collisions and 85.6%–92.3%success rates in target detection and encirclement,outperforming standard multi-agent RL,pure PFO,and single-modality RL.Three-dimensional trajectory visualizations confirm cohesive formations,collision-free maneuvers,and efficient exploration in urban search-and-rescue scenarios.Innovations include meta-RL for rapid adaptation,multi-modal fusion for robust perception,and curiosity-driven PFO for scalable,decentralized control,advancing real-world multi-UAV swarm autonomy and coordination. 展开更多
关键词 multiple unmanned aerial vehicle(multi-UAV)swarm autonomous control reinforcement learning(RL) multi-modal perception pigeon flock optimization(PFO)
在线阅读 下载PDF
GaitMAFF:Adaptive Multi-Modal Fusion of Skeleton Maps and Silhouettes for Robust Gait Recognition in Complex Scenarios
2
作者 Zhongbin Luo Zhaoyang Guan +2 位作者 Wenxing You Yunteng Wang Yanqiu Bi 《Computers, Materials & Continua》 2026年第5期540-558,共19页
Gait recognition is a key biometric for long-distance identification,yet its performance is severely degraded by real-world challenges such as varying clothing,carrying conditions,and changing viewpoints.While combini... Gait recognition is a key biometric for long-distance identification,yet its performance is severely degraded by real-world challenges such as varying clothing,carrying conditions,and changing viewpoints.While combining silhouette and skeleton data is a promising direction,effectively fusing these heterogeneous modalities and adaptively weighting their contributions in response to diverse conditions remains a central problem.This paper introduces GaitMAFF,a novelMulti-modal Adaptive Feature Fusion Network,to address this challenge.Our approach first transforms discrete skeleton joints into a dense SkeletonMap representation to align with silhouettes,then employs an attention-based module to dynamically learn the fusion weights between the two modalities.These fused features are processed by a powerful spatio-temporal backbone withWeighted Global-Local Feature FusionModules(WFFM)to learn a discriminative representation.Extensive experiments on the challenging CCPG and Gait3D datasets show that GaitMAFF achieves state-of-the-art performance,with an average Rank-1 accuracy of 84.6%on CCPG and 58.7%on Gait3D.These results demonstrate that our adaptive fusion strategy effectively integrates complementary multimodal information,significantly enhancing gait recognition robustness and accuracy in complex scenes and providing a practical solution for real-world applications. 展开更多
关键词 Gait recognition multi-modal fusion adaptive feature fusion skeleton map SILHOUETTE
在线阅读 下载PDF
Sensilla Trichoidea-Inspired,High-Temperature,and Omnidirectional Vibration Perception Based on Monolayer Graphene
3
作者 Yuning Li Danke Chen +9 位作者 Xiaoqiu Tang Peizhi Yu Jingye Sun Xue Li Qing You Mingqiang Zhu Chang Gao Linan Li He Tian Tao Deng 《Nano-Micro Letters》 2026年第6期350-365,共16页
With the convergence of sensor technology,artificial intelligence,and the Internet of Things,intelligent vibration monitoring systems are undergoing transformative development.This evolution imposes stringent demands ... With the convergence of sensor technology,artificial intelligence,and the Internet of Things,intelligent vibration monitoring systems are undergoing transformative development.This evolution imposes stringent demands on the miniaturization,low power consumption,high integration,and environmental adaptability of transducers.Graphene,renowned for its superlative physicochemical attributes,holds significant promise for application in micro-and nanoelectromechanical systems(M/NEMS).However,the inherent central symmetry of graphene restricts its utility in piezoelectric devices.Inspired by the sensilla trichoidea of spiders,a threedimensional(3D)cilia-like monolayer graphene omnidirectional vibration transducer(CGVT)based on a stress-induced self-assembly mechanism is fabricated,demonstrating notable performance and high-temperature resistance.Furthermore,3D vibration vector decoding is realized via an omnidirectional decoupling algorithm based on one-dimensional convolutional neural networks(1DCNN)to achieve precise discrimination of vibration directions.The 3D bionic vibration-sensing system incorporates a spider web structure into a bionic cilia MEMS chip through a gold wire bonding process,enabling the realization of three distinct mechanisms for vibration detection and recognition.In particular,these devices are manufactured using silicon-based semiconductor processing techniques and MEMS fabrication methodologies,leading to a substantial reduction in the dimensions of individual components compared to traditional counterparts. 展开更多
关键词 BIOINSPIRED 3D GRAPHENE Vibration perception Monolithic integration
在线阅读 下载PDF
Neuromorphic devices for intelligent visual perception
4
作者 Yixin Zhu Xiangjing Wang +4 位作者 Yuqing Hu Xinli Chen Xianhao Le Changjin Wan Qing Wan 《International Journal of Extreme Manufacturing》 2026年第1期186-219,共34页
Neuromorphic visual perception,by emulating the efficient information processing mechanisms of biological vision systems and integrating innovations in materials and device architectures,offers novel solutions for art... Neuromorphic visual perception,by emulating the efficient information processing mechanisms of biological vision systems and integrating innovations in materials and device architectures,offers novel solutions for artificial intelligence sensing.For instance,the incorporation of low-dimensional materials(e.g.,quantum dots,carbon nanotubes,and two-dimensional materials)optimizes device optoelectronic properties,while the synergistic design of organic semiconductors and oxide materials balances flexibility with complementary metal-oxide-semiconductor(CMOS)compatibility.Representative neuromorphic devices such as memristors and neuromorphic transistors address traditional vision system bottlenecks via near-sensor and in-sensor architectures in data transmission latency and energy consumption,offering a new paradigm for highly integrated,energy-efficient real-time perception.However,critical challenges—including device non-uniformity caused by material interface defects,system instability induced by memristor conductance drift,and environmental adaptability under complex illumination—remain barriers to scalable applications.This review comprehensively examines neuromorphic visual perception devices from the perspectives of device structure,operational mechanisms,materials,and applications.It explores the pivotal roles of memristors,electrolyte-gated transistors,and other neuromorphic devices in optical signal perception and information processing,with a focus on their implementations in visual perception tasks and future prospects. 展开更多
关键词 neuromorphic visual perception neuromorphic computing MEMRISTOR TRANSISTOR
在线阅读 下载PDF
Visual perception and density-sensitive interaction in active agent system
5
作者 Fei Meng Weiqiang Ma +1 位作者 Run Cheng Jun Wang 《Chinese Physics B》 2026年第1期608-614,共7页
This study extends the self-propelled particle(SPP)model by incorporating a limited vision cone and local density sensing.The results reveal that clusters can simultaneously exhibit velocity polarization and spatial c... This study extends the self-propelled particle(SPP)model by incorporating a limited vision cone and local density sensing.The results reveal that clusters can simultaneously exhibit velocity polarization and spatial cohesion within specific ranges of vision angle and density threshold.The dependence of the dynamical features,including the order parameter and density variation,on the threshold and visual cone is investigated.Furthermore,a critical threshold is identified,which governs the transition between ordered and disordered states and is closely linked to density fluctuations and noise intensity.The clustering results show that the model is explained by the chasing mechanism responsible for cluster formation,density,and shape.These results may stimulate practical applications in swarm maneuvering. 展开更多
关键词 active matter Vicsek model visual perception THRESHOLD
原文传递
Multi-modal data analysis for autism spectrum disorder in children:State of the art and trends
6
作者 Lukai Pang Xiaoke Zhao +4 位作者 Lulu Zhao Jianqing Li Fengyi Kuo Hongxing Wang Chengyu Liu 《EngMedicine》 2026年第1期47-56,共10页
Autism spectrum disorder(AsD)is a highly heterogeneous neurodevelopmental disorder.Early diagnosis and intervention are crucial for improving outcomes.Traditional single-modality diagnostic methods are subjective,limi... Autism spectrum disorder(AsD)is a highly heterogeneous neurodevelopmental disorder.Early diagnosis and intervention are crucial for improving outcomes.Traditional single-modality diagnostic methods are subjective,limited,and struggle to reveal the underlying pathological mechanisms.In contrast,multimodal data analysis integrates behavioral,physiological,and neuroimaging information with advanced machine-learning and deeplearning algorithms to overcome these limitations.In this review,we surveyed the recent pediatric AsD literature,highlighting artificial intelligence-driven diagnostic techniques,multimodal data fusion strategies,and emerging trends in ASD assessment.We surveyed studies that integrated two or more modalities and summarized the fusion levels,learning paradigms,tasks,datasets,and metrics.Multimodal approaches outperform singlemodality baselines in classification,severity estimation,and subtyping by leveraging complementary information and reducing modality-specific biases.Multimodal approaches significantly enhance diagnostic accuracy and comprehensiveness,enabling early screening of AsD,symptom subtyping,severity assessment,and personalized interventions.Advances in multimodal fusion techniques have promoted progress in precision medicine for the treatment of ASD. 展开更多
关键词 Autism spectrum disorder multi-modal data Machine learning Early screening Symptom subtyping
暂未订购
Mapping the Light Atlas:A Phenomenological Parable on the Human Perception of Spacetime in the Anthropocene
7
作者 Nicolas Vantis 《Philosophy Study》 2026年第2期172-182,共11页
What is spacetime?How do we perceive this medium?How can we fit it into our everyday linear lives?How can we situate ourselves within it in our post-industrial worldview,in an unsustainable world?This philosophical es... What is spacetime?How do we perceive this medium?How can we fit it into our everyday linear lives?How can we situate ourselves within it in our post-industrial worldview,in an unsustainable world?This philosophical essay adopts a phenomenological method to interrogate the meaning of this fundamental dimension of reality.Spacetime is interpreted not merely as a physical structure but as a plastic field whose instability shapes inner and social life.Yet the contemporary human condition is marked by a profound alienation,much of which derives from a self-inflicted existential disorientation:I once chose exile and moved to a remote island in the Atlantic Ocean,becoming my own research material.In search of genuine contact with nature,the nonverbal appeared as a necessity.I turned to music as an archetypal language,in the Romantic sense of a medium offering pre-conceptual access to the real.I composed Light Atlas,a six-movement work aiming to capture the flight of seagulls and the eternal struggle between light and darkness.This led me back to physics,to my original question:the lived perception of spacetime. 展开更多
关键词 PHENOMENOLOGY philosophy of physics spacetime perception philosophy of music ANTHROPOCENE
在线阅读 下载PDF
MDGET-MER:Multi-Level Dynamic Gating and Emotion Transfer for Multi-Modal Emotion Recognition
8
作者 Musheng Chen Qiang Wen +2 位作者 Xiaohong Qiu Junhua Wu Wenqing Fu 《Computers, Materials & Continua》 2026年第3期872-893,共22页
In multi-modal emotion recognition,excessive reliance on historical context often impedes the detection of emotional shifts,while modality heterogeneity and unimodal noise limit recognition performance.Existing method... In multi-modal emotion recognition,excessive reliance on historical context often impedes the detection of emotional shifts,while modality heterogeneity and unimodal noise limit recognition performance.Existing methods struggle to dynamically adjust cross-modal complementary strength to optimize fusion quality and lack effective mechanisms to model the dynamic evolution of emotions.To address these issues,we propose a multi-level dynamic gating and emotion transfer framework for multi-modal emotion recognition.A dynamic gating mechanism is applied across unimodal encoding,cross-modal alignment,and emotion transfer modeling,substantially improving noise robustness and feature alignment.First,we construct a unimodal encoder based on gated recurrent units and feature-selection gating to suppress intra-modal noise and enhance contextual representation.Second,we design a gated-attention crossmodal encoder that dynamically calibrates the complementary contributions of visual and audio modalities to the dominant textual features and eliminates redundant information.Finally,we introduce a gated enhanced emotion transfer module that explicitly models the temporal dependence of emotional evolution in dialogues via transfer gating and optimizes continuity modeling with a comparative learning loss.Experimental results demonstrate that the proposed method outperforms state-of-the-art models on the public MELD and IEMOCAP datasets. 展开更多
关键词 multi-modal emotion recognition dynamic gating emotion transfer module cross-modal dynamic alignment noise robustness
在线阅读 下载PDF
The Influence of Discrimination Perception on the Psychological Resilience among Vocational High School Students:Longitudinal Mediating Effect of Vocational Identity
9
作者 Lingyan Zhang Yuying Yang Zhuoxuan Huang 《International Journal of Mental Health Promotion》 2026年第2期112-124,共13页
Objectives:Psychological resilience is a critical resource for vocational high school students navigating social biases and fostering mental well-being.This six-month longitudinal study investigated the developmental ... Objectives:Psychological resilience is a critical resource for vocational high school students navigating social biases and fostering mental well-being.This six-month longitudinal study investigated the developmental trajectories of discrimination perception,vocational identity,and psychological resilience in this population.It further examined the longitudinal mediating role of vocational identity in the relationship between discrimination perception and psychological resilience.Methods:A total of 526 students from five vocational high schools in Guangdong,China,were assessed via convenience sampling at two time points:baseline(T1,September 2023)and six-month follow-up(T2,March 2024).Measures of discrimination perception,psychological resilience,and vocational identity were administered.Data were analyzed using a cross-lagged panel model to test for bidirectional relationships.Results:Over the six-month period,students showed significant decreases in discrimination perception and vocational identity,but a significant increase in psychological resilience.The cross-lagged model revealed significant bidirectional relationships:discrimination perception and psychological resilience negatively predicted each other over time(β=−0.124,p<0.01;β=−0.200,p<0.001),while psychological resilience and vocational identity positively predicted each other(β=0.084,p<0.05;β=0.076,p<0.05).The mediation analysis revealed a dual-pathway mechanism.T1 discrimination perception exerted both a significant direct negative effect on T2 psychological resilience(β=−0.332,p<0.001)and a significant indirect positive effect via T1 vocational identity(indirect effect=0.020,95%CI[0.001,0.046]).This confirms a partial mediating role,indicating that vocational identity functions as a compensatory mechanism,transforming the experience of discrimination perception into a potential source of psychological resilience.Conclusions:For vocational high school students,perception of discrimination directly undermines psychological resilience,but also indirectly fosters it through the positive development of vocational identity.These findings highlight vocational identity as a pivotal mechanism in the complex relationship between social adversity and mental resilience. 展开更多
关键词 Vocational high school students vocational identity discrimination perception psychological resilience
在线阅读 下载PDF
Real-Time 3D Scene Perception in Dynamic Urban Environments via Street Detection Gaussians
10
作者 Yu Du Runwei Guan +4 位作者 Ho-Pun Lam Jeremy Smith Yutao Yue KaLok Man Yan Li 《Computers, Materials & Continua》 2026年第4期1384-1402,共19页
As a cornerstone for applications such as autonomous driving,3D urban perception is a burgeoning field of study.Enhancing the performance and robustness of these perception systems is crucial for ensuring the safety o... As a cornerstone for applications such as autonomous driving,3D urban perception is a burgeoning field of study.Enhancing the performance and robustness of these perception systems is crucial for ensuring the safety of next-generation autonomous vehicles.In this work,we introduce a novel neural scene representation called Street Detection Gaussians(SDGs),which redefines urban 3D perception through an integrated architecture unifying reconstruction and detection.At its core lies the dynamic Gaussian representation,where time-conditioned parameterization enables simultaneous modeling of static environments and dynamic objects through physically constrained Gaussian evolution.The framework’s radar-enhanced perception module learns cross-modal correlations between sparse radardata anddense visual features,resulting ina22%reduction inocclusionerrors compared tovisiononly systems.A breakthrough differentiable rendering pipeline back-propagates semantic detection losses throughout the entire 3D reconstruction process,enabling the optimization of both geometric and semantic fidelity.Evaluated on the Waymo Open Dataset and the KITTI Dataset,the system achieves real-time performance(135 Frames Per Second(FPS)),photorealistic quality(Peak Signal-to-Noise Ratio(PSNR)34.9 dB),and state-of-the-art detection accuracy(78.1%Mean Average Precision(mAP)),demonstrating a 3.8×end-to-end improvement over existing hybrid approaches while enabling seamless integration with autonomous driving stacks. 展开更多
关键词 Radar-vision fusion differentiable rendering autonomous driving perception 3D reconstruction occlusion robustness
在线阅读 下载PDF
A Bio-inspired Bubble Artificial Muscles and TacTip Perception-driven Tri-legged Robot for Obstacle Avoidance
11
作者 Chaoqun Xiang Zhengwei Zhong +3 位作者 Wenqiang Wu Xiaocong Chen Yisheng Guan Tao Zou 《Journal of Bionic Engineering》 2026年第1期175-191,共17页
Legged robots have considerable potential for traversing unstructured situations;nonetheless,their inflexible frameworks often constrain adaptability and obstacle negotiation.The study article presents a revolutionary... Legged robots have considerable potential for traversing unstructured situations;nonetheless,their inflexible frameworks often constrain adaptability and obstacle negotiation.The study article presents a revolutionary Soft Tri-Legged Robot(STLR)that improves movement and obstacle-avoidance skills by using a bio-inspired pneumatic artificial muscle(Bubble Artificial Muscles)and a bio-inspired tactile sensor(TacTip).The STLR is activated by BAMs,which are flexible,pneu-matic-driven actuators that provide fine control over forward,backward,and steering movements.Obstacle identification and avoidance are facilitated by the TacTip sensor,which delivers tactile input for traversing unstructured terrains.We delineate the mechanical features of the BAMs,assess the functionality of the robot's legs,and elaborate on the incorpora-tion of the tactile sensing system.Experimental results demonstrate that the STLR can effectively achieve multi-directional flexible movement and obstacle avoidance through a cross-modal perception-actuation mechanism.This study highlights the promise of soft robotics for search and rescue,medical aid,and autonomous exploration,while delineating difficulties and opportunities for future improvements in functionality and efficiency. 展开更多
关键词 Legged robot Bio-inspired bubble artificial muscles Bio-inspired TacTip sensor Foot tactile perception Obstacle avoidance
在线阅读 下载PDF
Special Section on Perception,Control,and Decision-Making of Embodied Intelligent Systems
12
《Journal of Systems Engineering and Electronics》 2026年第1期F0002-F0002,共1页
Embodied intelligent systems integrate perception,control,and decision-making within physical agents,and have become a cornerstone of modern aerospace,autonomous driving,and cooperative robotic applications.When opera... Embodied intelligent systems integrate perception,control,and decision-making within physical agents,and have become a cornerstone of modern aerospace,autonomous driving,and cooperative robotic applications.When operating in uncertain and dynamic environments,such systems must address challenges arising from incomplete sensing,unpredictable maneuvers,communication constraints,disturbances,and evolving network structures. 展开更多
关键词 incomplete sensingunpredictable decision making embodied intelligent systems aerospaceautonomous drivingand CONTROL cooperative robotic applicationswhen evolving network structures perception
在线阅读 下载PDF
Information perception and feedback mechanism and key techniques of multi-modality human-robot interaction for service robots 被引量:1
13
作者 赵其杰 《Journal of Shanghai University(English Edition)》 CAS 2006年第3期281-281,共1页
With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much att... With the increasing of the elderly population and the growing hearth care cost, the role of service robots in aiding the disabled and the elderly is becoming important. Many researchers in the world have paid much attention to heaRthcare robots and rehabilitation robots. To get natural and harmonious communication between the user and a service robot, the information perception/feedback ability, and interaction ability for service robots become more important in many key issues. 展开更多
关键词 service robot multi-modalITY human-robot interaction user model interaction protocol information perception and feedback.
在线阅读 下载PDF
Perceptions and emotions in postoperative recovery of patients with perianal diseases 被引量:1
14
作者 Bryan Adrian Priego-Parra Jose Maria Remes-Troche 《World Journal of Psychiatry》 SCIE 2025年第1期179-184,共6页
This article examines the complex relationship between disease perception,negative emotions,and their impact on postoperative recovery in patients with perianal diseases.These conditions not only cause physical discom... This article examines the complex relationship between disease perception,negative emotions,and their impact on postoperative recovery in patients with perianal diseases.These conditions not only cause physical discomfort,but also carry a significant emotional burden,often exacerbated by social stigma.Psycho-logical factors,including stress,anxiety,and depression,activate neuroendocrine pathways,such as the hypothalamic–pituitary–adrenal axis,disrupting the gut microbiota and leading to dysbiosis.This disruption can delay wound healing,prolong hospital stay,and intensify pain.Drawing on the findings of Hou et al,our article highlights the critical role of illness perception and negative emotions in shaping recovery outcomes.It advocates for a holistic approach that integrates psychological support and gut microbiota modulation,to enhance healing and improve overall patient outcomes. 展开更多
关键词 Perianal disease Illness perception Gut microbiota Post-surgical outcomes MICROBIOTA
暂未订购
Healthcare providers’perceptions of artificial intelligence in diabetes care:A cross-sectional study in China 被引量:6
15
作者 Yongzhen Mo Fang Zhao +8 位作者 Li Yuan Qiuling Xing Yingxia Zhou Quanying Wu Caihong Li Juan Lin Haidi Wu Shunzhi Deng Mingxia Zhang 《International Journal of Nursing Sciences》 2025年第3期218-224,I0003,共8页
Objectives Diabetes remains a major global health challenge in China.Artificial intelligence(AI)has demonstrated considerable potential in improving diabetes management.This study aimed to assess healthcare providers... Objectives Diabetes remains a major global health challenge in China.Artificial intelligence(AI)has demonstrated considerable potential in improving diabetes management.This study aimed to assess healthcare providers’perceptions regarding AI in diabetes care across China.Methods A cross-sectional survey was conducted using snowball sampling from November 12 to November 24,2024.We selected 514 physicians and nurses by a snowball sampling method from healthcare providers across 30 cities or provinces in China.The self-developed questionnaire comprised five sections with 19 questions assessing medical workers’demographic characteristics,AI-related experience and interest,awareness,attitudes,and concerns regarding AI in diabetes care.Statistical analysis was performed using t-test,analysis of variance(ANOVA),and linear regression.Results Among them,20.0%and 48.1%of respondents had participated in AI-related research and training,while 85.4%expressed moderate to high interest in AI training for diabetes care.Most respondents reported partial awareness of AI in diabetes care,and only 12.6%exhibited a comprehensive or substantial understanding.Attitudes toward AI in diabetes care were generally positive,with a mean score of 24.50±3.38.Nurses demonstrated significantly higher scores than physicians(P<0.05).Greater awareness,prior AI training experience,and higher interest in AI training in diabetes care were strongly associated with more positive attitudes(P<0.05).Key concerns regarding AI included trust issues from AI-clinician inconsistencies(77.2%),increased workload and clinical workflow disruptions(63.4%),and incomplete legal and regulatory frameworks(60.3%).Only 34.2%of respondents expressed concerns about job displacement,indicating general confidence in their professional roles.Conclusions While Chinese healthcare providers show moderate awareness of AI in diabetes care,their attitudes are generally positive,and they are considerably interested in future training.Tailored,role-specific AI training is essential for equitable and effective integration into clinical practice.Additionally,transparent,reliable,ethical AI models must be prioritized to alleviate practitioners’concerns. 展开更多
关键词 Artificial intelligence ATTITUDES DIABETES Medical workers NURSING perceptionS
暂未订购
Construction and evaluation of a predictive model for the degree of coronary artery occlusion based on adaptive weighted multi-modal fusion of traditional Chinese and western medicine data 被引量:2
16
作者 Jiyu ZHANG Jiatuo XU +1 位作者 Liping TU Hongyuan FU 《Digital Chinese Medicine》 2025年第2期163-173,共11页
Objective To develop a non-invasive predictive model for coronary artery stenosis severity based on adaptive multi-modal integration of traditional Chinese and western medicine data.Methods Clinical indicators,echocar... Objective To develop a non-invasive predictive model for coronary artery stenosis severity based on adaptive multi-modal integration of traditional Chinese and western medicine data.Methods Clinical indicators,echocardiographic data,traditional Chinese medicine(TCM)tongue manifestations,and facial features were collected from patients who underwent coro-nary computed tomography angiography(CTA)in the Cardiac Care Unit(CCU)of Shanghai Tenth People's Hospital between May 1,2023 and May 1,2024.An adaptive weighted multi-modal data fusion(AWMDF)model based on deep learning was constructed to predict the severity of coronary artery stenosis.The model was evaluated using metrics including accura-cy,precision,recall,F1 score,and the area under the receiver operating characteristic(ROC)curve(AUC).Further performance assessment was conducted through comparisons with six ensemble machine learning methods,data ablation,model component ablation,and various decision-level fusion strategies.Results A total of 158 patients were included in the study.The AWMDF model achieved ex-cellent predictive performance(AUC=0.973,accuracy=0.937,precision=0.937,recall=0.929,and F1 score=0.933).Compared with model ablation,data ablation experiments,and various traditional machine learning models,the AWMDF model demonstrated superior per-formance.Moreover,the adaptive weighting strategy outperformed alternative approaches,including simple weighting,averaging,voting,and fixed-weight schemes.Conclusion The AWMDF model demonstrates potential clinical value in the non-invasive prediction of coronary artery disease and could serve as a tool for clinical decision support. 展开更多
关键词 Coronary artery disease Deep learning multi-modal Clinical prediction Traditional Chinese medicine diagnosis
暂未订购
TCM network pharmacology:new perspective integrating network target with artificial intelligence and multi-modal multi-omics technologies 被引量:1
17
作者 Ziyi Wang Tingyu Zhang +1 位作者 Boyang Wang Shao Li 《Chinese Journal of Natural Medicines》 2025年第11期1425-1434,共10页
Traditional Chinese medicine(TCM)demonstrates distinctive advantages in disease prevention and treatment.However,analyzing its biological mechanisms through the modern medical research paradigm of“single drug,single ... Traditional Chinese medicine(TCM)demonstrates distinctive advantages in disease prevention and treatment.However,analyzing its biological mechanisms through the modern medical research paradigm of“single drug,single target”presents significant challenges due to its holistic approach.Network pharmacology and its core theory of network targets connect drugs and diseases from a holistic and systematic perspective based on biological networks,overcoming the limitations of reductionist research models and showing considerable value in TCM research.Recent integration of network target computational and experimental methods with artificial intelligence(AI)and multi-modal multi-omics technologies has substantially enhanced network pharmacology methodology.The advancement in computational and experimental techniques provides complementary support for network target theory in decoding TCM principles.This review,centered on network targets,examines the progress of network target methods combined with AI in predicting disease molecular mechanisms and drug-target relationships,alongside the application of multi-modal multi-omics technologies in analyzing TCM formulae,syndromes,and toxicity.Looking forward,network target theory is expected to incorporate emerging technologies while developing novel approaches aligned with its unique characteristics,potentially leading to significant breakthroughs in TCM research and advancing scientific understanding and innovation in TCM. 展开更多
关键词 Network pharmacology Traditional Chinese medicine Network target Artificial intelligence multi-modal Multi-omics
原文传递
MMGC-Net: Deep neural network for classification of mineral grains using multi-modal polarization images 被引量:1
18
作者 Jun Shu Xiaohai He +3 位作者 Qizhi Teng Pengcheng Yan Haibo He Honggang Chen 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第6期3894-3909,共16页
The multi-modal characteristics of mineral particles play a pivotal role in enhancing the classification accuracy,which is critical for obtaining a profound understanding of the Earth's composition and ensuring ef... The multi-modal characteristics of mineral particles play a pivotal role in enhancing the classification accuracy,which is critical for obtaining a profound understanding of the Earth's composition and ensuring effective exploitation utilization of its resources.However,the existing methods for classifying mineral particles do not fully utilize these multi-modal features,thereby limiting the classification accuracy.Furthermore,when conventional multi-modal image classification methods are applied to planepolarized and cross-polarized sequence images of mineral particles,they encounter issues such as information loss,misaligned features,and challenges in spatiotemporal feature extraction.To address these challenges,we propose a multi-modal mineral particle polarization image classification network(MMGC-Net)for precise mineral particle classification.Initially,MMGC-Net employs a two-dimensional(2D)backbone network with shared parameters to extract features from two types of polarized images to ensure feature alignment.Subsequently,a cross-polarized intra-modal feature fusion module is designed to refine the spatiotemporal features from the extracted features of the cross-polarized sequence images.Ultimately,the inter-modal feature fusion module integrates the two types of modal features to enhance the classification precision.Quantitative and qualitative experimental results indicate that when compared with the current state-of-the-art multi-modal image classification methods,MMGC-Net demonstrates marked superiority in terms of mineral particle multi-modal feature learning and four classification evaluation metrics.It also demonstrates better stability than the existing models. 展开更多
关键词 Mineral particles multi-modal image classification Shared parameters Feature fusion Spatiotemporal feature
暂未订购
Advancing depth perception in spatial computing with binocular metalenses 被引量:1
19
作者 Junkyeong Park Gyeongtae Kim Junsuk Rho 《Opto-Electronic Advances》 2025年第1期1-3,共3页
Spatial computing and augmented reality are advancing rapidly,with the goal of seamlessly blending virtual and physical worlds.However,traditional depth-sensing systems are bulky and energy-intensive,limiting their us... Spatial computing and augmented reality are advancing rapidly,with the goal of seamlessly blending virtual and physical worlds.However,traditional depth-sensing systems are bulky and energy-intensive,limiting their use in wearable devices.To overcome this,recent research by X.Liu et al.presents a compact binocular metalens-based depth perception system that integrates efficient edge detection through an advanced neural network.This system enables accurate,realtime depth mapping even in complex environments,enhancing potential applications in augmented reality,robotics,and autonomous systems. 展开更多
关键词 metasurface metalens deep learning depth perception edge detection
在线阅读 下载PDF
Multi-modal intelligent situation awareness in real-time air traffic control: Control intent understanding and flight trajectory prediction 被引量:1
20
作者 Dongyue GUO Jianwei ZHANG +1 位作者 Bo YANG Yi LIN 《Chinese Journal of Aeronautics》 2025年第6期41-57,共17页
With the advent of the next-generation Air Traffic Control(ATC)system,there is growing interest in using Artificial Intelligence(AI)techniques to enhance Situation Awareness(SA)for ATC Controllers(ATCOs),i.e.,Intellig... With the advent of the next-generation Air Traffic Control(ATC)system,there is growing interest in using Artificial Intelligence(AI)techniques to enhance Situation Awareness(SA)for ATC Controllers(ATCOs),i.e.,Intelligent SA(ISA).However,the existing AI-based SA approaches often rely on unimodal data and lack a comprehensive description and benchmark of the ISA tasks utilizing multi-modal data for real-time ATC environments.To address this gap,by analyzing the situation awareness procedure of the ATCOs,the ISA task is refined to the processing of the two primary elements,i.e.,spoken instructions and flight trajectories.Subsequently,the ISA is further formulated into Controlling Intent Understanding(CIU)and Flight Trajectory Prediction(FTP)tasks.For the CIU task,an innovative automatic speech recognition and understanding framework is designed to extract the controlling intent from unstructured and continuous ATC communications.For the FTP task,the single-and multi-horizon FTP approaches are investigated to support the high-precision prediction of the situation evolution.A total of 32 unimodal/multi-modal advanced methods with extensive evaluation metrics are introduced to conduct the benchmarks on the real-world multi-modal ATC situation dataset.Experimental results demonstrate the effectiveness of AI-based techniques in enhancing ISA for the ATC environment. 展开更多
关键词 Airtraffic control Automatic speechrecognition and understanding Flight trajectory prediction multi-modal Situationawareness
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部