期刊文献+
共找到10,630篇文章
< 1 2 250 >
每页显示 20 50 100
Semantic-Guided Stereo Matching Network Based on Parallax Attention Mechanism and Seg Former
1
作者 Zeyuan Chen Yafei Xie +2 位作者 Jinkun Li Song Wang Yingqiang Ding 《Computers, Materials & Continua》 2026年第4期1322-1340,共19页
Stereo matching is a pivotal task in computer vision,enabling precise depth estimation from stereo image pairs,yet it encounters challenges in regions with reflections,repetitive textures,or fine structures.In this pa... Stereo matching is a pivotal task in computer vision,enabling precise depth estimation from stereo image pairs,yet it encounters challenges in regions with reflections,repetitive textures,or fine structures.In this paper,we propose a Semantic-Guided Parallax Attention Stereo Matching Network(SGPASMnet)that can be trained in unsupervised manner,building upon the Parallax Attention Stereo Matching Network(PASMnet).Our approach leverages unsupervised learning to address the scarcity of ground truth disparity in stereo matching datasets,facilitating robust training across diverse scene-specific datasets and enhancing generalization.SGPASMnet incorporates two novel components:a Cross-Scale Feature Interaction(CSFI)block and semantic feature augmentation using a pre-trained semantic segmentation model,SegFormer,seamlessly embedded into the parallax attention mechanism.The CSFI block enables effective fusion ofmulti-scale features,integrating coarse and fine details to enhance disparity estimation accuracy.Semantic features,extracted by SegFormer,enrich the parallax attention mechanism by providing high-level scene context,significantly improving performance in ambiguous regions.Our model unifies these enhancements within a cohesive architecture,comprising semantic feature extraction,an hourglass network,a semantic-guided cascaded parallax attentionmodule,outputmodule,and a disparity refinement network.Evaluations on the KITTI2015 dataset demonstrate that our unsupervised method achieves a lower error rate compared to the original PASMnet,highlighting the effectiveness of our enhancements in handling complex scenes.By harnessing unsupervised learning without ground truth disparity needed,SGPASMnet offers a scalable and robust solution for accurate stereo matching,with superior generalization across varied real-world applications. 展开更多
关键词 Stereo matching parallax attention unsupervised learning convolutional neural network stereo correspondence
在线阅读 下载PDF
基于CNN-Attention-LSTM的液压系统故障诊断网络
2
作者 张旭峰 马硕 +2 位作者 易飞彤 刘庆同 纪辉 《机电工程》 北大核心 2026年第2期238-247,共10页
针对液压系统故障时信号复杂难以诊断、维护成本高等问题,提出了一种融合卷积神经网络(CNN)、注意力机制(Attention)和长短时记忆网络(LSTM)的深度学习模型(CNN-Attention-LSTM),对液压系统进行了故障诊断。首先,采用CNN提取了液压系统... 针对液压系统故障时信号复杂难以诊断、维护成本高等问题,提出了一种融合卷积神经网络(CNN)、注意力机制(Attention)和长短时记忆网络(LSTM)的深度学习模型(CNN-Attention-LSTM),对液压系统进行了故障诊断。首先,采用CNN提取了液压系统传感器信号的局部特征,结合LSTM提取了时序依赖关系,将Attention融入LSTM网络中,增强了对关键故障特征的关注度;然后,使用来自UCI网站的液压系统运行数据作为数据集,对不同采样频率的数据进行了处理,保证了所有传感器的采样点数保持一致;最后,针对冷却器、阀门、泵和蓄能器四类元件故障类别,评估了CNN-Attention-LSTM模型的故障预测准确性。研究结果表明:在预测的样本数量增多的情况下,CNN-Attention-LSTM模型对冷却器、阀门和泵三类故障的预测准确率达99%以上,对蓄能器故障的预测准确率达98%,验证了CNN-Attention-LSTM模型的有效性且证明其具备较强的泛化能力。该模型对故障状态识别能力明显优于传统的LSTM模型、支持向量机(SVM)网络、反向传播(BP)神经网络和循环神经网络(RNN)模型,为维护液压系统的稳定运行提供了新方法。 展开更多
关键词 液压传动系统 故障识别模型 多传感器信息融合 卷积神经网络 长短时记忆网络 注意力机制
在线阅读 下载PDF
优化变分模态分解下NRBO-LSTM-Attention修正预测风速的风电功率短期预测
3
作者 杨渊文 黄曌 +2 位作者 王欣 郭智薇 张柳 《太阳能学报》 北大核心 2026年第1期441-449,共9页
为提高数值天气预报(NWP)预测风速的精确性,将NWP风速与实际风电场风速输入到全局搜索策略鲸鱼算法(GSWOA)优化的变分模态分解(VMD)进行分解。分解后的实际风速分量作为训练目标,对应的NWP风速分量则输入基于牛顿-拉夫逊优化算法-长短... 为提高数值天气预报(NWP)预测风速的精确性,将NWP风速与实际风电场风速输入到全局搜索策略鲸鱼算法(GSWOA)优化的变分模态分解(VMD)进行分解。分解后的实际风速分量作为训练目标,对应的NWP风速分量则输入基于牛顿-拉夫逊优化算法-长短期记忆网络加注意力机制(NRBO-LSTM-Attention)模型,将输出的各分量线性叠加后替换原NWP风速。之后,通过孤立森林和Ransac算法等对修正后的NWP与风电场数据进行异常值清洗,最终输入NRBO-LSTM-Attention模型,用于预测未来功率。仿真结果表明:修正后的NWP风速更接近实际风速,评估指标平均绝对误差(MAE)和均方根误差(RMSE)分别降低11.45%和19.82%,R^(2)提升31.24%;预测功率模型的性能更优,MAE和RMSE分别降低11.36%和10.43%,R^(2)提升3.42%。 展开更多
关键词 风电场 风速 变分模态分解 神经网络 牛顿-拉夫逊优化算法 注意力机制 功率预测
原文传递
Keyword Spotting Based on Dual-Branch Broadcast Residual and Time-Frequency Coordinate Attention
4
作者 Zeyu Wang Jian-Hong Wang Kuo-Chun Hsu 《Computers, Materials & Continua》 2026年第4期333-352,共20页
In daily life,keyword spotting plays an important role in human-computer interaction.However,noise often interferes with the extraction of time-frequency information,and achieving both computational efficiency and rec... In daily life,keyword spotting plays an important role in human-computer interaction.However,noise often interferes with the extraction of time-frequency information,and achieving both computational efficiency and recognition accuracy on resource-constrained devices such as mobile terminals remains a major challenge.To address this,we propose a novel time-frequency dual-branch parallel residual network,which integrates a Dual-Branch Broadcast Residual module and a Time-Frequency Coordinate Attention module.The time-domain and frequency-domain branches are designed in parallel to independently extract temporal and spectral features,effectively avoiding the potential information loss caused by serial stacking,while enhancing information flow and multi-scale feature fusion.In terms of training strategy,a curriculum learning approach is introduced to progressively improve model robustness fromeasy to difficult tasks.Experimental results demonstrate that the proposed method consistently outperforms existing lightweight models under various signal-to-noise ratio(SNR)conditions,achieving superior far-field recognition performance on the Google Speech Commands V2 dataset.Notably,the model maintains stable performance even in low-SNR environments such as–10 dB,and generalizes well to unseen SNR conditions during training,validating its robustness to novel noise scenarios.Furthermore,the proposed model exhibits significantly fewer parameters,making it highly suitable for deployment on resource-limited devices.Overall,the model achieves a favorable balance between performance and parameter efficiency,demonstrating strong potential for practical applications. 展开更多
关键词 Keyword spotting convolutional neural network residual learning attention small footprint noisy far-field
在线阅读 下载PDF
Attention-Enhanced ResNet-LSTM Model with Wind-Regime Clustering for Wind Speed Forecasting
5
作者 Weiqi Mao Enbo Yu +1 位作者 Guoji Xu Xiaozhen Li 《Computer Modeling in Engineering & Sciences》 2026年第1期781-811,共31页
Accurate wind speed prediction is crucial for stabilizing power grids with high wind energy penetration.This study presents a novel machine learning model that integrates clustering,deep learning,and transfer learning... Accurate wind speed prediction is crucial for stabilizing power grids with high wind energy penetration.This study presents a novel machine learning model that integrates clustering,deep learning,and transfer learning to mitigate accuracy degradation in 24-h forecasting.Initially,an optimized DB-SCAN(Density-Based Spatial Clustering of Applications with Noise)algorithm clusters wind fields based on wind direction,probability density,and spectral features,enhancing physical interpretability and reducing training complexity.Subsequently,a ResNet(Residual Network)extracts multi-scale patterns from decomposed wind signals,while transfer learning adapts the backbone network across clusters,cutting training time by over 90%.Finally,a CBAM(Convolutional Block Attention Module)attention mechanism is employed to prioritize features for LSTM-based prediction.Tested on the 2015 Jena wind speed dataset,the model demonstrates superior accuracy and robustness compared to state-of-the-art baselines.Key innovations include:(a)Physics-informed clustering for interpretable wind regime classification;(b)Transfer learning with deep feature extraction,preserving accuracy while minimizing training time;and(c)On the 2016 Jena wind speed dataset,the model achieves MAPE(Mean Absolute Percentage Error)values of 16.82%and 18.02%for the Weibull-shaped and Gaussian-shaped wind speed clusters,respectively,demonstrating the model’s robust generalization capacity.This framework offers an efficient and effective solution for long-term wind forecasting. 展开更多
关键词 Wind speed prediction residual network transfer learning long short-term memory attention mechanism
在线阅读 下载PDF
SIM-Net:A Multi-Scale Attention-Guided Deep Learning Framework for High-Precision PCB Defect Detection
6
作者 Ping Fang Mengjun Tong 《Computers, Materials & Continua》 2026年第4期1754-1770,共17页
Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To ... Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection. 展开更多
关键词 Deep learning small object detection PCB defect detection attention mechanism multi-scale fusion network
在线阅读 下载PDF
An Integrated Attention-BiLSTM Approach for Probabilistic Remaining Useful Life Prediction
7
作者 Bo Zhu Enzhi Dong +3 位作者 Zhonghua Cheng Kexin Jiang Chiming Guo Shuai Yue 《Computers, Materials & Continua》 2026年第4期966-984,共19页
Accurate prediction of remaining useful life serves as a reliable basis for maintenance strategies,effectively reducing both the frequency of failures and associated costs.As a core component of PHM,RUL prediction pla... Accurate prediction of remaining useful life serves as a reliable basis for maintenance strategies,effectively reducing both the frequency of failures and associated costs.As a core component of PHM,RUL prediction plays a crucial role in preventing equipment failures and optimizing maintenance decision-making.However,deep learning models often falter when processing raw,noisy temporal signals,fail to quantify prediction uncertainty,and face challenges in effectively capturing the nonlinear dynamics of equipment degradation.To address these issues,this study proposes a novel deep learning framework.First,a newbidirectional long short-termmemory network integrated with an attention mechanism is designed to enhance temporal feature extraction with improved noise robustness.Second,a probabilistic prediction framework based on kernel density estimation is constructed,incorporating residual connections and stochastic regularization to achieve precise RUL estimation.Finally,extensive experiments on the C-MAPSS dataset demonstrate that our method achieves competitive performance in terms of RMSE and Score metrics compared to state-of-the-artmodels.More importantly,the probabilistic output provides a quantifiablemeasure of prediction confidence,which is crucial for risk-informed maintenance planning,enabling managers to optimize maintenance strategies based on a quantifiable understanding of failure risk. 展开更多
关键词 Bidirectional long short-term memory network attention mechanism kernel density estimation remaining useful life prediction
在线阅读 下载PDF
Feature pyramid attention network for audio-visual scene classification 被引量:1
8
作者 Liguang Zhou Yuhongze Zhou +3 位作者 Xiaonan Qi Junjie Hu Tin Lun Lam Yangsheng Xu 《CAAI Transactions on Intelligence Technology》 2025年第2期359-374,共16页
Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and text... Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and textures found in visual images.The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures,inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data.The authors present a feature pyramid attention network(FPANet)for audio-visual scene understanding,which extracts semantically significant characteristics from audio-visual data.The authors’approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module(FPAM).A dimension alignment(DA)strategy is employed to align feature maps from multiple layers,a pyramid spatial attention(PSA)to spatially locate essential regions,and a pyramid channel attention(PCA)to pinpoint significant temporal frames.Experiments on visual scene classification(VSC),audio scene classification(ASC),and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art(SOTA)approaches,with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%.Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals. 展开更多
关键词 dimension alignment feature pyramid attention network pyramid channel attention pyramid spatial attention semantic relevant regions
在线阅读 下载PDF
基于自适应Kalman滤波与GWO-LSTM-Attention的温室温湿度预测方法
9
作者 蔡玉琴 刘大铭 +2 位作者 徐琴 李波洋 刘博杰 《智慧农业(中英文)》 2026年第1期148-155,共8页
[目的/意义]针对温室温湿度预测中多传感器数据融合可靠性低、传统模型忽略温湿度动态耦合,以及参数调优依赖人工经验等问题。[方法]首先,对传统卡尔曼(Kalman)滤波算法实施改进,通过动态调整过程噪声协方差和观测噪声协方差,结合新息... [目的/意义]针对温室温湿度预测中多传感器数据融合可靠性低、传统模型忽略温湿度动态耦合,以及参数调优依赖人工经验等问题。[方法]首先,对传统卡尔曼(Kalman)滤波算法实施改进,通过动态调整过程噪声协方差和观测噪声协方差,结合新息方差动态分配多传感器权重。其次,针对温湿度的强耦合性及其协同控制的需求,构建多输出长短期记忆-注意力机制(Long Short-Term Memory-Attention,LSTM-Attention)模型,以温湿度协同预测为目标,引入注意力机制自适应加权关键环境因子,并采用灰狼优化算法(Grey Wolf Optimizer,GWO)自动对超参数进行寻优。[结果和讨论]提出的自适应卡尔曼滤波算法在多点温湿度融合中的平均绝对偏差分别为1.59℃和8.64%,比传统卡尔曼滤波算法分别降低1.24%、8.57%。以该算法融合结果作为模型训练集,模型在温湿度预测中决定系数R2分别达到98.2%和99.3%,比传统Kalman提升4.7%和4.3%。GWO-LSTM-Atten⁃tion模型的温湿度预测均方根误差分别为0.7768℃和2.0564%,比LSTM、LSTM-Attention时间序列预测模型分别降低15.6%、6.6%,湿度分别降低29.2%、5.7%。[结论]提出的自适应卡尔曼融合算法能够有效抑制异常值影响,可在非平稳环境变化下实现多传感器数据可靠融合。在温室多环境因子预测中,GWO-LSTM-Attention模型温湿度预测值在未来可作为控制温室环境的重要参考,进而实现对温室环境的实时调控。 展开更多
关键词 日光温室 卡尔曼滤波 灰狼优化算法 长短期记忆神经网络 注意力机制
在线阅读 下载PDF
Machine Learning Enabled Reusable Adhesion,Entangled Network‑Based Hydrogel for Long‑Term,High‑Fidelity EEG Recording and Attention Assessment 被引量:1
10
作者 Kai Zheng Chengcheng Zheng +9 位作者 Lixian Zhu Bihai Yang Xiaokun Jin Su Wang Zikai Song Jingyu Liu Yan Xiong Fuze Tian Ran Cai Bin Hu 《Nano-Micro Letters》 2025年第11期514-529,共16页
Due to their high mechanical compliance and excellent biocompatibility,conductive hydrogels exhibit significant potential for applications in flexible electronics.However,as the demand for high sensitivity,superior me... Due to their high mechanical compliance and excellent biocompatibility,conductive hydrogels exhibit significant potential for applications in flexible electronics.However,as the demand for high sensitivity,superior mechanical properties,and strong adhesion performance continues to grow,many conventional fabrication methods remain complex and costly.Herein,we propose a simple and efficient strategy to construct an entangled network hydrogel through a liquid-metal-induced cross-linking reaction,hydrogel demonstrates outstanding properties,including exceptional stretchability(1643%),high tensile strength(366.54 kPa),toughness(350.2 kJ m^(−3)),and relatively low mechanical hysteresis.The hydrogel exhibits long-term stable reusable adhesion(104 kPa),enabling conformal and stable adhesion to human skin.This capability allows it to effectively capture high-quality epidermal electrophysiological signals with high signal-to-noise ratio(25.2 dB)and low impedance(310 ohms).Furthermore,by integrating advanced machine learning algorithms,achieving an attention classification accuracy of 91.38%,which will significantly impact fields like education,healthcare,and artificial intelligence. 展开更多
关键词 Entangled network Reusable adhesion Epidermal sensor Machine learning attention assessment
在线阅读 下载PDF
Multi-Head Attention Enhanced Parallel Dilated Convolution and Residual Learning for Network Traffic Anomaly Detection 被引量:1
11
作者 Guorong Qi Jian Mao +2 位作者 Kai Huang Zhengxian You Jinliang Lin 《Computers, Materials & Continua》 2025年第2期2159-2176,共18页
Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract loc... Abnormal network traffic, as a frequent security risk, requires a series of techniques to categorize and detect it. Existing network traffic anomaly detection still faces challenges: the inability to fully extract local and global features, as well as the lack of effective mechanisms to capture complex interactions between features;Additionally, when increasing the receptive field to obtain deeper feature representations, the reliance on increasing network depth leads to a significant increase in computational resource consumption, affecting the efficiency and performance of detection. Based on these issues, firstly, this paper proposes a network traffic anomaly detection model based on parallel dilated convolution and residual learning (Res-PDC). To better explore the interactive relationships between features, the traffic samples are converted into two-dimensional matrix. A module combining parallel dilated convolutions and residual learning (res-pdc) was designed to extract local and global features of traffic at different scales. By utilizing res-pdc modules with different dilation rates, we can effectively capture spatial features at different scales and explore feature dependencies spanning wider regions without increasing computational resources. Secondly, to focus and integrate the information in different feature subspaces, further enhance and extract the interactions among the features, multi-head attention is added to Res-PDC, resulting in the final model: multi-head attention enhanced parallel dilated convolution and residual learning (MHA-Res-PDC) for network traffic anomaly detection. Finally, comparisons with other machine learning and deep learning algorithms are conducted on the NSL-KDD and CIC-IDS-2018 datasets. The experimental results demonstrate that the proposed method in this paper can effectively improve the detection performance. 展开更多
关键词 network traffic anomaly detection multi-head attention parallel dilated convolution residual learning
在线阅读 下载PDF
SA-ResNet:An Intrusion Detection Method Based on Spatial Attention Mechanism and Residual Neural Network Fusion 被引量:1
12
作者 Zengyu Cai Yuming Dai +1 位作者 Jianwei Zhang Yuan Feng 《Computers, Materials & Continua》 2025年第5期3335-3350,共16页
The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential ... The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential for safeguarding network integrity.To address the low accuracy of existing intrusion detection models in identifying network attacks,this paper proposes an intrusion detection method based on the fusion of Spatial Attention mechanism and Residual Neural Network(SA-ResNet).Utilizing residual connections can effectively capture local features in the data;by introducing a spatial attention mechanism,the global dependency relationships of intrusion features can be extracted,enhancing the intrusion recognition model’s focus on the global features of intrusions,and effectively improving the accuracy of intrusion recognition.The proposed model in this paper was experimentally verified on theNSL-KDD dataset.The experimental results showthat the intrusion recognition accuracy of the intrusion detection method based on SA-ResNet has reached 99.86%,and its overall accuracy is 0.41% higher than that of traditional Convolutional Neural Network(CNN)models. 展开更多
关键词 Intrusion detection deep learning residual neural network spatial attention mechanism
在线阅读 下载PDF
一种基于Attention-TCN的跌倒预测算法
13
作者 王宏宇 潘巨龙 +1 位作者 周辰 宋炜 《传感技术学报》 北大核心 2026年第1期58-65,共8页
随着全球人口老龄化的加剧,老年人的跌倒事件日益频发,严重威胁他们的身心健康。尽管已有许多研究致力于跌倒检测与预测,但大多数研究主要分析惯性传感器获取的时序数据的时间特征,对空间特征方面研究不足。为更有效地预测老年人跌倒事... 随着全球人口老龄化的加剧,老年人的跌倒事件日益频发,严重威胁他们的身心健康。尽管已有许多研究致力于跌倒检测与预测,但大多数研究主要分析惯性传感器获取的时序数据的时间特征,对空间特征方面研究不足。为更有效地预测老年人跌倒事件并及时激活保护装置(如安全气囊),提出了一种基于Attention-TCN的跌倒预测算法,结合注意力机制和时序卷积网络(TCN),提取跌倒时序数据的全局时间、空间特征,并通过自适应特征融合方法自主融合时空特征,为跌倒预测提供精准依据。同时,利用下采样技术提高模型预测性能,减小了模型的大小和推理时间。在SisFall公开跌倒数据集上进行的PC端离线实验中,该算法取得了98.67%的准确率、98.89%的敏感度和98.52%的特异度,跌倒预测平均前置时间为221.16 ms,模型推理时间为0.19±0.05 ms,模型大小为673 KB,验证了所提算法在预测老年人跌倒事件中的高效性和实用性。 展开更多
关键词 深度学习 跌倒预测 时序卷积网络 注意力机制 惯性传感器
在线阅读 下载PDF
M2ANet:Multi-branch and multi-scale attention network for medical image segmentation 被引量:1
14
作者 Wei Xue Chuanghui Chen +3 位作者 Xuan Qi Jian Qin Zhen Tang Yongsheng He 《Chinese Physics B》 2025年第8期547-559,共13页
Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to ... Convolutional neural networks(CNNs)-based medical image segmentation technologies have been widely used in medical image segmentation because of their strong representation and generalization abilities.However,due to the inability to effectively capture global information from images,CNNs can easily lead to loss of contours and textures in segmentation results.Notice that the transformer model can effectively capture the properties of long-range dependencies in the image,and furthermore,combining the CNN and the transformer can effectively extract local details and global contextual features of the image.Motivated by this,we propose a multi-branch and multi-scale attention network(M2ANet)for medical image segmentation,whose architecture consists of three components.Specifically,in the first component,we construct an adaptive multi-branch patch module for parallel extraction of image features to reduce information loss caused by downsampling.In the second component,we apply residual block to the well-known convolutional block attention module to enhance the network’s ability to recognize important features of images and alleviate the phenomenon of gradient vanishing.In the third component,we design a multi-scale feature fusion module,in which we adopt adaptive average pooling and position encoding to enhance contextual features,and then multi-head attention is introduced to further enrich feature representation.Finally,we validate the effectiveness and feasibility of the proposed M2ANet method through comparative experiments on four benchmark medical image segmentation datasets,particularly in the context of preserving contours and textures. 展开更多
关键词 medical image segmentation convolutional neural network multi-branch attention multi-scale feature fusion
原文传递
3D medical image segmentation using the serial-parallel convolutional neural network and transformer based on crosswindow self-attention 被引量:1
15
作者 Bin Yu Quan Zhou +3 位作者 Li Yuan Huageng Liang Pavel Shcherbakov Xuming Zhang 《CAAI Transactions on Intelligence Technology》 2025年第2期337-348,共12页
Convolutional neural network(CNN)with the encoder-decoder structure is popular in medical image segmentation due to its excellent local feature extraction ability but it faces limitations in capturing the global featu... Convolutional neural network(CNN)with the encoder-decoder structure is popular in medical image segmentation due to its excellent local feature extraction ability but it faces limitations in capturing the global feature.The transformer can extract the global information well but adapting it to small medical datasets is challenging and its computational complexity can be heavy.In this work,a serial and parallel network is proposed for the accurate 3D medical image segmentation by combining CNN and transformer and promoting feature interactions across various semantic levels.The core components of the proposed method include the cross window self-attention based transformer(CWST)and multi-scale local enhanced(MLE)modules.The CWST module enhances the global context understanding by partitioning 3D images into non-overlapping windows and calculating sparse global attention between windows.The MLE module selectively fuses features by computing the voxel attention between different branch features,and uses convolution to strengthen the dense local information.The experiments on the prostate,atrium,and pancreas MR/CT image datasets consistently demonstrate the advantage of the proposed method over six popular segmentation models in both qualitative evaluation and quantitative indexes such as dice similarity coefficient,Intersection over Union,95%Hausdorff distance and average symmetric surface distance. 展开更多
关键词 convolution neural network cross window self‐attention medical image segmentation transformer
在线阅读 下载PDF
Dual networks with hierarchical attention for fine-grained image classification
16
作者 YANG Tao WANG Gaihua 《中国科学院大学学报(中英文)》 北大核心 2025年第6期806-813,共8页
In this paper,we propose hierarchical attention dual network(DNet)for fine-grained image classification.The DNet can randomly select pairs of inputs from the dataset and compare the differences between them through hi... In this paper,we propose hierarchical attention dual network(DNet)for fine-grained image classification.The DNet can randomly select pairs of inputs from the dataset and compare the differences between them through hierarchical attention feature learning,which are used simultaneously to remove noise and retain salient features.In the loss function,it considers the losses of difference in paired images according to the intra-variance and inter-variance.In addition,we also collect the disaster scene dataset from remote sensing images and apply the proposed method to disaster scene classification,which contains complex scenes and multiple types of disasters.Compared to other methods,experimental results show that the DNet with hierarchical attention is robust to different datasets and performs better. 展开更多
关键词 dual network(DNet) fine-grained image classification hierarchical attention features
在线阅读 下载PDF
基于MSCNN+Attention模型的轴承故障诊断方法研究
17
作者 付志鹏 么洪飞 《齐齐哈尔大学学报(自然科学版)》 2026年第1期9-16,43,共9页
针对传统故障诊断方法特征提取能力不足以及诊断精度低的问题,提出一种融合通道注意力与自注意力机制的轴承故障诊断模型。该模型通过多层卷积与注意力机制提取关键特征,并利用自注意力模块进行全局特征融合,构建残差结构增强特征表达能... 针对传统故障诊断方法特征提取能力不足以及诊断精度低的问题,提出一种融合通道注意力与自注意力机制的轴承故障诊断模型。该模型通过多层卷积与注意力机制提取关键特征,并利用自注意力模块进行全局特征融合,构建残差结构增强特征表达能力,诊断模型通过Softmax分类器识别故障。通过凯斯西储大学的轴承数据验证窗口长度与优化器选择的合理性,结果表明,当窗口长度为1024,采用Adam优化器(学习率0.001)时模型性能最佳。通过准确率、ROC曲线和混淆矩阵指标对模型性能进行全面评估。实验结果显示,模型的故障识别准确率达99.4%~100%,显著优于RF模型(96.8%)、GRU模型(97.5%)和LSTM模型(92.3%),在窗口长度为1024时,分类准确率提升最明显,且AUC均超过0.99,综合分析表明该模型的特征提取能力和诊断精度相比传统模型显著提升。 展开更多
关键词 注意力机制 滚动轴承 特征提取 卷积神经网络
在线阅读 下载PDF
基于CNN-LSTM-Attention的公共建筑能耗预测
18
作者 张子瑞 郭伟 《机电工程技术》 2026年第2期67-72,共6页
随着城市化进程的不断加速,公共建筑能耗问题日益成为社会各界关注的焦点。准确预测公共建筑能耗,对于优化能源管理、提高能源利用效率具有不可估量的重要性。传统的能耗预测方法往往受限于数据处理的复杂性和模型的局限性,难以满足当... 随着城市化进程的不断加速,公共建筑能耗问题日益成为社会各界关注的焦点。准确预测公共建筑能耗,对于优化能源管理、提高能源利用效率具有不可估量的重要性。传统的能耗预测方法往往受限于数据处理的复杂性和模型的局限性,难以满足当前复杂多变的能耗预测需求,深度学习技术的快速发展为能耗预测提供了新的解决方案。结合CNN的空间特征提取能力和LSTM的时间序列处理能力,并引入注意力机制以增强模型对关键信息的捕捉能力,构建了CNN-LSTM-Attention公共建筑能耗预测模型。实验结果表明,CNN-LSTM-Attention的loss值在训练轮次达到70次之后,基本稳定在一个较低的水平,最终稳点在0.0002左右,并与其他模型对比后评价指标都是最优,其中MAPE为0.0052,R^(2)为0.9977,充分证明了其在公共建筑能耗预测中的精确度。 展开更多
关键词 公共建筑能耗 卷积神经网络 长短期记忆网络 注意力机制
在线阅读 下载PDF
基于VMD-CNN-BiGRU-Attention的澳门地区城市淹没水位预测模型研究
19
作者 唐静 许鹏飞 +1 位作者 郑泳杰 李栋 《海洋预报》 北大核心 2026年第1期36-44,共9页
澳门地区由于地理位置特殊、气候湿润以及高度城市化的特点,在台风风暴潮期间频繁遭受由降水、河道洪水、倒灌海水等多因素叠加引发的复合型洪涝事件。为了精准分析这类灾害下的城市淹没水位的变化规律,提出一种基于变分模态分解(VMD)... 澳门地区由于地理位置特殊、气候湿润以及高度城市化的特点,在台风风暴潮期间频繁遭受由降水、河道洪水、倒灌海水等多因素叠加引发的复合型洪涝事件。为了精准分析这类灾害下的城市淹没水位的变化规律,提出一种基于变分模态分解(VMD)、卷积神经网络(CNN)、双向门控循环单元(BiGRU)和注意力机制的城市淹没水位预测模型。首先,采用VMD对历史城市淹没水位数据进行分解,得到一系列相对平稳的子序列;然后,将CNN用于对多环境因素数据进行特征提取;特征提取完成后,使用BiGRU网络进行双向循环训练;最后,通过注意力机制为BiGRU输出分配相应权重,并加权求和得到最终的城市淹没水位预测结果。实验结果表明,该模型在城市淹没水位变化预测中表现优异。 展开更多
关键词 风暴潮 城市淹没水位预测 变分模态分解 卷积神经网络 双向门控制循环单元 注意力机制
在线阅读 下载PDF
Infrared road object detection algorithm based on spatial depth channel attention network and improved YOLOv8
20
作者 LI Song SHI Tao +1 位作者 JING Fangke CUI Jie 《Optoelectronics Letters》 2025年第8期491-498,共8页
Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm f... Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm for infrared images,F-YOLOv8,is proposed.First,a spatial-to-depth network replaces the traditional backbone network's strided convolution or pooling layer.At the same time,it combines with the channel attention mechanism so that the neural network focuses on the channels with large weight values to better extract low-resolution image feature information;then an improved feature pyramid network of lightweight bidirectional feature pyramid network(L-BiFPN)is proposed,which can efficiently fuse features of different scales.In addition,a loss function of insertion of union based on the minimum point distance(MPDIoU)is introduced for bounding box regression,which obtains faster convergence speed and more accurate regression results.Experimental results on the FLIR dataset show that the improved algorithm can accurately detect infrared road targets in real time with 3%and 2.2%enhancement in mean average precision at 50%IoU(mAP50)and mean average precision at 50%—95%IoU(mAP50-95),respectively,and 38.1%,37.3%and 16.9%reduction in the number of model parameters,the model weight,and floating-point operations per second(FLOPs),respectively.To further demonstrate the detection capability of the improved algorithm,it is tested on the public dataset PASCAL VOC,and the results show that F-YOLO has excellent generalized detection performance. 展开更多
关键词 feature pyramid network infrared road object detection infrared imagesf yolov backbone networks channel attention mechanism spatial depth channel attention network object detection improved YOLOv
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部