期刊文献+
共找到528篇文章
< 1 2 27 >
每页显示 20 50 100
SA-ResNet:An Intrusion Detection Method Based on Spatial Attention Mechanism and Residual Neural Network Fusion 被引量:1
1
作者 Zengyu Cai Yuming Dai +1 位作者 Jianwei Zhang Yuan Feng 《Computers, Materials & Continua》 2025年第5期3335-3350,共16页
The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential ... The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential for safeguarding network integrity.To address the low accuracy of existing intrusion detection models in identifying network attacks,this paper proposes an intrusion detection method based on the fusion of Spatial Attention mechanism and Residual Neural Network(SA-ResNet).Utilizing residual connections can effectively capture local features in the data;by introducing a spatial attention mechanism,the global dependency relationships of intrusion features can be extracted,enhancing the intrusion recognition model’s focus on the global features of intrusions,and effectively improving the accuracy of intrusion recognition.The proposed model in this paper was experimentally verified on theNSL-KDD dataset.The experimental results showthat the intrusion recognition accuracy of the intrusion detection method based on SA-ResNet has reached 99.86%,and its overall accuracy is 0.41% higher than that of traditional Convolutional Neural Network(CNN)models. 展开更多
关键词 Intrusion detection deep learning residual neural network spatial attention mechanism
在线阅读 下载PDF
Infrared road object detection algorithm based on spatial depth channel attention network and improved YOLOv8
2
作者 LI Song SHI Tao +1 位作者 JING Fangke CUI Jie 《Optoelectronics Letters》 2025年第8期491-498,共8页
Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm f... Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm for infrared images,F-YOLOv8,is proposed.First,a spatial-to-depth network replaces the traditional backbone network's strided convolution or pooling layer.At the same time,it combines with the channel attention mechanism so that the neural network focuses on the channels with large weight values to better extract low-resolution image feature information;then an improved feature pyramid network of lightweight bidirectional feature pyramid network(L-BiFPN)is proposed,which can efficiently fuse features of different scales.In addition,a loss function of insertion of union based on the minimum point distance(MPDIoU)is introduced for bounding box regression,which obtains faster convergence speed and more accurate regression results.Experimental results on the FLIR dataset show that the improved algorithm can accurately detect infrared road targets in real time with 3%and 2.2%enhancement in mean average precision at 50%IoU(mAP50)and mean average precision at 50%—95%IoU(mAP50-95),respectively,and 38.1%,37.3%and 16.9%reduction in the number of model parameters,the model weight,and floating-point operations per second(FLOPs),respectively.To further demonstrate the detection capability of the improved algorithm,it is tested on the public dataset PASCAL VOC,and the results show that F-YOLO has excellent generalized detection performance. 展开更多
关键词 feature pyramid network infrared road object detection infrared imagesf yolov backbone networks channel attention mechanism spatial depth channel attention network object detection improved YOLOv
原文传递
Feature pyramid attention network for audio-visual scene classification 被引量:1
3
作者 Liguang Zhou Yuhongze Zhou +3 位作者 Xiaonan Qi Junjie Hu Tin Lun Lam Yangsheng Xu 《CAAI Transactions on Intelligence Technology》 2025年第2期359-374,共16页
Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and text... Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and textures found in visual images.The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures,inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data.The authors present a feature pyramid attention network(FPANet)for audio-visual scene understanding,which extracts semantically significant characteristics from audio-visual data.The authors’approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module(FPAM).A dimension alignment(DA)strategy is employed to align feature maps from multiple layers,a pyramid spatial attention(PSA)to spatially locate essential regions,and a pyramid channel attention(PCA)to pinpoint significant temporal frames.Experiments on visual scene classification(VSC),audio scene classification(ASC),and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art(SOTA)approaches,with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%.Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals. 展开更多
关键词 dimension alignment feature pyramid attention network pyramid channel attention pyramid spatial attention semantic relevant regions
在线阅读 下载PDF
Integrating multi-modal information to detect spatial domains of spatial transcriptomics by graph attention network 被引量:1
4
作者 Yuying Huo Yilang Guo +4 位作者 Jiakang Wang Huijie Xue Yujuan Feng Weizheng Chen Xiangyu Li 《Journal of Genetics and Genomics》 SCIE CAS CSCD 2023年第9期720-733,共14页
Recent advances in spatially resolved transcriptomic technologies have enabled unprecedented opportunities to elucidate tissue architecture and function in situ.Spatial transcriptomics can provide multimodal and compl... Recent advances in spatially resolved transcriptomic technologies have enabled unprecedented opportunities to elucidate tissue architecture and function in situ.Spatial transcriptomics can provide multimodal and complementary information simultaneously,including gene expression profiles,spatial locations,and histology images.However,most existing methods have limitations in efficiently utilizing spatial information and matched high-resolution histology images.To fully leverage the multi-modal information,we propose a SPAtially embedded Deep Attentional graph Clustering(SpaDAC)method to identify spatial domains while reconstructing denoised gene expression profiles.This method can efficiently learn the low-dimensional embeddings for spatial transcriptomics data by constructing multi-view graph modules to capture both spatial location connectives and morphological connectives.Benchmark results demonstrate that SpaDAC outperforms other algorithms on several recent spatial transcriptomics datasets.SpaDAC is a valuable tool for spatial domain detection,facilitating the comprehension of tissue architecture and cellular microenvironment.The source code of SpaDAC is freely available at Github(https://github.com/huoyuying/SpaDAC.git). 展开更多
关键词 spatialtranscriptomics spatial domaindetection Multi-modal integration Graph attention network
原文传递
考虑空间相关性的MSCNN LSTM Attention能见度预测模型 被引量:1
5
作者 王小建 苏彤 +6 位作者 马飞 林智婕 白元旦 郭庆元 魏俊涛 黄凯 徐玉凤 《安全与环境学报》 北大核心 2025年第4期1622-1632,共11页
准确预测能见度对保障交通运输安全具有重要意义。针对现有方法在能见度预测时对影响因素空间相关性考虑不足导致预测精度较低的问题,研究构建了一种考虑空间相关性的能见度预测模型。利用一维多尺度卷积神经网络(Multi-Scale Convoluti... 准确预测能见度对保障交通运输安全具有重要意义。针对现有方法在能见度预测时对影响因素空间相关性考虑不足导致预测精度较低的问题,研究构建了一种考虑空间相关性的能见度预测模型。利用一维多尺度卷积神经网络(Multi-Scale Convolutional Neural Network, MSCNN)提取能见度以预测各影响因素下不同精细度的空间特征,并将其进行线性融合得到多因素空间特征,实现对能见度预测影响因素的空间特征提取;利用Attention机制加强对关键信息关注的优势以对长短期记忆神经网络(Long-Short Term Memory Neural Network, LSTM)方法进行改进,进而增强模型对重要时序信息关注的能力和模型预测的准确性,实现在考虑影响因素空间相关性下对能见度的预测。以2021—2023年西安市逐时气象数据和污染物数据为试验数据,采用均方根误差(RMSE)、平均绝对误差(MAE)和R2指标对模型进行评价。试验结果显示,研究模型MAE下降26.3%~39.1%,RMSE下降25%~40%,R2提升3.7%~16.4%,能见度预测精度较高。 展开更多
关键词 环境科学技术基础学科 能见度预测 空间相关性 一维多尺度卷积神经网络 长短期记忆神经网络 注意力机制
原文传递
DCA-YOLO:Detection Algorithm for YOLOv8 Pulmonary Nodules Based on Attention Mechanism Optimization 被引量:2
6
作者 SONG Yongsheng LIU Guohua 《Journal of Donghua University(English Edition)》 2025年第1期78-87,共10页
Pulmonary nodules represent an early manifestation of lung cancer.However,pulmonary nodules only constitute a small portion of the overall image,posing challenges for physicians in image interpretation and potentially... Pulmonary nodules represent an early manifestation of lung cancer.However,pulmonary nodules only constitute a small portion of the overall image,posing challenges for physicians in image interpretation and potentially leading to false positives or missed detections.To solve these problems,the YOLOv8 network is enhanced by adding deformable convolution and atrous spatial pyramid pooling(ASPP),along with the integration of a coordinate attention(CA)mechanism.This allows the network to focus on small targets while expanding the receptive field without losing resolution.At the same time,context information on the target is gathered and feature expression is enhanced by attention modules in different directions.It effectively improves the positioning accuracy and achieves good results on the LUNA16 dataset.Compared with other detection algorithms,it improves the accuracy of pulmonary nodule detection to a certain extent. 展开更多
关键词 pulmonary nodule YOLOv8 network object detection deformable convolution atrous spatial pyramid pooling(ASPP) coordinate attention(CA)mechanism
在线阅读 下载PDF
An attention-based prototypical network for forest fire smoke few-shot detection 被引量:3
7
作者 Tingting Li Haowei Zhu +1 位作者 Chunhe Hu Junguo Zhang 《Journal of Forestry Research》 SCIE CAS CSCD 2022年第5期1493-1504,共12页
Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learn... Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learning method, named Attention-Based Prototypical Network, is proposed for forest fire smoke detection. Specifically, feature extraction network, which consists of convolutional block attention module, could extract high-level and discriminative features and further decrease the false alarm rate resulting from suspected smoke areas. Moreover, we design a metalearning module to alleviate the overfitting issue caused by limited smoke images, and the meta-learning network enables achieving effective detection via comparing the distance between the class prototype of support images and the features of query images. A series of experiments on forest fire smoke datasets and miniImageNet dataset testify that the proposed method is superior to state-of-the-art few-shot learning approaches. 展开更多
关键词 Forest fire smoke detection Few-shot learning Channel attention module spatial attention module Prototypical network
在线阅读 下载PDF
Image Inpainting Detection Based on High-Pass Filter Attention Network
8
作者 Can Xiao Feng Li +3 位作者 Dengyong Zhang Pu Huang Xiangling Ding Victor S.Sheng 《Computer Systems Science & Engineering》 SCIE EI 2022年第12期1145-1154,共10页
Image inpainting based on deep learning has been greatly improved.The original purpose of image inpainting was to repair some broken photos, suchas inpainting artifacts. However, it may also be used for malicious oper... Image inpainting based on deep learning has been greatly improved.The original purpose of image inpainting was to repair some broken photos, suchas inpainting artifacts. However, it may also be used for malicious operations,such as destroying evidence. Therefore, detection and localization of imageinpainting operations are essential. Recent research shows that high-pass filteringfull convolutional network (HPFCN) is applied to image inpainting detection andachieves good results. However, those methods did not consider the spatial location and channel information of the feature map. To solve these shortcomings, weintroduce the squeezed excitation blocks (SE) and propose a high-pass filter attention full convolutional network (HPACN). In feature extraction, we apply concurrent spatial and channel attention (scSE) to enhance feature extraction and obtainmore information. Channel attention (cSE) is introduced in upsampling toenhance detection and localization. The experimental results show that the proposed method can achieve improvement on ImageNet. 展开更多
关键词 Image inpainting detection spatial attention channel attention full convolutional network high-pass filter
在线阅读 下载PDF
Lightweight Cross-Modal Multispectral Pedestrian Detection Based on Spatial Reweighted Attention Mechanism
9
作者 Lujuan Deng Ruochong Fu +3 位作者 Zuhe Li Boyi Liu Mengze Xue Yuhao Cui 《Computers, Materials & Continua》 SCIE EI 2024年第3期4071-4089,共19页
Multispectral pedestrian detection technology leverages infrared images to provide reliable information for visible light images, demonstrating significant advantages in low-light conditions and background occlusion s... Multispectral pedestrian detection technology leverages infrared images to provide reliable information for visible light images, demonstrating significant advantages in low-light conditions and background occlusion scenarios. However, while continuously improving cross-modal feature extraction and fusion, ensuring the model’s detection speed is also a challenging issue. We have devised a deep learning network model for cross-modal pedestrian detection based on Resnet50, aiming to focus on more reliable features and enhance the model’s detection efficiency. This model employs a spatial attention mechanism to reweight the input visible light and infrared image data, enhancing the model’s focus on different spatial positions and sharing the weighted feature data across different modalities, thereby reducing the interference of multi-modal features. Subsequently, lightweight modules with depthwise separable convolution are incorporated to reduce the model’s parameter count and computational load through channel-wise and point-wise convolutions. The network model algorithm proposed in this paper was experimentally validated on the publicly available KAIST dataset and compared with other existing methods. The experimental results demonstrate that our approach achieves favorable performance in various complex environments, affirming the effectiveness of the multispectral pedestrian detection technology proposed in this paper. 展开更多
关键词 Multispectral pedestrian detection convolutional neural networks depth separable convolution spatially reweighted attention mechanism
在线阅读 下载PDF
Channel attention based wavelet cascaded network for image super-resolution
10
作者 CHEN Jian HUANG Detian HUANG Weiqin 《High Technology Letters》 EI CAS 2022年第2期197-207,共11页
Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details o... Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details of reconstructed images.To address this issue,a channel attention based wavelet cascaded network for image super-resolution(CWSR) is proposed.Specifically,a second-order channel attention(SOCA) mechanism is incorporated into the network,and the covariance matrix normalization is utilized to explore interdependencies between channel-wise features.Then,to boost the quality of residual features,the non-local module is adopted to further improve the global information integration ability of the network.Finally,taking the image loss in the spatial and wavelet domains into account,a dual-constrained loss function is proposed to optimize the network.Experimental results illustrate that CWSR outperforms several state-of-the-art methods in terms of both visual quality and quantitative metrics. 展开更多
关键词 image super-resolution(SR) wavelet transform convolutional neural network(CNN) second-order channel attention(SOCA) non-local self-similarity
在线阅读 下载PDF
面向交通流量预测的时空Graph-CoordAttention网络 被引量:2
11
作者 刘建松 康雁 +2 位作者 李浩 王韬 王海宁 《计算机科学》 CSCD 北大核心 2023年第S01期558-564,共7页
交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧... 交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧重于单独捕捉时间和空间的依赖性,忽视了时间和空间依赖性之间的动态关联性,不能很好地融合它们。此外,以前的方法使用现实世界的静态交通网络来构建空间邻接矩阵,这可能忽略了动态的空间依赖性。为了克服这些局限性,并提高模型的性能,提出了一种新颖的时空Graph-CoordAttention网络(STGCA)。具体来说,提出了时空同步模块,用来建模不同时刻的时空依赖交融关系。然后,提出了一种动态图学习的方案,基于车流量之间数据关联,挖掘出潜在的图信息。在4个公开的数据集上和现有基线模型进行对比实验,STGCA表现了优异的性能。 展开更多
关键词 交通流量预测 时空预测 图卷积网络 注意力机制 时空依赖
在线阅读 下载PDF
基于深度学习的Attention U-Net语义分割模型研究 被引量:3
12
作者 薛泽民 邹连旭 +3 位作者 黄志威 冉杰 余若岩 郑国勋 《长春工程学院学报(自然科学版)》 2023年第4期97-101,共5页
针对当前深度神经网络在处理图像分割过程中普遍存在的处理耗时长、实时性低和分割准确率不高的问题,提出了一种融入注意力机制的U-Net网络对GAN扩充的数据集进行训练的模型,试验结果表明:相较于U-Net++、SegNet和DeepLabV1等传统模型,... 针对当前深度神经网络在处理图像分割过程中普遍存在的处理耗时长、实时性低和分割准确率不高的问题,提出了一种融入注意力机制的U-Net网络对GAN扩充的数据集进行训练的模型,试验结果表明:相较于U-Net++、SegNet和DeepLabV1等传统模型,提出模型的平均损失约为129%,与U-Net++、DeepLabV1模型较为接近;平均精确度约为95.4%,比U-Net++提高了1.7%,比SegNet提高了6%,比DeepLabV1提高了1.7%。 展开更多
关键词 数据增强 语义分割 空间注意力机制 生成对抗网络
在线阅读 下载PDF
Exploring the Influence of Tourism Network Attention on the Development of Tourism in the Yangtze River Delta:A Spatial Analysis 被引量:1
13
作者 WANG Yuewei DI Jiao +1 位作者 CHEN Hang AN Lidan 《Journal of Resources and Ecology》 2025年第4期1103-1115,共13页
This study incorporates both positive and negative tourism network attention into a comprehensive framework to examine their distinct effects on tourism development in the Yangtze River Delta(YRD).In particular,this s... This study incorporates both positive and negative tourism network attention into a comprehensive framework to examine their distinct effects on tourism development in the Yangtze River Delta(YRD).In particular,this study uses a spatial econometric model to accurately examine the relationship between positive and negative tourism network attention and regional tourism development,including the impact of tourism network attention on local and neighboring areas.In addition,the framework also uses fuzzy set qualitative comparative analysis(fsQCA)to explore the path combination of network attention and other factors that affect varied stages of tourism development in each city of the YRD,and expounds its driving mechanism.Research findings reveal:(1)Positive tourism network attention has a“U-shaped”influence on regional tourism development.(2)Positive tourism network attention significantly promotes tourism development of both local and neighboring areas,while negative tourism network attention both hinders local tourism development and adversely affects neighboring areas via spillover effects.(3)Multiple paths for tourism development exist in the region,including four modes:Demand-facility driven,demand-resource-facility-transportation driven,word of mouth-transportation driven,and traffic-resource driven.Using the YRD as a case study,this research offers empirical evidence and theoretical insights into how positive and negative tourism network attention influence tourism development in the region. 展开更多
关键词 spatial effect network attention regional tourism fsQCA
原文传递
Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion
14
作者 DIAO Zijian CAO Shuai +4 位作者 LI Wenwei LIANG Jianan WEN Guilin HUANG Weici ZHANG Shouming 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期363-374,共12页
In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestri... In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestrian re-identification tasks,a person re-identification method combining spatial feature learning and multi-granularity feature fusion was proposed.First,an attention spatial transformation network(A-STN)is proposed to learn spatial features and solve the problem of misalignment of pedestrian spatial features.Then the network was divided into a global branch,a local coarse-grained fusion branch,and a local fine-grained fusion branch to extract pedestrian global features,coarse-grained fusion features,and fine-grained fusion features,respectively.Among them,the global branch enriches the global features by fusing different pooling features.The local coarse-grained fusion branch uses an overlay pooling to enhance each local feature while learning the correlation relationship between multi-granularity features.The local fine-grained fusion branch uses a differential pooling to obtain the differential features that were fused with global features to learn the relationship between pedestrian local features and pedestrian global features.Finally,the proposed method was compared on three public datasets:Market1501,DukeMTMC-ReID and CUHK03.The experimental results were better than those of the comparative methods,which verifies the effectiveness of the proposed method. 展开更多
关键词 pedestrian re-identification spatial features attention spatial transformation network multi-branch network relation features
原文传递
SpaGRA:Graph augmentation facilitates domain identification for spatially resolved transcriptomics
15
作者 Xue Sun Wei Zhang +8 位作者 Wenrui Li Na Yu Daoliang Zhang Qi Zou Qiongye Dong Xianglin Zhang Zhiping Liu Zhiyuan Yuan Rui Gao 《Journal of Genetics and Genomics》 2025年第1期93-104,共12页
Recent advances in spatially resolved transcriptomics(SRT)have provided new opportunities for characterizing spatial structures of various tissues.Graph-based geometric deep learning has gained widespread adoption for... Recent advances in spatially resolved transcriptomics(SRT)have provided new opportunities for characterizing spatial structures of various tissues.Graph-based geometric deep learning has gained widespread adoption for spatial domain identification tasks.Currently,most methods define adjacency relation between cells or spots by their spatial distance in SRT data,which overlooks key biological interactions like gene expression similarities,and leads to inaccuracies in spatial domain identification.To tackle this challenge,we propose a novel method,SpaGRA(https://github.com/sunxue-yy/SpaGRA),for automatic multi-relationship construction based on graph augmentation.SpaGRA uses spatial distance as prior knowledge and dynamically adjusts edge weights with multi-head graph attention networks(GATs).This helps SpaGRA to uncover diverse node relationships and enhance message passing in geometric contrastive learning.Additionally,SpaGRA uses these multi-view relationships to construct negative samples,addressing sampling bias posed by random selection.Experimental results show that SpaGRA presents superior domain identification performance on multiple datasets generated from different protocols.Using SpaGRA,we analyze the functional regions in the mouse hypothalamus,identify key genes related to heart development in mouse embryos,and observe cancer-associated fibroblasts enveloping cancer cells in the latest Visium HD data.Overall,SpaGRA can effectively characterize spatial structures across diverse SRT datasets. 展开更多
关键词 spatial domain identification spatially resolved transcriptomics Multi-head graph attention networks Graph augmentation Geometric contrastive learning
原文传递
考虑谐波激励的电工钢片SAMCNN-BiLSTM磁致伸缩特性精细预测方法
16
作者 肖飞 杨北超 +4 位作者 王瑞田 范学鑫 陈俊全 张新生 王崇 《中国电机工程学报》 北大核心 2026年第3期1274-1285,I0034,共13页
针对不同磁密幅值、频率、谐波组合等复杂激励工况下磁致伸缩建模面临的精准性问题,该文利用空间注意力机制(spatial attention mechanism,SAM)对传统的卷积神经网络(convolutional neural network,CNN)进行改进,将SAM嵌套入CNN网络中,... 针对不同磁密幅值、频率、谐波组合等复杂激励工况下磁致伸缩建模面临的精准性问题,该文利用空间注意力机制(spatial attention mechanism,SAM)对传统的卷积神经网络(convolutional neural network,CNN)进行改进,将SAM嵌套入CNN网络中,建立SAMCNN改进型网络。再结合双向长短期记忆(bidirectional long short-term memory,BiLSTM)网络,提出电工钢片SAMCNN-BiLSTM磁致伸缩模型。首先,利用灰狼优化算法(grey wolf optimization,GWO)寻优神经网络结构的参数,实现复杂工况下磁致伸缩效应的准确表征;然后,建立中低频范围单频与叠加谐波激励等复杂工况下的磁致伸缩应变数据库,开展数据预处理与特征分析;最后,对SAMCNN-BiLSTM模型开展对比验证。对比叠加3次谐波激励下的磁致伸缩应变频谱主要分量,SAMCNN-BiLSTM模型计算值最大相对误差为3.70%,其比Jiles-Atherton-Sablik(J-A-S)、二次畴转等模型能更精确地表征电工钢片的磁致伸缩效应。 展开更多
关键词 磁致伸缩效应 谐波激励 卷积神经网络 空间注意力机制 双向长短期记忆网络
原文传递
基于局部相关性和多尺度空间注意力的人脸表情识别
17
作者 胡黄水 曹禹 +1 位作者 刘名扬 康琪儿 《吉林大学学报(理学版)》 北大核心 2026年第1期104-112,共9页
针对遮挡、姿势变化和光照等因素对人脸表情识别的影响,提出一种基于局部相关性和多尺度空间注意力的人脸表情识别方法.首先,通过局部相关性模块,将局部特征与全局特征相结合,并增强局部特征之间的联系,从而提高模型在复杂环境下的识别... 针对遮挡、姿势变化和光照等因素对人脸表情识别的影响,提出一种基于局部相关性和多尺度空间注意力的人脸表情识别方法.首先,通过局部相关性模块,将局部特征与全局特征相结合,并增强局部特征之间的联系,从而提高模型在复杂环境下的识别性能.其次,采用多尺度空间注意力机制,提取并融合不同层次的空间结构信息,提升模型的鲁棒性.实验结果表明,该方法在数据集RAF-DB和AffectNet上展现了优越的人脸表情识别效果,从而验证了该方法的有效性和泛化能力. 展开更多
关键词 人脸表情识别 空间注意力 多尺度网络 局部相关性
在线阅读 下载PDF
CDA-Net:Cross dimensional attention network for wetland bird detection
18
作者 Jia'nan Lv Changchun Zhang +1 位作者 Jiangjian Xie Junguo Zhang 《Avian Research》 2026年第1期216-227,共12页
Monitoring waterbirds is vital for evaluating the ecological health of wetlands,and object detection offers an automated solution for identifying birds in monitoring imagery.However,conventional detection methods ofte... Monitoring waterbirds is vital for evaluating the ecological health of wetlands,and object detection offers an automated solution for identifying birds in monitoring imagery.However,conventional detection methods often overlook the multi-scale nature of bird targets,limiting their ability to capture rich contextual information across different scales.To address this,we propose a cross-dimensional attention network(CDA-Net)for bird detection that integrates spatial and channel information to improve species recognition.The proposed CDA-Net partitions feature maps into multiple channel wise sub-features.Spatial and channel attention are applied to each subfeature,and the resulting features are fused using the Hadamard product.The fused features are then forwarded to the detection head to generate the final detection results.This approach effectively captures and integrates information across spatial and channel dimensions.Experiments on our self-constructed Nanhai Wetland Waterbird Dataset and the public CUB-200-2011 dataset yield precision scores of 91.32%and 81.99%,respectively,outperforming existing methods.Our approach effectively handles scale variation in bird detection and provides a valuable tool for advancing automated wetland waterbird monitoring. 展开更多
关键词 Bird detection Channel and spatial attention Cross dimensional network Feature integration Multi sizes object
在线阅读 下载PDF
基于高阶空间交互的盲超分辨率图像重建算法
19
作者 王晓峰 谭文雅 +1 位作者 沈紫璇 黄俊俊 《计算机工程与设计》 北大核心 2026年第2期309-315,共7页
为了克服盲超分辨率领域中生成对抗网络模型在生成细节和抑制伪影方面的局限性,提出了一种新型的具有高阶交互能力的Real-GSRGAN模型。该模型包括3个关键组成部分:高阶退化模型、基于残差门控注意力模块的Transformer生成器和U-Net鉴别... 为了克服盲超分辨率领域中生成对抗网络模型在生成细节和抑制伪影方面的局限性,提出了一种新型的具有高阶交互能力的Real-GSRGAN模型。该模型包括3个关键组成部分:高阶退化模型、基于残差门控注意力模块的Transformer生成器和U-Net鉴别器。在生成器中,采用了通道空间自注意力模块来捕捉多维特征,并通过递归门控卷积实现全局依赖和局部细节的高阶交互。前馈网络引入门控机制添加空间建模信息。为抑制伪影和图像过于平滑的现象,添加了去伪影损失函数。实验结果表明,该方法在多个数据集上表现出更优的视觉重建效果,还通过高阶交互机制显著提升了整体性能,优于现有方法。 展开更多
关键词 生成对抗网络 盲超分辨率 注意力机制 前馈网络 递归门控卷积 高阶空间交互 高阶特征
在线阅读 下载PDF
VIFusion:低光场景下可见光与红外图像的互补融合模型
20
作者 张晓滨 牛燕皓 陈金广 《西安工程大学学报》 2026年第1期126-135,共10页
针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual tem... 针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual temporal feature aggregation,DTFA)模块、特征细化前馈网络(feature refinement feedforward network,FRFN)模块和空间通道注意力机制(spatial channel attention,SCA)模块提升了融合图像的质量和信息表达能力。其中,DTFA模块使用分组卷积保持特征空间完整性,然后进行时序对齐与融合,以增强时序一致性并减少信息损失。FRFN模块对提取的特征进行逐层优化,减少通道冗余。SCA模块通过自适应建模图像空间和通道关系,突出关键特征,提高信息表达能力、增强边缘、纹理等细节信息。实验结果表明:在LLVIP数据集上,VIFusion模型在客观指标(AG、CC、EN、SF、SSIM、VIF、MI)上优于传统方法和深度学习模型(如GTF、TarDAL、DenseFuse等)。在数据集TNO上的泛化实验中,生成的融合图像在细节保留和目标突出上也表现更佳。VIFusion模型为低光场景下的多模态图像融合提供了一种高效实用的解决方案。 展开更多
关键词 双时态特征聚合 特征细化前馈网络 空间通道注意力 图像融合
在线阅读 下载PDF
上一页 1 2 27 下一页 到第
使用帮助 返回顶部