期刊文献+
共找到497篇文章
< 1 2 25 >
每页显示 20 50 100
Infrared road object detection algorithm based on spatial depth channel attention network and improved YOLOv8
1
作者 LI Song SHI Tao +1 位作者 JING Fangke CUI Jie 《Optoelectronics Letters》 2025年第8期491-498,共8页
Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm f... Aiming at the problems of low detection accuracy and large model size of existing object detection algorithms applied to complex road scenes,an improved you only look once version 8(YOLOv8)object detection algorithm for infrared images,F-YOLOv8,is proposed.First,a spatial-to-depth network replaces the traditional backbone network's strided convolution or pooling layer.At the same time,it combines with the channel attention mechanism so that the neural network focuses on the channels with large weight values to better extract low-resolution image feature information;then an improved feature pyramid network of lightweight bidirectional feature pyramid network(L-BiFPN)is proposed,which can efficiently fuse features of different scales.In addition,a loss function of insertion of union based on the minimum point distance(MPDIoU)is introduced for bounding box regression,which obtains faster convergence speed and more accurate regression results.Experimental results on the FLIR dataset show that the improved algorithm can accurately detect infrared road targets in real time with 3%and 2.2%enhancement in mean average precision at 50%IoU(mAP50)and mean average precision at 50%—95%IoU(mAP50-95),respectively,and 38.1%,37.3%and 16.9%reduction in the number of model parameters,the model weight,and floating-point operations per second(FLOPs),respectively.To further demonstrate the detection capability of the improved algorithm,it is tested on the public dataset PASCAL VOC,and the results show that F-YOLO has excellent generalized detection performance. 展开更多
关键词 feature pyramid network infrared road object detection infrared imagesf yolov backbone networks channel attention mechanism spatial depth channel attention network object detection improved YOLOv
原文传递
SA-ResNet:An Intrusion Detection Method Based on Spatial Attention Mechanism and Residual Neural Network Fusion
2
作者 Zengyu Cai Yuming Dai +1 位作者 Jianwei Zhang Yuan Feng 《Computers, Materials & Continua》 2025年第5期3335-3350,共16页
The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential ... The rapid development and widespread adoption of Internet technology have significantly increased Internet traffic,highlighting the growing importance of network security.Intrusion Detection Systems(IDS)are essential for safeguarding network integrity.To address the low accuracy of existing intrusion detection models in identifying network attacks,this paper proposes an intrusion detection method based on the fusion of Spatial Attention mechanism and Residual Neural Network(SA-ResNet).Utilizing residual connections can effectively capture local features in the data;by introducing a spatial attention mechanism,the global dependency relationships of intrusion features can be extracted,enhancing the intrusion recognition model’s focus on the global features of intrusions,and effectively improving the accuracy of intrusion recognition.The proposed model in this paper was experimentally verified on theNSL-KDD dataset.The experimental results showthat the intrusion recognition accuracy of the intrusion detection method based on SA-ResNet has reached 99.86%,and its overall accuracy is 0.41% higher than that of traditional Convolutional Neural Network(CNN)models. 展开更多
关键词 Intrusion detection deep learning residual neural network spatial attention mechanism
在线阅读 下载PDF
Feature pyramid attention network for audio-visual scene classification
3
作者 Liguang Zhou Yuhongze Zhou +3 位作者 Xiaonan Qi Junjie Hu Tin Lun Lam Yangsheng Xu 《CAAI Transactions on Intelligence Technology》 2025年第2期359-374,共16页
Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and text... Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and textures found in visual images.The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures,inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data.The authors present a feature pyramid attention network(FPANet)for audio-visual scene understanding,which extracts semantically significant characteristics from audio-visual data.The authors’approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module(FPAM).A dimension alignment(DA)strategy is employed to align feature maps from multiple layers,a pyramid spatial attention(PSA)to spatially locate essential regions,and a pyramid channel attention(PCA)to pinpoint significant temporal frames.Experiments on visual scene classification(VSC),audio scene classification(ASC),and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art(SOTA)approaches,with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%.Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals. 展开更多
关键词 dimension alignment feature pyramid attention network pyramid channel attention pyramid spatial attention semantic relevant regions
在线阅读 下载PDF
考虑空间相关性的MSCNN LSTM Attention能见度预测模型
4
作者 王小建 苏彤 +6 位作者 马飞 林智婕 白元旦 郭庆元 魏俊涛 黄凯 徐玉凤 《安全与环境学报》 北大核心 2025年第4期1622-1632,共11页
准确预测能见度对保障交通运输安全具有重要意义。针对现有方法在能见度预测时对影响因素空间相关性考虑不足导致预测精度较低的问题,研究构建了一种考虑空间相关性的能见度预测模型。利用一维多尺度卷积神经网络(Multi-Scale Convoluti... 准确预测能见度对保障交通运输安全具有重要意义。针对现有方法在能见度预测时对影响因素空间相关性考虑不足导致预测精度较低的问题,研究构建了一种考虑空间相关性的能见度预测模型。利用一维多尺度卷积神经网络(Multi-Scale Convolutional Neural Network, MSCNN)提取能见度以预测各影响因素下不同精细度的空间特征,并将其进行线性融合得到多因素空间特征,实现对能见度预测影响因素的空间特征提取;利用Attention机制加强对关键信息关注的优势以对长短期记忆神经网络(Long-Short Term Memory Neural Network, LSTM)方法进行改进,进而增强模型对重要时序信息关注的能力和模型预测的准确性,实现在考虑影响因素空间相关性下对能见度的预测。以2021—2023年西安市逐时气象数据和污染物数据为试验数据,采用均方根误差(RMSE)、平均绝对误差(MAE)和R2指标对模型进行评价。试验结果显示,研究模型MAE下降26.3%~39.1%,RMSE下降25%~40%,R2提升3.7%~16.4%,能见度预测精度较高。 展开更多
关键词 环境科学技术基础学科 能见度预测 空间相关性 一维多尺度卷积神经网络 长短期记忆神经网络 注意力机制
原文传递
DCA-YOLO:Detection Algorithm for YOLOv8 Pulmonary Nodules Based on Attention Mechanism Optimization 被引量:1
5
作者 SONG Yongsheng LIU Guohua 《Journal of Donghua University(English Edition)》 2025年第1期78-87,共10页
Pulmonary nodules represent an early manifestation of lung cancer.However,pulmonary nodules only constitute a small portion of the overall image,posing challenges for physicians in image interpretation and potentially... Pulmonary nodules represent an early manifestation of lung cancer.However,pulmonary nodules only constitute a small portion of the overall image,posing challenges for physicians in image interpretation and potentially leading to false positives or missed detections.To solve these problems,the YOLOv8 network is enhanced by adding deformable convolution and atrous spatial pyramid pooling(ASPP),along with the integration of a coordinate attention(CA)mechanism.This allows the network to focus on small targets while expanding the receptive field without losing resolution.At the same time,context information on the target is gathered and feature expression is enhanced by attention modules in different directions.It effectively improves the positioning accuracy and achieves good results on the LUNA16 dataset.Compared with other detection algorithms,it improves the accuracy of pulmonary nodule detection to a certain extent. 展开更多
关键词 pulmonary nodule YOLOv8 network object detection deformable convolution atrous spatial pyramid pooling(ASPP) coordinate attention(CA)mechanism
在线阅读 下载PDF
Lightweight Cross-Modal Multispectral Pedestrian Detection Based on Spatial Reweighted Attention Mechanism
6
作者 Lujuan Deng Ruochong Fu +3 位作者 Zuhe Li Boyi Liu Mengze Xue Yuhao Cui 《Computers, Materials & Continua》 SCIE EI 2024年第3期4071-4089,共19页
Multispectral pedestrian detection technology leverages infrared images to provide reliable information for visible light images, demonstrating significant advantages in low-light conditions and background occlusion s... Multispectral pedestrian detection technology leverages infrared images to provide reliable information for visible light images, demonstrating significant advantages in low-light conditions and background occlusion scenarios. However, while continuously improving cross-modal feature extraction and fusion, ensuring the model’s detection speed is also a challenging issue. We have devised a deep learning network model for cross-modal pedestrian detection based on Resnet50, aiming to focus on more reliable features and enhance the model’s detection efficiency. This model employs a spatial attention mechanism to reweight the input visible light and infrared image data, enhancing the model’s focus on different spatial positions and sharing the weighted feature data across different modalities, thereby reducing the interference of multi-modal features. Subsequently, lightweight modules with depthwise separable convolution are incorporated to reduce the model’s parameter count and computational load through channel-wise and point-wise convolutions. The network model algorithm proposed in this paper was experimentally validated on the publicly available KAIST dataset and compared with other existing methods. The experimental results demonstrate that our approach achieves favorable performance in various complex environments, affirming the effectiveness of the multispectral pedestrian detection technology proposed in this paper. 展开更多
关键词 Multispectral pedestrian detection convolutional neural networks depth separable convolution spatially reweighted attention mechanism
在线阅读 下载PDF
Integrating multi-modal information to detect spatial domains of spatial transcriptomics by graph attention network 被引量:1
7
作者 Yuying Huo Yilang Guo +4 位作者 Jiakang Wang Huijie Xue Yujuan Feng Weizheng Chen Xiangyu Li 《Journal of Genetics and Genomics》 SCIE CAS CSCD 2023年第9期720-733,共14页
Recent advances in spatially resolved transcriptomic technologies have enabled unprecedented opportunities to elucidate tissue architecture and function in situ.Spatial transcriptomics can provide multimodal and compl... Recent advances in spatially resolved transcriptomic technologies have enabled unprecedented opportunities to elucidate tissue architecture and function in situ.Spatial transcriptomics can provide multimodal and complementary information simultaneously,including gene expression profiles,spatial locations,and histology images.However,most existing methods have limitations in efficiently utilizing spatial information and matched high-resolution histology images.To fully leverage the multi-modal information,we propose a SPAtially embedded Deep Attentional graph Clustering(SpaDAC)method to identify spatial domains while reconstructing denoised gene expression profiles.This method can efficiently learn the low-dimensional embeddings for spatial transcriptomics data by constructing multi-view graph modules to capture both spatial location connectives and morphological connectives.Benchmark results demonstrate that SpaDAC outperforms other algorithms on several recent spatial transcriptomics datasets.SpaDAC is a valuable tool for spatial domain detection,facilitating the comprehension of tissue architecture and cellular microenvironment.The source code of SpaDAC is freely available at Github(https://github.com/huoyuying/SpaDAC.git). 展开更多
关键词 spatialtranscriptomics spatial domaindetection Multi-modal integration Graph attention network
原文传递
An attention-based prototypical network for forest fire smoke few-shot detection 被引量:3
8
作者 Tingting Li Haowei Zhu +1 位作者 Chunhe Hu Junguo Zhang 《Journal of Forestry Research》 SCIE CAS CSCD 2022年第5期1493-1504,共12页
Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learn... Existing almost deep learning methods rely on a large amount of annotated data, so they are inappropriate for forest fire smoke detection with limited data. In this paper, a novel hybrid attention-based few-shot learning method, named Attention-Based Prototypical Network, is proposed for forest fire smoke detection. Specifically, feature extraction network, which consists of convolutional block attention module, could extract high-level and discriminative features and further decrease the false alarm rate resulting from suspected smoke areas. Moreover, we design a metalearning module to alleviate the overfitting issue caused by limited smoke images, and the meta-learning network enables achieving effective detection via comparing the distance between the class prototype of support images and the features of query images. A series of experiments on forest fire smoke datasets and miniImageNet dataset testify that the proposed method is superior to state-of-the-art few-shot learning approaches. 展开更多
关键词 Forest fire smoke detection Few-shot learning Channel attention module spatial attention module Prototypical network
在线阅读 下载PDF
Image Inpainting Detection Based on High-Pass Filter Attention Network
9
作者 Can Xiao Feng Li +3 位作者 Dengyong Zhang Pu Huang Xiangling Ding Victor S.Sheng 《Computer Systems Science & Engineering》 SCIE EI 2022年第12期1145-1154,共10页
Image inpainting based on deep learning has been greatly improved.The original purpose of image inpainting was to repair some broken photos, suchas inpainting artifacts. However, it may also be used for malicious oper... Image inpainting based on deep learning has been greatly improved.The original purpose of image inpainting was to repair some broken photos, suchas inpainting artifacts. However, it may also be used for malicious operations,such as destroying evidence. Therefore, detection and localization of imageinpainting operations are essential. Recent research shows that high-pass filteringfull convolutional network (HPFCN) is applied to image inpainting detection andachieves good results. However, those methods did not consider the spatial location and channel information of the feature map. To solve these shortcomings, weintroduce the squeezed excitation blocks (SE) and propose a high-pass filter attention full convolutional network (HPACN). In feature extraction, we apply concurrent spatial and channel attention (scSE) to enhance feature extraction and obtainmore information. Channel attention (cSE) is introduced in upsampling toenhance detection and localization. The experimental results show that the proposed method can achieve improvement on ImageNet. 展开更多
关键词 Image inpainting detection spatial attention channel attention full convolutional network high-pass filter
在线阅读 下载PDF
Channel attention based wavelet cascaded network for image super-resolution
10
作者 CHEN Jian HUANG Detian HUANG Weiqin 《High Technology Letters》 EI CAS 2022年第2期197-207,共11页
Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details o... Convolutional neural networks(CNNs) have shown great potential for image super-resolution(SR).However,most existing CNNs only reconstruct images in the spatial domain,resulting in insufficient high-frequency details of reconstructed images.To address this issue,a channel attention based wavelet cascaded network for image super-resolution(CWSR) is proposed.Specifically,a second-order channel attention(SOCA) mechanism is incorporated into the network,and the covariance matrix normalization is utilized to explore interdependencies between channel-wise features.Then,to boost the quality of residual features,the non-local module is adopted to further improve the global information integration ability of the network.Finally,taking the image loss in the spatial and wavelet domains into account,a dual-constrained loss function is proposed to optimize the network.Experimental results illustrate that CWSR outperforms several state-of-the-art methods in terms of both visual quality and quantitative metrics. 展开更多
关键词 image super-resolution(SR) wavelet transform convolutional neural network(CNN) second-order channel attention(SOCA) non-local self-similarity
在线阅读 下载PDF
Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion
11
作者 DIAO Zijian CAO Shuai +4 位作者 LI Wenwei LIANG Jianan WEN Guilin HUANG Weici ZHANG Shouming 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期363-374,共12页
In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestri... In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestrian re-identification tasks,a person re-identification method combining spatial feature learning and multi-granularity feature fusion was proposed.First,an attention spatial transformation network(A-STN)is proposed to learn spatial features and solve the problem of misalignment of pedestrian spatial features.Then the network was divided into a global branch,a local coarse-grained fusion branch,and a local fine-grained fusion branch to extract pedestrian global features,coarse-grained fusion features,and fine-grained fusion features,respectively.Among them,the global branch enriches the global features by fusing different pooling features.The local coarse-grained fusion branch uses an overlay pooling to enhance each local feature while learning the correlation relationship between multi-granularity features.The local fine-grained fusion branch uses a differential pooling to obtain the differential features that were fused with global features to learn the relationship between pedestrian local features and pedestrian global features.Finally,the proposed method was compared on three public datasets:Market1501,DukeMTMC-ReID and CUHK03.The experimental results were better than those of the comparative methods,which verifies the effectiveness of the proposed method. 展开更多
关键词 pedestrian re-identification spatial features attention spatial transformation network multi-branch network relation features
原文传递
Exploring the Influence of Tourism Network Attention on the Development of Tourism in the Yangtze River Delta:A Spatial Analysis
12
作者 WANG Yuewei DI Jiao +1 位作者 CHEN Hang AN Lidan 《Journal of Resources and Ecology》 2025年第4期1103-1115,共13页
This study incorporates both positive and negative tourism network attention into a comprehensive framework to examine their distinct effects on tourism development in the Yangtze River Delta(YRD).In particular,this s... This study incorporates both positive and negative tourism network attention into a comprehensive framework to examine their distinct effects on tourism development in the Yangtze River Delta(YRD).In particular,this study uses a spatial econometric model to accurately examine the relationship between positive and negative tourism network attention and regional tourism development,including the impact of tourism network attention on local and neighboring areas.In addition,the framework also uses fuzzy set qualitative comparative analysis(fsQCA)to explore the path combination of network attention and other factors that affect varied stages of tourism development in each city of the YRD,and expounds its driving mechanism.Research findings reveal:(1)Positive tourism network attention has a“U-shaped”influence on regional tourism development.(2)Positive tourism network attention significantly promotes tourism development of both local and neighboring areas,while negative tourism network attention both hinders local tourism development and adversely affects neighboring areas via spillover effects.(3)Multiple paths for tourism development exist in the region,including four modes:Demand-facility driven,demand-resource-facility-transportation driven,word of mouth-transportation driven,and traffic-resource driven.Using the YRD as a case study,this research offers empirical evidence and theoretical insights into how positive and negative tourism network attention influence tourism development in the region. 展开更多
关键词 spatial effect network attention regional tourism fsQCA
原文传递
SpaGRA:Graph augmentation facilitates domain identification for spatially resolved transcriptomics
13
作者 Xue Sun Wei Zhang +8 位作者 Wenrui Li Na Yu Daoliang Zhang Qi Zou Qiongye Dong Xianglin Zhang Zhiping Liu Zhiyuan Yuan Rui Gao 《Journal of Genetics and Genomics》 2025年第1期93-104,共12页
Recent advances in spatially resolved transcriptomics(SRT)have provided new opportunities for characterizing spatial structures of various tissues.Graph-based geometric deep learning has gained widespread adoption for... Recent advances in spatially resolved transcriptomics(SRT)have provided new opportunities for characterizing spatial structures of various tissues.Graph-based geometric deep learning has gained widespread adoption for spatial domain identification tasks.Currently,most methods define adjacency relation between cells or spots by their spatial distance in SRT data,which overlooks key biological interactions like gene expression similarities,and leads to inaccuracies in spatial domain identification.To tackle this challenge,we propose a novel method,SpaGRA(https://github.com/sunxue-yy/SpaGRA),for automatic multi-relationship construction based on graph augmentation.SpaGRA uses spatial distance as prior knowledge and dynamically adjusts edge weights with multi-head graph attention networks(GATs).This helps SpaGRA to uncover diverse node relationships and enhance message passing in geometric contrastive learning.Additionally,SpaGRA uses these multi-view relationships to construct negative samples,addressing sampling bias posed by random selection.Experimental results show that SpaGRA presents superior domain identification performance on multiple datasets generated from different protocols.Using SpaGRA,we analyze the functional regions in the mouse hypothalamus,identify key genes related to heart development in mouse embryos,and observe cancer-associated fibroblasts enveloping cancer cells in the latest Visium HD data.Overall,SpaGRA can effectively characterize spatial structures across diverse SRT datasets. 展开更多
关键词 spatial domain identification spatially resolved transcriptomics Multi-head graph attention networks Graph augmentation Geometric contrastive learning
原文传递
面向交通流量预测的时空Graph-CoordAttention网络 被引量:2
14
作者 刘建松 康雁 +2 位作者 李浩 王韬 王海宁 《计算机科学》 CSCD 北大核心 2023年第S01期558-564,共7页
交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧... 交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧重于单独捕捉时间和空间的依赖性,忽视了时间和空间依赖性之间的动态关联性,不能很好地融合它们。此外,以前的方法使用现实世界的静态交通网络来构建空间邻接矩阵,这可能忽略了动态的空间依赖性。为了克服这些局限性,并提高模型的性能,提出了一种新颖的时空Graph-CoordAttention网络(STGCA)。具体来说,提出了时空同步模块,用来建模不同时刻的时空依赖交融关系。然后,提出了一种动态图学习的方案,基于车流量之间数据关联,挖掘出潜在的图信息。在4个公开的数据集上和现有基线模型进行对比实验,STGCA表现了优异的性能。 展开更多
关键词 交通流量预测 时空预测 图卷积网络 注意力机制 时空依赖
在线阅读 下载PDF
基于深度学习的Attention U-Net语义分割模型研究 被引量:2
15
作者 薛泽民 邹连旭 +3 位作者 黄志威 冉杰 余若岩 郑国勋 《长春工程学院学报(自然科学版)》 2023年第4期97-101,共5页
针对当前深度神经网络在处理图像分割过程中普遍存在的处理耗时长、实时性低和分割准确率不高的问题,提出了一种融入注意力机制的U-Net网络对GAN扩充的数据集进行训练的模型,试验结果表明:相较于U-Net++、SegNet和DeepLabV1等传统模型,... 针对当前深度神经网络在处理图像分割过程中普遍存在的处理耗时长、实时性低和分割准确率不高的问题,提出了一种融入注意力机制的U-Net网络对GAN扩充的数据集进行训练的模型,试验结果表明:相较于U-Net++、SegNet和DeepLabV1等传统模型,提出模型的平均损失约为129%,与U-Net++、DeepLabV1模型较为接近;平均精确度约为95.4%,比U-Net++提高了1.7%,比SegNet提高了6%,比DeepLabV1提高了1.7%。 展开更多
关键词 数据增强 语义分割 空间注意力机制 生成对抗网络
在线阅读 下载PDF
基于YOLOv8改进的跌倒检测算法:CASL-YOLO 被引量:1
16
作者 徐慧英 赵蕊 +1 位作者 朱信忠 黄晓 《浙江师范大学学报(自然科学版)》 CAS 2025年第1期36-44,共9页
跌倒对老年人危害极大,是我国65岁以上老年人致残和伤害死亡的首要原因.然而,目前主流的跌倒检测技术受环境的干扰较大,在物体遮挡、光照变化等复杂场景下的检测准确率较低,且模型的参数量和计算量较高,导致成本居高不下,不能很好地部... 跌倒对老年人危害极大,是我国65岁以上老年人致残和伤害死亡的首要原因.然而,目前主流的跌倒检测技术受环境的干扰较大,在物体遮挡、光照变化等复杂场景下的检测准确率较低,且模型的参数量和计算量较高,导致成本居高不下,不能很好地部署应用于实际生活场景.针对上述问题,提出了一种在复杂环境下轻量级的基于YOLOv8模型改进的跌倒检测算法:CASL-YOLO.首先,该模型引入空间深度卷积(SPD-Conv)模块替代传统卷积模块,通过对每个特征映射进行卷积操作,保留通道维度中的全部信息,从而提高模型在低分辨率图像和小物体检测方面的性能;其次,引入基于位置信息的注意力机制,以捕获跨通道、方向和位置感知的信息,从而更准确地定位和识别人体目标;最后,在特征提取模块中引入选择性大卷积核(LSKNet)动态调整感受野,以有效处理跌倒检测场景中的复杂环境信息,提高网络的感知能力和检测精度.实验结果表明,在公开的Human Fall数据集上,CASL-YOLO的mAP@0.5达到96.8%,优于基线YOLOv8n,同时模型仅有3.4×MiB的参数量和11.7×10^(9)的计算量.相比其他检测算法,CASL-YOLO在参数量和计算量小幅增加的情况下,实现了更高的精度和性能,同时满足实际场景的部署要求. 展开更多
关键词 跌倒检测 YOLOv8 注意力机制 空间深度卷积 选择性大卷积核
在线阅读 下载PDF
融合时空注意力的改进ST-GCN人体动作识别方法研究 被引量:1
17
作者 雷建云 梁钧 +2 位作者 夏梦 张慧丽 田祚汉 《中南民族大学学报(自然科学版)》 2025年第4期526-535,共10页
针对现有的人体骨架动作识别算法不能充分发掘运动的时空特征问题,提出了一种基于融合时空注意力的改进图卷积网络模型.该模型包含空间注意力机制和时间注意力机制,利用时空注意力机制从时间和空间两个维度分别提取动作的全局时空特征.... 针对现有的人体骨架动作识别算法不能充分发掘运动的时空特征问题,提出了一种基于融合时空注意力的改进图卷积网络模型.该模型包含空间注意力机制和时间注意力机制,利用时空注意力机制从时间和空间两个维度分别提取动作的全局时空特征.将这二者融合到统一的时空图卷积网络(ST-GCN)框架中,实现了端到端的训练.在Kinetics和NTU RGB+D两个公开数据集的对比实验证明:改进模型在NTU-RGB+D数据集上的CS标准下取得了82.37%的Top-1精度,在CV标准下取得89.84%的Top-1精度,相比原来的ST-GCN算法,分别提升0.87%的Top-1精度和1.54%的Top-5精度.在Kinetics数据集上,改进模型取得了31.78%的精度,与ST-GCN相比提高了1.08%.由此验证了改进方法的有效性. 展开更多
关键词 图卷积网络 骨架数据 动作识别 时空注意力
在线阅读 下载PDF
流量经济背景下旅游资源丰裕度与网络关注度空间错位及驱动机制——以云南省为例 被引量:1
18
作者 赵书虹 孔营营 +1 位作者 李晓光 李佳懿 《自然资源学报》 北大核心 2025年第4期934-953,共20页
网络关注度是市场需求的集中表达,流量经济背景下其为释放旅游消费潜力和优化旅游资源配置提供了新的着力点。基于当前网络关注热点与以空间为基础的资源分布丰度并未完全协同、旅游资源利用价值向产品市场价值转化不足等现象,综合使用... 网络关注度是市场需求的集中表达,流量经济背景下其为释放旅游消费潜力和优化旅游资源配置提供了新的着力点。基于当前网络关注热点与以空间为基础的资源分布丰度并未完全协同、旅游资源利用价值向产品市场价值转化不足等现象,综合使用重心模型、空间错位指数和地理探测器方法,从整体和局部尺度分析云南省2013—2022年旅游资源丰裕度和网络关注度的空间错位关系及其驱动机制。研究表明:(1)从整体错位特征来看,研究期内旅游资源丰裕度重心和旅游网络关注度重心分别呈现出整体向楚雄彝族自治州东南部和楚雄彝族自治州西南部移动的特征,二者的重心距离波动幅度较大,呈现“靠近—远离—靠近”反复交替的态势。(2)从区域错位特征来看,二者的空间错位关系存在明显区域特征,由西北至东南呈现出“正向错位区—负向错位区—正向错位区”交替的空间分布格局。(3)从驱动机制来看,资源禀赋基础力、人力资源潜在力、经济发展拉动力、信息传播催动力四种驱动力相互交织,共同驱动着云南省旅游资源丰裕度与网络关注度由空间错位向空间适配的过程演化。研究结果不仅回应了流量经济背景下区域旅游资源“量质”与网络关注度适配发展的现实需要和学术关切,还推进了对二者空间错位驱动机制的学理性解释。 展开更多
关键词 旅游资源丰裕度 网络关注度 空间错位 驱动机制 流量经济 云南省
原文传递
基于通道注意力机制增强DGNN的外骨骼机器人步态相位预测 被引量:1
19
作者 颜建军 许赢家 +2 位作者 林越 金理 江金林 《华东理工大学学报(自然科学版)》 北大核心 2025年第1期110-118,共9页
利用一种基于通道注意力机制增强的有向图神经网络(Channel Attention Enhanced Directed Graph Neural Network,CA-DGNN)的外骨骼机器人步态相位预测方法,提高了步态相位预测的准确性和可靠性。首先,研制了人体下肢姿态信息采集装置,... 利用一种基于通道注意力机制增强的有向图神经网络(Channel Attention Enhanced Directed Graph Neural Network,CA-DGNN)的外骨骼机器人步态相位预测方法,提高了步态相位预测的准确性和可靠性。首先,研制了人体下肢姿态信息采集装置,采集人体下肢的行走步态数据并构建人体下肢的骨架模型;之后,建立了基于CA-DGNN步态相位的预测模型,提取人体步态相位的运动特征,并基于当前时刻数据预测未来时刻的步态相位;最后,探讨了滑动窗口大小对算法性能的影响。本文提高了外骨骼机器人步态相位预测的准确性和鲁棒性,为此方向研究提供了一种新的思路和方法。 展开更多
关键词 步态相位预测 惯性传感器 骨架 时空图卷积网络 通道注意力机制
在线阅读 下载PDF
融合时空注意力机制的多尺度卷积车辆轨迹预测 被引量:1
20
作者 闫建红 刘芝妍 王震 《计算机工程》 北大核心 2025年第8期406-414,共9页
车辆轨迹预测是自动驾驶的重要环节,提升车辆轨迹预测的可靠性和准确性对自动驾驶安全性有很大帮助。道路上车辆行驶受交通环境影响,考虑相邻车辆运动和相对空间位置等交通环境因素,在长短期记忆(LSTM)神经网络编码器-解码器模型基础上... 车辆轨迹预测是自动驾驶的重要环节,提升车辆轨迹预测的可靠性和准确性对自动驾驶安全性有很大帮助。道路上车辆行驶受交通环境影响,考虑相邻车辆运动和相对空间位置等交通环境因素,在长短期记忆(LSTM)神经网络编码器-解码器模型基础上引入时空注意力机制,通过时间注意力层关注目标车辆和相邻车辆的历史轨迹,空间注意力层关注车辆的相对空间位置。为了增强特征提取程度和实现更全面的特征融合,使用多尺度卷积社交池增大感受野,融合多尺度特征,并提出基于LSTM编码器-解码器架构融合多尺度卷积社交池和时空注意力机制的车辆轨迹预测模型MCS-STA-LSTM。通过学习车辆运动相互依赖关系,以达到获得目标车辆未来轨迹基于机动类别的多模态预测分布的目的。在公开数据集NGSIM上进行训练、验证和测试,实验结果表明,相较于其他轨迹预测模型,该方法在3 s内的均方根误差平均降低了9.35%,5 s内均方根误差平均降低了5.53%,提高了轨迹预测准确性,在中短期预测上更具有优势。 展开更多
关键词 多尺度卷积社交池化 轨迹预测 长短期记忆神经网络 时空注意力机制 多尺度特征融合
在线阅读 下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部