期刊文献+
共找到588篇文章
< 1 2 30 >
每页显示 20 50 100
Hybrid receptive field network for small object detection on drone view 被引量:1
1
作者 Zhaodong CHEN Hongbing JI +2 位作者 Yongquan ZHANG Wenke LIU Zhigang ZHU 《Chinese Journal of Aeronautics》 2025年第2期322-338,共17页
Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones... Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones and lack of detail information for small-scale objects make drone-based small object detection a formidable challenge. To address these issues, we first develop a mathematical model to explore how changing receptive fields impacts the polynomial fitting results. Subsequently, based on the obtained conclusions, we propose a simple but effective Hybrid Receptive Field Network (HRFNet), whose modules include Hybrid Feature Augmentation (HFA), Hybrid Feature Pyramid (HFP) and Dual Scale Head (DSH). Specifically, HFA employs parallel dilated convolution kernels of different sizes to extend shallow features with different receptive fields, committed to improving the multi-scale adaptability of the network;HFP enhances the perception of small objects by capturing contextual information across layers, while DSH reconstructs the original prediction head utilizing a set of high-resolution features and ultrahigh-resolution features. In addition, in order to train HRFNet, the corresponding dual-scale loss function is designed. Finally, comprehensive evaluation results on public benchmarks such as VisDrone-DET and TinyPerson demonstrate the robustness of the proposed method. Most impressively, the proposed HRFNet achieves a mAP of 51.0 on VisDrone-DET with 29.3 M parameters, which outperforms the extant state-of-the-art detectors. HRFNet also performs excellently in complex scenarios captured by drones, achieving the best performance on the CS-Drone dataset we built. 展开更多
关键词 Drone remote sensing Object detection on drone view Small object detector Hybrid receptive field Feature pyramid network Feature augmentation Multi-scale object detection
原文传递
Detection of the movement direction by the cells with directional receptive fields in the primary visual cortex of the cat
2
作者 Ausra Daugirdiene Algimantas Svegzda +1 位作者 Romualdas Satinskas Henrikas Vaitkevicius 《Health》 2010年第10期1232-1237,共6页
The study was performed on neurons with direction selective (DS) receptive fields (RFs) in the primary visual cortex of the cat. Preferred directions (PDs) of these cells to a single light spot and a system of two ide... The study was performed on neurons with direction selective (DS) receptive fields (RFs) in the primary visual cortex of the cat. Preferred directions (PDs) of these cells to a single light spot and a system of two identical light spots moving across the RF with a given angle between them were compared. Directional interactions appeared when the angles between the directions of the two moving spots were 30o or 60o. PD for 56% of the cells coincided with bisectors of these angles. These cells responded to a combination of the two moving stimuli as if only one stimulus moved in the RF in an intermediate direction. This direction coincided with PD of the DS neuron to a single spot. Also, the investigation revealed that DS neurons responded to stimuli moving at such angles as 180o (to preferred and opposite directions simultaneously). In the further experiment we investigated responses of the DS cells in the primary visual cortex of RF. The angle between the directions of the two moving spots was 60o. These cells responded to a combination of the two moving stimuli as if only one stimulus moved in RF in an intermediate direction. The more relative luminance of one of spots in pair was, the closer the intermediate direction approached to the direction of this spot). 展开更多
关键词 CAT PRIMARY Visual CORTEX Directionally SELECTIVE CELLS receptive field (RF)
暂未订购
Coal rock image recognition method based on improved CLBP and receptive field theory 被引量:3
3
作者 Chuanmeng Sun Ruijia Xu +2 位作者 Chong Wang Tiehua Ma Jiaxin Chen 《Deep Underground Science and Engineering》 2022年第2期165-173,共9页
Rapid coal-rock identification is one of the key technologies for intelligent and unmanned coal mining.Currently,the existing image recognition algorithms cannot satisfy practical needs in terms of recognition speed a... Rapid coal-rock identification is one of the key technologies for intelligent and unmanned coal mining.Currently,the existing image recognition algorithms cannot satisfy practical needs in terms of recognition speed and accuracy.In view of the evident differences between coal and rock in visual attributes such as color,gloss and texture,the complete local binary pattern(CLBP)image feature descriptor is introduced for coal and rock image recognition.Given that the original algorithm oversimplifies local texture features by ignoring imaging information from higher-order pixels and the concave and convex areas between adjacent sampling points,this paper proposes a higher-order differential median CLBP image feature descriptor to replace the original CLBP center pixel gray with a local gray median,and replace the binary differential with a second-order differential.Meanwhile,for the high dimensionality of CLBP descriptor histogram and feature redundancy,deep learning perceptual field theory is introduced to realize data nonlinear dimensionality reduction and deep feature extraction.With relevant experiments conducted,the following conclusion can be drawn:(1)Compared with that of the original CLBP,the recognition accuracy of the improved CLBP algorithm is greatly improved and finally stabilized above 94.3%under strong noise interference;(2)Compared with that of the original CLBP model,the single image recognition time of the coal rock image recognition model fusing the improved CLBP and the receptive field theory is 0.0035 s,a reduction of 71.0%;compared with the improved CLBP model(without the fusion of receptive field theory),it can shorten the recognition time by 97.0%,but the accuracy rate still maintains more than 98.5%.The method offers a valuable technical reference for the fields of mineral development and deep mining. 展开更多
关键词 coal-rock identification complete local binary pattern receptive field texture feature
原文传递
Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network 被引量:1
4
作者 Xuan Zhou Jianping Yi 《Intelligent Automation & Soft Computing》 SCIE 2023年第8期2103-2116,共14页
Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal windo... Mining more discriminative temporal features to enrich temporal context representation is considered the key to fine-grained action recog-nition.Previous action recognition methods utilize a fixed spatiotemporal window to learn local video representation.However,these methods failed to capture complex motion patterns due to their limited receptive field.To solve the above problems,this paper proposes a lightweight Temporal Pyramid Excitation(TPE)module to capture the short,medium,and long-term temporal context.In this method,Temporal Pyramid(TP)module can effectively expand the temporal receptive field of the network by using the multi-temporal kernel decomposition without significantly increasing the computational cost.In addition,the Multi Excitation module can emphasize temporal importance to enhance the temporal feature representation learning.TPE can be integrated into ResNet50,and building a compact video learning framework-TPENet.Extensive validation experiments on several challenging benchmark(Something-Something V1,Something-Something V2,UCF-101,and HMDB51)datasets demonstrate that our method achieves a preferable balance between computation and accuracy. 展开更多
关键词 fine-grained action recognition temporal pyramid excitation module temporal receptive multi-excitation module
在线阅读 下载PDF
基于YOLOv8s的路面缺陷检测算法研究
5
作者 朱灵茜 于泳波 +2 位作者 毛健 李庆党 孙振 《电子设计工程》 2026年第1期150-154,共5页
针对路面缺陷数据尺寸差距较大,数据类别之间的距离较近,背景复杂,漏检误检率高等问题,提出一种基于改进YOLOv8s的路面缺陷检测算法。在主干使用感受野块来全面感知输入数据的内容,充分提取上下文信息,并引入注意力机制,以关注网络有用... 针对路面缺陷数据尺寸差距较大,数据类别之间的距离较近,背景复杂,漏检误检率高等问题,提出一种基于改进YOLOv8s的路面缺陷检测算法。在主干使用感受野块来全面感知输入数据的内容,充分提取上下文信息,并引入注意力机制,以关注网络有用信息,抑制无用信息;在颈部使用DAMO-YOLO的高效重参数化广义特征金字塔网络(RepGFPN),将高级语义信息和低级空间信息进行充分交互,传递有效的信息,提高检测精度;在颈部使用轻量级的组合混合卷积(GSConv)替换常规卷积,并且引入到C2f模块中,在降低参数量的同时保持检测精度。算法在RDD2022数据集上进行验证,实验结果表明,改进后的YOLOv8s平均检测精度(mAP@0.5)达到78.1%,相比于原模型提高了3.5%,参数量降低了24%,满足路面缺陷检测在精度和速度上的要求。 展开更多
关键词 路面缺陷检测 YOLOv8s 感受野块 广义特征金字塔网络 组合混合卷积
在线阅读 下载PDF
重叠感受野间隙耦合的方向差分运动感知神经网络
6
作者 陶星宇 胡滨 《计算机科学与探索》 北大核心 2026年第1期122-142,共21页
认知神经科学研究发现,脊椎动物视网膜存在方向差分神经元(OMS-DS),具有局部-全局方向差异偏好的视觉神经响应特性,有助于动态视觉场景中区分前景局部和背景全局之间的运动差异,但目前鲜有该神经特性在视觉运动感知问题研究的计算模型... 认知神经科学研究发现,脊椎动物视网膜存在方向差分神经元(OMS-DS),具有局部-全局方向差异偏好的视觉神经响应特性,有助于动态视觉场景中区分前景局部和背景全局之间的运动差异,但目前鲜有该神经特性在视觉运动感知问题研究的计算模型报道。针对该问题,基于蝾螈视网膜差分运动响应特性,提出一种方向差分运动感知神经网络(dirDMPNN)。所提出的神经网络包含突触前和突触后两部分。突触前网络感知运动变化在视野域中引发的低阶视觉线索;突触后网络基于重叠感受野间隙连接耦合机制以对前景、背景方向差分实现响应输出。系统性实验研究表明,dirDMPNN能感知视野域中平移自运动的前景-背景方向运动线索,并对其差分运动模式输出强烈神经尖峰响应。该工作涉及生物视脑神经机制启发的视觉信息加工处理,可为自运动视觉场景,诸如空间飞行器、无人驾驶、机器人自主导航等自治环境下的运动感知与识别、目标检测与跟踪问题的研究与解决提供新思想、新方法。 展开更多
关键词 方向差分感知 间隙连接耦合 重叠感受野 方向差分神经元 视网膜神经 视觉运动感知 自运动视觉场景
在线阅读 下载PDF
基于改进YOLOv8n的砀山酥梨识别算法
7
作者 赵天 曹浩 +1 位作者 董梦婷 赵旭 《安徽科技学院学报》 2026年第1期66-74,共9页
针对砀山酥梨在识别过程中由于果实之间的间距较小且识别背景复杂,容易造成检测算法的误识别和漏检等问题,通过改进YOLOv8n模型,以提升对酥梨密集果实的识别精度,从而更有效地提升砀山酥梨的识别精度。本文采用感受野注意力卷积提高Back... 针对砀山酥梨在识别过程中由于果实之间的间距较小且识别背景复杂,容易造成检测算法的误识别和漏检等问题,通过改进YOLOv8n模型,以提升对酥梨密集果实的识别精度,从而更有效地提升砀山酥梨的识别精度。本文采用感受野注意力卷积提高Backbone的特征提取能力,提升小目标检测性能;引入高级筛选特征融合金字塔网络增强多尺度特征融合能力,提高不同尺度目标的检测效果;使用混合注意力变化器检测头结合CNN与Transformer,增强高分辨率目标的检测能力。结果表明,改进后的YOLOv8n-RHH相较于原始YOLOv8n模型,mAP@50从83.7%提升至88.2%,召回率从74.7%提升至80.3%,显示出更高的鲁棒性和可靠性。改进后的YOLOv8n-RHH模型在砀山酥梨识别任务中表现较为出色,尤其对多尺度目标的识别较为准确。 展开更多
关键词 混合注意力变化器检测头 高级筛选特征融合金字塔网络 感受野注意力卷积 目标识别
在线阅读 下载PDF
Local field potentials,spiking activity,and receptive fields in human visual cortex 被引量:1
8
作者 Lu Luo Xiongfei Wang +5 位作者 Junshi Lu Guanpeng Chen Guoming Luan Wu Li Qian Wang Fang Fang 《Science China(Life Sciences)》 SCIE CAS CSCD 2024年第3期543-554,共12页
The concept of receptive field(RF) is central to sensory neuroscience. Neuronal RF properties have been substantially studied in animals,while those in humans remain nearly unexplored. Here, we measured neuronal RFs w... The concept of receptive field(RF) is central to sensory neuroscience. Neuronal RF properties have been substantially studied in animals,while those in humans remain nearly unexplored. Here, we measured neuronal RFs with intracranial local field potentials(LFPs) and spiking activity in human visual cortex(V1/V2/V3). We recorded LFPs via macro-contacts and discovered that RF sizes estimated from lowfrequency activity(LFA, 0.5–30 Hz) were larger than those estimated from low-gamma activity(LGA, 30–60 Hz) and high-gamma activity(HGA, 60–150 Hz). We then took a rare opportunity to record LFPs and spiking activity via microwires in V1 simultaneously. We found that RF sizes and temporal profiles measured from LGA and HGA closely matched those from spiking activity. In sum, this study reveals that spiking activity of neurons in human visual cortex could be well approximated by LGA and HGA in RF estimation and temporal profile measurement, implying the pivotal functions of LGA and HGA in early visual information processing. 展开更多
关键词 human visual cortex receptive field intracranial EEG local field potential spiking activity
暂未订购
Spatiotemporal organization of simple-cell receptive fields in area 18 of cat's cortex
9
作者 雷静江 李朝义 《Science China(Life Sciences)》 SCIE CAS 1998年第1期1-8,共8页
Spatiotemporal structures of receptive fields (RF) have been studied for simple cells in area 18 of cat by measuring the temporal transfer function (TTF) over different locations (subregions) within the RF. The tempor... Spatiotemporal structures of receptive fields (RF) have been studied for simple cells in area 18 of cat by measuring the temporal transfer function (TTF) over different locations (subregions) within the RF. The temporal characteristics of different subregions differed from each other in the absolute phase shift (APS) to visual stimuli. Two types of relationships can be seen: (i)The APS varied continuously from one subregion to the next; (ii) A 180° phase jump was seen as the stimulus position changed somewhere within the receptive field. Spatiotemporal receptive field profiles have been determined by applying reverse Fourier analysis to responses in the frequency domain. For the continuous type, spatial and temporal characteristics cannot be dissociated (space time inseparable) and the spatiotemporal structure is oriented. On the contrary, the spatial and temporal characteristics for the jumping type can be dissociated (space time separable) and the structure is not oriented in the space time plane. Based on the APSs measured at different subregions, the optimal direction of motion and optimal spatial frequency of neurons can be predicted. 展开更多
关键词 visual CORTEX SIMPLE cell receptive field SPATIOTEMPORAL characteristics response phase direction selectivity.
原文传递
Learning adaptive receptive fields for deep image parsing networks
10
作者 Zhen Wei Yao Sun +1 位作者 Junyu Lin Si Liu 《Computational Visual Media》 CSCD 2018年第3期231-244,共14页
In this paper,we introduce a novel approach to automatically regulate receptive fields in deep image parsing networks.Unlike previous work which placed much importance on obtaining better receptive fields using manual... In this paper,we introduce a novel approach to automatically regulate receptive fields in deep image parsing networks.Unlike previous work which placed much importance on obtaining better receptive fields using manually selected dilated convolutional kernels,our approach uses two affine transformation layers in the network’s backbone and operates on feature maps.Feature maps are inflated or shrunk by the new layer,thereby changing the receptive fields in the following layers.By use of end-to-end training,the whole framework is data-driven,without laborious manual intervention.The proposed method is generic across datasets and different tasks.We have conducted extensive experiments on both general image parsing tasks,and face parsing tasks as concrete examples,to demonstrate the method’s superior ability to regulate over manual designs. 展开更多
关键词 semantic segmentation receptive field DATA-DRIVEN face parsing
原文传递
Hebbian-based mean shift for learning the diverse shapes of V1 simple cell receptive fields
11
作者 Jiqian Liu Yunde Jia 《Chinese Science Bulletin》 SCIE EI CAS 2014年第4期452-458,共7页
The L0-norm constraint in sparse coding has the advantage of producing the same diversity of receptive field shapes as physiology data,but is difficult for analysis.It remains a challenging issue to understand how the... The L0-norm constraint in sparse coding has the advantage of producing the same diversity of receptive field shapes as physiology data,but is difficult for analysis.It remains a challenging issue to understand how the diverse shapes of V1 simple cell receptive fields emerge in visual cortex.This paper presents a biologically plausible learning algorithm,named Hebbian-based mean shift,for this problem.The L0-norm constraint optimizes the number of basis functions rather than their coefficients.We report that the optimization procedure is essentially a 0–1 programming of the selection of basis functions.By assuming that the basis functions are independently selected from a basis set,we find the spatial distribution of input samples containing a special basis function has a star shape and peaks at this basis function.Thus,learning the basis functions for sparse coding with the L0-norm can be interpreted as mode detection where the basis functions are the modes of the kernel density estimate.We employ mean shift to detect modes and prove that the updating rule for the mean shift is Hebbian.The simulation results demonstrate the robustness of the proposed algorithm in producing both Gabor-like and blob-like basis functions. 展开更多
关键词 移动学习 感受野 单细胞 形状 均值 范数约束 稀疏编码 学习算法
在线阅读 下载PDF
改进YOLOv11n的无人机小目标检测算法 被引量:19
12
作者 李彬 李生林 《计算机工程与应用》 北大核心 2025年第7期96-104,共9页
为了有效应对无人机航拍中小目标检测面临的复杂背景、目标密集、目标微小化和移动端部署等挑战,对YOLOv11n模型进行了改进。使用RFCBAMConv模块改进C3k2,增强了特征提取能力。设计了膨胀特征金字塔卷积(dilated featurepyramidconvolut... 为了有效应对无人机航拍中小目标检测面临的复杂背景、目标密集、目标微小化和移动端部署等挑战,对YOLOv11n模型进行了改进。使用RFCBAMConv模块改进C3k2,增强了特征提取能力。设计了膨胀特征金字塔卷积(dilated featurepyramidconvolution,DFPC)模块,替代了原有的SPPF层。通过多尺度膨胀卷积,加强了对无人机小目标细节特征的提取。提出了一种新的特征金字塔结构,在P2层增加160×160尺寸的特征图输出,以提取小目标特征信息。该方法替代了传统通过添加P2小目标检测头的做法。引入了CSPOK模块和ContextGuidedBlock_Down(CGBD)卷积,显著提升了全局特征的提取能力和多尺度特征的融合能力。采用动态检测头(DyHead)替代了原有的检测头,提升了模型的目标检测精度。实验结果表明,改进模型在VisDrone数据集上的mAP@0.5和mAP@0.5:0.95指标分别提高了0.071和0.049。此外,在AI-TOD和SODA-A等数据集上的泛化实验也显示,改进模型在mAP@0.5上分别获得0.055和0.048的提升,充分验证了模型的有效性和泛用性。 展开更多
关键词 小目标检测 YOLOv11 特征提取 感受野注意力
在线阅读 下载PDF
基于RFCARep-YOLOv8n的光伏电池缺陷检测算法 被引量:6
13
作者 张冀 王文彬 余洋 《计算机工程与应用》 北大核心 2025年第3期131-143,共13页
针对光伏电池缺陷图像存在目标遮掩、复杂背景以及人眼难以分辨的小目标缺陷等问题,提出一种基于感受野坐标注意力和重参数的YOLOv8n光伏电池缺陷检测算法,简记为RFCARep-YOLOv8n。提出一种基于感受野坐标注意力的重参数模块代替瓶颈模... 针对光伏电池缺陷图像存在目标遮掩、复杂背景以及人眼难以分辨的小目标缺陷等问题,提出一种基于感受野坐标注意力和重参数的YOLOv8n光伏电池缺陷检测算法,简记为RFCARep-YOLOv8n。提出一种基于感受野坐标注意力的重参数模块代替瓶颈模块进行特征提取,扩大对全局信息的关注度提高语义表达能力,抑制遮掩物和复杂背景的干扰;在快速空间金字塔池化后添加可分离大核聚集模块,通过提高长距离特征依赖增强全局特征信息融合;在特征融合部分使用多尺度序列特征融合颈部网络,结合多尺度辅助检测头,减少细节特征丢失,提高小目标缺陷检测能力。实验结果表明,该模型在PASCAL VOC数据集中较基准模型mAP@0.5和mAP@0.5:0.95分别提升2.3和2.1个百分点,同时在光伏缺陷数据集中mAP@0.5达到87.6%,较基准模型提升3.5个百分点,参数量为3.23×10^(6),保持了基准模型的轻量参数同时提高检测性能。 展开更多
关键词 光伏缺陷 YOLOv8n 感受野注意力 特征融合 重参数
在线阅读 下载PDF
基于大内核自适应融合的小目标检测算法 被引量:1
14
作者 王磊 胡君红 任洋 《计算机工程》 北大核心 2025年第6期65-73,共9页
针对当前基于卷积神经网络的单阶段目标检测算法(YOLO系列、VFNet等)在高空拍摄场景下目标背景复杂、检测精度低、特征混叠等问题,提出一种端到端的目标检测算法CSPENet。首先,采用基于大内核深度卷积CSPNeXt作为模型主干,提高模型捕捉... 针对当前基于卷积神经网络的单阶段目标检测算法(YOLO系列、VFNet等)在高空拍摄场景下目标背景复杂、检测精度低、特征混叠等问题,提出一种端到端的目标检测算法CSPENet。首先,采用基于大内核深度卷积CSPNeXt作为模型主干,提高模型捕捉全局上下文的能力;其次,通过引入特征细化模块(FRM)在空间和通道维度上生成自适应权重,可有效抑制混叠特征,并在特征融合阶段添加基于移动网络的感受野注意力(RFA)机制解决大内核参数共享问题;最后,采用EIoU损失函数作为模型的回归损失函数,并拆分预测框和真实框纵横比的影响因子,以提高模型收敛速度并改善定位效果。实验结果表明,CSPENet在VisDrone-DET数据集上相对于DINO算法平均准确率均值提升4.4百分点,为小目标检测算法的研究及其应用提供新的参考方案。 展开更多
关键词 大内核 小目标 上下文信息 特征细化 自适应融合 感受野
在线阅读 下载PDF
上下文感知多感受野融合网络的定向遥感目标检测 被引量:1
15
作者 姚婷婷 肇恒鑫 +1 位作者 冯子豪 胡青 《电子与信息学报》 北大核心 2025年第1期233-243,共11页
以广距鸟瞰视角拍摄获取的遥感图像通常具有目标种类多、尺度变化大以及背景信息丰富等特点,为目标检测任务带来巨大挑战。针对遥感图像成像特点,该文设计一种上下文感知多感受野融合网络,通过充分挖掘深度网络中遥感图像在不同尺寸特... 以广距鸟瞰视角拍摄获取的遥感图像通常具有目标种类多、尺度变化大以及背景信息丰富等特点,为目标检测任务带来巨大挑战。针对遥感图像成像特点,该文设计一种上下文感知多感受野融合网络,通过充分挖掘深度网络中遥感图像在不同尺寸特征描述下所包含的上下文关联信息,提高图像特征描述力,进而提升遥感目标检测精度。首先,在特征金字塔前4层网络中构建了感受野扩张模块,通过扩大网络在不同尺度特征图上的感受野范围,增强网络对不同尺度遥感目标的感知能力;进一步,构建了高层特征聚合模块,通过将特征金字塔网络中高层语义信息聚合到低层特征中,从而将特征图中所包含的多尺度上下文信息进行有效融合;最后,在双阶段定向目标检测框架下设计了特征细化区域建议网络。通过对一阶段提案进行精细化处理,提升提案准确性,进而提高二阶段兴趣区域对齐网络得到的不同成像方向下的遥感目标检测性能。在公测数据集DIOR-R和HRSC2016上的定性和定量的对比实验结果证明,所提方法对不同种类和尺度大小的遥感目标均能实现更加准确的检测。 展开更多
关键词 遥感图像 深度学习 目标检测 多感受野融合
在线阅读 下载PDF
基于改进YOLOv7tiny的无人机小目标检测 被引量:1
16
作者 倪健 申奥 王峥 《计算机工程与设计》 北大核心 2025年第11期3065-3073,共9页
针对航拍图像中小目标密集遮挡等问题,提出一种基于YOLOv7tiny改进的小目标检测算法。增加一个微小目标检测层,增强模型对特征的捕捉能力;使用自适应空间融合改进FPN结构,促进主干网络输出的非相邻层特征图融合;提出多尺度感知卷积MSACo... 针对航拍图像中小目标密集遮挡等问题,提出一种基于YOLOv7tiny改进的小目标检测算法。增加一个微小目标检测层,增强模型对特征的捕捉能力;使用自适应空间融合改进FPN结构,促进主干网络输出的非相邻层特征图融合;提出多尺度感知卷积MSAConv,增强卷积神经网络捕获目标特征信息的能力;引入RFCA注意力机制模块,解决参数共享问题,增强特征提取能力。实验结果表明,改进后模型在VisDrone2019数据集上mAP50达到了40.6%,较基准模型提升了5.2%。 展开更多
关键词 无人机 YOLOv7tiny 小目标检测 自适应空间融合 感受野注意力 多尺度特征信息 深度学习
在线阅读 下载PDF
面向无人机载平台的轻量级小目标检测算法
17
作者 党兰学 李赞 +2 位作者 苗长伟 崔金华 赵雅靓 《河南大学学报(自然科学版)》 北大核心 2025年第1期1-11,共11页
无人机载平台中的目标检测在军事和民用领域具有重要的应用价值.然而,现有的检测方法通常侧重于多尺度目标检测,缺乏对小目标的优化,且模型复杂度过高,难以在资源受限的机载平台中应用.为此,本文提出了一种面向无人机载平台的轻量级小... 无人机载平台中的目标检测在军事和民用领域具有重要的应用价值.然而,现有的检测方法通常侧重于多尺度目标检测,缺乏对小目标的优化,且模型复杂度过高,难以在资源受限的机载平台中应用.为此,本文提出了一种面向无人机载平台的轻量级小目标检测算法YOLOH(You Only Look One Head).首先,针对小目标对基准网络优化,移除深层特征以减少模型参数量,增加浅层特征以获取小目标信息.其次,在特征融合部分加入NAM注意力,增强对小目标的感知能力.接着,设计了多感受野聚焦模块MRFF,以挖掘特征图的感受野信息,增强模型的多尺度检测能力.最后,使用LAMP算法对模型剪枝,去除冗余神经元以压缩模型.实验结果表明,与YOLOv8s相比,YOLOH的模型参数量和计算量分别减少了92%和35%,FPS提高了57%.在VisDrone2019和CARPK数据集上AP_(S)分别提高了3.3%和3.7%.与其他轻量级模型相比,所提YOLOH具有最佳的整体性能,同时平衡了模型大小、精度和推理速度,为无人机载平台的目标检测提供了有效的解决方案. 展开更多
关键词 机载平台 YOLOH 小目标检测 轻量级 多感受野
原文传递
自适应采样与重影多尺度特征融合的轻量化焊缝缺陷检测
18
作者 鲁斌 杨烜 +1 位作者 杨振宇 高啸天 《系统仿真学报》 北大核心 2025年第8期1978-1990,共13页
为提升焊接缺陷识别的准确率和速度,并实现模型的轻量化,提出了一种基于YOLOv8的轻量化焊缝缺陷检测网络LAW-YOLO(light adaptive-weight sampling-YOLO)。设计了一种轻量级自适应权重采样LAWS模块,通过学习感受野区域内交互的特征来构... 为提升焊接缺陷识别的准确率和速度,并实现模型的轻量化,提出了一种基于YOLOv8的轻量化焊缝缺陷检测网络LAW-YOLO(light adaptive-weight sampling-YOLO)。设计了一种轻量级自适应权重采样LAWS模块,通过学习感受野区域内交互的特征来构建自适应权重注意力特征图。采用优化的高效加权双向特征金字塔网络作为LAW-YOLO中的特征提取网络,设计重影多尺度采样模块并引用了混合注意力机制,以增强对小目标缺陷的检测能力。实验结果表明:该方法在SteelTube数据集中mAP0.5达到97.6%,处理数据速度可达91帧/s,比基线模型提高了5.5%的平均精度及4.6%的处理速度,在保持高效性能的同时减少了25.3%的计算量和50%的模型大小,更便于部署在边缘设备上进行场景作业。 展开更多
关键词 缺陷检测 YOLOv8 重影多尺度卷积 感受野空间特征 混合注意力机制
原文传递
基于改进YOLO的光伏组件缺陷检测算法
19
作者 王红君 杨溢 +1 位作者 赵辉 岳有军 《计算机工程与设计》 北大核心 2025年第11期3317-3325,共9页
针对光伏组件红外图像检测中的误检、漏检和低精度问题,提出改进的YOLOv8算法。使用VanillaNet简化主干网络,提出双层transformer的DBoTNet提升场景理解能力,结合感受野注意力、CBAM和V7DownSampling模块增强特征提取能力,改用Focaler-W... 针对光伏组件红外图像检测中的误检、漏检和低精度问题,提出改进的YOLOv8算法。使用VanillaNet简化主干网络,提出双层transformer的DBoTNet提升场景理解能力,结合感受野注意力、CBAM和V7DownSampling模块增强特征提取能力,改用Focaler-WIoU损失函数进一步提升检测性能。在Solar2024数据集上的实验结果表明,改进模型mAP提升4.6%至75.5%,参数量仅1.92 M,验证了方法有效性。 展开更多
关键词 无人机 缺陷检测 深度学习 多头自注意力机制 损失函数 感受野注意力机制 轻量化
在线阅读 下载PDF
上下文信息和多尺度特征序列引导的遥感图像检测
20
作者 肖振久 李士博 +1 位作者 曲海成 李富坤 《中国图象图形学报》 北大核心 2025年第7期2570-2583,共14页
目的针对遥感图像(remote sensing image,RSI)检测中目标尺寸小且密集、尺度变化大,尤其在复杂背景信息下容易出现漏检和误检问题,提出一种上下文信息和多尺度特征序列引导的遥感图像检测方法,以提升遥感图像的检测精度。方法首先,设计... 目的针对遥感图像(remote sensing image,RSI)检测中目标尺寸小且密集、尺度变化大,尤其在复杂背景信息下容易出现漏检和误检问题,提出一种上下文信息和多尺度特征序列引导的遥感图像检测方法,以提升遥感图像的检测精度。方法首先,设计自适应大感受野机制(adaptive large receptive field,ALRF)用于特征提取。该机制通过级联不同扩张率的深度卷积进行分层特征提取,并利用通道和空间注意力对提取的特征进行通道加权和空间融合,使模型能够自适应地调整感受野大小,从而实现遥感图像上下文信息的有效利用。其次,为解决颈部网络特征融合过程中小目标语义信息丢失问题,设计多尺度特征序列融合架构(multi-scale feature fusion,MFF)。该架构通过构建多尺度特征序列,并结合浅层语义特征信息,实现复杂背景下多尺度全局信息的有效融合,从而减轻深层网络中特征模糊性对小目标局部细节捕捉的影响。最后,因传统交并比(intersection over union,IoU)对小目标位置偏差过于敏感,引入归一化Wasserstein距离(normalized Wasserstein distance,NWD)。NWD将边界框建模为二维高斯分布,计算这些分布间的Wasserstein距离来衡量边界框的相似性,从而降低小目标位置偏差敏感性。结果在NWPU VHR-10(Northwestern Polytechnical University very high resolution10)和DIOR(dataset for object detection in aerial images)数据集上与10种方法进行综合比较,结果表明,提出的方法优于对比方法,平均精度(average precision,AP)分别达到93.15%和80.89%,相较于基准模型YOLOv8n(you only look once version 8 nano),提升了5.48%和2.97%,同时参数量下降6.96%。结论提出一种上下文信息和多尺度特征序列引导的遥感图像检测方法,该方法提升目标的定位能力,改善复杂背景下遥感图像检测中的漏检和误检问题。 展开更多
关键词 遥感图像(RSI) 目标检测 感受野(RF) 特征融合 归一化Wasserstein距离(NWD)
原文传递
上一页 1 2 30 下一页 到第
使用帮助 返回顶部