期刊文献+
共找到161篇文章
< 1 2 9 >
每页显示 20 50 100
CT-MFENet:Context Transformer and Multi-Scale Feature Extraction Network via Global-Local Features Fusion for Retinal Vessels Segmentation
1
作者 SHAO Dangguo YANG Yuanbiao +1 位作者 MA Lei YI Sanli 《Journal of Shanghai Jiaotong university(Science)》 2025年第4期668-682,共15页
Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete v... Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net. 展开更多
关键词 retinal vessel segmentation context transformer(CT) multi-scale dense residual hybrid loss function global-local fusion
原文传递
Fault diagnosis of rolling bearing based on two-dimensional composite multi-scale ensemble Gramian dispersion entropy
2
作者 Wenqing Ding Jinde Zheng +3 位作者 Jianghong Li Haiyang Pan Jian Cheng Jinyu Tong 《Chinese Journal of Mechanical Engineering》 2026年第1期125-144,共20页
One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mension... One-dimensional ensemble dispersion entropy(EDE1D)is an effective nonlinear dynamic analysis method for complexity measurement of time series.However,it is only restricted to assessing the complexity of one-di-mensional time series(TS1d)with the extracted complexity features only at a single scale.Aiming at these problems,a new nonlinear dynamic analysis method termed two-dimensional composite multi-scale ensemble Gramian dispersion entropy(CMEGDE_(2D))is proposed in this paper.First,the TS_(1D) is transformed into a two-dimensional image(I_(2D))by using Gramian angular fields(GAF)with more internal data structures and geometri features,which preserve the global characteristics and time dependence of vibration signals.Second,the I2D is analyzed at multiple scales through the composite coarse-graining method,which overcomes the limitation of a single scale and provides greater stability compared to traditional coarse-graining methods.Subsequently,a new fault diagnosis method of rolling bearing is proposed based on the proposed CMEGDE_(2D) for fault feature ex-traction and the chicken swarm algorithm optimized support vector machine(CsO-SvM)for fault pattern identification.The simulation signals and two data sets of rolling bearings are utilized to verify the effectiveness of the proposed fault diagnosis method.The results demonstrate that the proposed method has stronger dis-crimination ability,higher fault diagnosis accuracy and better stability than the other compared methods. 展开更多
关键词 Composite multi-scale ensemble Gramian dispersion entropy Dispersion entropy Fault diagnosis Rolling bearing Feature extraction
在线阅读 下载PDF
Feature Extraction by Multi-Scale Principal Component Analysis and Classification in Spectral Domain 被引量:2
3
作者 Shengkun Xie Anna T. Lawnizak +1 位作者 Pietro Lio Sridhar Krishnan 《Engineering(科研)》 2013年第10期268-271,共4页
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (... Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals. 展开更多
关键词 multi-scale Principal Component Analysis Discrete WAVELET TRANSFORM FEATURE extraction Signal CLASSIFICATION Empirical CLASSIFICATION
在线阅读 下载PDF
MSC-YOLO:Improved YOLOv7 Based on Multi-Scale Spatial Context for Small Object Detection in UAV-View
4
作者 Xiangyan Tang Chengchun Ruan +2 位作者 Xiulai Li Binbin Li Cebin Fu 《Computers, Materials & Continua》 SCIE EI 2024年第4期983-1003,共21页
Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variati... Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications. 展开更多
关键词 Small object detection YOLOv7 multi-scale attention spatial context
在线阅读 下载PDF
A multi-scale convolutional auto-encoder and its application in fault diagnosis of rolling bearings 被引量:12
5
作者 Ding Yunhao Jia Minping 《Journal of Southeast University(English Edition)》 EI CAS 2019年第4期417-423,共7页
Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on ... Aiming at the difficulty of fault identification caused by manual extraction of fault features of rotating machinery,a one-dimensional multi-scale convolutional auto-encoder fault diagnosis model is proposed,based on the standard convolutional auto-encoder.In this model,the parallel convolutional and deconvolutional kernels of different scales are used to extract the features from the input signal and reconstruct the input signal;then the feature map extracted by multi-scale convolutional kernels is used as the input of the classifier;and finally the parameters of the whole model are fine-tuned using labeled data.Experiments on one set of simulation fault data and two sets of rolling bearing fault data are conducted to validate the proposed method.The results show that the model can achieve 99.75%,99.3%and 100%diagnostic accuracy,respectively.In addition,the diagnostic accuracy and reconstruction error of the one-dimensional multi-scale convolutional auto-encoder are compared with traditional machine learning,convolutional neural networks and a traditional convolutional auto-encoder.The final results show that the proposed model has a better recognition effect for rolling bearing fault data. 展开更多
关键词 fault diagnosis deep learning convolutional auto-encoder multi-scale convolutional kernel feature extraction
在线阅读 下载PDF
Multi-Scale Mixed Attention Tea Shoot Instance Segmentation Model 被引量:1
6
作者 Dongmei Chen Peipei Cao +5 位作者 Lijie Yan Huidong Chen Jia Lin Xin Li Lin Yuan Kaihua Wu 《Phyton-International Journal of Experimental Botany》 SCIE 2024年第2期261-275,共15页
Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often... Tea leaf picking is a crucial stage in tea production that directly influences the quality and value of the tea.Traditional tea-picking machines may compromise the quality of the tea leaves.High-quality teas are often handpicked and need more delicate operations in intelligent picking machines.Compared with traditional image processing techniques,deep learning models have stronger feature extraction capabilities,and better generalization and are more suitable for practical tea shoot harvesting.However,current research mostly focuses on shoot detection and cannot directly accomplish end-to-end shoot segmentation tasks.We propose a tea shoot instance segmentation model based on multi-scale mixed attention(Mask2FusionNet)using a dataset from the tea garden in Hangzhou.We further analyzed the characteristics of the tea shoot dataset,where the proportion of small to medium-sized targets is 89.9%.Our algorithm is compared with several mainstream object segmentation algorithms,and the results demonstrate that our model achieves an accuracy of 82%in recognizing the tea shoots,showing a better performance compared to other models.Through ablation experiments,we found that ResNet50,PointRend strategy,and the Feature Pyramid Network(FPN)architecture can improve performance by 1.6%,1.4%,and 2.4%,respectively.These experiments demonstrated that our proposed multi-scale and point selection strategy optimizes the feature extraction capability for overlapping small targets.The results indicate that the proposed Mask2FusionNet model can perform the shoot segmentation in unstructured environments,realizing the individual distinction of tea shoots,and complete extraction of the shoot edge contours with a segmentation accuracy of 82.0%.The research results can provide algorithmic support for the segmentation and intelligent harvesting of premium tea shoots at different scales. 展开更多
关键词 Tea shoots attention mechanism multi-scale feature extraction instance segmentation deep learning
在线阅读 下载PDF
Ship recognition based on HRRP via multi-scale sparse preserving method
7
作者 YANG Xueling ZHANG Gong SONG Hu 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第3期599-608,共10页
In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) ba... In order to extract the richer feature information of ship targets from sea clutter, and address the high dimensional data problem, a method termed as multi-scale fusion kernel sparse preserving projection(MSFKSPP) based on the maximum margin criterion(MMC) is proposed for recognizing the class of ship targets utilizing the high-resolution range profile(HRRP). Multi-scale fusion is introduced to capture the local and detailed information in small-scale features, and the global and contour information in large-scale features, offering help to extract the edge information from sea clutter and further improving the target recognition accuracy. The proposed method can maximally preserve the multi-scale fusion sparse of data and maximize the class separability in the reduced dimensionality by reproducing kernel Hilbert space. Experimental results on the measured radar data show that the proposed method can effectively extract the features of ship target from sea clutter, further reduce the feature dimensionality, and improve target recognition performance. 展开更多
关键词 ship target recognition high-resolution range profile(HRRP) multi-scale fusion kernel sparse preserving projection(MSFKSPP) feature extraction dimensionality reduction
在线阅读 下载PDF
Underwater Image Enhancement Based on Multi-scale Adversarial Network
8
作者 ZENG Jun-yang SI Zhan-jun 《印刷与数字媒体技术研究》 CAS 北大核心 2024年第5期70-77,共8页
In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of ea... In this study,an underwater image enhancement method based on multi-scale adversarial network was proposed to solve the problem of detail blur and color distortion in underwater images.Firstly,the local features of each layer were enhanced into the global features by the proposed residual dense block,which ensured that the generated images retain more details.Secondly,a multi-scale structure was adopted to extract multi-scale semantic features of the original images.Finally,the features obtained from the dual channels were fused by an adaptive fusion module to further optimize the features.The discriminant network adopted the structure of the Markov discriminator.In addition,by constructing mean square error,structural similarity,and perceived color loss function,the generated image is consistent with the reference image in structure,color,and content.The experimental results showed that the enhanced underwater image deblurring effect of the proposed algorithm was good and the problem of underwater image color bias was effectively improved.In both subjective and objective evaluation indexes,the experimental results of the proposed algorithm are better than those of the comparison algorithm. 展开更多
关键词 Underwater image enhancement Generative adversarial network multi-scale feature extraction Residual dense block
在线阅读 下载PDF
Global Context Fusion Network for SAR Ship Detection
9
作者 Boya Zhang Yong Wang 《Journal of Beijing Institute of Technology》 2025年第6期577-589,共13页
Ship detection in synthetic aperture radar(SAR)image is crucial for marine surveillance and navigation.The application of detection network based on deep learning has achieved a promising result in SAR ship detection.... Ship detection in synthetic aperture radar(SAR)image is crucial for marine surveillance and navigation.The application of detection network based on deep learning has achieved a promising result in SAR ship detection.However,the existing networks encounters challenges due to the complex backgrounds,diverse scales and irregular distribution of ship targets.To address these issues,this article proposes a detection algorithm that integrates global context of the images(GCF-Net).First,we construct a global feature extraction module in the backbone network of GCF-Net,which encodes features along different spatial directions.Then,we incorporate bi-directional feature pyramid network(BiFPN)in the neck network to fuse the multi-scale features selectively.Finally,we design a convolution and transformer mixed(CTM)detection head to obtain contextual information of targets and concentrate network attention on the most informative regions of the images.Experimental results demonstrate that the proposed method achieves more accurate detection of ship targets in SAR images. 展开更多
关键词 synthetic aperture radar(SAR) ship detection global context fusion convolutional neural network feature extraction
在线阅读 下载PDF
基于特征融合及多尺度上下文提取的实时语义分割
10
作者 刘伯红 刘磊 《重庆邮电大学学报(自然科学版)》 北大核心 2026年第1期167-176,共10页
针对实时语义分割算法常用的双分支结构存在空间分支和上下文分支特征融合不充分、多尺度上下文信息提取不全面等问题,提出基于特征融合及多尺度上下文提取的实时语义分割网络。设计空间-多尺度双向注意力融合模块,使用空间注意力机制... 针对实时语义分割算法常用的双分支结构存在空间分支和上下文分支特征融合不充分、多尺度上下文信息提取不全面等问题,提出基于特征融合及多尺度上下文提取的实时语义分割网络。设计空间-多尺度双向注意力融合模块,使用空间注意力机制和多尺度特征融合模块实现双分支交互融合,促进空间特征以及语义特征在双分支上的流动;在上下文分支末端设计了串联聚合金字塔池化模块,更精确地捕捉细节信息;聚合空间分支不同阶段特征,增强模型对图像特征的全面理解能力,促进深层特征与浅层特征的深度融合;使用多尺度注意力特征融合模块引导空间分支和上下文分支融合。实验结果表明,构建的网络在Cityscapes数据集上平均交并比(mean intersection over union,MIoU)达到78.0%,推理速度为104.5 Frame/s;在CamVid数据集上,MIoU达到75.9%,推理速度为224.6 Frame/s。 展开更多
关键词 实时语义分割 特征融合 注意力机制 多尺度上下文提取
在线阅读 下载PDF
基于级联注意力的结肠息肉图像分割算法研究
11
作者 周孟然 陆鹏 《重庆工商大学学报(自然科学版)》 2026年第1期1-10,共10页
目的针对现有Transformer模型在息肉图像分割中存在注意力分散以及作为编码器提取的多级特征在融合时易产生信息丢失导致的分割精度不高的问题,提出一种新的分割模型PVT-CAMNet。方法在该模型中,使用金字塔式Transformer(Pyramid Vision... 目的针对现有Transformer模型在息肉图像分割中存在注意力分散以及作为编码器提取的多级特征在融合时易产生信息丢失导致的分割精度不高的问题,提出一种新的分割模型PVT-CAMNet。方法在该模型中,使用金字塔式Transformer(Pyramid Vision Transformer,PVT)作为编码器,接着设计了多尺度特征注意力提取模块(Multi-scale Feature Attention Extraction,MFAE)和层间注意力聚合模块(Inter-layer Attention Aggregation,IA)。其中,PVT通过其自注意力机制保证了模型的泛化能力,MFAE使用不同大小的滤波器多尺度提取特征,旨在缓解注意力分散问题;IA交互融合不同层级特征,有效解决多级特征融合产生的信息丢失问题;最后引入全局上下文模块(Global Context,GC)使模型更好地理解特征图之间的像素依赖关系。结果在Kvasir、CVC-ClinicDB、CVC-ColonDB和ETIS数据集上进行了评估,相较于最优基线模型,mDice、mIoU分别提高了1.76%、0.81%、1.51%、1.74%、3.15%、2.65%和1.73%、3.84%。结论PVT-CAMNet的学习性能和泛化性能均优于其他先进方法,在息肉图像分割上具有一定的应用价值。 展开更多
关键词 息肉图像分割 多尺度注意力提取 层间注意力聚合 全局上下文
在线阅读 下载PDF
基于自适应空间混合的大规模点云语义分割
12
作者 黄和炜 周树东 +1 位作者 方立 张静茹 《微电子学与计算机》 2026年第3期56-64,共9页
针对点云语义分割在保持分割精度的同时提升大规模点云场景实时性的需求,提出了一种基于自适应空间混合的点云语义分割算法。首先,在嵌入层融合点特征和领域特征,增强原始点数据并捕获局部几何信息。随后,通过空间和通道混合层对嵌入点... 针对点云语义分割在保持分割精度的同时提升大规模点云场景实时性的需求,提出了一种基于自适应空间混合的点云语义分割算法。首先,在嵌入层融合点特征和领域特征,增强原始点数据并捕获局部几何信息。随后,通过空间和通道混合层对嵌入点特征进行迭代更新。在空间混合阶段,利用稀疏矩阵投影点云,引入轴感知特征混合模块捕获跨维度上下文信息,并实现本地特征与查询特征的自适应融合,融合后与嵌入特征残差连接后输入通道混合阶段。在通道混合阶段,设计一种低参数量的低秩近似MLP模块替代传统MLP,对特征进行高效处理。经多次交替混合后,利用卷积对更新后的特征进行分类。在SemanticKITTI和SemanticPOSS数据集上的实验结果表明:所提方法在效率与精度之间取得良好平衡,验证集平均交并比分别达到64.0%与52.2%。 展开更多
关键词 大规模点云 MLP-mixer 语义分割 自适应 空间混合 上下文提取 低秩近似
在线阅读 下载PDF
DL-YOLO:AMulti-Scale Feature Fusion Detection Algorithm for Low-Light Environments
13
作者 Yuanmeng Chang Hongmei Liu 《Computers, Materials & Continua》 2026年第5期1901-1915,共15页
Driven by rapid advances in deep learning,object detection has been widely adopted across diverse application scenarios.However,in low-light conditions,critical visual cues of target objects are severely degraded,posi... Driven by rapid advances in deep learning,object detection has been widely adopted across diverse application scenarios.However,in low-light conditions,critical visual cues of target objects are severely degraded,posing a significant challenge for accurate low-light object detection.Existing methods struggle to preserve discriminative features while maintaining semantic consistency between low-light and normal-light images.For this purpose,this study proposes a DL-YOLO model specially tailored for low-light detection.To mitigate target feature attenuation introduced by repeated downsampling,we design aMulti-Scale FeatureConvolution(MSF-Conv)module that captures rich,multi-level details via multi-scale feature learning,thereby reducing model complexity and computational cost.For feature fusion,we integrated the C3k2-DWRmodule by embedding the Dilation-wise Residual(DWR)mechanism into the 2-core optimized Cross Stage Partial(C3)framework,achieving efficient feature integration.In addition,we replace conventional localization losses with WIoU(Weighted Intersection over Union),which dynamically adjusts gradient gain according to sample quality,thereby improving localization robustness and precision.Experiments on the ExDark dataset demonstrate that DL-YOLO delivers strong low-light detection performance.The relevant code is published at https://github.com/cym0997/DL-YOLO. 展开更多
关键词 multi-scale feature extraction object detection low-light environments ExDark dataset
在线阅读 下载PDF
跨尺度特征融合的遥感微小目标检测算法 被引量:1
14
作者 邵凯 李浩刚 +2 位作者 梁燕 宁婧 陈戊 《系统工程与电子技术》 北大核心 2025年第5期1421-1431,共11页
针对遥感图像微小目标检测中存在的浅层细化特征、深层语义表征和多尺度信息提取3个问题,提出一种综合运用多项技术的跨尺度YOLOv7(cross-scale YOLOv7,CSYOLOv7)网络。首先,设计跨阶段特征提取模块(cross-stage feature extraction mod... 针对遥感图像微小目标检测中存在的浅层细化特征、深层语义表征和多尺度信息提取3个问题,提出一种综合运用多项技术的跨尺度YOLOv7(cross-scale YOLOv7,CSYOLOv7)网络。首先,设计跨阶段特征提取模块(cross-stage feature extraction module,CFEM)和感受野特征增强模块(receptive field feature enhancement module,RFFEM)。CFEM提高模型细化特征提取能力并抑制浅层下采样过程中特征的丢失,RFFEM加大网络对深层语义特征的提取力度,增强模型对目标上下文信息获取能力。其次,设计跨梯度空间金字塔池化模块(cross-gradient space pyramid pool module,CSPPM)有效融合微小目标多尺度的全局和局部特征。最后,用形状感知交并比(shape-aware intersection over union,Shape IoU)替换完全交并比(complete intersection over union,CIoU),提高模型在边界框定位任务中的精确度。实验结果表明,CSYOLOv7网络在DIOR(dataset for image object recognition)数据集和NWPU VHR-10(Northwestern Polytechnical University Very High Resolution-10)数据集上分别取得了74%和89.6%的检测精度,有效提升遥感图像微小目标的检测效果。 展开更多
关键词 遥感图像 微小目标 特征提取 上下文信息
在线阅读 下载PDF
基于跨度和图卷积网络的篇章级事件抽取模型 被引量:2
15
作者 廖涛 牛冰宇 《湖北民族大学学报(自然科学版)》 2025年第1期108-113,共6页
为解决现有的事件抽取方法在实体抽取子任务中难以充分利用上下文信息,导致事件抽取精度较低的问题,提出了基于跨度和图卷积网络的篇章级事件抽取(document-level event extraction based on span and graph convolutional network, DEE... 为解决现有的事件抽取方法在实体抽取子任务中难以充分利用上下文信息,导致事件抽取精度较低的问题,提出了基于跨度和图卷积网络的篇章级事件抽取(document-level event extraction based on span and graph convolutional network, DEESG)模型。首先,设计中间线性层对编码的向量进行线性处理,并结合标注信息计算最佳跨度,通过提升对跨度开始位置和结束位置判断的准确度来提高实体抽取的精度;接着,提出异构图的构建方法,使用池化策略将实体与句子表示为图的节点,根据提出的建边规则构建异构图,以此建立全局信息的交互,并利用多层图卷积网络(graph convolutional network, GCN)对异构图进行卷积,获得具有上下文信息的实体表示和句子表示,以此解决上下文信息利用不充分的问题;然后,利用多头注意力机制进行事件类型的检测;最后,为组合中的实体分配论元角色,完成事件抽取任务。在中文金融公告(Chinese financial announcements, ChFinAnn)数据集上进行实验。结果表明,与拥有追踪器的异构图交互模型(graph-based interaction model with a tracker, GIT)相比,DEESG模型的F1分数提升了1.3个百分点。该研究证实DEESG模型能有效应用于篇章级事件抽取领域。 展开更多
关键词 事件抽取 跨度 实体抽取 异构图 图卷积网络 上下文信息
在线阅读 下载PDF
基于双目BEV与改进YOLOv8的路面裂缝识别方法 被引量:1
16
作者 谢海波 邱杨航 +4 位作者 黄莹颖 张汇祥 蔡生勇 王培玉 萧白 《长沙理工大学学报(自然科学版)》 2025年第5期1-16,共16页
【目的】针对路面裂缝检测过程中目标尺度不一致、全局特征建模能力不足的问题,提出一种基于双目鸟瞰图(bird’s eye view,BEV)与改进YOLOv8模型的路面裂缝识别方法,旨在实现高效、精准的裂缝检测与分割。【方法】首先,通过双目立体视... 【目的】针对路面裂缝检测过程中目标尺度不一致、全局特征建模能力不足的问题,提出一种基于双目鸟瞰图(bird’s eye view,BEV)与改进YOLOv8模型的路面裂缝识别方法,旨在实现高效、精准的裂缝检测与分割。【方法】首先,通过双目立体视觉与逆透视变换技术生成高精度BEV图像,解决传统视角下目标尺度不一致问题;其次,提出的C2f-DRR模块利用区域残差化-语义残差化的解耦策略有效捕获裂缝的多尺度上下文信息,通过大核卷积与小核空洞卷积协同作用丰富图像的细节信息并减少背景干扰;最后,引入上下文锚点注意力机制,使模型能够动态聚焦裂缝中心区域,并实现对远距离像素间的长程依赖关系的建模。【结果】为验证改进模型的有效性,在测试集上进行了对比试验。改进后模型的平均精度均值MAP50达83.7%,准确率P达83.9%,F1分数达83.5%,较原始的YOLOv8n模型的分别提升4.4、4.0、1.8个百分点。在公开数据集UAV-PDD2023上验证的MAP50达70.5%,召回率R达64.8%,准确度P达74.1%,较原模型的分别提升了3.5、4.5、0.6个百分点。改进模型在识别精度、鲁棒性、泛化学习能力方面均优于原始模型。【结论】本研究提出的基于双目BEV视角的裂缝分割方法有效提升了模型在复杂道路场景下的检测精度与泛化能力,为自动化路面病害检测提供了技术支持。 展开更多
关键词 路面裂缝识别 双目立体视觉 鸟瞰图 YOLOv8 上下文锚点注意力 特征提取
在线阅读 下载PDF
上下文协同与混合注意力机制驱动的图像去雾算法
17
作者 赵志强 何进良 《计算机应用研究》 北大核心 2025年第9期2875-2880,共6页
现有的去雾模型在处理雾霾图像时,常因雾霾在不同像素和局部区域的分布不均匀,导致特征提取不充分,进而削弱了对图像纹理和边缘信息的捕捉,影响去雾效果。针对这一问题,提出了一种全新的多域注意力机制与上下文协同的图像去雾方法——HA... 现有的去雾模型在处理雾霾图像时,常因雾霾在不同像素和局部区域的分布不均匀,导致特征提取不充分,进而削弱了对图像纹理和边缘信息的捕捉,影响去雾效果。针对这一问题,提出了一种全新的多域注意力机制与上下文协同的图像去雾方法——HACNet。HACNet创新性地提出了以多域注意力机制(MDA)为基础的混合注意力融合算法(HAF)用于整合局部和全局特征,实现了特征的精细化处理与自适应权重分配。同时提出了上下文自适应感知网络(CAF)协同多尺度膨胀卷积(CAAC),增强了模型对多尺度特征的感知能力,能有效应对雾霾浓度的区域差异。实验结果表明,HACNet在RESIDE、NH-Haze和Dense-Haze等数据集上均优于目前的先进去雾模型。HACNet通过多尺度雾霾特征捕捉与局部-全局细节的平衡,有效提高了去雾性能,具有较强的应用潜力。源代码发布在https://github.com/ruicys/HACNet。 展开更多
关键词 图像去雾 注意力机制 上下文协同 特征提取
在线阅读 下载PDF
融合动态蛇形卷积的山区道路提取
18
作者 戴激光 马争 +2 位作者 李宛潼 秦志伟 王继承 《遥感信息》 北大核心 2025年第4期11-18,共8页
针对山区道路曲率变化大、阴影遮挡等特点导致的提取精度低的问题,提出了一种新的山区道路提取模型。该方法以MANet为基础,首先,采用动态蛇形卷积(dynamic snake convolution,DSCov)自适应聚焦细长和弯曲的局部结构,来准确捕捉道路结构... 针对山区道路曲率变化大、阴影遮挡等特点导致的提取精度低的问题,提出了一种新的山区道路提取模型。该方法以MANet为基础,首先,采用动态蛇形卷积(dynamic snake convolution,DSCov)自适应聚焦细长和弯曲的局部结构,来准确捕捉道路结构的特征,聚焦道路曲率变化大的问题;其次,提出多卷积上下文提取模块(multiple convolution context extraction,MCCE)关注道路的远程依赖关系,有效地捕获长距离的道路环境,增强阴影遮挡情况下模型的稳定性与泛化性。为验证该方法的有效性,在自主绘制的GF-2山区道路数据集和Massachusetts数据集上进行实验,F1分数分别达到了82.41%、88.87%;同时,在GF-2山区道路数据集上进行消融实验,F1分数相较于MANet提高2.24个百分点。通过对比分析,该方法在道路曲率大和遮挡处的提取效果均优于其他模型。 展开更多
关键词 山区道路提取 动态蛇形卷积 多卷积上下文提取 深度学习 卷积神经网络
在线阅读 下载PDF
基于显著性目标检测的图像前景提取算法研究
19
作者 贾小云 杨振英 +1 位作者 邵帆 罗豪才 《计算技术与自动化》 2025年第4期110-115,共6页
为解决自然场景中深度学习抠像算法依赖于人机交互且前景提取效果不佳的问题,提出了通过检测显著性目标的方式实现图像前景提取算法。该算法由显著性目标检测和alpha估计两个网络组成,前者在基准网络的基础上引入混合损失函数和深监督... 为解决自然场景中深度学习抠像算法依赖于人机交互且前景提取效果不佳的问题,提出了通过检测显著性目标的方式实现图像前景提取算法。该算法由显著性目标检测和alpha估计两个网络组成,前者在基准网络的基础上引入混合损失函数和深监督策略提高图像前景提取的完整度,后者通过增加上下文注意力引导模块的方式优化基准网络,实现对前景图像细节的精确恢复。改进算法以自然场景中人像作为待提取前景进行研究,在自制数据集上训练,公开数据集Composition-1K上测试。改进后算法在均方误差(MSE)指标上低至2.329×10-4平方像素,梯度误差指标上相较基准算法减小近60%。实验表明,改进后算法提升了头发、玻璃等半透明区域的alpha值估计的准确度从而提高图像中抠像精度,并且具有较强的鲁棒性与泛化能力,可应用于图像编辑等应用场景。 展开更多
关键词 图像前景提取 显著性目标检测 alpha估计 混合损失函数 深监督 上下文引导
在线阅读 下载PDF
融合局部上下文的双图文档级关系抽取方法 被引量:3
20
作者 闻克妍 纪婉婷 宋宝燕 《小型微型计算机系统》 北大核心 2025年第3期535-541,共7页
文档级关系抽取是一项复杂的自然语言处理任务,旨在识别出文档中存在的实体,并预测实体之间的关系.相较于句子级关系抽取任务,文档级关系抽取面临更大的挑战,因为它需要考虑整个文档的语义信息和句子间的逻辑关系.针对这一任务,提出了... 文档级关系抽取是一项复杂的自然语言处理任务,旨在识别出文档中存在的实体,并预测实体之间的关系.相较于句子级关系抽取任务,文档级关系抽取面临更大的挑战,因为它需要考虑整个文档的语义信息和句子间的逻辑关系.针对这一任务,提出了一种融合局部上下文信息的双图推理方法(BRM)用于文档级关系抽取.该方法首先识别文档中的实体提及,并构造了一个提及级别的异构图来表示这些提及以及它们之间的关系.在获得提及级别的表示后,方法进一步构建了一个实体级别的推理图,通过聚合提及级别的信息来形成实体级别的表示,以判断实体之间的关系.该方法在文档级关系抽取公开数据集DocRED上进行了实验.实验结果表明,与现有的文档级关系抽取方法相比,该方法能够更准确地识别实体并预测它们之间的关系. 展开更多
关键词 文档级关系抽取 局部上下文 双图推理 数据集成
在线阅读 下载PDF
上一页 1 2 9 下一页 到第
使用帮助 返回顶部