期刊文献+
共找到135篇文章
< 1 2 7 >
每页显示 20 50 100
Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations 被引量:15
1
作者 Dimitri P.Bertsekas 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第1期1-31,共31页
In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinfor... In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinforcement learning schemes. We introduce features of the states of the original problem, and we formulate a smaller "aggregate" Markov decision problem, whose states relate to the features. We discuss properties and possible implementations of this type of aggregation, including a new approach to approximate policy iteration. In this approach the policy improvement operation combines feature-based aggregation with feature construction using deep neural networks or other calculations. We argue that the cost function of a policy may be approximated much more accurately by the nonlinear function of the features provided by aggregation, than by the linear function of the features provided by neural networkbased reinforcement learning, thereby potentially leading to more effective policy improvement. 展开更多
关键词 REINFORCEMENT learning dynamic programming Markovian DECISION problems aggregation feature-based ARCHITECTURES policy ITERATION DEEP neural networks rollout algorithms
在线阅读 下载PDF
MSL-Net:a lightweight apple leaf disease detection model based on multi-scale feature fusion
2
作者 YANG Kangyi YAN Chunman 《Optoelectronics Letters》 2025年第12期745-752,共8页
Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstl... Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstly,a multiplexed aggregated feature extraction network is proposed using residual bottleneck block(RES-Bottleneck)and middle partial-convolution(MP-Conv)to capture multi-scale spatial features and enhance focus on disease features for better differentiation between disease targets and background information.Secondly,a lightweight feature fusion network is designed using scale-fuse concatenation(SF-Cat)and triple-scale sequence feature fusion(TSSF)module to merge multi-scale feature maps comprehensively.Depthwise convolution(DWConv)and GhostNet lighten the network,while the cross stage partial bottleneck with 3 convolutions ghost-normalization attention module(C3-GN)reduces missed detections by suppressing irrelevant background information.Finally,soft non-maximum suppression(Soft-NMS)is used in the post-processing stage to improve the problem of misdetection of dense disease sites.The results show that the MSL-Net improves mean average precision at intersection over union of 0.5(mAP@0.5)by 2.0%over the baseline you only look once version 5s(YOLOv5s)and reduces parameters by 44%,reducing computation by 27%,outperforming other state-of-the-art(SOTA)models overall.This method also shows excellent performance compared to the latest research. 展开更多
关键词 enhance focus disease features background i multi scale feature fusion apple leaf disease spots residual bottleneck block res bottleneck multiplexed aggregated feature extraction network lightweight network apple leaf disease detection
原文传递
Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion
3
作者 Chuanchuan Wang Ahmad Sufril Azlan Mohamed +3 位作者 Xiao Yang Hao Zhang Xiang Li Mohd Halim Bin Mohd Noor 《Computers, Materials & Continua》 2025年第10期855-874,共20页
Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for ... Classroom behavior recognition is a hot research topic,which plays a vital role in assessing and improving the quality of classroom teaching.However,existing classroom behavior recognition methods have challenges for high recognition accuracy with datasets with problems such as scenes with blurred pictures,and inconsistent objects.To address this challenge,we proposed an effective,lightweight object detector method called the RFNet model(YOLO-FR).The YOLO-FR is a lightweight and effective model.Specifically,for efficient multi-scale feature extraction,effective feature pyramid shared convolutional(FPSC)was designed to improve the feature extract performance by leveraging convolutional layers with varying dilation rates from the input image in the backbone.Secondly,to address the problem of multi-scale variability in the scene,we design the Rep Ghost fusion Cross Stage Partial and Efficient Layer Aggregation Network(RGCSPELAN)to improve the network performance further and reduce the amount of computation and the number of parameters.In addition,by conducting experimental valuation on the SCB dataset3 and STBD-08 dataset.Experimental results indicate that,compared to the baseline model,the RFNet model has increased mean accuracy precision(mAP@50)from 69.6%to 71.0%on the SCB dataset3 and from 91.8%to 93.1%on the STBD-08 dataset.The RFNet approach has effectiveness precision at 68.6%,surpassing the baseline method(YOLOv11)at 3.3%and archieve the minimal size(4.9 M)on the SCB dataset3.Finally,comparing it with other algorithms,it accurately detects student behavior in complex classroom environments results confirmed that RFNet is well-suited for real-time and efficiently recognizing classroom behaviors. 展开更多
关键词 Classroom action recognition YOLO-FR feature pyramid shared convolutional rep ghost cross stage partial efficient layer aggregation network(RGCSPELAN)
在线阅读 下载PDF
An Expert System to Detect Political Arabic Articles Orientation Using CatBoost Classifier Boosted by Multi-Level Features
4
作者 Saad M.Darwish Abdul Rahman M.Sabri +1 位作者 Dhafar Hamed Abd Adel A.Elzoghabi 《Computer Systems Science & Engineering》 2024年第6期1595-1624,共30页
The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orient... The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orientation detection.Political articles(especially in the Arab world)are different from other articles due to their subjectivity,in which the author’s beliefs and political affiliation might have a significant influence on a political article.With categories representing the main political ideologies,this problem may be thought of as a subset of the text categorization(classification).In general,the performance of machine learning models for text classification is sensitive to hyperparameter settings.Furthermore,the feature vector used to represent a document must capture,to some extent,the complex semantics of natural language.To this end,this paper presents an intelligent system to detect political Arabic article orientation that adapts the categorical boosting(CatBoost)method combined with a multi-level feature concept.Extracting features at multiple levels can enhance the model’s ability to discriminate between different classes or patterns.Each level may capture different aspects of the input data,contributing to a more comprehensive representation.CatBoost,a robust and efficient gradient-boosting algorithm,is utilized to effectively learn and predict the complex relationships between these features and the political orientation labels associated with the articles.A dataset of political Arabic texts collected from diverse sources,including postings and articles,is used to assess the suggested technique.Conservative,reform,and revolutionary are the three subcategories of these opinions.The results of this study demonstrate that compared to other frequently used machine learning models for text classification,the CatBoost method using multi-level features performs better with an accuracy of 98.14%. 展开更多
关键词 Political articles orientation detection CatBoost classifier multi-level features context-based classification social networks machine learning stylometric features
在线阅读 下载PDF
MASA-Net:Multi-Aspect Channel-Spatial Attention Network With Cross-Layer Feature Aggregation for Accurate Fungi Species Identification
5
作者 Indranil Bera Rajesh Mukherjee Bidesh Chakraborty 《IET Cyber-Systems and Robotics》 2025年第4期81-94,共14页
Accurate identification of fungal species is essential for effective diagnosis and treatment.Traditional microscopy-based methods are often subjective and time-consuming.Deep learning has emerged as a promising tool i... Accurate identification of fungal species is essential for effective diagnosis and treatment.Traditional microscopy-based methods are often subjective and time-consuming.Deep learning has emerged as a promising tool in this domain.However,existing deep learning models often struggle to generalise in the presence of class imbalance and subtle morphological differences,which are common in fungal image datasets.This study proposes MASA-Net,a deep learning framework that combines a fine-tuned DenseNet201 backbone with a multi-aspect channel-spatial attention(MASA)module.The attention mechanism refines spatial and channel-wise features by capturing multi-scale spatial patterns and adaptively emphasising informative channels.This enhances the network's ability to focus on diagnostically relevant fungal structures while suppressing irrelevant features.The MASA-Net is evaluated on the DeFungi dataset and demonstrates superior performance in terms of accuracy,precision,recall and F1-score.It also outperforms established attention mechanisms such as squeeze-andexcitation networks(SE)and convolutional block attention module(CBAM)under identical conditions.These results highlight MASA-Net's robustness and effectiveness in addressing class imbalance and structural variability,offering a reliable solution for automated fungal species identification. 展开更多
关键词 attention convolutional neural network(CNN) cross-layer feature aggregation fungi Classification transfer learning
原文传递
Effective convolution mixed Transformer Siamese network for robust visual tracking
6
作者 Lin Chen Yungang Liu Yuan Wang 《Control Theory and Technology》 2025年第2期221-236,共16页
Siamese tracking algorithms usually take convolutional neural networks(CNNs)as feature extractors owing to their capability of extracting deep discriminative features.However,the convolution kernels in CNNs have limit... Siamese tracking algorithms usually take convolutional neural networks(CNNs)as feature extractors owing to their capability of extracting deep discriminative features.However,the convolution kernels in CNNs have limited receptive fields,making it difficult to capture global feature dependencies which is important for object detection,especially when the target undergoes large-scale variations or movement.In view of this,we develop a novel network called effective convolution mixed Transformer Siamese network(SiamCMT)for visual tracking,which integrates CNN-based and Transformer-based architectures to capture both local information and long-range dependencies.Specifically,we design a Transformer-based module named lightweight multi-head attention(LWMHA)which can be flexibly embedded into stage-wise CNNs and improve the network’s representation ability.Additionally,we introduce a stage-wise feature aggregation mechanism which integrates features learned from multiple stages.By leveraging both location and semantic information,this mechanism helps the SiamCMT to better locate and find the target.Moreover,to distinguish the contribution of different channels,a channel-wise attention mechanism is introduced to enhance the important channels and suppress the others.Extensive experiments on seven challenging benchmarks,i.e.,OTB2015,UAV123,GOT10K,LaSOT,DTB70,UAVTrack112_L,and VOT2018,demonstrate the effectiveness of the proposed algorithm.Specially,the proposed method outperforms the baseline by 3.5%and 3.1%in terms of precision and success rates with a real-time speed of 59.77 FPS on UAV123. 展开更多
关键词 Visual tracking Siamese network TRANSFORMER feature aggregation Channel-wise attention
原文传递
Efficient Image Deraining through a Stage-Wise Dual-Residual Network with Cross-Dimensional Spatial Attention
7
作者 Tiantian Wang Zhihua Hu 《Computer Modeling in Engineering & Sciences》 2025年第11期2357-2381,共25页
Rain streaks introduced by atmospheric precipitation significantly degrade image quality and impair the reliability of high-level vision tasks.We present a novel image deraining framework built on a three-stage dual-r... Rain streaks introduced by atmospheric precipitation significantly degrade image quality and impair the reliability of high-level vision tasks.We present a novel image deraining framework built on a three-stage dual-residual architecture that progressively restores rain-degraded content while preserving fine structural details.Each stage begins with a multi-scale feature extractor and a channel attention module that adaptively emphasizes informative representations for rain removal.The core restoration is achieved via enhanced dual-residual blocks,which stabilize training and mitigate feature degradation across layers.To further refine representations,we integrate crossdimensional spatial attention supervised by ground-truth guidance,ensuring that only high-quality features propagate to subsequent stages.Inter-stage feature fusion modules are employed to aggregate complementary information,reinforcing reconstruction continuity and consistency.Extensive experiments on five benchmark datasets(Rain100H,Rain100L,RainKITTI2012,RainKITTI2015,and JRSRD)demonstrate that our method establishes new state-of-the-art results in both fidelity and perceptual quality,effectively removing rain streaks while preserving natural textures and structural integrity. 展开更多
关键词 Image deraining stage-wise network deep learning feature aggregation image processing
在线阅读 下载PDF
VIFusion:低光场景下可见光与红外图像的互补融合模型
8
作者 张晓滨 牛燕皓 陈金广 《西安工程大学学报》 2026年第1期126-135,共10页
针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual tem... 针对低光场景下可见光与红外图像融合算法存在时序信息丢失、特征图通道冗余、细节模糊等问题,本文基于Vision Transformer框架,提出了一种低光场景下可见光与红外图像的互补融合模型VIFusion。该模型通过包含的双时态特征聚合(dual temporal feature aggregation,DTFA)模块、特征细化前馈网络(feature refinement feedforward network,FRFN)模块和空间通道注意力机制(spatial channel attention,SCA)模块提升了融合图像的质量和信息表达能力。其中,DTFA模块使用分组卷积保持特征空间完整性,然后进行时序对齐与融合,以增强时序一致性并减少信息损失。FRFN模块对提取的特征进行逐层优化,减少通道冗余。SCA模块通过自适应建模图像空间和通道关系,突出关键特征,提高信息表达能力、增强边缘、纹理等细节信息。实验结果表明:在LLVIP数据集上,VIFusion模型在客观指标(AG、CC、EN、SF、SSIM、VIF、MI)上优于传统方法和深度学习模型(如GTF、TarDAL、DenseFuse等)。在数据集TNO上的泛化实验中,生成的融合图像在细节保留和目标突出上也表现更佳。VIFusion模型为低光场景下的多模态图像融合提供了一种高效实用的解决方案。 展开更多
关键词 双时态特征聚合 特征细化前馈网络 空间通道注意力 图像融合
在线阅读 下载PDF
基于GraphSAGE的井震数据岩相预测
9
作者 陈国军 耿润田 +1 位作者 顾雅慧 于丽香 《计算机与数字工程》 2026年第1期281-286,共6页
为了充分利用三维空间的地震信息,提出一种基于图神经网络GraphSAGE算法的岩相预测方法,利用网络对数据的聚合能力,聚合目标点邻域内的地震数据特征,以固定的时窗提取多个相邻采样点的地震数据,建立其与岩相的非线性映射关系。实验表明... 为了充分利用三维空间的地震信息,提出一种基于图神经网络GraphSAGE算法的岩相预测方法,利用网络对数据的聚合能力,聚合目标点邻域内的地震数据特征,以固定的时窗提取多个相邻采样点的地震数据,建立其与岩相的非线性映射关系。实验表明:提出的方法可以很好地拟合三维地震数据特征,并具有良好的薄层岩相抗噪性。应用该方法在工区不同区域进行实验,岩相预测准确率80%以上,预测结果与测井岩相基本吻合,地震道剖面岩相图清晰连续。 展开更多
关键词 岩相预测 图神经网络 GraphSAGE 地震属性 平面特征聚合 时窗采样
在线阅读 下载PDF
基于链路聚合的图欺诈检测
10
作者 邱天 贾凌翔 +3 位作者 高杨 冯尊磊 高艺 宋明黎 《软件学报》 北大核心 2026年第2期860-874,共15页
随着信息技术发展,信息网络、人类社会与物理空间交互加深,信息空间风险外溢现象严峻.欺诈事件激增,欺诈检测成为重要研究领域.欺诈行为给社会带来了诸多负面影响,且逐渐呈现出智能化、产业化及高度隐蔽性等新兴特征,传统的专家规则与... 随着信息技术发展,信息网络、人类社会与物理空间交互加深,信息空间风险外溢现象严峻.欺诈事件激增,欺诈检测成为重要研究领域.欺诈行为给社会带来了诸多负面影响,且逐渐呈现出智能化、产业化及高度隐蔽性等新兴特征,传统的专家规则与深度图神经网络算法在应对上显得愈发局限.当前反欺诈算法多从节点自身与邻居节点的局部信息出发,或聚焦于用户个体,或分析节点与网络拓扑关系,或利用图嵌入技术学习节点表示,这些视角虽然能具备一定的欺诈检测能力,但是忽略了实体长程关联模式的关键作用,缺乏对于海量欺诈链路之间共性模式的挖掘,限制了全面的欺诈检测能力.针对以上欺诈检测算法的局限性,提出一种基于链路聚合的图欺诈检测模型PA-GNN(path aggregation graph neural network),包含不定长链路采样,位置关联的统一链路编码,链路信息交互聚合,以及聚合关联的欺诈检测.从节点出发的若干链路之间通过全局模式交互与相似度比对,挖掘欺诈链路之间的共性规律,从而更全面地揭示欺诈行为之间的关联模式,并通过链路聚合继而实现欺诈检测.在金融交易、社交网络和评论网络这3类欺诈场景下的多个数据集上的实验结果表明,所提方法的曲线下面积(AUC)和平均精度(AP)指标相较于最优基准模型均有显著提升.此外,该方法为欺诈检测任务挖掘了潜在的共性欺诈链路模式,驱动节点学习这些重要的模式并获得更具表现力的表示,具备一定的可解释性. 展开更多
关键词 图神经网络 欺诈检测 链路聚合 注意力机制 特征表示
在线阅读 下载PDF
基于部分卷积的残差特征聚合轻量超分辨率网络
11
作者 闫航 刘春龙 宋振峰 《现代电子技术》 北大核心 2026年第5期89-96,共8页
为了解决图像超分辨率重建模型普遍参数量大和计算过程复杂,对计算量和资源需求急剧增加的问题,文中提出一种基于部分卷积的残差特征聚合轻量超分辨率网络。该网络在部分卷积层的基础上减少模型冗余计算的同时也减少了参数量。在轻量化... 为了解决图像超分辨率重建模型普遍参数量大和计算过程复杂,对计算量和资源需求急剧增加的问题,文中提出一种基于部分卷积的残差特征聚合轻量超分辨率网络。该网络在部分卷积层的基础上减少模型冗余计算的同时也减少了参数量。在轻量化的前提下,引入残差特征聚合模块,同时关注局部与非局部特征信息,以增强网络对图像细节的全面捕捉,加速信息传递的同时提高网络泛化能力。实验结果表明,所提方法同NGswin和LKFN在公共基准测试集2倍、3倍、4倍缩放因子下的PSNR相比,分别平均提升0.28 dB、0.13 dB、0.08 dB和0.03 dB、0.02 dB、0.02 dB;参数量分别减少82%、81%、81%和38%、37%、36%;GFLOPs分别减少55%、58%、56%和6%、7%、11%。网络在轻量化的同时实现了重建图像质量的提高,减少了图像模糊程度,缓解了重建图像的伪影情况,充分证明了所提方法的高效性。 展开更多
关键词 轻量化模型 卷积神经网络 图像超分辨率重建 特征聚合 深度学习 自注意力机制
在线阅读 下载PDF
Detecting human-object interaction with multi-level pairwise feature network 被引量:4
12
作者 Hanchao Liu Tai-Jiang Mu Xiaolei Huan 《Computational Visual Media》 EI CSCD 2021年第2期229-239,共11页
Human–object interaction(HOI)detection is crucial for human-centric image understanding which aims to infer human,action,object triplets within an image.Recent studies often exploit visual features and the spatial co... Human–object interaction(HOI)detection is crucial for human-centric image understanding which aims to infer human,action,object triplets within an image.Recent studies often exploit visual features and the spatial configuration of a human–object pair in order to learn the action linking the human and object in the pair.We argue that such a paradigm of pairwise feature extraction and action inference can be applied not only at the whole human and object instance level,but also at the part level at which a body part interacts with an object,and at the semantic level by considering the semantic label of an object along with human appearance and human–object spatial configuration,to infer the action.We thus propose a multi-level pairwise feature network(PFNet)for detecting human–object interactions.The network consists of three parallel streams to characterize HOI utilizing pairwise features at the above three levels;the three streams are finally fused to give the action prediction.Extensive experiments show that our proposed PFNet outperforms other state-of-the-art methods on the VCOCO dataset and achieves comparable results to the state-of-the-art on the HICO-DET dataset. 展开更多
关键词 human–object interaction detection pairwise feature network deep learning multi-level object instance
原文传递
基于深度特征强化与路径聚合优化的目标检测 被引量:2
13
作者 王晓峰 黄俊俊 +1 位作者 谭文雅 沈紫璇 《计算机科学》 北大核心 2025年第11期184-195,共12页
在深度网络的前馈过程中,输入数据的特征信息会被抽象和压缩,导致部分对于目标检测关键的特征信息被弱化。基于YOLOv11n,提出了深度特征强化与路径聚合优化的目标检测方法。首先,设计全局-局部特征增强模块GLFEM(Global-Local Feature E... 在深度网络的前馈过程中,输入数据的特征信息会被抽象和压缩,导致部分对于目标检测关键的特征信息被弱化。基于YOLOv11n,提出了深度特征强化与路径聚合优化的目标检测方法。首先,设计全局-局部特征增强模块GLFEM(Global-Local Feature Enhancement Module),结合特征图局部特征与全局特征,强化深层网络特征的表达能力。然后,设计自适应特征增强模块AFEM(Adaptive Feature Enhancement Module),根据特征的可靠性动态增强深层网络的特征提取能力。最后,对路径聚合特征金字塔网络进行优化,融合了不同层次之间的特征信息,减少了层次之间的语义信息差。在VisDrone,NWPU VHR-10和TinyPerson这3个公共数据集上的实验结果表明,该方法的平均检测精度相较于当前先进的目标检测器均有所提升。在自建数据集AirportTiny上进行实验,该方法同样取得了不错的效果,具有良好的泛化能力。 展开更多
关键词 目标检测 深层网络 路径聚合 特征信息 特征强化
在线阅读 下载PDF
嵌入切片序列关联模式的前交叉韧带损伤分类 被引量:1
14
作者 刘英莉 茶寅秋 +1 位作者 黄一山 高明 《中国图象图形学报》 北大核心 2025年第4期1183-1194,共12页
目的膝关节前交叉韧带(anterior cruciate ligament,ACL)损伤的及时诊断是降低骨关节炎等风险重要途径。作为ACL损伤判别的常用影像学方法,三维磁共振成像(magnetic resonance imaging,MRI)相较于二维自然图像细节更繁杂,医生诊断所需... 目的膝关节前交叉韧带(anterior cruciate ligament,ACL)损伤的及时诊断是降低骨关节炎等风险重要途径。作为ACL损伤判别的常用影像学方法,三维磁共振成像(magnetic resonance imaging,MRI)相较于二维自然图像细节更繁杂,医生诊断所需时间长,智能辅助医疗是有益的。目前应用三维卷积神经网络(3D convolutional neural network,3D CNN)的ACL损伤分类算法存在计算成本大且数据使用不充分的问题,而基于二维卷积神经网络(2D convolutional neural network,2D CNN)的方法则忽略了第三维度的相关性和ACL的形态多样性。方法提出一种嵌入切片序列关联模式的网络(slice sequence association mode network,SSAMNet)。SSAMNet通过设计切片序列信息融合(slice sequence information fusion,SFS)模块从MRI数据的邻近、全量切片中并行学习序列性质,建立切片特征映射中的关联模式,整合切片信息。借助多级尺度特征自适应关注(multi-level scale feature adaptive attention,MSFAA)模块,对不同关联尺度权重再分配,以适应ACL区域形状和位置表征多变的特性。结果在MRNet数据集的实验中,模型的AUC(area under curve)值达到98.4%,相较于其他的ACL损伤分类算法性能最优,准确性及特异性指标也分别达到了91.4%和97%的最优值。在kneeMRI数据集上进行5折交叉验证实验后,SSAMNet的AUC平均值最高,ROC(receiver operating characteristic curve)曲线始终保持着平稳的趋势。可视化结果也表明提出的方法可以有效关注病变区域。结论针对ACL损伤分类任务,所提模型性能优异且具有鲁棒性,存在临床应用价值。 展开更多
关键词 磁共振成像(MRI) 3D图像分类 切片特征聚合 自适应尺度注意力 二维卷积神经网络(2D CNN)
原文传递
基于FACNNCN的高分遥感影像场景分类方法
15
作者 张婧 杨宇浩 +2 位作者 曹峰 张超 李德玉 《数据采集与处理》 北大核心 2025年第6期1637-1649,共13页
高分遥感影像场景分类旨在对复杂的地表场景影像进行精确认知,对于高分遥感影像的理解和信息提取具有重要的意义。本文提出了一种高分遥感影像场景方法,该方法基于特征聚合卷积神经网络(Feature aggregated convolution neural network,... 高分遥感影像场景分类旨在对复杂的地表场景影像进行精确认知,对于高分遥感影像的理解和信息提取具有重要的意义。本文提出了一种高分遥感影像场景方法,该方法基于特征聚合卷积神经网络(Feature aggregated convolution neural network,FACNN)和向量胶囊网络(Capsule network,CapsNet),即FACNNCN网络。通过增加聚合特征提升场景分类中影像特征的区分力和鲁棒性,并基于向量胶囊网络表征场景影像中地物与场景的空间关系,有效弥补了当前基于卷积神经网络的高分遥感影像场景分类方法中普遍存在的场景影像特征提取不充分、地物空间特征欠考虑的不足。本文提出的方法在2个公共高分遥感影像场景分类数据集(UC Merced Land⁃Use和NWPU⁃RESISC45)上进行了测试,实验结果表明该方法的分类精度优于相关的对比方法。 展开更多
关键词 高分遥感影像 场景分类 特征聚合 卷积神经网络 胶囊网络
在线阅读 下载PDF
随时间持续演化的流图神经网络
16
作者 郭虎升 张旭飞 +1 位作者 孙玉杰 王文剑 《计算机科学》 北大核心 2025年第8期118-126,共9页
流图在现实应用中广泛存在,其节点特征和结构特征会随时间推移而动态变化。尽管图神经网络在静态图节点分类中表现卓越,但其难以直接应用于流图,流图的持续演化会导致信息滞后和遗漏问题,所以模型难以准确提取流图特征。针对上述挑战,... 流图在现实应用中广泛存在,其节点特征和结构特征会随时间推移而动态变化。尽管图神经网络在静态图节点分类中表现卓越,但其难以直接应用于流图,流图的持续演化会导致信息滞后和遗漏问题,所以模型难以准确提取流图特征。针对上述挑战,提出了一种随时间持续演化的流图神经网络(Continuously Evolution Streaming Graph Neural Network,CESGNN),以解决流图节点分类问题。该方法首先通过持续更新的图卷积网络(Continuous Updates Graph Convolutional Network,CU-GCN)增量地更新参数,以适应流图节点特征的变化,缓解信息滞后问题,然后自适应扩展的图神经网络(Adaptive Deepening Graph Neural Network,AD-GNN)通过将聚合和更新操作解耦,以挖掘流图深层特征,从而缓解信息遗漏问题。CESGNN通过有机地融合原始特征、CU-GCN提取的浅层特征和AD-GNN提取的深层特征,获得更准确、全面的流图特征表示。实验结果表明,CESGNN模型对流图具有良好的适应性和稳定性,提高了流图节点分类的准确率。 展开更多
关键词 流图 图神经网络 增量更新 聚合与更新解耦 特征融合
在线阅读 下载PDF
融合注意力的特征聚合孪生网络视觉跟踪
17
作者 金静 牛品 翟凤文 《计算机工程与应用》 北大核心 2025年第12期166-176,共11页
目前以孪生网络为基础的目标跟踪算法,仍然存在网络浅层的特征中有价值的上下文信息无法合理利用的问题。针对这一问题,提出一种融合拆分注意力机制(split-attention,SA)的目标跟踪算法SiamMCFA(siamese multi-channel feature aggregat... 目前以孪生网络为基础的目标跟踪算法,仍然存在网络浅层的特征中有价值的上下文信息无法合理利用的问题。针对这一问题,提出一种融合拆分注意力机制(split-attention,SA)的目标跟踪算法SiamMCFA(siamese multi-channel feature aggregation module)。在骨干网络中引入拆分注意力机制,用来提取浅层特征中有价值的上下文信息,通过像素级互相关模块(pixel-wise cross correlation,PWCC)融合模板区域和搜索区域浅层和深层特征中的上下文信息,以增强模板区域和搜索区域的特征图之间的联系,从而提高跟踪器的鲁棒性。针对因尺度变化而容易导致目标丢失的问题,设计了一个多通道特征聚合模块(multi-channel feature aggregation module,MCFA),用于聚合目标不同区域的特征信息,使跟踪器尽可能地区分目标和语义背景,进一步提升跟踪准确性。最后,在OTB100、VOT2019、GOT10K和LaSOT四个数据集上进行了详尽的实验评估,结果显示,SiamMCFA与当前基于孪生网络的先进的跟踪器SiamCAR相比,其成功率(success rate)与精准度(precision)分别提高了2.26和2.83个百分点。与SiamIRCA相比成功率与精准度提高了0.3和0.9个百分点。 展开更多
关键词 目标跟踪 孪生网络 拆分注意力 像素级互相关 多通道特征聚合
在线阅读 下载PDF
面向遥感小目标检测的实例间特征聚合方法研究
18
作者 王海涛 艾晨 +1 位作者 谭福 高硕 《宇航学报》 北大核心 2025年第7期1467-1474,共8页
针对遥感图像小目标检测中特征缺失与定位精度低的问题,提出一种融合实例特征交互与自适应回归度量的检测框架。通过构建动态图结构的实例间特征聚合网络,利用高置信度实例引导弱目标特征增强,减少因下采样导致的漏检;同时设计分段平滑W... 针对遥感图像小目标检测中特征缺失与定位精度低的问题,提出一种融合实例特征交互与自适应回归度量的检测框架。通过构建动态图结构的实例间特征聚合网络,利用高置信度实例引导弱目标特征增强,减少因下采样导致的漏检;同时设计分段平滑Wasserstein损失,将边界框建模为2D高斯分布,结合一阶与二阶距离度量,优化多尺度定位精度。在AI-TOD v1/v2和DOTA v2数据集上的实验结果表明,该方法在小目标检测精度、特征增强能力和回归优化效果方面均取得显著提升,同时保持了较低的计算和参数开销。该方法为高分辨率遥感场景下的微小目标检测提供了轻量化解决方案。 展开更多
关键词 遥感小目标检测 特征聚合增强 图神经网络 Wasserstein距离 边界框回归
在线阅读 下载PDF
结合提案校准与分类优化的开放世界目标检测
19
作者 谢斌红 吴文丽 +1 位作者 张睿 张英俊 《计算机工程与设计》 北大核心 2025年第11期3216-3223,共8页
针对未知物体检测精度低和标签偏差两个问题,提出了一种结合提案校准与分类优化的开放世界目标检测框架。其中,协助提案帮助器通过基于对象的类无关属性和边缘信息生成对象的候选区域,辅助未知探测区域建议网络在无监督的情况下准确识... 针对未知物体检测精度低和标签偏差两个问题,提出了一种结合提案校准与分类优化的开放世界目标检测框架。其中,协助提案帮助器通过基于对象的类无关属性和边缘信息生成对象的候选区域,辅助未知探测区域建议网络在无监督的情况下准确识别未知物体提案。而类原型空间位置约束器模块包含提案特征聚合器和类原型分布约束器,前者对已知类物体分类,后者有效区分已知与未知类别,以此来解决未知物体误分类为已知类别的问题。在OWOD数据集上的广泛对比实验结果表明了该框架的有效性和优越性。 展开更多
关键词 开放世界目标检测 区域建议网络 未知物体检测 标签偏差 提案校准 分类优化 特征聚合
在线阅读 下载PDF
双先验引导的注意力特征聚合去雾生成对抗网络
20
作者 王燕 胡津源 +1 位作者 刘晶晶 陈燕燕 《计算机工程与科学》 北大核心 2025年第10期1841-1852,共12页
图像去雾是计算机视觉领域中一个具有挑战性的热点问题。现有的去雾方法通常使用单一的卷积神经网络(CNN)来解决问题,但此类方法缺乏细节恢复机制,并且在非均匀雾情况下去雾性能较差。为了解决上述2个问题,提出了一个双先验引导的注意... 图像去雾是计算机视觉领域中一个具有挑战性的热点问题。现有的去雾方法通常使用单一的卷积神经网络(CNN)来解决问题,但此类方法缺乏细节恢复机制,并且在非均匀雾情况下去雾性能较差。为了解决上述2个问题,提出了一个双先验引导的注意力特征聚合去雾生成对抗网络,暗通道先验和语义先验分别引导图像广义特征和纹理细节的恢复。其中,生成器采用参数共享编码器提取特征,添加了注意力特征聚合块(AFAB)对多尺度特征进行聚合增强,并通过解码多尺度特征恢复无雾图像,最后用多尺度判别器监督无雾图像的恢复。此外,考虑到图像中可能存在雾的不均匀分布,提出了坐标注意力残差块(CARB),它能自适应地分配权重,使网络关注图像的重要特征;同时,采用残差聚合的方式通过3个CARB构造了坐标注意力密集残差组(CARG),使得残差特征能被充分利用。实验结果表明,提出的网络在合成有雾图像数据集和现实有雾图像数据集上均表现优异。 展开更多
关键词 图像去雾 生成对抗网络 双先验引导 注意力特征聚合 参数共享编码器 坐标注意力
在线阅读 下载PDF
上一页 1 2 7 下一页 到第
使用帮助 返回顶部