期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Full Perception Head:Bridging the Gap Between Local and Global Features
1
作者 Jie Hua Zhongyuan Wang +3 位作者 Xin Tian Qin Zou Jinsheng Xiao Jiayi Ma 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1391-1406,共16页
Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and te... Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image.Local features extracted by convolutions,etc.,capture finegrained details such as edges and textures,while global features extracted by full connection layers,etc.,represent the overall structure and long-range relationships within the image.These features are crucial for accurate object detection,yet most existing methods focus on aggregating local and global features,often overlooking the importance of medium-range dependencies.To address this gap,we propose a novel full perception module(FPModule),a simple yet effective feature extraction module designed to simultaneously capture local details,medium-range dependencies,and long-range dependencies.Building on this,we construct a full perception head(FP-Head)by cascading multiple FP-Modules,enabling the prediction layer to leverage the most informative features.Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization,achieving 2.7−5.7 APval gains when integrated into standard object detectors.Notably,the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance.The code will be released at https://github.com/Idcogroup/FP-Head. 展开更多
关键词 Feature aggregation full perception module medium-range dependencies object detection
在线阅读 下载PDF
A Survey of Adversarial Examples in Computer Vision:Attack,Defense,and Beyond
2
作者 XU Keyizhi LU Yajuan +1 位作者 WANG Zhongyuan LIANG Chao 《Wuhan University Journal of Natural Sciences》 2025年第1期1-20,共20页
Recent years have witnessed the ever-increasing performance of Deep Neural Networks(DNNs)in computer vision tasks.However,researchers have identified a potential vulnerability:carefully crafted adversarial examples ca... Recent years have witnessed the ever-increasing performance of Deep Neural Networks(DNNs)in computer vision tasks.However,researchers have identified a potential vulnerability:carefully crafted adversarial examples can easily mislead DNNs into incorrect behavior via the injection of imperceptible modification to the input data.In this survey,we focus on(1)adversarial attack algorithms to generate adversarial examples,(2)adversarial defense techniques to secure DNNs against adversarial examples,and(3)important problems in the realm of adversarial examples beyond attack and defense,including the theoretical explanations,trade-off issues and benign attacks in adversarial examples.Additionally,we draw a brief comparison between recently published surveys on adversarial examples,and identify the future directions for the research of adversarial examples,such as the generalization of methods and the understanding of transferability,that might be solutions to the open problems in this field. 展开更多
关键词 computer vision adversarial examples adversarial attack adversarial defense
原文传递
A lightweight distillation CNN-transformer architecture for remote sensing image super-resolution
3
作者 Yu Wang Zhenfeng Shao +5 位作者 Tao Lu Lifeng Liu Xiao Huang Jiaming Wang Kui Jiang Kangli Zeng 《International Journal of Digital Earth》 SCIE EI 2023年第1期3560-3579,共20页
Remote sensing images exhibit rich texture features and strong autocorrelation.Although the super-resolution(SR)method of remote sensing images based on convolutional neural networks(CNN)can capture rich local informa... Remote sensing images exhibit rich texture features and strong autocorrelation.Although the super-resolution(SR)method of remote sensing images based on convolutional neural networks(CNN)can capture rich local information,the limited perceptual field prevents it from establishing long-distance dependence on global information,leading to the low accuracy of remote sensing image reconstruction.Furthermore,it is difficult for existing SR methods to be deployed in mobile devices due to their large network parameters and high computational demand.In this study,we propose a lightweight distillation CNN-Transformer SR architecture,named DCTA,for remote sensing SR,addressing the aforementioned issues.Specifically,the proposed DCTA first extracts the coarse features through the coarse feature extraction layer and then learns the deep features of remote sensing at different scales by fusing the feature distillation extraction module of CNN and Transformer.In addition,we introduce the feature fusion module at the end of the feature distillation extraction module to control the information propagation,aiming to select the informative components for better feature fusion.The extracted low-resolution(LR)feature maps are reorganized through the up-sampling module to obtain high-resolution(HR)feature maps with high accuracy to generate highquality HR remote sensing images.The experiments comparing different methods demonstrate that the proposed approach performs well on multiple datasets,including NWPU-RESISC45,Draper,and UC Merced.This is achieved by balancing reconstruction performance and network complexity,resulting in both competitive subjective and objective results. 展开更多
关键词 SUPER-RESOLUTION remote sensing lightweight network CNN-Transformer
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部