期刊文献+
共找到483篇文章
< 1 2 25 >
每页显示 20 50 100
MEET:A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification With Zoom-Free Remote Sensing Imagery 被引量:1
1
作者 Yansheng Li Yuning Wu +9 位作者 Gong Cheng Chao Tao Bo Dang Yu Wang Jiahao Zhang Chuge Zhang Yiting Liu Xu Tang Jiayi Ma Yongjun Zhang 《IEEE/CAA Journal of Automatica Sinica》 2025年第5期1004-1023,共20页
Accurate fine-grained geospatial scene classification using remote sensing imagery is essential for a wide range of applications.However,existing approaches often rely on manually zooming remote sensing images at diff... Accurate fine-grained geospatial scene classification using remote sensing imagery is essential for a wide range of applications.However,existing approaches often rely on manually zooming remote sensing images at different scales to create typical scene samples.This approach fails to adequately support the fixed-resolution image interpretation requirements in real-world scenarios.To address this limitation,we introduce the million-scale fine-grained geospatial scene classification dataset(MEET),which contains over 1.03 million zoom-free remote sensing scene samples,manually annotated into 80 fine-grained categories.In MEET,each scene sample follows a scene-in-scene layout,where the central scene serves as the reference,and auxiliary scenes provide crucial spatial context for fine-grained classification.Moreover,to tackle the emerging challenge of scene-in-scene classification,we present the context-aware transformer(CAT),a model specifically designed for this task,which adaptively fuses spatial context to accurately classify the scene samples.CAT adaptively fuses spatial context to accurately classify the scene samples by learning attentional features that capture the relationships between the center and auxiliary scenes.Based on MEET,we establish a comprehensive benchmark for fine-grained geospatial scene classification,evaluating CAT against 11 competitive baselines.The results demonstrate that CAT significantly outperforms these baselines,achieving a 1.88%higher balanced accuracy(BA)with the Swin-Large backbone,and a notable 7.87%improvement with the Swin-Huge backbone.Further experiments validate the effectiveness of each module in CAT and show the practical applicability of CAT in the urban functional zone mapping.The source code and dataset will be publicly available at https://jerrywyn.github.io/project/MEET.html. 展开更多
关键词 Fine-grained geospatial scene classification(FGSC) million-scale dataset remote sensing imagery(RSI) scene-in-scene transformer
在线阅读 下载PDF
ERSNet:Lightweight Attention-Guided Network for Remote Sensing Scene Image Classification
2
作者 LIU Yunyu YUAN Jinpeng 《Journal of Geodesy and Geoinformation Science》 2025年第1期30-46,共17页
Remote sensing scene image classification is a prominent research area within remote sensing.Deep learningbased methods have been extensively utilized and have shown significant advancements in this field.Recent progr... Remote sensing scene image classification is a prominent research area within remote sensing.Deep learningbased methods have been extensively utilized and have shown significant advancements in this field.Recent progress in these methods primarily focuses on enhancing feature representation capabilities to improve performance.The challenge lies in the limited spatial resolution of small-sized remote sensing images,as well as image blurring and sparse data.These factors contribute to lower accuracy in current deep learning models.Additionally,deeper networks with attention-based modules require a substantial number of network parameters,leading to high computational costs and memory usage.In this article,we introduce ERSNet,a lightweight novel attention-guided network for remote sensing scene image classification.ERSNet is constructed using a deep separable convolutional network and incorporates an attention mechanism.It utilizes spatial attention,channel attention,and channel self-attention to enhance feature representation and accuracy,while also reducing computational complexity and memory usage.Experimental results indicate that,compared to existing state-of-the-art methods,ERSNet has a significantly lower parameter count of only 1.2 M and reduced Flops.It achieves the highest classification accuracy of 99.14%on the EuroSAT dataset,demonstrating its suitability for application on mobile terminal devices.Furthermore,experimental results from the UCMerced land use dataset and the Brazilian coffee scene also confirm the strong generalization ability of this method. 展开更多
关键词 deep learning remote sensing scene classification CNN
在线阅读 下载PDF
Shallow Convolutional Neural Networks for Acoustic Scene Classification 被引量:5
3
作者 LU Lu YANG Yuhong +2 位作者 JIANG Yuzhi AI Haojun TU Weiping 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2018年第2期178-184,共7页
Recently, deep neural networks, which include convolutional neural networks(CNNs), have been widely applied to acoustic scene classification(ASC). Motivated by the fact that some simplified CNNs have shown improve... Recently, deep neural networks, which include convolutional neural networks(CNNs), have been widely applied to acoustic scene classification(ASC). Motivated by the fact that some simplified CNNs have shown improvements over deep CNNs, such as Visual Geometry Group Net(VGG-Net), we have figured out how to simplify the VGG-Net style architecture to a shallow CNN with improved performance. Max pooling and batch normalization are also applied for better accuracy. With a series of controlled tests on detection and classification of acoustic scenes and events(DCASE) 2016 data sets, our shallow CNN achieves 6.7% improvement, and reduces time complexity to 5%, compared with the VGG-Net style CNN. 展开更多
关键词 acoustic scene classification convolutional neuralnetworks Mel-spectrogram
原文传递
Deep Scalogram Representations for Acoustic Scene Classification 被引量:5
4
作者 Zhao Ren Kun Qian +3 位作者 Zixing Zhang Vedhas Pandit Alice Baird Bjorn Schuller 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第3期662-669,共8页
Spectrogram representations of acoustic scenes have achieved competitive performance for acoustic scene classification. Yet, the spectrogram alone does not take into account a substantial amount of time-frequency info... Spectrogram representations of acoustic scenes have achieved competitive performance for acoustic scene classification. Yet, the spectrogram alone does not take into account a substantial amount of time-frequency information. In this study, we present an approach for exploring the benefits of deep scalogram representations, extracted in segments from an audio stream. The approach presented firstly transforms the segmented acoustic scenes into bump and morse scalograms, as well as spectrograms; secondly, the spectrograms or scalograms are sent into pre-trained convolutional neural networks; thirdly,the features extracted from a subsequent fully connected layer are fed into(bidirectional) gated recurrent neural networks, which are followed by a single highway layer and a softmax layer;finally, predictions from these three systems are fused by a margin sampling value strategy. We then evaluate the proposed approach using the acoustic scene classification data set of 2017 IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events(DCASE). On the evaluation set, an accuracy of 64.0 % from bidirectional gated recurrent neural networks is obtained when fusing the spectrogram and the bump scalogram, which is an improvement on the 61.0 % baseline result provided by the DCASE 2017 organisers. This result shows that extracted bump scalograms are capable of improving the classification accuracy,when fusing with a spectrogram-based system. 展开更多
关键词 Acoustic scene classification(ASC) (bidirectional) gated recurrent neural networks((B) GRNNs) convolutional neural networks(CNNs) deep scalogram representation spectrogram representation
在线阅读 下载PDF
Semi-supervised remote sensing image scene classification with prototype-based consistency 被引量:2
5
作者 Yang LI Zhang LI +2 位作者 Zi WANG Kun WANG Qifeng YU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2024年第2期459-470,共12页
Deep learning significantly improves the accuracy of remote sensing image scene classification,benefiting from the large-scale datasets.However,annotating the remote sensing images is time-consuming and even tough for... Deep learning significantly improves the accuracy of remote sensing image scene classification,benefiting from the large-scale datasets.However,annotating the remote sensing images is time-consuming and even tough for experts.Deep neural networks trained using a few labeled samples usually generalize less to new unseen images.In this paper,we propose a semi-supervised approach for remote sensing image scene classification based on the prototype-based consistency,by exploring massive unlabeled images.To this end,we,first,propose a feature enhancement module to extract discriminative features.This is achieved by focusing the model on the foreground areas.Then,the prototype-based classifier is introduced to the framework,which is used to acquire consistent feature representations.We conduct a series of experiments on NWPU-RESISC45 and Aerial Image Dataset(AID).Our method improves the State-Of-The-Art(SOTA)method on NWPU-RESISC45 from 92.03%to 93.08%and on AID from 94.25%to 95.24%in terms of accuracy. 展开更多
关键词 Semi-supervised learning Remote sensing scene classification Prototype network Deep learning
原文传递
Learning Multi-Modality Features for Scene Classification of High-Resolution Remote Sensing Images 被引量:1
6
作者 Feng’an Zhao Xiongmei Zhang +2 位作者 Xiaodong Mu Zhaoxiang Yi Zhou Yang 《Journal of Computer and Communications》 2018年第11期185-193,共9页
Scene classification of high-resolution remote sensing (HRRS) image is an important research topic and has been applied broadly in many fields. Deep learning method has shown its high potential to in this domain, owin... Scene classification of high-resolution remote sensing (HRRS) image is an important research topic and has been applied broadly in many fields. Deep learning method has shown its high potential to in this domain, owing to its powerful learning ability of characterizing complex patterns. However the deep learning methods omit some global and local information of the HRRS image. To this end, in this article we show efforts to adopt explicit global and local information to provide complementary information to deep models. Specifically, we use a patch based MS-CLBP method to acquire global and local representations, and then we consider a pretrained CNN model as a feature extractor and extract deep hierarchical features from full-connection layers. After fisher vector (FV) encoding, we obtain the holistic visual representation of the scene image. We view the scene classification as a reconstruction procedure and train several class-specific stack denoising autoencoders (SDAEs) of corresponding class, i.e., one SDAE per class, and classify the test image according to the reconstruction error. Experimental results show that our combination method outperforms the state-of-the-art deep learning classification methods without employing fine-tuning. 展开更多
关键词 FEATURE Fusion Multiple FEATURES scene classification STACK DENOISING Autoencoder
在线阅读 下载PDF
TP-MobNet: A Two-pass Mobile Network for Low-complexity Classification of Acoustic Scene 被引量:1
7
作者 Soonshin Seo Junseok Oh +3 位作者 Eunsoo Cho Hosung Park Gyujin Kim Ji-Hwan Kim 《Computers, Materials & Continua》 SCIE EI 2022年第11期3291-3303,共13页
Acoustic scene classification(ASC)is a method of recognizing and classifying environments that employ acoustic signals.Various ASC approaches based on deep learning have been developed,with convolutional neural networ... Acoustic scene classification(ASC)is a method of recognizing and classifying environments that employ acoustic signals.Various ASC approaches based on deep learning have been developed,with convolutional neural networks(CNNs)proving to be the most reliable and commonly utilized in ASC systems due to their suitability for constructing lightweight models.When using ASC systems in the real world,model complexity and device robustness are essential considerations.In this paper,we propose a two-pass mobile network for low-complexity classification of the acoustic scene,named TP-MobNet.With inverse residuals and linear bottlenecks,TPMobNet is based on MobileNetV2,and following mobile blocks,coordinate attention and two-pass fusion approaches are utilized.The log-range dependencies and precise position information in feature maps can be trained via coordinate attention.By capturing more diverse feature resolutions at the network’s end sides,two-pass fusions can also train generalization.Also,the model size is reduced by applying weight quantization to the trained model.By adding weight quantization to the trained model,the model size is also lowered.The TAU Urban Acoustic Scenes 2020 Mobile development set was used for all of the experiments.It has been confirmed that the proposed model,with a model size of 219.6 kB,achieves an accuracy of 73.94%. 展开更多
关键词 Acoustic scene classification LOW-COMPLEXITY device robustness two-pass mobile network coordinate attention weight quantization
在线阅读 下载PDF
Complex Traffic Scene Image Classification Based on Sparse Optimization Boundary Semantics Deep Learning 被引量:1
8
作者 ZHOU Xiwei WANG Huifeng +2 位作者 LI Saisai PENG Haonan WU Jianfeng 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2023年第2期150-162,共13页
With the rapid development of intelligent traffic information monitoring technology,accurate identification of vehicles,pedestrians and other objects on the road has become particularly important.Therefore,in order to... With the rapid development of intelligent traffic information monitoring technology,accurate identification of vehicles,pedestrians and other objects on the road has become particularly important.Therefore,in order to improve the recognition and classification accuracy of image objects in complex traffic scenes,this paper proposes a segmentation method of semantic redefine segmentation using image boundary region.First,we use the Seg Net semantic segmentation model to obtain the rough classification features of the vehicle road object,then use the simple linear iterative clustering(SLIC)algorithm to obtain the over segmented area of the image,which can determine the classification of each pixel in each super pixel area,and then optimize the target segmentation of the boundary and small areas in the vehicle road image.Finally,the edge recovery ability of condition random field(CRF)is used to refine the image boundary.The experimental results show that compared with FCN-8s and Seg Net,the pixel accuracy of the proposed algorithm in this paper improves by 2.33%and 0.57%,respectively.And compared with Unet,the algorithm in this paper performs better when dealing with multi-target segmentation. 展开更多
关键词 traffic scene SegNet image classification simple linear iterative clustering(SLIC) conditional random field boundary number
原文传递
Hybrid HRNet-Swin Transformer:Multi-Scale Feature Fusion for Aerial Segmentation and Classification
9
作者 Asaad Algarni Aysha Naseer +3 位作者 Mohammed Alshehri Yahya AlQahtani Abdulmonem Alshahrani Jeongmin Park 《Computers, Materials & Continua》 2025年第10期1981-1998,共18页
Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(C... Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(CNNs)struggle with long-range dependencies and preserving high-resolution features,limiting their effectiveness in complex aerial image analysis.To address these challenges,we propose a Hybrid HRNet-Swin Transformer model that synergizes the strengths of HRNet-W48 for high-resolution segmentation and the Swin Transformer for global feature extraction.This hybrid architecture ensures robust multi-scale feature fusion,capturing fine-grained details and broader contextual relationships in aerial imagery.Our methodology begins with preprocessing steps,including normalization,histogram equalization,and noise reduction,to enhance input data quality.The HRNet-W48 backbone maintains high-resolution feature maps throughout the network,enabling precise segmentation,while the Swin Transformer leverages hierarchical self-attention to model long-range dependencies efficiently.By integrating these components,our model achieves superior performance in segmentation and classification tasks compared to traditional CNNs and standalone transformer models.We evaluate our approach on two benchmark datasets:UC Merced and WHU-RS19.Experimental results demonstrate that the proposed hybrid model outperforms existing methods,achieving state-of-the-art accuracy while maintaining computational efficiency.Specifically,it excels in preserving fine spatial details and contextual understanding,critical for applications like land-use classification and disaster assessment. 展开更多
关键词 Remote sensing computer vision aerial imagery scene classification feature extraction TRANSFORMER
在线阅读 下载PDF
Natural Scene Classification Inspired by Visual Perception and Cognition Mechanisms
10
作者 ZHANG Rui 《重庆理工大学学报(自然科学)》 CAS 2011年第7期24-43,共20页
The process of human natural scene categorization consists of two correlated stages: visual perception and visual cognition of natural scenes.Inspired by this fact,we propose a biologically plausible approach for natu... The process of human natural scene categorization consists of two correlated stages: visual perception and visual cognition of natural scenes.Inspired by this fact,we propose a biologically plausible approach for natural scene image classification.This approach consists of one visual perception model and two visual cognition models.The visual perception model,composed of two steps,is used to extract discriminative features from natural scene images.In the first step,we mimic the oriented and bandpass properties of human primary visual cortex by a special complex wavelets transform,which can decompose a natural scene image into a series of 2D spatial structure signals.In the second step,a hybrid statistical feature extraction method is used to generate gist features from those 2D spatial structure signals.Then we design a cognitive feedback model to realize adaptive optimization for the visual perception model.At last,we build a multiple semantics based cognition model to imitate human cognitive mode in rapid natural scene categorization.Experiments on natural scene datasets show that the proposed method achieves high efficiency and accuracy for natural scene classification. 展开更多
关键词 natural scene classification visual perception model visual cognition model
在线阅读 下载PDF
Intelligent Deep Data Analytics Based Remote Sensing Scene Classification Model
11
作者 Ahmed Althobaiti Abdullah Alhumaidi Alotaibi +2 位作者 Sayed Abdel-Khalek Suliman A.Alsuhibany Romany F.Mansour 《Computers, Materials & Continua》 SCIE EI 2022年第7期1921-1938,共18页
Latest advancements in the integration of camera sensors paves a way for newUnmannedAerialVehicles(UAVs)applications such as analyzing geographical(spatial)variations of earth science in mitigating harmful environment... Latest advancements in the integration of camera sensors paves a way for newUnmannedAerialVehicles(UAVs)applications such as analyzing geographical(spatial)variations of earth science in mitigating harmful environmental impacts and climate change.UAVs have achieved significant attention as a remote sensing environment,which captures high-resolution images from different scenes such as land,forest fire,flooding threats,road collision,landslides,and so on to enhance data analysis and decision making.Dynamic scene classification has attracted much attention in the examination of earth data captured by UAVs.This paper proposes a new multi-modal fusion based earth data classification(MMF-EDC)model.The MMF-EDC technique aims to identify the patterns that exist in the earth data and classifies them into appropriate class labels.The MMF-EDC technique involves a fusion of histogram of gradients(HOG),local binary patterns(LBP),and residual network(ResNet)models.This fusion process integrates many feature vectors and an entropy based fusion process is carried out to enhance the classification performance.In addition,the quantum artificial flora optimization(QAFO)algorithm is applied as a hyperparameter optimization technique.The AFO algorithm is inspired by the reproduction and the migration of flora helps to decide the optimal parameters of the ResNet model namely learning rate,number of hidden layers,and their number of neurons.Besides,Variational Autoencoder(VAE)based classification model is applied to assign appropriate class labels for a useful set of feature vectors.The proposedMMF-EDCmodel has been tested using UCM and WHU-RS datasets.The proposed MMFEDC model attains exhibits promising classification results on the applied remote sensing images with the accuracy of 0.989 and 0.994 on the test UCM and WHU-RS dataset respectively. 展开更多
关键词 Remote sensing unmanned aerial vehicles deep learning artificial intelligence scene classification
在线阅读 下载PDF
Adaptive Binary Coding for Scene Classification Based on Convolutional Networks
12
作者 Shuai Wang Xianyi Chen 《Computers, Materials & Continua》 SCIE EI 2020年第12期2065-2077,共13页
With the rapid development of computer technology,millions of images are produced everyday by different sources.How to efficiently process these images and accurately discern the scene in them becomes an important but... With the rapid development of computer technology,millions of images are produced everyday by different sources.How to efficiently process these images and accurately discern the scene in them becomes an important but tough task.In this paper,we propose a novel supervised learning framework based on proposed adaptive binary coding for scene classification.Specifically,we first extract some high-level features of images under consideration based on available models trained on public datasets.Then,we further design a binary encoding method called one-hot encoding to make the feature representation more efficient.Benefiting from the proposed adaptive binary coding,our method is free of time to train or fine-tune the deep network and can effectively handle different applications.Experimental results on three public datasets,i.e.,UIUC sports event dataset,MIT Indoor dataset,and UC Merced dataset in terms of three different classifiers,demonstrate that our method is superior to the state-of-the-art methods with large margins. 展开更多
关键词 scene classification convolutional neural network one-hot encoding supervised feature training
在线阅读 下载PDF
A More Efficient Approach for Remote Sensing Image Classification 被引量:1
13
作者 Huaxiang Song 《Computers, Materials & Continua》 SCIE EI 2023年第3期5741-5756,共16页
Over the past decade,the significant growth of the convolutional neural network(CNN)based on deep learning(DL)approaches has greatly improved the machine learning(ML)algorithm’s performance on the semantic scene clas... Over the past decade,the significant growth of the convolutional neural network(CNN)based on deep learning(DL)approaches has greatly improved the machine learning(ML)algorithm’s performance on the semantic scene classification(SSC)of remote sensing images(RSI).However,the unbalanced attention to classification accuracy and efficiency has made the superiority of DL-based algorithms,e.g.,automation and simplicity,partially lost.Traditional ML strategies(e.g.,the handcrafted features or indicators)and accuracy-aimed strategies with a high trade-off(e.g.,the multi-stage CNNs and ensemble of multi-CNNs)are widely used without any training efficiency optimization involved,which may result in suboptimal performance.To address this problem,we propose a fast and simple training CNN framework(named FST-EfficientNet)for RSI-SSC based on an EfficientNetversion2 small(EfficientNetV2-S)CNN model.The whole algorithm flow is completely one-stage and end-to-end without any handcrafted features or discriminators introduced.In the implementation of training efficiency optimization,only several routine data augmentation tricks coupled with a fixed ratio of resolution or a gradually increasing resolution strategy are employed,so that the algorithm’s trade-off is very cheap.The performance evaluation shows that our FST-EfficientNet achieves new state-of-the-art(SOTA)records in the overall accuracy(OA)with about 0.8%to 2.7%ahead of all earlier methods on the Aerial Image Dataset(AID)and Northwestern Poly-technical University Remote Sensing Image Scene Classification 45 Dataset(NWPU-RESISC45D).Meanwhile,the results also demonstrate the importance and indispensability of training efficiency optimization strategies for RSI-SSC by DL.In fact,it is not necessary to gain better classification accuracy by completely relying on an excessive trade-off without efficiency.Ultimately,these findings are expected to contribute to the development of more efficient CNN-based approaches in RSI-SSC. 展开更多
关键词 FST-EfficientNet efficient approach scene classification remote sensing deep learning
在线阅读 下载PDF
AV-FDTI:Audio-visual fusion for drone threat identification 被引量:1
14
作者 Yizhuo Yang Shenghai Yuan +5 位作者 Jianfei Yang Thien Hoang Nguyen Muqing Cao Thien-Minh Nguyen Han Wang Lihua Xie 《Journal of Automation and Intelligence》 2024年第3期144-151,共8页
In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fus... In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fusion system designed for Drone Threat Identification.AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs,providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization.Specifically,AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction.Furthermore,we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data.Notably,our system is trained based on automated Leica tracking annotations,offering accurate ground truth data with millimeter-level accuracy.Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems.In our commitment to advancing this field,we will release this work as open-source code and wearable AV-FDTI design,contributing valuable resources to the research community. 展开更多
关键词 audio-visual fusion Anti-UAV Multi-modal fusion classification 3D localization Self-attention
在线阅读 下载PDF
HybridLSTM:An Innovative Method for Road Scene Categorization Employing Hybrid Features
15
作者 Sanjay P.Pande Sarika Khandelwal +4 位作者 Ganesh K.Yenurkar Rakhi D.Wajgi Vincent O.Nyangaresi Pratik R.Hajare Poonam T.Agarkar 《Computers, Materials & Continua》 2025年第9期5937-5975,共39页
Recognizing road scene context from a single image remains a critical challenge for intelligent autonomous driving systems,particularly in dynamic and unstructured environments.While recent advancements in deep learni... Recognizing road scene context from a single image remains a critical challenge for intelligent autonomous driving systems,particularly in dynamic and unstructured environments.While recent advancements in deep learning have significantly enhanced road scene classification,simultaneously achieving high accuracy,computational efficiency,and adaptability across diverse conditions continues to be difficult.To address these challenges,this study proposes HybridLSTM,a novel and efficient framework that integrates deep learning-based,object-based,and handcrafted feature extraction methods within a unified architecture.HybridLSTM is designed to classify four distinct road scene categories—crosswalk(CW),highway(HW),overpass/tunnel(OP/T),and parking(P)—by leveraging multiple publicly available datasets,including Places-365,BDD100K,LabelMe,and KITTI,thereby promoting domain generalization.The framework fuses object-level features extracted using YOLOv5 and VGG19,scene-level global representations obtained from a modified VGG19,and fine-grained texture features captured through eight handcrafted descriptors.This hybrid feature fusion enables the model to capture both semantic context and low-level visual cues,which are critical for robust scene understanding.To model spatial arrangements and latent sequential dependencies present even in static imagery,the combined features are processed through a Long Short-Term Memory(LSTM)network,allowing the extraction of discriminative patterns across heterogeneous feature spaces.Extensive experiments conducted on 2725 annotated road scene images,with an 80:20 training-to-testing split,validate the effectiveness of the proposed model.HybridLSTM achieves a classification accuracy of 96.3%,a precision of 95.8%,a recall of 96.1%,and an F1-score of 96.0%,outperforming several existing state-of-the-art methods.These results demonstrate the robustness,scalability,and generalization capability of HybridLSTM across varying environments and scene complexities.Moreover,the framework is optimized to balance classification performance with computational efficiency,making it highly suitable for real-time deployment in embedded autonomous driving systems.Future work will focus on extending the model to multi-class detection within a single frame and optimizing it further for edge-device deployments to reduce computational overhead in practical applications. 展开更多
关键词 HybridLSTM autonomous vehicles road scene classification critical requirement global features handcrafted features
在线阅读 下载PDF
基于遥感图像场景分类的频域量化对抗攻击
16
作者 王熠 李智 +3 位作者 张丽 石雪丽 刘登波 卢妤 《计算机工程》 北大核心 2026年第1期266-281,共16页
深度神经网络在遥感图像的场景分类任务中取得巨大成功。然而,由于对抗样本具有较强的可迁移性,基于遥感图像的场景分类网络的脆弱性不容忽视。为了增强遥感图像场景分类网络的鲁棒性,确保其在各种环境和条件下的可靠性和安全性,有效提... 深度神经网络在遥感图像的场景分类任务中取得巨大成功。然而,由于对抗样本具有较强的可迁移性,基于遥感图像的场景分类网络的脆弱性不容忽视。为了增强遥感图像场景分类网络的鲁棒性,确保其在各种环境和条件下的可靠性和安全性,有效提高其实际应用价值,提出一种频域的量化对抗攻击(FDQ)方法。首先,将输入图像进行离散余弦变换(DCT),在频域中利用量化筛选器有效捕捉使图像正确分类的关键特征在频域中的突出区域;然后,提出一个基于类的注意力损失,使得量化筛选器逐渐丢失这些使图像正确分类的关键特征,模型的注意力逐渐偏离与原始类别毫不相干的特征和区域。所提方法利用模型的注意力分布实现特征层级的黑盒攻击,通过找到不同网络中的共同防御漏洞,实现针对遥感图像生成且具有通用性的对抗样本。实验结果表明,FDQ方法可在遥感图像场景分类任务中成功攻击大多数最先进的深度神经网络,与目前最先进的基于遥感图像场景分类任务的攻击方法相比,FDQ在基准数据集UCM和AID上基于RegNetX-400MF架构的攻击成功率分别提高了35.43%和23.63%。实验表明FDQ具有良好的攻击性和可迁移性,很难被防御系统抵御。 展开更多
关键词 对抗攻击 对抗样本 深度神经网络 遥感图像 场景分类 黑盒攻击
在线阅读 下载PDF
基于多尺度注意力机制的遥感图像复杂场景分类方法研究
17
作者 赵君君 《中国高新科技》 2026年第3期145-146,149,共3页
文章针对遥感图像中复杂场景类别多样、目标尺度差异显著、语义信息提取困难等问题,提出了一种基于多尺度注意力机制的遥感图像复杂场景分类方法。该方法首先采用多尺度卷积神经网络提取不同尺度的空间特征信息,有效增强模型对不同目标... 文章针对遥感图像中复杂场景类别多样、目标尺度差异显著、语义信息提取困难等问题,提出了一种基于多尺度注意力机制的遥感图像复杂场景分类方法。该方法首先采用多尺度卷积神经网络提取不同尺度的空间特征信息,有效增强模型对不同目标尺寸的感知能力;随后,引入注意力机制对多尺度特征进行加权融合,突出关键信息,抑制冗余干扰;最后,利用全连接分类器完成图像场景的分类判断。实验在多个遥感图像公开数据集上进行验证。结果表明,所提方法在准确率、召回率及F1分数等指标上均优于现有主流方法,具有较强的鲁棒性和应用前景。 展开更多
关键词 遥感图像 场景分类 多尺度特征 注意力机制
在线阅读 下载PDF
基于多场景动态配置的智能体路由系统
18
作者 刘航 刘为 +2 位作者 金云智 甘永年 郭龙 《信息与电脑》 2026年第2期191-194,共4页
随着大模型在全球的普及应用,各行业的智能问答系统中都引入了不同领域的智能体,而用户往往需要统一的问答系统。因此,文章提出一种智能体路由方法。该方法基于贝叶斯场景分类和智能体命名模糊匹配,自动配置智能体运行环境并调用相应领... 随着大模型在全球的普及应用,各行业的智能问答系统中都引入了不同领域的智能体,而用户往往需要统一的问答系统。因此,文章提出一种智能体路由方法。该方法基于贝叶斯场景分类和智能体命名模糊匹配,自动配置智能体运行环境并调用相应领域的智能体。结果表明,在动态配置智能体运行环境的条件下,问答系统具有较高的反馈F1值。 展开更多
关键词 大语言模型 场景分类 动态配置 智能体运行环境 智能体路由
在线阅读 下载PDF
预训练模型范式迁移视角下的遥感影像技术分析
19
作者 李冰 侯锐 +3 位作者 杨晓艳 石涛 孟令通 赵琛浩 《计算机技术与发展》 2026年第1期212-220,211,共10页
遥感影像分析作为地理信息科学的核心领域,经历了从专家分析到半自动分析,再到融合多维度、多层次的高分辨率遥感图像自动化分析技术的阶段式跃迁。在计算机视觉与深度学习的驱动下,遥感图像分析领域的研究呈现方法多样化与领域专精化... 遥感影像分析作为地理信息科学的核心领域,经历了从专家分析到半自动分析,再到融合多维度、多层次的高分辨率遥感图像自动化分析技术的阶段式跃迁。在计算机视觉与深度学习的驱动下,遥感图像分析领域的研究呈现方法多样化与领域专精化趋势。该文通过对58篇高水平文献开展系统性荟萃分析,聚焦场景分类、图像检索与分割三大方向,从特征提取、语义分析等维度对比核心数据集、方法差异及评价体系。研究发现:当前研究存在多模态数据融合效率不足、任务集成度有限等瓶颈,提出构建开源共享平台、优化大模型驱动下的特征提取精度、强化多任务协同框架等发展路径。该文不仅填补了GIS领域遥感影像荟萃分析的空白,更为技术迭代提供理论依据与优化方向。 展开更多
关键词 遥感影像分析 场景分类 特征提取 语义分析 图像检索 目标分割 荟萃分析
在线阅读 下载PDF
面向复杂自然场景的遥感地学分区智能解译框架及初探 被引量:2
20
作者 王志华 杨晓梅 +4 位作者 张俊瑶 刘晓亮 李连发 董文 贺伟 《地球信息科学学报》 北大核心 2025年第2期305-330,共26页
【目的】当下,面向多圈层耦合、人类干扰强烈的复杂自然场景遥感智能解译在地学研究和实际业务中常存在不好用的问题。为此,本文从遥感地学认知原理角度出发,在明晰遥感智能解译的使命是依托遥感大数据更好地辅助建立数字地球之后,认为... 【目的】当下,面向多圈层耦合、人类干扰强烈的复杂自然场景遥感智能解译在地学研究和实际业务中常存在不好用的问题。为此,本文从遥感地学认知原理角度出发,在明晰遥感智能解译的使命是依托遥感大数据更好地辅助建立数字地球之后,认为达成一致的知识表征模型是解决问题的关键,进而提出遥感解译与地学认知应该耦合为一个系统,以实现“数据获取知识”与“知识引导数据”的双向驱动。【分析】在此基础上,提出以遥感地学分区为纽带的智能解译框架,以打通已有地学知识向遥感智能解译过程的输入与引导,增加解译结果与已有地学知识体系的匹配度。该框架主要依靠定量化的场景复杂性度量和地理分区知识耦合,形成面向遥感智能解译的地学分区方法以及分区样本抽样与规范,从而实现面向大区域的知识耦合下分区解译策略。【展望】通过复杂度与优化抽样实验、影像分区分割尺度优选、耕地类型细分等实验,初步揭示了本框架思路在优选样本、影像分割、耕地精细类型识别等遥感智能解译多方面均存在巨大潜力。 展开更多
关键词 遥感大数据 数字地球 遥感智能解译 信息提取 地理分区/区划 土地利用/覆被分类 复杂自然场景 场景分类 地学知识图谱
原文传递
上一页 1 2 25 下一页 到第
使用帮助 返回顶部