期刊文献+
共找到5,585篇文章
< 1 2 250 >
每页显示 20 50 100
Spectral matching algorithm based on nonsubsampled contourlet transform and scale-invariant feature transform 被引量:4
1
作者 Dong Liang Pu Yan +2 位作者 Ming Zhu Yizheng Fan Kui Wang 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2012年第3期453-459,共7页
A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low freq... A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy. 展开更多
关键词 point pattern matching nonsubsampled contourlet transform scale-invariant feature transform spectral algorithm.
在线阅读 下载PDF
Algorithm Based on Morphological Component Analysis and Scale-Invariant Feature Transform for Image Registration 被引量:1
2
作者 王刚 李京娜 +3 位作者 苏庆堂 张小峰 吕高焕 王洪刚 《Journal of Shanghai Jiaotong university(Science)》 EI 2017年第1期99-106,共8页
In this paper, we proposed a registration method by combining the morphological component analysis(MCA) and scale-invariant feature transform(SIFT) algorithm. This method uses the perception dictionaries,and combines ... In this paper, we proposed a registration method by combining the morphological component analysis(MCA) and scale-invariant feature transform(SIFT) algorithm. This method uses the perception dictionaries,and combines the Basis-Pursuit algorithm and the Total-Variation regularization scheme to extract the cartoon part containing basic geometrical information from the original image, and is stable and unsusceptible to noise interference. Then a smaller number of the distinctive key points will be obtained by using the SIFT algorithm based on the cartoon part of the original image. Matching the key points by the constrained Euclidean distance,we will obtain a more correct and robust matching result. The experimental results show that the geometrical transform parameters inferred by the matched key points based on MCA+SIFT registration method are more exact than the ones based on the direct SIFT algorithm. 展开更多
关键词 image registration morphological component analysis (MCA) scale-invariant feature transform (SIFT) key point matching TN 911 A
原文传递
Mosaic of the Curved Human Retinal Images Based on the Scale-Invariant Feature Transform
3
作者 LI Ju-peng CHEN Hou-jin +1 位作者 ZHANG Xin-yuan YAO Chang 《Chinese Journal of Biomedical Engineering(English Edition)》 2008年第2期71-78,共8页
To meet the needs in the fundus examination,including outlook widening,pathology tracking,etc.,this paper describes a robust feature-based method for fully-automatic mosaic of the curved human retinal images photograp... To meet the needs in the fundus examination,including outlook widening,pathology tracking,etc.,this paper describes a robust feature-based method for fully-automatic mosaic of the curved human retinal images photographed by a fundus microscope. The kernel of this new algorithm is the scale-,rotation-and illumination-invariant interest point detector & feature descriptor-Scale-Invariant Feature Transform. When matched interest points according to second-nearest-neighbor strategy,the parameters of the model are estimated using the correct matches of the interest points,extracted by a new inlier identification scheme based on Sampson distance from putative sets. In order to preserve image features,bilinear warping and multi-band blending techniques are used to create panoramic retinal images. Experiments show that the proposed method works well with rejection error in 0.3 pixels,even for those cases where the retinal images without discernable vascular structure in contrast to the state-of-the-art algorithms. 展开更多
关键词 images mosaic retinal image scale-invariant feature transform inlier identification
在线阅读 下载PDF
Hybrid HRNet-Swin Transformer:Multi-Scale Feature Fusion for Aerial Segmentation and Classification
4
作者 Asaad Algarni Aysha Naseer +3 位作者 Mohammed Alshehri Yahya AlQahtani Abdulmonem Alshahrani Jeongmin Park 《Computers, Materials & Continua》 2025年第10期1981-1998,共18页
Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(C... Remote sensing plays a pivotal role in environmental monitoring,disaster relief,and urban planning,where accurate scene classification of aerial images is essential.However,conventional convolutional neural networks(CNNs)struggle with long-range dependencies and preserving high-resolution features,limiting their effectiveness in complex aerial image analysis.To address these challenges,we propose a Hybrid HRNet-Swin Transformer model that synergizes the strengths of HRNet-W48 for high-resolution segmentation and the Swin Transformer for global feature extraction.This hybrid architecture ensures robust multi-scale feature fusion,capturing fine-grained details and broader contextual relationships in aerial imagery.Our methodology begins with preprocessing steps,including normalization,histogram equalization,and noise reduction,to enhance input data quality.The HRNet-W48 backbone maintains high-resolution feature maps throughout the network,enabling precise segmentation,while the Swin Transformer leverages hierarchical self-attention to model long-range dependencies efficiently.By integrating these components,our model achieves superior performance in segmentation and classification tasks compared to traditional CNNs and standalone transformer models.We evaluate our approach on two benchmark datasets:UC Merced and WHU-RS19.Experimental results demonstrate that the proposed hybrid model outperforms existing methods,achieving state-of-the-art accuracy while maintaining computational efficiency.Specifically,it excels in preserving fine spatial details and contextual understanding,critical for applications like land-use classification and disaster assessment. 展开更多
关键词 Remote sensing computer vision aerial imagery scene classification feature extraction transformER
在线阅读 下载PDF
A Generative Image Steganography Based on Disentangled Attribute Feature Transformation and Invertible Mapping Rule
5
作者 Xiang Zhang Shenyan Han +1 位作者 Wenbin Huang Daoyong Fu 《Computers, Materials & Continua》 2025年第4期1149-1171,共23页
Generative image steganography is a technique that directly generates stego images from secret infor-mation.Unlike traditional methods,it theoretically resists steganalysis because there is no cover image.Currently,th... Generative image steganography is a technique that directly generates stego images from secret infor-mation.Unlike traditional methods,it theoretically resists steganalysis because there is no cover image.Currently,the existing generative image steganography methods generally have good steganography performance,but there is still potential room for enhancing both the quality of stego images and the accuracy of secret information extraction.Therefore,this paper proposes a generative image steganography algorithm based on attribute feature transformation and invertible mapping rule.Firstly,the reference image is disentangled by a content and an attribute encoder to obtain content features and attribute features,respectively.Then,a mean mapping rule is introduced to map the binary secret information into a noise vector,conforming to the distribution of attribute features.This noise vector is input into the generator to produce the attribute transformed stego image with the content feature of the reference image.Additionally,we design an adversarial loss,a reconstruction loss,and an image diversity loss to train the proposed model.Experimental results demonstrate that the stego images generated by the proposed method are of high quality,with an average extraction accuracy of 99.4%for the hidden information.Furthermore,since the stego image has a uniform distribution similar to the attribute-transformed image without secret information,it effectively resists both subjective and objective steganalysis. 展开更多
关键词 Image information hiding generative information hiding disentangled attribute feature transformation invertible mapping rule steganalysis resistance
在线阅读 下载PDF
Exploring High Dimensional Feature Space With Channel-Spatial Nonlinear Transforms for Learned Image Compression
6
作者 Wen Tan Fanyang Meng +2 位作者 Chao Li Youneng Bao Yongsheng Liang 《CAAI Transactions on Intelligence Technology》 2025年第4期1235-1253,共19页
Nonlinear transforms have significantly advanced learned image compression(LIC),particularly using residual blocks.This transform enhances the nonlinear expression ability and obtain compact feature representation by ... Nonlinear transforms have significantly advanced learned image compression(LIC),particularly using residual blocks.This transform enhances the nonlinear expression ability and obtain compact feature representation by enlarging the receptive field,which indicates how the convolution process extracts features in a high dimensional feature space.However,its functionality is restricted to the spatial dimension and network depth,limiting further improvements in network performance due to insufficient information interaction and representation.Crucially,the potential of high dimensional feature space in the channel dimension and the exploration of network width/resolution remain largely untapped.In this paper,we consider nonlinear transforms from the perspective of feature space,defining high-dimensional feature spaces in different dimensions and investigating the specific effects.Firstly,we introduce the dimension increasing and decreasing transforms in both channel and spatial dimensions to obtain high dimensional feature space and achieve better feature extraction.Secondly,we design a channel-spatial fusion residual transform(CSR),which incorporates multi-dimensional transforms for a more effective representation.Furthermore,we simplify the proposed fusion transform to obtain a slim architecture(CSR-sm),balancing network complexity and compression performance.Finally,we build the overall network with stacked CSR transforms to achieve better compression and reconstruction.Experimental results demonstrate that the proposed method can achieve superior ratedistortion performance compared to the existing LIC methods and traditional codecs.Specifically,our proposed method achieves 9.38%BD-rate reduction over VVC on Kodak dataset. 展开更多
关键词 high dimensional feature space learned image compression nonlinear transform the dimension increase and decrease
在线阅读 下载PDF
Fast uniform content-based satellite image registration using the scale-invariant feature transform descriptor 被引量:3
7
作者 Hamed BOZORGI Ali JAFARI 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2017年第8期1108-1116,共9页
Content-based satellite image registration is a difficult issue in the fields of remote sensing and image processing. The difficulty is more significant in the case of matching multisource remote sensing images which ... Content-based satellite image registration is a difficult issue in the fields of remote sensing and image processing. The difficulty is more significant in the case of matching multisource remote sensing images which suffer from illumination, rotation, and source differences. The scale-invariant feature transform (SIFT) algorithm has been used successfully in satellite image registration problems. Also, many researchers have applied a local SIFT descriptor to improve the image retrieval process. Despite its robustness, this algorithm has some difficulties with the quality and quantity of the extracted local feature points in multisource remote sensing. Furthermore, high dimensionality of the local features extracted by SIFT results in time-consuming computational processes alongside high storage requirements for saving the relevant information, which are important factors in content-based image retrieval (CBIR) applications. In this paper, a novel method is introduced to transform the local SIFT features to global features for multisource remote sensing. The quality and quantity of SIFT local features have been enhanced by applying contrast equalization on images in a pre-processing stage. Considering the local features of each image in the reference database as a separate class, linear discriminant analysis (LDA) is used to transform the local features to global features while reducing di- mensionality of the feature space. This will also significantly reduce the computational time and storage required. Applying the trained kernel on verification data and mapping them showed a successful retrieval rate of 91.67% for test feature points. 展开更多
关键词 Content-based image retrieval feature point distribution Image registration Linear discriminant analysis REMOTESENSING scale-invariant feature transform
原文传递
Digital watermarking algorithm based on scale-invariant feature regions in non-subsampled contourlet transform domain 被引量:8
8
作者 Jian Zhao Na Zhang +1 位作者 Jian Jia Huanwei Wang 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2015年第6期1310-1315,共6页
Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy... Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy sub-band after NSCT is selected to embed watermark. The watermark is embedded into scaleinvariant feature transform (SIFT) regions. During embedding, the initial region is divided into some cirque sub-regions with the same area, and each watermark bit is embedded into one sub-region. Extensive simulation results and comparisons show that the algorithm gets a good trade-off of invisibility, robustness and capacity, thus obtaining good quality of the image while being able to effectively resist common image processing, and geometric and combo attacks, and normalized similarity is almost all reached. 展开更多
关键词 multi-scale geometric analysis (MGA) non-subsampled contourlet transform (NSCT) scale-invariant featureregion.
在线阅读 下载PDF
Point Cloud Classification Using Content-Based Transformer via Clustering in Feature Space 被引量:6
9
作者 Yahui Liu Bin Tian +2 位作者 Yisheng Lv Lingxi Li Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期231-239,共9页
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to est... Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space(content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectN N. Source code of this paper is available at https://github.com/yahuiliu99/PointC onT. 展开更多
关键词 Content-based transformer deep learning feature aggregator local attention point cloud classification
在线阅读 下载PDF
Weak Fault Feature Extraction of the Rotating Machinery Using Flexible Analytic Wavelet Transform and Nonlinear Quantum Permutation Entropy 被引量:1
10
作者 Lili Bai Wenhui Li +3 位作者 He Ren Feng Li TaoYan Lirong Chen 《Computers, Materials & Continua》 SCIE EI 2024年第6期4513-4531,共19页
Addressing the challenges posed by the nonlinear and non-stationary vibrations in rotating machinery,where weak fault characteristic signals hinder accurate fault state representation,we propose a novel feature extrac... Addressing the challenges posed by the nonlinear and non-stationary vibrations in rotating machinery,where weak fault characteristic signals hinder accurate fault state representation,we propose a novel feature extraction method that combines the Flexible Analytic Wavelet Transform(FAWT)with Nonlinear Quantum Permutation Entropy.FAWT,leveraging fractional orders and arbitrary scaling and translation factors,exhibits superior translational invariance and adjustable fundamental oscillatory characteristics.This flexibility enables FAWT to provide well-suited wavelet shapes,effectively matching subtle fault components and avoiding performance degradation associated with fixed frequency partitioning and low-oscillation bases in detecting weak faults.In our approach,gearbox vibration signals undergo FAWT to obtain sub-bands.Quantum theory is then introduced into permutation entropy to propose Nonlinear Quantum Permutation Entropy,a feature that more accurately characterizes the operational state of vibration simulation signals.The nonlinear quantum permutation entropy extracted from sub-bands is utilized to characterize the operating state of rotating machinery.A comprehensive analysis of vibration signals from rolling bearings and gearboxes validates the feasibility of the proposed method.Comparative assessments with parameters derived from traditional permutation entropy,sample entropy,wavelet transform(WT),and empirical mode decomposition(EMD)underscore the superior effectiveness of this approach in fault detection and classification for rotating machinery. 展开更多
关键词 Rotating machinery quantum theory nonlinear quantum permutation entropy Flexible Analytic Wavelet transform(FAWT) feature extraction
在线阅读 下载PDF
Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion
11
作者 DIAO Zijian CAO Shuai +4 位作者 LI Wenwei LIANG Jianan WEN Guilin HUANG Weici ZHANG Shouming 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期363-374,共12页
In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestri... In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestrian re-identification tasks,a person re-identification method combining spatial feature learning and multi-granularity feature fusion was proposed.First,an attention spatial transformation network(A-STN)is proposed to learn spatial features and solve the problem of misalignment of pedestrian spatial features.Then the network was divided into a global branch,a local coarse-grained fusion branch,and a local fine-grained fusion branch to extract pedestrian global features,coarse-grained fusion features,and fine-grained fusion features,respectively.Among them,the global branch enriches the global features by fusing different pooling features.The local coarse-grained fusion branch uses an overlay pooling to enhance each local feature while learning the correlation relationship between multi-granularity features.The local fine-grained fusion branch uses a differential pooling to obtain the differential features that were fused with global features to learn the relationship between pedestrian local features and pedestrian global features.Finally,the proposed method was compared on three public datasets:Market1501,DukeMTMC-ReID and CUHK03.The experimental results were better than those of the comparative methods,which verifies the effectiveness of the proposed method. 展开更多
关键词 pedestrian re-identification spatial features attention spatial transformation network multi-branch network relation features
原文传递
Double Self-Attention Based Fully Connected Feature Pyramid Network for Field Crop Pest Detection
12
作者 Zijun Gao Zheyi Li +2 位作者 Chunqi Zhang Ying Wang Jingwen Su 《Computers, Materials & Continua》 2025年第6期4353-4371,共19页
Pest detection techniques are helpful in reducing the frequency and scale of pest outbreaks;however,their application in the actual agricultural production process is still challenging owing to the problems of intersp... Pest detection techniques are helpful in reducing the frequency and scale of pest outbreaks;however,their application in the actual agricultural production process is still challenging owing to the problems of interspecies similarity,multi-scale,and background complexity of pests.To address these problems,this study proposes an FD-YOLO pest target detection model.The FD-YOLO model uses a Fully Connected Feature Pyramid Network(FC-FPN)instead of a PANet in the neck,which can adaptively fuse multi-scale information so that the model can retain small-scale target features in the deep layer,enhance large-scale target features in the shallow layer,and enhance the multiplexing of effective features.A dual self-attention module(DSA)is then embedded in the C3 module of the neck,which captures the dependencies between the information in both spatial and channel dimensions,effectively enhancing global features.We selected 16 types of pests that widely damage field crops in the IP102 pest dataset,which were used as our dataset after data supplementation and enhancement.The experimental results showed that FD-YOLO’s mAP@0.5 improved by 6.8%compared to YOLOv5,reaching 82.6%and 19.1%–5%better than other state-of-the-art models.This method provides an effective new approach for detecting similar or multiscale pests in field crops. 展开更多
关键词 Pest detection YOLOv5 feature pyramid network transformer attention module
在线阅读 下载PDF
TransSSA: Invariant Cue Perceptual Feature Focused Learning for Dynamic Fruit Target Detection
13
作者 Jianyin Tang Zhenglin Yu Changshun Shao 《Computers, Materials & Continua》 2025年第5期2829-2850,共22页
In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challe... In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments. 展开更多
关键词 Fruit recognition invariant features TransSSA model swin transformer self-attention mechanism
在线阅读 下载PDF
Oversampling-Enhanced Feature Fusion-Based Hybrid ViT-1DCNN Model for Ransomware Cyber Attack Detection
14
作者 Muhammad Armghan Latif Zohaib Mushtaq +4 位作者 Saifur Rahman Saad Arif Salim Nasar Faraj Mursal Muhammad Irfan Haris Aziz 《Computer Modeling in Engineering & Sciences》 2025年第2期1667-1695,共29页
Ransomware attacks pose a significant threat to critical infrastructures,demanding robust detection mechanisms.This study introduces a hybrid model that combines vision transformer(ViT)and one-dimensional convolutiona... Ransomware attacks pose a significant threat to critical infrastructures,demanding robust detection mechanisms.This study introduces a hybrid model that combines vision transformer(ViT)and one-dimensional convolutional neural network(1DCNN)architectures to enhance ransomware detection capabilities.Addressing common challenges in ransomware detection,particularly dataset class imbalance,the synthetic minority oversampling technique(SMOTE)is employed to generate synthetic samples for minority class,thereby improving detection accuracy.The integration of ViT and 1DCNN through feature fusion enables the model to capture both global contextual and local sequential features,resulting in comprehensive ransomware classification.Tested on the UNSW-NB15 dataset,the proposed ViT-1DCNN model achieved 98%detection accuracy with precision,recall,and F1-score metrics surpassing conventional methods.This approach not only reduces false positives and negatives but also offers scalability and robustness for real-world cybersecurity applications.The results demonstrate the model’s potential as an effective tool for proactive ransomware detection,especially in environments where evolving threats require adaptable and high-accuracy solutions. 展开更多
关键词 Ransomware attacks CYBERSECURITY vision transformer convolutional neural network feature fusion ENCRYPTION threat detection
在线阅读 下载PDF
Triple-path feature transform network for ring-array photoacoustic tomography image reconstruction
15
作者 Lingyu Ma Zezheng Qin +1 位作者 Yiming Ma Mingjian Sun 《Journal of Innovative Optical Health Sciences》 SCIE EI CSCD 2024年第3期23-40,共18页
Photoacoustic imaging(PAI)is a noninvasive emerging imaging method based on the photoacoustic effect,which provides necessary assistance for medical diagnosis.It has the characteristics of large imaging depth and high... Photoacoustic imaging(PAI)is a noninvasive emerging imaging method based on the photoacoustic effect,which provides necessary assistance for medical diagnosis.It has the characteristics of large imaging depth and high contrast.However,limited by the equipment cost and reconstruction time requirements,the existing PAI systems distributed with annular array transducers are difficult to take into account both the image quality and the imaging speed.In this paper,a triple-path feature transform network(TFT-Net)for ring-array photoacoustic tomography is proposed to enhance the imaging quality from limited-view and sparse measurement data.Specifically,the network combines the raw photoacoustic pressure signals and conventional linear reconstruction images as input data,and takes the photoacoustic physical model as a prior information to guide the reconstruction process.In addition,to enhance the ability of extracting signal features,the residual block and squeeze and excitation block are introduced into the TFT-Net.For further efficient reconstruction,the final output of photoacoustic signals uses‘filter-then-upsample’operation with a pixel-shuffle multiplexer and a max out module.Experiment results on simulated and in-vivo data demonstrate that the constructed TFT-Net can restore the target boundary clearly,reduce background noise,and realize fast and high-quality photoacoustic image reconstruction of limited view with sparse sampling. 展开更多
关键词 Deep learning feature transformation image reconstruction limited-view measurement photoacoustic tomography.
原文传递
Olive Leaf Disease Detection via Wavelet Transform and Feature Fusion of Pre-Trained Deep Learning Models
16
作者 Mahmood A.Mahmood Khalaf Alsalem 《Computers, Materials & Continua》 SCIE EI 2024年第3期3431-3448,共18页
Olive trees are susceptible to a variety of diseases that can cause significant crop damage and economic losses.Early detection of these diseases is essential for effective management.We propose a novel transformed wa... Olive trees are susceptible to a variety of diseases that can cause significant crop damage and economic losses.Early detection of these diseases is essential for effective management.We propose a novel transformed wavelet,feature-fused,pre-trained deep learning model for detecting olive leaf diseases.The proposed model combines wavelet transforms with pre-trained deep-learning models to extract discriminative features from olive leaf images.The model has four main phases:preprocessing using data augmentation,three-level wavelet transformation,learning using pre-trained deep learning models,and a fused deep learning model.In the preprocessing phase,the image dataset is augmented using techniques such as resizing,rescaling,flipping,rotation,zooming,and contrasting.In wavelet transformation,the augmented images are decomposed into three frequency levels.Three pre-trained deep learning models,EfficientNet-B7,DenseNet-201,and ResNet-152-V2,are used in the learning phase.The models were trained using the approximate images of the third-level sub-band of the wavelet transform.In the fused phase,the fused model consists of a merge layer,three dense layers,and two dropout layers.The proposed model was evaluated using a dataset of images of healthy and infected olive leaves.It achieved an accuracy of 99.72%in the diagnosis of olive leaf diseases,which exceeds the accuracy of other methods reported in the literature.This finding suggests that our proposed method is a promising tool for the early detection of olive leaf diseases. 展开更多
关键词 Olive leaf diseases wavelet transform deep learning feature fusion
在线阅读 下载PDF
基于Transformer-FNN和无人机高光谱遥感技术的棉花黄萎病危害等级分类研究 被引量:1
17
作者 廖娟 梁业雄 +7 位作者 姜锐 邢赫 何欣颖 王辉 曾浩求 何松炜 唐赛欧 罗锡文 《农业机械学报》 北大核心 2025年第2期240-251,共12页
针对目前使用无人机识别棉花黄萎病危害等级时,光谱数据冗余度高和传统机器学习模型识别精度不足等问题,采用无人机搭载Nano-Hyperspec高光谱成像仪采集棉田高光谱图像,通过探究棉花冠层对不同黄萎病危害等级的光谱响应特征,利用最优植... 针对目前使用无人机识别棉花黄萎病危害等级时,光谱数据冗余度高和传统机器学习模型识别精度不足等问题,采用无人机搭载Nano-Hyperspec高光谱成像仪采集棉田高光谱图像,通过探究棉花冠层对不同黄萎病危害等级的光谱响应特征,利用最优植被指数组合建立一种适用于黄萎病危害等级分类的监测模型,实现棉花黄萎病危害等级的精准分类。首先,利用最小冗余最大相关算法(Minimum redundancy maximum relevance,mRMR)对17种潜在的植被指数和270个光谱波段进行特征重要性排序,将mRMR筛选得到的特征,通过逐步递增分组的方式输入至极限梯度提升模型(eXtreme gradient boosting,XGBoost),确定与黄萎病危害等级相关性最高的植被指数和光谱特征波段。然后,基于Transformer架构和前馈神经网络(Feedforward neural network,FNN)构建Transformer-FNN棉花黄萎病危害等级分类模型,将植被指数与光谱特征波段输入Transformer-FNN模型进行分类识别,对比了植被指数与光谱特征波段对棉花黄萎病危害等级分类识别的准确性。最后,利用后向传播神经网络(Back propagation neural network,BPNN)、Transformer和支持向量机(Support vector machine,SVM)构建棉花黄萎病危害等级分类模型,并对这4种分类模型进行精度验证与对比分析。结果表明:棉花黄萎病等级分类的最优植被指数组合为MSR和TVI,最优特征波段组合为430、439、488、566、697、722、742、764、769、782、822、831、858、873、878、893、909、985 nm。基于Transformer-FNN模型,植被指数对黄萎病危害等级的总体分类精度为95.6%,较光谱特征波段的总体分类精度89.4%提高6.2个百分点。基于植被指数,Transformer-FNN模型对黄萎病危害等级的分类识别率比BPNN模型提高11.2个百分点,比Transformer模型提高17.2个百分点,比SVM模型提高30.8个百分点。研究提出了一种通过植被指数进行棉花黄萎病高精度监测方法,可为大面积棉花黄萎病精确监测提供有效措施。 展开更多
关键词 棉花黄萎病 transformer-FNN 特征组合 mRMR-XGBoost 高光谱遥感 植被指数
在线阅读 下载PDF
多变量时序标记Transformer及其在电潜泵故障诊断中的应用 被引量:2
18
作者 李康 李爽 +2 位作者 高小永 李强 张来斌 《控制与决策》 北大核心 2025年第4期1145-1153,共9页
电潜泵故障诊断对于确保安全可靠采油至关重要,但是,电潜泵数据呈现出的多变量、非线性和动态变化等复杂特性为该任务带来了严峻挑战.近年来,深度学习在复杂数据特征提取方面表现出的强大能力催生了一系列基于神经网络的电潜泵故障诊断... 电潜泵故障诊断对于确保安全可靠采油至关重要,但是,电潜泵数据呈现出的多变量、非线性和动态变化等复杂特性为该任务带来了严峻挑战.近年来,深度学习在复杂数据特征提取方面表现出的强大能力催生了一系列基于神经网络的电潜泵故障诊断方法.然而,多数方法忽略了电潜泵数据的动态特性以及长时依赖特征提取困难的问题.针对上述问题,提出一种多变量时序标记Transformer神经网络来实现电潜泵故障诊断.该模型设计新的多变量时间序列标记策略,继承引入多头注意力机制和残差连接的传统Transformer神经网络编码器在长时依赖特征提取方面的优势,用前向神经网络替代传统Transformer神经网络解码器来简化模型复杂度.通过对油田现场故障数据分析,验证所提出方法的有效性.实验结果表明,所提出方法实现了10类电潜泵故障的精确诊断,相比于流行的深度学习方法诊断性能更优. 展开更多
关键词 电潜泵 transformer神经网络 深度学习 特征提取 故障诊断 多变量时序标记
原文传递
基于时序二维变换和多尺度Transformer的电能质量扰动分类方法 被引量:1
19
作者 王守相 李慧强 +3 位作者 赵倩宇 郭陆阳 王同勋 王洋 《电力系统自动化》 北大核心 2025年第7期198-207,共10页
随着新能源渗透率的不断提高,电网面临的电能质量扰动(PQD)问题变得更加复杂,基于一维PQD信号的传统分类方法难以同时提取并辨识周期性与趋势性扰动。针对此问题,提出了一种基于时序二维变换和多尺度Transformer的PQD分类方法。首先,利... 随着新能源渗透率的不断提高,电网面临的电能质量扰动(PQD)问题变得更加复杂,基于一维PQD信号的传统分类方法难以同时提取并辨识周期性与趋势性扰动。针对此问题,提出了一种基于时序二维变换和多尺度Transformer的PQD分类方法。首先,利用时序二维变换将一维PQD时间序列转换为一组基于多个周期的二维张量,以实现在二维空间中深入挖掘PQD信号中所包含的特征信息。然后,通过多尺度Transformer编码器模块提取PQD信号的多尺度特征图,利用多尺度Transformer解码器模块对多尺度特征图进行拼接和特征融合,有效合并在不同尺度上提取的特征图。最后,通过全连接层和Softmax分类器完成PQD分类任务。为验证所提方法的有效性,建立了含24种PQD的数据集对模型进行测试,结果表明所提方法对PQD信号具有较高的分类准确率和噪声鲁棒性。 展开更多
关键词 电能质量 扰动 分类 时序二维变换 多尺度transformer 特征提取 特征融合
在线阅读 下载PDF
基于改进Swin Transformer的人脸活体检测 被引量:2
20
作者 王旭光 卜辰宇 时泽宇 《中国测试》 北大核心 2025年第6期31-39,共9页
随着人脸识别技术的发展,人脸活体检测作为人脸识别系统的安全保障变得更加重要。但当前主流的人脸活体检测模型仅针对特定的检测场景及欺诈攻击方式,面对未知攻击的鲁棒性和泛化能力较差。为此,该文提出一种改进的Swin Transformer模型... 随着人脸识别技术的发展,人脸活体检测作为人脸识别系统的安全保障变得更加重要。但当前主流的人脸活体检测模型仅针对特定的检测场景及欺诈攻击方式,面对未知攻击的鲁棒性和泛化能力较差。为此,该文提出一种改进的Swin Transformer模型,即CDCSwin-T(central difference convolution Swin Transformer)模型。该模型以Swin Transformer为主干,利用其滑动窗口注意力机制提取人脸全局信息,同时引入中心差分卷积(central difference convolution,CDC)模块提取人脸局部信息,加强主干模型捕获真假人脸差异的能力,从而增强其面对未知攻击的鲁棒性;另外在主干模型中引入瓶颈注意力模块,引导模型关注人脸关键信息,加速模型训练;最终将主干模型不同阶段的多尺度信息进行自适应融合,进一步提升该文模型的泛化能力。CDCSwin-T模型在OULU-NPU数据集4个协议上的平均分类错误率(ACER)分别为0.2%,1.1%,(1.1±0.6)%,(2.8±1.4)%,在CASIA-MFSD和REPLAYATTACK数据集跨库测试上的半错误率(HTER)分别为14.1%,22.9%,均优于当前的主流模型,表明其面对未知攻击的鲁棒性和泛化能力均有所提升。 展开更多
关键词 人脸活体检测 Swin transformer 瓶颈注意力模块 特征融合
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部