A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low freq...A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.展开更多
A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segme...A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.展开更多
In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant featur...In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant feature transform (SIFT). First,the distance between SIFT features is included in the equations of the proximity matrix to measure the similarity between two feature points; then the normalized cross correlation (NCC) used in Scott s method,which has been modified with adaptive scale and orientation,...展开更多
On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits o...On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits of using an SIFT algorithm for target classification are discussed.Secondly,the scales of SIFT descriptors are sorted by descending as SIFT-SS,which is sent to a support vector machine(SVM) with radial based function(RBF) kernel in order to train SVM classifier,which will be used for achieving target classification.Experimental results indicate that the SIFT-SS algorithm is efficient for target classification and can obtain a higher recognition rate than affine moment invariants(AMI) and multi-scale auto-convolution(MSA) in some complex situations,such as the situation with the existence of noises and occlusions.Moreover,the computational time of SIFT-SS is shorter than MSA and longer than AMI.展开更多
In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challe...In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments.展开更多
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (...Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.展开更多
针对现有微表情识别方法存在多尺度特征提取能力不足、区域协同关系建模不充分及计算复杂度较高等缺点,提出结合空间通道特征与图注意力的分层Transformer微表情识别方法(Hierarchical Transformer for Micro-Expression Recognition wi...针对现有微表情识别方法存在多尺度特征提取能力不足、区域协同关系建模不充分及计算复杂度较高等缺点,提出结合空间通道特征与图注意力的分层Transformer微表情识别方法(Hierarchical Transformer for Micro-Expression Recognition with Spatial-Channel Features and Graph Attention,HT-SCGA).首先,设计多尺度动态窗口模块,通过自适应窗口扩展实现从局部到全局的特征层次化提取.然后,设计双域特征关联模块,在空间维度与通道维度建模细粒度依赖关系,有效提升特征表达能力并降低计算复杂度.最后,构建图注意力聚合模块,显式建模面部关键区域间的语义依赖,增强面部动作单元的联动特征.在多个数据集上的实验表明,HT-SCGA性能较优,由此表明其在微表情识别任务中的有效性与高效性.展开更多
基金supported by the National Natural Science Foundation of China (6117212711071002)+1 种基金the Specialized Research Fund for the Doctoral Program of Higher Education (20113401110006)the Innovative Research Team of 211 Project in Anhui University (KJTD007A)
文摘A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.
基金The National Natural Science Foundation of China(No60271033)
文摘A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.
基金National High-tech Research and Development Program (2007AA01Z314)National Natural Science Foundation of China (60873085)
文摘In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant feature transform (SIFT). First,the distance between SIFT features is included in the equations of the proximity matrix to measure the similarity between two feature points; then the normalized cross correlation (NCC) used in Scott s method,which has been modified with adaptive scale and orientation,...
基金supported by the National High Technology Research and Development Program (863 Program) (2010AA7080302)
文摘On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits of using an SIFT algorithm for target classification are discussed.Secondly,the scales of SIFT descriptors are sorted by descending as SIFT-SS,which is sent to a support vector machine(SVM) with radial based function(RBF) kernel in order to train SVM classifier,which will be used for achieving target classification.Experimental results indicate that the SIFT-SS algorithm is efficient for target classification and can obtain a higher recognition rate than affine moment invariants(AMI) and multi-scale auto-convolution(MSA) in some complex situations,such as the situation with the existence of noises and occlusions.Moreover,the computational time of SIFT-SS is shorter than MSA and longer than AMI.
基金supported in part by the Basic Research Project of Science and Technology Department of Jilin Province,China(Grant No.202002044JC).
文摘In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments.
文摘Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.
文摘针对现有微表情识别方法存在多尺度特征提取能力不足、区域协同关系建模不充分及计算复杂度较高等缺点,提出结合空间通道特征与图注意力的分层Transformer微表情识别方法(Hierarchical Transformer for Micro-Expression Recognition with Spatial-Channel Features and Graph Attention,HT-SCGA).首先,设计多尺度动态窗口模块,通过自适应窗口扩展实现从局部到全局的特征层次化提取.然后,设计双域特征关联模块,在空间维度与通道维度建模细粒度依赖关系,有效提升特征表达能力并降低计算复杂度.最后,构建图注意力聚合模块,显式建模面部关键区域间的语义依赖,增强面部动作单元的联动特征.在多个数据集上的实验表明,HT-SCGA性能较优,由此表明其在微表情识别任务中的有效性与高效性.