A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low freq...A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.展开更多
A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segme...A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.展开更多
In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant featur...In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant feature transform (SIFT). First,the distance between SIFT features is included in the equations of the proximity matrix to measure the similarity between two feature points; then the normalized cross correlation (NCC) used in Scott s method,which has been modified with adaptive scale and orientation,...展开更多
On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits o...On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits of using an SIFT algorithm for target classification are discussed.Secondly,the scales of SIFT descriptors are sorted by descending as SIFT-SS,which is sent to a support vector machine(SVM) with radial based function(RBF) kernel in order to train SVM classifier,which will be used for achieving target classification.Experimental results indicate that the SIFT-SS algorithm is efficient for target classification and can obtain a higher recognition rate than affine moment invariants(AMI) and multi-scale auto-convolution(MSA) in some complex situations,such as the situation with the existence of noises and occlusions.Moreover,the computational time of SIFT-SS is shorter than MSA and longer than AMI.展开更多
In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challe...In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments.展开更多
Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (...Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.展开更多
Discussions regarding the functional transformation of agricultural utilization and the mechanisms that underlie these changes within the Three Gorges Reservoir Area(TGRA)reflect variati ons in the relati on ship betw...Discussions regarding the functional transformation of agricultural utilization and the mechanisms that underlie these changes within the Three Gorges Reservoir Area(TGRA)reflect variati ons in the relati on ship betwee n people and their environme nt in China's central and wester ns part,an area of mountains and reservoirs.A clear understa nding of these changes also provides the scientific basis for the development of multi-functional agriculture in typical mountainous areas.Five counties were selected for analysis in this study from the hinterland of the TGRA;we analyzed changes in farmland scaling and corresponding under?lying mechanisms by defining the concepts of“Scaling Farmland”(SF)and by using the software packages ArcGIS10.2,SPSS,and Geographical Detectors.The results of this analysis show that sources of increased SF have mainly comprised cultivated and shrub land.In deed,with the excepti on of some alpine off-season vegetables,SF growth has mainly occurred in low altitude areas and in places where the slope is less than 30°.We also show that spatial changes in various SF types have also been substantially different,but in all cases are closely related to road and township administrative centers.Natural factors at the patch level,including elevation and slope,have contributed significantly to SF,while at the township level,underlying socioeconomic and humanistic factors have tended to include road traffic and agricultural population density.In contrast,at the region al level,underlying driving forces within each have tended to be more significant than overall study area scale.We show that while changes in,and the development of,SF have been driven by numerous factors,agri?cultural policies have always been amongst the most important.The results clearly elucidate general land use transformation patter ns within the mountain regi ons of western China.展开更多
Image registration is an indispensable component in multi-source remote sensing image processing. In this paper, we put forward a remote sensing image registration method by including an improved multi-scale and multi...Image registration is an indispensable component in multi-source remote sensing image processing. In this paper, we put forward a remote sensing image registration method by including an improved multi-scale and multi-direction Harris algorithm and a novel compound feature. Multi-scale circle Gaussian combined invariant moments and multi-direction gray level co-occurrence matrix are extracted as features for image matching. The proposed algorithm is evaluated on numerous multi-source remote sensor images with noise and illumination changes. Extensive experimental studies prove that our proposed method is capable of receiving stable and even distribution of key points as well as obtaining robust and accurate correspondence matches. It is a promising scheme in multi-source remote sensing image registration.展开更多
Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combin...Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.展开更多
Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D mes...Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D meshes. After preprocessing, shape index extrema on the 3D facial surface are selected as keypoints in the difference scale space and the unstable keypoints are removed after two screening steps. Then, a local coordinate system for each keypoint is established by principal component analysis(PCA).Next, two local geometric features are extracted around each keypoint through the local coordinate system. Additionally, the features are augmented by the symmetrization according to the approximate left-right symmetry in human face. The proposed method is evaluated on the Bosphorus, BU-3DFE, and Gavab databases, respectively. Good results are achieved on these three datasets. As a result, the proposed method proves robust to facial expression variations, partial external occlusions and large pose changes.展开更多
将Bag of Features模型结合OpenCV开源图像库提取害虫图像的特征,然后用Kmedoids算法对其进行聚类,生成关键字,最后用AdaBoosting算法构建分类器,实验采用Pascal Voc图像库中的数据进行训练和测试,实验表明,该算法分类精度高、特征提取...将Bag of Features模型结合OpenCV开源图像库提取害虫图像的特征,然后用Kmedoids算法对其进行聚类,生成关键字,最后用AdaBoosting算法构建分类器,实验采用Pascal Voc图像库中的数据进行训练和测试,实验表明,该算法分类精度高、特征提取速度和分类速度也比较快。展开更多
基金supported by the National Natural Science Foundation of China (6117212711071002)+1 种基金the Specialized Research Fund for the Doctoral Program of Higher Education (20113401110006)the Innovative Research Team of 211 Project in Anhui University (KJTD007A)
文摘A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.
基金The National Natural Science Foundation of China(No60271033)
文摘A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.
基金National High-tech Research and Development Program (2007AA01Z314)National Natural Science Foundation of China (60873085)
文摘In order to obtain a large number of correct matches with high accuracy,this article proposes a robust wide baseline point matching method,which is based on Scott s proximity matrix and uses the scale invariant feature transform (SIFT). First,the distance between SIFT features is included in the equations of the proximity matrix to measure the similarity between two feature points; then the normalized cross correlation (NCC) used in Scott s method,which has been modified with adaptive scale and orientation,...
基金supported by the National High Technology Research and Development Program (863 Program) (2010AA7080302)
文摘On the basis of scale invariant feature transform(SIFT) descriptors,a novel kind of local invariants based on SIFT sequence scale(SIFT-SS) is proposed and applied to target classification.First of all,the merits of using an SIFT algorithm for target classification are discussed.Secondly,the scales of SIFT descriptors are sorted by descending as SIFT-SS,which is sent to a support vector machine(SVM) with radial based function(RBF) kernel in order to train SVM classifier,which will be used for achieving target classification.Experimental results indicate that the SIFT-SS algorithm is efficient for target classification and can obtain a higher recognition rate than affine moment invariants(AMI) and multi-scale auto-convolution(MSA) in some complex situations,such as the situation with the existence of noises and occlusions.Moreover,the computational time of SIFT-SS is shorter than MSA and longer than AMI.
基金supported in part by the Basic Research Project of Science and Technology Department of Jilin Province,China(Grant No.202002044JC).
文摘In the field of automated fruit harvesting,precise and efficient fruit target recognition and localization play a pivotal role in enhancing the efficiency of harvesting robots.However,this domain faces two core challenges:firstly,the dynamic nature of the automatic picking process requires fruit target detection algorithms to adapt to multi-view characteristics,ensuring effective recognition of the same fruit from different perspectives.Secondly,fruits in natural environments often suffer from interference factors such as overlapping,occlusion,and illumination fluctuations,which increase the difficulty of image capture and recognition.To address these challenges,this study conducted an in-depth analysis of the key features in fruit recognition and discovered that the stem,body,and base serve as constant and core information in fruit identification,exhibiting long-term dependent semantic relationships during the recognition process.These invariant features provide a stable foundation for dynamic fruit recognition,contributing to improved recognition accuracy and robustness.Specifically,the morphology and position of the stem,body,and base are relatively fixed,and the effective extraction of these features plays a crucial role in fruit recognition.This paper proposes a novel model,TransSSA,and designs two innovative modules to effectively extract fruit image features.The Self-Attention Core Feature Extraction(SAF)module integrates YOLOV8 and Swin Transformer as backbone networks and introduces the Shuffle Attention self-attention mechanism,significantly enhancing the ability to extract core features.This module focuses on constant features such as the stem,body,and base,ensuring accurate fruit recognition in different environments.On the other hand,the Squeeze and Excitation Aggregation(SAE)module combines the network’s ability to capture channel patterns with global knowledge,further optimizing the extraction of effective features.Additionally,to improve detection accuracy,this studymodifies the regression loss function to EIOU.To validate the effectiveness of the TransSSA model,this study conducted extensive visualization analysis to support the interpretability of the SAF and SAE modules.Experimental results demonstrate that TransSSA achieves a performance of 91.3%on a tomato dataset,fully proving its innovative capabilities.Through this research,we provide amore effective solution for using fruit harvesting robots in complex environments.
文摘Feature extraction of signals plays an important role in classification problems because of data dimension reduction property and potential improvement of a classification accuracy rate. Principal component analysis (PCA), wavelets transform or Fourier transform methods are often used for feature extraction. In this paper, we propose a multi-scale PCA, which combines discrete wavelet transform, and PCA for feature extraction of signals in both the spatial and temporal domains. Our study shows that the multi-scale PCA combined with the proposed new classification methods leads to high classification accuracy for the considered signals.
基金Key Basic Science and Cutting Edge Technology Research Plan of Chongqing,No.cstc2015jcyjBX0128National Natural Science Foundation of China,No.41261045Chongqing Normal University Graduate Student Research Innovation Project,No.YKC18033.
文摘Discussions regarding the functional transformation of agricultural utilization and the mechanisms that underlie these changes within the Three Gorges Reservoir Area(TGRA)reflect variati ons in the relati on ship betwee n people and their environme nt in China's central and wester ns part,an area of mountains and reservoirs.A clear understa nding of these changes also provides the scientific basis for the development of multi-functional agriculture in typical mountainous areas.Five counties were selected for analysis in this study from the hinterland of the TGRA;we analyzed changes in farmland scaling and corresponding under?lying mechanisms by defining the concepts of“Scaling Farmland”(SF)and by using the software packages ArcGIS10.2,SPSS,and Geographical Detectors.The results of this analysis show that sources of increased SF have mainly comprised cultivated and shrub land.In deed,with the excepti on of some alpine off-season vegetables,SF growth has mainly occurred in low altitude areas and in places where the slope is less than 30°.We also show that spatial changes in various SF types have also been substantially different,but in all cases are closely related to road and township administrative centers.Natural factors at the patch level,including elevation and slope,have contributed significantly to SF,while at the township level,underlying socioeconomic and humanistic factors have tended to include road traffic and agricultural population density.In contrast,at the region al level,underlying driving forces within each have tended to be more significant than overall study area scale.We show that while changes in,and the development of,SF have been driven by numerous factors,agri?cultural policies have always been amongst the most important.The results clearly elucidate general land use transformation patter ns within the mountain regi ons of western China.
基金supported by National Nature Science Foundation of China (Nos. 61462046 and 61762052)Natural Science Foundation of Jiangxi Province (Nos. 20161BAB202049 and 20161BAB204172)+2 种基金the Bidding Project of the Key Laboratory of Watershed Ecology and Geographical Environment Monitoring, NASG (Nos. WE2016003, WE2016013 and WE2016015)the Science and Technology Research Projects of Jiangxi Province Education Department (Nos. GJJ160741, GJJ170632 and GJJ170633)the Art Planning Project of Jiangxi Province (Nos. YG2016250 and YG2017381)
文摘Image registration is an indispensable component in multi-source remote sensing image processing. In this paper, we put forward a remote sensing image registration method by including an improved multi-scale and multi-direction Harris algorithm and a novel compound feature. Multi-scale circle Gaussian combined invariant moments and multi-direction gray level co-occurrence matrix are extracted as features for image matching. The proposed algorithm is evaluated on numerous multi-source remote sensor images with noise and illumination changes. Extensive experimental studies prove that our proposed method is capable of receiving stable and even distribution of key points as well as obtaining robust and accurate correspondence matches. It is a promising scheme in multi-source remote sensing image registration.
基金supported by National Natural Science Foundation of China(No.61103123)Scientific Research Foundation for the Returned Overseas Chinese Scholars,State Education Ministry
文摘Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.
基金Project(XDA06020300)supported by the"Strategic Priority Research Program"of the Chinese Academy of SciencesProject(12511501700)supported by the Research on the Key Technology of Internet of Things for Urban Community Safety Based on Video Sensor networks
文摘Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D meshes. After preprocessing, shape index extrema on the 3D facial surface are selected as keypoints in the difference scale space and the unstable keypoints are removed after two screening steps. Then, a local coordinate system for each keypoint is established by principal component analysis(PCA).Next, two local geometric features are extracted around each keypoint through the local coordinate system. Additionally, the features are augmented by the symmetrization according to the approximate left-right symmetry in human face. The proposed method is evaluated on the Bosphorus, BU-3DFE, and Gavab databases, respectively. Good results are achieved on these three datasets. As a result, the proposed method proves robust to facial expression variations, partial external occlusions and large pose changes.