Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy...Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy sub-band after NSCT is selected to embed watermark. The watermark is embedded into scaleinvariant feature transform (SIFT) regions. During embedding, the initial region is divided into some cirque sub-regions with the same area, and each watermark bit is embedded into one sub-region. Extensive simulation results and comparisons show that the algorithm gets a good trade-off of invisibility, robustness and capacity, thus obtaining good quality of the image while being able to effectively resist common image processing, and geometric and combo attacks, and normalized similarity is almost all reached.展开更多
A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low freq...A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.展开更多
In this paper, we proposed a registration method by combining the morphological component analysis(MCA) and scale-invariant feature transform(SIFT) algorithm. This method uses the perception dictionaries,and combines ...In this paper, we proposed a registration method by combining the morphological component analysis(MCA) and scale-invariant feature transform(SIFT) algorithm. This method uses the perception dictionaries,and combines the Basis-Pursuit algorithm and the Total-Variation regularization scheme to extract the cartoon part containing basic geometrical information from the original image, and is stable and unsusceptible to noise interference. Then a smaller number of the distinctive key points will be obtained by using the SIFT algorithm based on the cartoon part of the original image. Matching the key points by the constrained Euclidean distance,we will obtain a more correct and robust matching result. The experimental results show that the geometrical transform parameters inferred by the matched key points based on MCA+SIFT registration method are more exact than the ones based on the direct SIFT algorithm.展开更多
To meet the needs in the fundus examination,including outlook widening,pathology tracking,etc.,this paper describes a robust feature-based method for fully-automatic mosaic of the curved human retinal images photograp...To meet the needs in the fundus examination,including outlook widening,pathology tracking,etc.,this paper describes a robust feature-based method for fully-automatic mosaic of the curved human retinal images photographed by a fundus microscope. The kernel of this new algorithm is the scale-,rotation-and illumination-invariant interest point detector & feature descriptor-Scale-Invariant Feature Transform. When matched interest points according to second-nearest-neighbor strategy,the parameters of the model are estimated using the correct matches of the interest points,extracted by a new inlier identification scheme based on Sampson distance from putative sets. In order to preserve image features,bilinear warping and multi-band blending techniques are used to create panoramic retinal images. Experiments show that the proposed method works well with rejection error in 0.3 pixels,even for those cases where the retinal images without discernable vascular structure in contrast to the state-of-the-art algorithms.展开更多
Phase space can be constructed for N equal and distinguishable binary subsystems which are correlated in a scale-invariant manner. In the paper, correlation coefficient and reduced probability are introduced to charac...Phase space can be constructed for N equal and distinguishable binary subsystems which are correlated in a scale-invariant manner. In the paper, correlation coefficient and reduced probability are introduced to characterize the scale-invariant correlated binary subsystems. Probabilistic sets for the correlated binary subsystems satisfy Leibnitz triangle rule in the sense that the marginal probabilities of N-system are equal to the joint probabilities of the (N - 1)-system. For entropic index q ≠ 1, nonextensive entropy Sq is shown to be additive in the scale-invariant occupation of phase space.展开更多
Content-based satellite image registration is a difficult issue in the fields of remote sensing and image processing. The difficulty is more significant in the case of matching multisource remote sensing images which ...Content-based satellite image registration is a difficult issue in the fields of remote sensing and image processing. The difficulty is more significant in the case of matching multisource remote sensing images which suffer from illumination, rotation, and source differences. The scale-invariant feature transform (SIFT) algorithm has been used successfully in satellite image registration problems. Also, many researchers have applied a local SIFT descriptor to improve the image retrieval process. Despite its robustness, this algorithm has some difficulties with the quality and quantity of the extracted local feature points in multisource remote sensing. Furthermore, high dimensionality of the local features extracted by SIFT results in time-consuming computational processes alongside high storage requirements for saving the relevant information, which are important factors in content-based image retrieval (CBIR) applications. In this paper, a novel method is introduced to transform the local SIFT features to global features for multisource remote sensing. The quality and quantity of SIFT local features have been enhanced by applying contrast equalization on images in a pre-processing stage. Considering the local features of each image in the reference database as a separate class, linear discriminant analysis (LDA) is used to transform the local features to global features while reducing di- mensionality of the feature space. This will also significantly reduce the computational time and storage required. Applying the trained kernel on verification data and mapping them showed a successful retrieval rate of 91.67% for test feature points.展开更多
Fractional diffusion equation for transport phenomena in fractal media is an integro-partial differential equation. For solving the problem of this kind of equation the invariants of the group of scaling transformatio...Fractional diffusion equation for transport phenomena in fractal media is an integro-partial differential equation. For solving the problem of this kind of equation the invariants of the group of scaling transformations are given and then used for deriving the integro-ordinary differential equation for the scale-invariant solution. With the help of Mellin transform the scale-invariant solution is obtained in terms of Fox functions. And the series and asymptotic representations for the solution are considered.展开更多
In this article,a robust,effective,and scale-invariant weighted compact nonlinear scheme(WCNS)is proposed by introducing descaling techniques to the nonlinear weights of the WCNS-Z/D schemes.The new scheme achieves an...In this article,a robust,effective,and scale-invariant weighted compact nonlinear scheme(WCNS)is proposed by introducing descaling techniques to the nonlinear weights of the WCNS-Z/D schemes.The new scheme achieves an essentially non-oscillatory approximation of a discontinuous function(ENO-property),a scaleinvariant property with an arbitrary scale of a function(Si-property),and an optimal order of accuracy with smooth function regardless of the critical point(Cp-property).The classical WCNS-Z/D schemes do not satisfy Si-property intrinsically,which is caused by a loss of sub-stencils’adaptivity in the nonlinear interpolation of a discontinuous function when scaled by a small scale factor.A new nonlinear weight is devised by using an average of the function values and the descaling function,providing the new WCNS schemes(WCNS-Zm/Dm)with many attractive properties.The ENO-property,Si-property and Cp-property of the new WCNS schemes are validated numerically.Results show that the WCNS-Zm/Dm schemes satisfy the ENO-property and Si-property,while only the WCNS-Dm scheme satisfies the Cp-property.In addition,the Gaussian wave problem is solved by using successively refined grids to verify that the optimal order of accuracy of the new schemes can be achieved.Several one-dimensional shock tube problems,and two-dimensional double Mach reflection(DMR)problem and the Riemann IVP problem are simulated to illustrate the ENOproperty and Si-property of the scale-invariant WCNS-Zm/Dm schemes.展开更多
Natural earthquakes and micro-seismicity resulting from hydraulic fracturing or other engineering practices display distinctively different spatial-temporal features,like mixed burst-and swarm-like features or predomi...Natural earthquakes and micro-seismicity resulting from hydraulic fracturing or other engineering practices display distinctively different spatial-temporal features,like mixed burst-and swarm-like features or predominantly swarm-like features.The mechanism(s)contributing to such observations can be diverse.We present the inspections on the dynamic formation process of the single swarm-like tree in laboratory acoustic emission(AE)catalogs.Such largest swarm-like trees can contain>97%AE events from the entire catalog within a test;and all catalogs under investigation display scale-invariance features.The formation of the largest swarm-like tree correlates with the rock fracture process analogue of the source pervasive process,where its AE releases exhibit significant spatial well-organization.Comparison to other laboratory catalogs under different laboratory settings helps us identify the spatial continuity of the rock fracture process as the primary factor in forming the largest swarm-like trees at laboratory scale.The stress transfer process is involved in the rock fracture process for the tests having pre-existing spatial discontinuity.Artificial perturbations on the spatial information induced by the stress transfer process further confirm that stress transfer also serves to shift the pure swarm-like catalog into a mixed burst-and swarm-like catalog.These laboratory observations may provide inspirational insights for understanding the field-scale mechanism(s)shaping the spatial-temporal energy release features.展开更多
The scale-invariance behavior has been widely observed in English or other phonetic language texts. In the present study, we examine whether the semantic language, Chinese can also show this behavior. Typically, the s...The scale-invariance behavior has been widely observed in English or other phonetic language texts. In the present study, we examine whether the semantic language, Chinese can also show this behavior. Typically, the scale-invariance behavior is examined in the series of character intervals for the four great Chinese novels by a method of detrended fluctuation analysis. We observe that the scale-invariance behavior characterized by a scaling exponent around 0.60 exists in each novel. Moreover, we divide each novel into three parts with equal number of chapters, and we also observe the existence of scale-invariance in the interval series for each part. Interestingly, we find that there is evident difference in the scaling exponents between the first(or second) part and the third part in the novel of A dream of red mansions, and the difference between parts is not evident for the other three novels. Our observation suggests that there are two writing styles in A dream of red mansions, which are consistent with current prevailing view that the first 80 chapters and the last 40 chapters were accomplished by Xueqin Cao and E Gao, respectively. Our method may shed light on the identification of writing styles in written texts.展开更多
Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combin...Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.展开更多
This paper presents a pure vision based technique for 3D reconstruction of planet terrain. The reconstruction accuracy depends ultimately on an optimization technique known as 'bundle adjustment'. In vision te...This paper presents a pure vision based technique for 3D reconstruction of planet terrain. The reconstruction accuracy depends ultimately on an optimization technique known as 'bundle adjustment'. In vision techniques, the translation is only known up to a scale factor, and a single scale factor is assumed for the whole sequence of images if only one camera is used. If an extra camera is available, stereo vision based reconstruction can be obtained by binocular views. If the baseline of the stereo setup is known, the scale factor problem is solved. We found that direct application of classical bundle adjustment on the constraints inherent between the binocular views has not been tested. Our method incorporated this constraint into the conventional bundle adjustment method. This special binocular bundle adjustment has been performed on image sequences similar to planet terrain circumstances. Experimental results show that our special method enhances not only the localization accuracy, but also the terrain mapping quality.展开更多
The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performanc...The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performance of the improved GC(IGC) to image matching is studied through extensive experiments on the Oxford A?ne dataset.Empirical results indicate that the proposed IGC yields quite stable and robust results,signi?cantly outperforms the original GC,and also can outperform the classical scale-invariant feature transform(SIFT) in most of the test cases.By integrating the IGC to the SIFT,the resulting of hybrid SIFT+IGC performs best over all other single descriptors in these experimental evaluations with various geometric transformations.展开更多
This paper describes a person identifcation method for a mobile robot which performs specifc person following under dynamic complicated environments like a school canteen where many persons exist.We propose a distance...This paper describes a person identifcation method for a mobile robot which performs specifc person following under dynamic complicated environments like a school canteen where many persons exist.We propose a distance-dependent appearance model which is based on scale-invariant feature transform(SIFT) feature.SIFT is a powerful image feature that is invariant to scale and rotation in the image plane and also robust to changes of lighting condition.However,the feature is weak against afne transformations and the identifcation power will thus be degraded when the pose of a person changes largely.We therefore use a set of images taken from various directions to cope with pose changes.Moreover,the number of SIFT feature matches between the model and an input image will decrease as the person becomes farther away from the camera.Therefore,we also use a distance-dependent threshold.The person following experiment was conducted using an actual mobile robot,and the quality assessment of person identifcation was performed.展开更多
Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve sa...Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve satisfactory results.In this paper,we propose a face recognition algorithm that combines the traditional features and deep features of masked faces.For traditional features,we extract Local Binary Pattern(LBP),Scale-Invariant Feature Transform(SIFT)and Histogram of Oriented Gradient(HOG)features from the periocular region,and use the Support Vector Machines(SVM)classifier to perform personal identification.We also propose an improved Convolutional Neural Network(CNN)model Angular Visual Geometry Group Network(A-VGG)to learn deep features.Then we use the decision-level fusion to combine the four features.Comprehensive experiments were carried out on databases of real masked faces and simulated masked faces,including frontal and side faces taken at different angles.Images with motion blur were also tested to evaluate the robustness of the algorithm.Besides,the experiment of matching a masked face with the corresponding full face is accomplished.The experimental results show that the proposed algorithm has state-of-the-art performance in masked face recognition,and the periocular region has rich biological features and high discrimination.展开更多
This paper studies the evolution of crescent-shaped dune under the influence of injected flux. A scaling law and a wind tunnel experiment are carried out for comparison. The experiment incorporates a novel image proce...This paper studies the evolution of crescent-shaped dune under the influence of injected flux. A scaling law and a wind tunnel experiment are carried out for comparison. The experiment incorporates a novel image processing algorithm to recover the evolutionary process. The theoretical and experimental results agree well in the middle stage of dune evolution, but deviate from each other in the initial and final stages, suggesting that the crescent-shaped dune evolution is intrinsically scale-variant and that the crescent shape breaks down under unsaturated condition.展开更多
基金supported by the National Natural Science Foundation of China(61379010)the Natural Science Basic Research Plan in Shaanxi Province of China(2015JM6293)
文摘Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy sub-band after NSCT is selected to embed watermark. The watermark is embedded into scaleinvariant feature transform (SIFT) regions. During embedding, the initial region is divided into some cirque sub-regions with the same area, and each watermark bit is embedded into one sub-region. Extensive simulation results and comparisons show that the algorithm gets a good trade-off of invisibility, robustness and capacity, thus obtaining good quality of the image while being able to effectively resist common image processing, and geometric and combo attacks, and normalized similarity is almost all reached.
基金supported by the National Natural Science Foundation of China (6117212711071002)+1 种基金the Specialized Research Fund for the Doctoral Program of Higher Education (20113401110006)the Innovative Research Team of 211 Project in Anhui University (KJTD007A)
文摘A new spectral matching algorithm is proposed by us- ing nonsubsampled contourlet transform and scale-invariant fea- ture transform. The nonsubsampled contourlet transform is used to decompose an image into a low frequency image and several high frequency images, and the scale-invariant feature transform is employed to extract feature points from the low frequency im- age. A proximity matrix is constructed for the feature points of two related images. By singular value decomposition of the proximity matrix, a matching matrix (or matching result) reflecting the match- ing degree among feature points is obtained. Experimental results indicate that the proposed algorithm can reduce time complexity and possess a higher accuracy.
基金the National Science Foundation of China(No.61471185)the Natural Science Foundation of Shandong Province(No.ZR2016FM21)+1 种基金Shandong Province Science and Technology Plan Project(No.2015GSF116001)Yantai City Key Research and Development Plan Project(Nos.2014ZH157 and2016ZH057)
文摘In this paper, we proposed a registration method by combining the morphological component analysis(MCA) and scale-invariant feature transform(SIFT) algorithm. This method uses the perception dictionaries,and combines the Basis-Pursuit algorithm and the Total-Variation regularization scheme to extract the cartoon part containing basic geometrical information from the original image, and is stable and unsusceptible to noise interference. Then a smaller number of the distinctive key points will be obtained by using the SIFT algorithm based on the cartoon part of the original image. Matching the key points by the constrained Euclidean distance,we will obtain a more correct and robust matching result. The experimental results show that the geometrical transform parameters inferred by the matched key points based on MCA+SIFT registration method are more exact than the ones based on the direct SIFT algorithm.
基金Program for NewCentury Excellent Talents in UniversityGrant number:50051+1 种基金The Key Project for Technology Research of Ministry Education of ChinaCrant number:106030
文摘To meet the needs in the fundus examination,including outlook widening,pathology tracking,etc.,this paper describes a robust feature-based method for fully-automatic mosaic of the curved human retinal images photographed by a fundus microscope. The kernel of this new algorithm is the scale-,rotation-and illumination-invariant interest point detector & feature descriptor-Scale-Invariant Feature Transform. When matched interest points according to second-nearest-neighbor strategy,the parameters of the model are estimated using the correct matches of the interest points,extracted by a new inlier identification scheme based on Sampson distance from putative sets. In order to preserve image features,bilinear warping and multi-band blending techniques are used to create panoramic retinal images. Experiments show that the proposed method works well with rejection error in 0.3 pixels,even for those cases where the retinal images without discernable vascular structure in contrast to the state-of-the-art algorithms.
基金the National Natural Science Foundation of China(No.60474069)
文摘Phase space can be constructed for N equal and distinguishable binary subsystems which are correlated in a scale-invariant manner. In the paper, correlation coefficient and reduced probability are introduced to characterize the scale-invariant correlated binary subsystems. Probabilistic sets for the correlated binary subsystems satisfy Leibnitz triangle rule in the sense that the marginal probabilities of N-system are equal to the joint probabilities of the (N - 1)-system. For entropic index q ≠ 1, nonextensive entropy Sq is shown to be additive in the scale-invariant occupation of phase space.
文摘Content-based satellite image registration is a difficult issue in the fields of remote sensing and image processing. The difficulty is more significant in the case of matching multisource remote sensing images which suffer from illumination, rotation, and source differences. The scale-invariant feature transform (SIFT) algorithm has been used successfully in satellite image registration problems. Also, many researchers have applied a local SIFT descriptor to improve the image retrieval process. Despite its robustness, this algorithm has some difficulties with the quality and quantity of the extracted local feature points in multisource remote sensing. Furthermore, high dimensionality of the local features extracted by SIFT results in time-consuming computational processes alongside high storage requirements for saving the relevant information, which are important factors in content-based image retrieval (CBIR) applications. In this paper, a novel method is introduced to transform the local SIFT features to global features for multisource remote sensing. The quality and quantity of SIFT local features have been enhanced by applying contrast equalization on images in a pre-processing stage. Considering the local features of each image in the reference database as a separate class, linear discriminant analysis (LDA) is used to transform the local features to global features while reducing di- mensionality of the feature space. This will also significantly reduce the computational time and storage required. Applying the trained kernel on verification data and mapping them showed a successful retrieval rate of 91.67% for test feature points.
基金Supported by the National Natural Science Foundation of China (10461005)the Department Fund of Science and Technology in Tianjin Higher Education Institutions (20050404).
文摘Fractional diffusion equation for transport phenomena in fractal media is an integro-partial differential equation. For solving the problem of this kind of equation the invariants of the group of scaling transformations are given and then used for deriving the integro-ordinary differential equation for the scale-invariant solution. With the help of Mellin transform the scale-invariant solution is obtained in terms of Fox functions. And the series and asymptotic representations for the solution are considered.
基金supported by the Hunan Provincial Natural Science Foundation of China(No.2022JJ40539)National Natural Science Foundation of China(No.11972370)National Key Project(No.GJXM92579).
文摘In this article,a robust,effective,and scale-invariant weighted compact nonlinear scheme(WCNS)is proposed by introducing descaling techniques to the nonlinear weights of the WCNS-Z/D schemes.The new scheme achieves an essentially non-oscillatory approximation of a discontinuous function(ENO-property),a scaleinvariant property with an arbitrary scale of a function(Si-property),and an optimal order of accuracy with smooth function regardless of the critical point(Cp-property).The classical WCNS-Z/D schemes do not satisfy Si-property intrinsically,which is caused by a loss of sub-stencils’adaptivity in the nonlinear interpolation of a discontinuous function when scaled by a small scale factor.A new nonlinear weight is devised by using an average of the function values and the descaling function,providing the new WCNS schemes(WCNS-Zm/Dm)with many attractive properties.The ENO-property,Si-property and Cp-property of the new WCNS schemes are validated numerically.Results show that the WCNS-Zm/Dm schemes satisfy the ENO-property and Si-property,while only the WCNS-Dm scheme satisfies the Cp-property.In addition,the Gaussian wave problem is solved by using successively refined grids to verify that the optimal order of accuracy of the new schemes can be achieved.Several one-dimensional shock tube problems,and two-dimensional double Mach reflection(DMR)problem and the Riemann IVP problem are simulated to illustrate the ENOproperty and Si-property of the scale-invariant WCNS-Zm/Dm schemes.
文摘Natural earthquakes and micro-seismicity resulting from hydraulic fracturing or other engineering practices display distinctively different spatial-temporal features,like mixed burst-and swarm-like features or predominantly swarm-like features.The mechanism(s)contributing to such observations can be diverse.We present the inspections on the dynamic formation process of the single swarm-like tree in laboratory acoustic emission(AE)catalogs.Such largest swarm-like trees can contain>97%AE events from the entire catalog within a test;and all catalogs under investigation display scale-invariance features.The formation of the largest swarm-like tree correlates with the rock fracture process analogue of the source pervasive process,where its AE releases exhibit significant spatial well-organization.Comparison to other laboratory catalogs under different laboratory settings helps us identify the spatial continuity of the rock fracture process as the primary factor in forming the largest swarm-like trees at laboratory scale.The stress transfer process is involved in the rock fracture process for the tests having pre-existing spatial discontinuity.Artificial perturbations on the spatial information induced by the stress transfer process further confirm that stress transfer also serves to shift the pure swarm-like catalog into a mixed burst-and swarm-like catalog.These laboratory observations may provide inspirational insights for understanding the field-scale mechanism(s)shaping the spatial-temporal energy release features.
基金Supported by the Innovation and Entrepreneurship Program of Shanghai University of Science and Technology under Grant No.XJ10252127National Natural Science Foundation of China under Grant Nos.11875042 and 11505114
文摘The scale-invariance behavior has been widely observed in English or other phonetic language texts. In the present study, we examine whether the semantic language, Chinese can also show this behavior. Typically, the scale-invariance behavior is examined in the series of character intervals for the four great Chinese novels by a method of detrended fluctuation analysis. We observe that the scale-invariance behavior characterized by a scaling exponent around 0.60 exists in each novel. Moreover, we divide each novel into three parts with equal number of chapters, and we also observe the existence of scale-invariance in the interval series for each part. Interestingly, we find that there is evident difference in the scaling exponents between the first(or second) part and the third part in the novel of A dream of red mansions, and the difference between parts is not evident for the other three novels. Our observation suggests that there are two writing styles in A dream of red mansions, which are consistent with current prevailing view that the first 80 chapters and the last 40 chapters were accomplished by Xueqin Cao and E Gao, respectively. Our method may shed light on the identification of writing styles in written texts.
基金supported by National Natural Science Foundation of China(No.61103123)Scientific Research Foundation for the Returned Overseas Chinese Scholars,State Education Ministry
文摘Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.
基金the National Natural Science Foundation of China (Nos. 60505017 and 60534070)the Science Planning Project of Zhejiang Province, China (No. 2005C14008)
文摘This paper presents a pure vision based technique for 3D reconstruction of planet terrain. The reconstruction accuracy depends ultimately on an optimization technique known as 'bundle adjustment'. In vision techniques, the translation is only known up to a scale factor, and a single scale factor is assumed for the whole sequence of images if only one camera is used. If an extra camera is available, stereo vision based reconstruction can be obtained by binocular views. If the baseline of the stereo setup is known, the scale factor problem is solved. We found that direct application of classical bundle adjustment on the constraints inherent between the binocular views has not been tested. Our method incorporated this constraint into the conventional bundle adjustment method. This special binocular bundle adjustment has been performed on image sequences similar to planet terrain circumstances. Experimental results show that our special method enhances not only the localization accuracy, but also the terrain mapping quality.
基金the National Natural Science Foundation of China(Nos.60970109 and 61170228)
文摘The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performance of the improved GC(IGC) to image matching is studied through extensive experiments on the Oxford A?ne dataset.Empirical results indicate that the proposed IGC yields quite stable and robust results,signi?cantly outperforms the original GC,and also can outperform the classical scale-invariant feature transform(SIFT) in most of the test cases.By integrating the IGC to the SIFT,the resulting of hybrid SIFT+IGC performs best over all other single descriptors in these experimental evaluations with various geometric transformations.
基金supported by JSPS KAKENHI (No.23700203) and NEDO Intelligent RT Software Project
文摘This paper describes a person identifcation method for a mobile robot which performs specifc person following under dynamic complicated environments like a school canteen where many persons exist.We propose a distance-dependent appearance model which is based on scale-invariant feature transform(SIFT) feature.SIFT is a powerful image feature that is invariant to scale and rotation in the image plane and also robust to changes of lighting condition.However,the feature is weak against afne transformations and the identifcation power will thus be degraded when the pose of a person changes largely.We therefore use a set of images taken from various directions to cope with pose changes.Moreover,the number of SIFT feature matches between the model and an input image will decrease as the person becomes farther away from the camera.Therefore,we also use a distance-dependent threshold.The person following experiment was conducted using an actual mobile robot,and the quality assessment of person identifcation was performed.
基金Supported by the Postgraduate Research and Practice Innovation Program of Nanjing University of Aeronautics and Astronautics(XCXJH20220318)。
文摘Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve satisfactory results.In this paper,we propose a face recognition algorithm that combines the traditional features and deep features of masked faces.For traditional features,we extract Local Binary Pattern(LBP),Scale-Invariant Feature Transform(SIFT)and Histogram of Oriented Gradient(HOG)features from the periocular region,and use the Support Vector Machines(SVM)classifier to perform personal identification.We also propose an improved Convolutional Neural Network(CNN)model Angular Visual Geometry Group Network(A-VGG)to learn deep features.Then we use the decision-level fusion to combine the four features.Comprehensive experiments were carried out on databases of real masked faces and simulated masked faces,including frontal and side faces taken at different angles.Images with motion blur were also tested to evaluate the robustness of the algorithm.Besides,the experiment of matching a masked face with the corresponding full face is accomplished.The experimental results show that the proposed algorithm has state-of-the-art performance in masked face recognition,and the periocular region has rich biological features and high discrimination.
基金funded by the National Natural Science Foundation of China(11402190)the China Postdoctoral Science Foundation(2014M552443)the Natural Science Foundation of Shaanxi Province(2013JQ2001)
文摘This paper studies the evolution of crescent-shaped dune under the influence of injected flux. A scaling law and a wind tunnel experiment are carried out for comparison. The experiment incorporates a novel image processing algorithm to recover the evolutionary process. The theoretical and experimental results agree well in the middle stage of dune evolution, but deviate from each other in the initial and final stages, suggesting that the crescent-shaped dune evolution is intrinsically scale-variant and that the crescent shape breaks down under unsaturated condition.