The information of expression texture extracted by the completed local ternary patterns(CLTP) method is not accurate enough, which may cause low recognition rate. Therefore, an improved completed local ternary pattern...The information of expression texture extracted by the completed local ternary patterns(CLTP) method is not accurate enough, which may cause low recognition rate. Therefore, an improved completed local ternary patterns(ICLTP) is proposed here. Firstly, the Scharr operator is used to calculate gradient magnitudes of images to enhance the detail of texture, which is beneficial to obtaining more accurate expression features. Secondly, two different neighborhoods of CLTP features are combined to obtain much information of facial expression. Finally, K nearest neighbor(KNN) and sparse representation classifier(SRC) are combined for classification and a 10-fold cross-validation method is tested in the JAFFE and CK+ databases. The results show that the ICLTP method can improve the recognition rate of facial expression and reduce the confusion between various expressions. Especially, the misrecognition rate of other six expressions recognized as neutral is reduced in the 7-class expression recognition.展开更多
With the increasing popularity of high-resolution remote sensing images,the remote sensing image retrieval(RSIR)has always been a topic of major issue.A combined,global non-subsampled shearlet transform(NSST)-domain s...With the increasing popularity of high-resolution remote sensing images,the remote sensing image retrieval(RSIR)has always been a topic of major issue.A combined,global non-subsampled shearlet transform(NSST)-domain statistical features(NSSTds)and local three dimensional local ternary pattern(3D-LTP)features,is proposed for high-resolution remote sensing images.We model the NSST image coefficients of detail subbands using 2-state laplacian mixture(LM)distribution and its three parameters are estimated using Expectation-Maximization(EM)algorithm.We also calculate the statistical parameters such as subband kurtosis and skewness from detail subbands along with mean and standard deviation calculated from approximation subband,and concatenate all of them with the 2-state LM parameters to describe the global features of the image.The various properties of NSST such as multiscale,localization and flexible directional sensitivity make it a suitable choice to provide an effective approximation of an image.In order to extract the dense local features,a new 3D-LTP is proposed where dimension reduction is performed via selection of‘uniform’patterns.The 3D-LTP is calculated from spatial RGB planes of the input image.The proposed inter-channel 3D-LTP not only exploits the local texture information but the color information is captured too.Finally,a fused feature representation(NSSTds-3DLTP)is proposed using new global(NSSTds)and local(3D-LTP)features to enhance the discriminativeness of features.The retrieval performance of proposed NSSTds-3DLTP features are tested on three challenging remote sensing image datasets such as WHU-RS19,Aerial Image Dataset(AID)and PatternNet in terms of mean average precision(MAP),average normalized modified retrieval rank(ANMRR)and precision-recall(P-R)graph.The experimental results are encouraging and the NSSTds-3DLTP features leads to superior retrieval performance compared to many well known existing descriptors such as Gabor RGB,Granulometry,local binary pattern(LBP),Fisher vector(FV),vector of locally aggregated descriptors(VLAD)and median robust extended local binary pattern(MRELBP).For WHU-RS19 dataset,in terms of{MAP,ANMRR},the NSSTds-3DLTP improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{41.93%,20.87%},{92.30%,32.68%},{86.14%,31.97%},{18.18%,15.22%},{8.96%,19.60%}and{15.60%,13.26%},respectively.For AID,in terms of{MAP,ANMRR},the NSSTds-3DLTP improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{152.60%,22.06%},{226.65%,25.08%},{185.03%,23.33%},{80.06%,12.16%},{50.58%,10.49%}and{62.34%,3.24%},respectively.For PatternNet,the NSSTds-3DLTP respectively improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{32.79%,10.34%},{141.30%,24.72%},{17.47%,10.34%},{83.20%,19.07%},{21.56%,3.60%},and{19.30%,0.48%}in terms of{MAP,ANMRR}.The moderate dimensionality of simple NSSTds-3DLTP allows the system to run in real-time.展开更多
针对视觉结构类似导致的文种相似性问题,基于局部三值模式的相邻共生矩阵(co-occurrence of adjacent local ternary patterns,CoALTP)提出一种具有判别性和鲁棒性的局部三值模式的相邻共生矩阵(discriminant and robust co-occurrence ...针对视觉结构类似导致的文种相似性问题,基于局部三值模式的相邻共生矩阵(co-occurrence of adjacent local ternary patterns,CoALTP)提出一种具有判别性和鲁棒性的局部三值模式的相邻共生矩阵(discriminant and robust co-occurrence of adjacent local ternary patterns,DRCoALTP)方法,用于获取图像纹理。计算文档图像的相邻稀疏局部三值模式(adjacent sparse local ternary patterns,ASLTP),将采样点数量设定为8,以便获得详细的局部纹理,设计出一种基于自适应中值滤波思想的半自适应阈值方法,用于提取灰度图像中心像素周边对角邻域像素的编码值。ASLTP在邻域像素位置存放稀疏局部三值模式(local ternary patterns,LTP)的值,提取灰度共生矩阵(gray-level co-occurrence matrix,GLCM),从4个方向统计使用ASLTP后灰度图像像素之间的频率关系。该算法在阿拉伯文、俄文、简体中文、哈萨克文、藏文、蒙古文、土耳其文、维吾尔文、英文、吉尔吉斯斯坦文和塔吉克斯坦文11个文种的自建印刷体文档图像数据集中验证。试验结果表明,相较于基线和先进的纹理方法,改进后的方法更具判别性,平均识别准确率为99.14%。为改善CoALTP方法可能产生低效分类特征的问题,提出半自适应阈值方法,有效提高识别率并抑制噪声。此外,针对算法产生的高维特征,采用基于均方差的特征选择方法,通过支持向量机(support vector machine,SVM)分类器特征选择后,识别速度提高284%,对11个文种的平均识别准确率达99.44%。展开更多
Finger Knuckle Print biometric plays a vital role in establishing security for real-time environments. The success of human authentication depends on high speed and accuracy. This paper proposed an integrated approach...Finger Knuckle Print biometric plays a vital role in establishing security for real-time environments. The success of human authentication depends on high speed and accuracy. This paper proposed an integrated approach of personal authentication using texture based Finger Knuckle Print (FKP) recognition in multiresolution domain. FKP images are rich in texture patterns. Recently, many texture patterns are proposed for biometric feature extraction. Hence, it is essential to review whether Local Binary Patterns or its variants perform well for FKP recognition. In this paper, Local Directional Pattern (LDP), Local Derivative Ternary Pattern (LDTP) and Local Texture Description Framework based Modified Local Directional Pattern (LTDF_MLDN) based feature extraction in multiresolution domain are experimented with Nearest Neighbor and Extreme Learning Machine (ELM) Classifier for FKP recognition. Experiments were conducted on PolYU database. The result shows that LDTP in Contourlet domain achieves a promising performance. It also proves that Soft classifier performs better than the hard classifier.展开更多
Retinal Optical Coherence Tomography (OCT) images, a non-invasive imaging technique, have become a standard retinal disease detection tool. Due to disease, there are morphological and textural changes in the layers of...Retinal Optical Coherence Tomography (OCT) images, a non-invasive imaging technique, have become a standard retinal disease detection tool. Due to disease, there are morphological and textural changes in the layers of the retina. Classifying OCT images is challenging, as the morphological manifestations of different diseases may be similar. The OCT images capture the reflectivity characteristics of the retinal tissues. Retinal diseases change the reflectivity property of retinal tissues, resulting in texture variations in OCT images. We propose a hybrid approach to OCT image classification in which the Convolution Neural Network (CNN) model is trained using Multiple Neighborhood Local Ternary Pattern (MNLTP) texture descriptors of the OCT images dataset for a robust disease prediction system. Parallel deep CNN (PDCNN) is proposed to improve feature representation and generalizability. The MNLTP-PDCNN model is tested on two publicly available datasets. The parameter values Accuracy, Precision, Recall, and F1-Score are calculated. The best accuracy obtained specifying the model’s overall performance is 93.98% and 99% for the NEH and OCT2017 datasets, respectively. With the proposed architecture, comparable performance is obtained with a subset of the original OCT2017 data set and a comparatively smaller number of trainable parameters (1.6 million, 1.8 million, and 2.3 million for a single CNN branch, two parallel CNN branches, and three parallel network branches, respectively), compared to off-the-shelf CNN models. Hence, the proposed approach is suitable for real-time OCT image classification systems with fast training of the CNN model and reduced memory requirement for computations.展开更多
基金supported by the National Natural Science Foundation of China(No.51604056)the Chongqing Science and Technology Commission(No.cstc2015jcyjBX0066)
文摘The information of expression texture extracted by the completed local ternary patterns(CLTP) method is not accurate enough, which may cause low recognition rate. Therefore, an improved completed local ternary patterns(ICLTP) is proposed here. Firstly, the Scharr operator is used to calculate gradient magnitudes of images to enhance the detail of texture, which is beneficial to obtaining more accurate expression features. Secondly, two different neighborhoods of CLTP features are combined to obtain much information of facial expression. Finally, K nearest neighbor(KNN) and sparse representation classifier(SRC) are combined for classification and a 10-fold cross-validation method is tested in the JAFFE and CK+ databases. The results show that the ICLTP method can improve the recognition rate of facial expression and reduce the confusion between various expressions. Especially, the misrecognition rate of other six expressions recognized as neutral is reduced in the 7-class expression recognition.
文摘With the increasing popularity of high-resolution remote sensing images,the remote sensing image retrieval(RSIR)has always been a topic of major issue.A combined,global non-subsampled shearlet transform(NSST)-domain statistical features(NSSTds)and local three dimensional local ternary pattern(3D-LTP)features,is proposed for high-resolution remote sensing images.We model the NSST image coefficients of detail subbands using 2-state laplacian mixture(LM)distribution and its three parameters are estimated using Expectation-Maximization(EM)algorithm.We also calculate the statistical parameters such as subband kurtosis and skewness from detail subbands along with mean and standard deviation calculated from approximation subband,and concatenate all of them with the 2-state LM parameters to describe the global features of the image.The various properties of NSST such as multiscale,localization and flexible directional sensitivity make it a suitable choice to provide an effective approximation of an image.In order to extract the dense local features,a new 3D-LTP is proposed where dimension reduction is performed via selection of‘uniform’patterns.The 3D-LTP is calculated from spatial RGB planes of the input image.The proposed inter-channel 3D-LTP not only exploits the local texture information but the color information is captured too.Finally,a fused feature representation(NSSTds-3DLTP)is proposed using new global(NSSTds)and local(3D-LTP)features to enhance the discriminativeness of features.The retrieval performance of proposed NSSTds-3DLTP features are tested on three challenging remote sensing image datasets such as WHU-RS19,Aerial Image Dataset(AID)and PatternNet in terms of mean average precision(MAP),average normalized modified retrieval rank(ANMRR)and precision-recall(P-R)graph.The experimental results are encouraging and the NSSTds-3DLTP features leads to superior retrieval performance compared to many well known existing descriptors such as Gabor RGB,Granulometry,local binary pattern(LBP),Fisher vector(FV),vector of locally aggregated descriptors(VLAD)and median robust extended local binary pattern(MRELBP).For WHU-RS19 dataset,in terms of{MAP,ANMRR},the NSSTds-3DLTP improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{41.93%,20.87%},{92.30%,32.68%},{86.14%,31.97%},{18.18%,15.22%},{8.96%,19.60%}and{15.60%,13.26%},respectively.For AID,in terms of{MAP,ANMRR},the NSSTds-3DLTP improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{152.60%,22.06%},{226.65%,25.08%},{185.03%,23.33%},{80.06%,12.16%},{50.58%,10.49%}and{62.34%,3.24%},respectively.For PatternNet,the NSSTds-3DLTP respectively improves upon Gabor RGB,Granulometry,LBP,FV,VLAD and MRELBP descriptors by{32.79%,10.34%},{141.30%,24.72%},{17.47%,10.34%},{83.20%,19.07%},{21.56%,3.60%},and{19.30%,0.48%}in terms of{MAP,ANMRR}.The moderate dimensionality of simple NSSTds-3DLTP allows the system to run in real-time.
文摘针对视觉结构类似导致的文种相似性问题,基于局部三值模式的相邻共生矩阵(co-occurrence of adjacent local ternary patterns,CoALTP)提出一种具有判别性和鲁棒性的局部三值模式的相邻共生矩阵(discriminant and robust co-occurrence of adjacent local ternary patterns,DRCoALTP)方法,用于获取图像纹理。计算文档图像的相邻稀疏局部三值模式(adjacent sparse local ternary patterns,ASLTP),将采样点数量设定为8,以便获得详细的局部纹理,设计出一种基于自适应中值滤波思想的半自适应阈值方法,用于提取灰度图像中心像素周边对角邻域像素的编码值。ASLTP在邻域像素位置存放稀疏局部三值模式(local ternary patterns,LTP)的值,提取灰度共生矩阵(gray-level co-occurrence matrix,GLCM),从4个方向统计使用ASLTP后灰度图像像素之间的频率关系。该算法在阿拉伯文、俄文、简体中文、哈萨克文、藏文、蒙古文、土耳其文、维吾尔文、英文、吉尔吉斯斯坦文和塔吉克斯坦文11个文种的自建印刷体文档图像数据集中验证。试验结果表明,相较于基线和先进的纹理方法,改进后的方法更具判别性,平均识别准确率为99.14%。为改善CoALTP方法可能产生低效分类特征的问题,提出半自适应阈值方法,有效提高识别率并抑制噪声。此外,针对算法产生的高维特征,采用基于均方差的特征选择方法,通过支持向量机(support vector machine,SVM)分类器特征选择后,识别速度提高284%,对11个文种的平均识别准确率达99.44%。
文摘Finger Knuckle Print biometric plays a vital role in establishing security for real-time environments. The success of human authentication depends on high speed and accuracy. This paper proposed an integrated approach of personal authentication using texture based Finger Knuckle Print (FKP) recognition in multiresolution domain. FKP images are rich in texture patterns. Recently, many texture patterns are proposed for biometric feature extraction. Hence, it is essential to review whether Local Binary Patterns or its variants perform well for FKP recognition. In this paper, Local Directional Pattern (LDP), Local Derivative Ternary Pattern (LDTP) and Local Texture Description Framework based Modified Local Directional Pattern (LTDF_MLDN) based feature extraction in multiresolution domain are experimented with Nearest Neighbor and Extreme Learning Machine (ELM) Classifier for FKP recognition. Experiments were conducted on PolYU database. The result shows that LDTP in Contourlet domain achieves a promising performance. It also proves that Soft classifier performs better than the hard classifier.
基金Deanship of Research and Graduate Studies at King Khalid University funded this work through Large Research Project under grant number RGP2/54/45.
文摘Retinal Optical Coherence Tomography (OCT) images, a non-invasive imaging technique, have become a standard retinal disease detection tool. Due to disease, there are morphological and textural changes in the layers of the retina. Classifying OCT images is challenging, as the morphological manifestations of different diseases may be similar. The OCT images capture the reflectivity characteristics of the retinal tissues. Retinal diseases change the reflectivity property of retinal tissues, resulting in texture variations in OCT images. We propose a hybrid approach to OCT image classification in which the Convolution Neural Network (CNN) model is trained using Multiple Neighborhood Local Ternary Pattern (MNLTP) texture descriptors of the OCT images dataset for a robust disease prediction system. Parallel deep CNN (PDCNN) is proposed to improve feature representation and generalizability. The MNLTP-PDCNN model is tested on two publicly available datasets. The parameter values Accuracy, Precision, Recall, and F1-Score are calculated. The best accuracy obtained specifying the model’s overall performance is 93.98% and 99% for the NEH and OCT2017 datasets, respectively. With the proposed architecture, comparable performance is obtained with a subset of the original OCT2017 data set and a comparatively smaller number of trainable parameters (1.6 million, 1.8 million, and 2.3 million for a single CNN branch, two parallel CNN branches, and three parallel network branches, respectively), compared to off-the-shelf CNN models. Hence, the proposed approach is suitable for real-time OCT image classification systems with fast training of the CNN model and reduced memory requirement for computations.