Image retrieval for food ingredients is important work,tremendously tiring,uninteresting,and expensive.Computer vision systems have extraordinary advancements in image retrieval with CNNs skills.But it is not feasible...Image retrieval for food ingredients is important work,tremendously tiring,uninteresting,and expensive.Computer vision systems have extraordinary advancements in image retrieval with CNNs skills.But it is not feasible for small-size food datasets using convolutional neural networks directly.In this study,a novel image retrieval approach is presented for small and medium-scale food datasets,which both augments images utilizing image transformation techniques to enlarge the size of datasets,and promotes the average accuracy of food recognition with state-of-the-art deep learning technologies.First,typical image transformation techniques are used to augment food images.Then transfer learning technology based on deep learning is applied to extract image features.Finally,a food recognition algorithm is leveraged on extracted deepfeature vectors.The presented image-retrieval architecture is analyzed based on a smallscale food dataset which is composed of forty-one categories of food ingredients and one hundred pictures for each category.Extensive experimental results demonstrate the advantages of image-augmentation architecture for small and medium datasets using deep learning.The novel approach combines image augmentation,ResNet feature vectors,and SMO classification,and shows its superiority for food detection of small/medium-scale datasets with comprehensive experiments.展开更多
Offline signature verification(OfSV)is essential in preventing the falsification of documents.Deep learning(DL)based OfSVs require a high number of signature images to attain acceptable performance.However,a limited n...Offline signature verification(OfSV)is essential in preventing the falsification of documents.Deep learning(DL)based OfSVs require a high number of signature images to attain acceptable performance.However,a limited number of signature samples are available to train these models in a real-world scenario.Several researchers have proposed models to augment new signature images by applying various transformations.Others,on the other hand,have used human neuromotor and cognitive-inspired augmentation models to address the demand for more signature samples.Hence,augmenting a sufficient number of signatures with variations is still a challenging task.This study proposed OffSig-SinGAN:a deep learning-based image augmentation model to address the limited number of signatures problem on offline signature verification.The proposed model is capable of augmenting better quality signatures with diversity from a single signature image only.It is empirically evaluated on widely used public datasets;GPDSsyntheticSignature.The quality of augmented signature images is assessed using four metrics like pixel-by-pixel difference,peak signal-to-noise ratio(PSNR),structural similarity index measure(SSIM),and frechet inception distance(FID).Furthermore,various experiments were organised to evaluate the proposed image augmentation model’s performance on selected DL-based OfSV systems and to prove whether it helped to improve the verification accuracy rate.Experiment results showed that the proposed augmentation model performed better on the GPDSsyntheticSignature dataset than other augmentation methods.The improved verification accuracy rate of the selected DL-based OfSV system proved the effectiveness of the proposed augmentation model.展开更多
The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and hist...The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications.展开更多
Diabetic Retinopathy is a disease,which happens due to abnormal growth of blood vessels that causes spots on the vision and vision loss.Various techniques are applied to identify the disease in the early stage with di...Diabetic Retinopathy is a disease,which happens due to abnormal growth of blood vessels that causes spots on the vision and vision loss.Various techniques are applied to identify the disease in the early stage with different methods and parameters.Machine Learning(ML)techniques are used for analyz-ing the images andfinding out the location of the disease.The restriction of the ML is a dataset size,which is used for model evaluation.This problem has been overcome by using an augmentation method by generating larger datasets with multidimensional features.Existing models are using only one augmentation tech-nique,which produces limited features of dataset and also lacks in the association of those data during DR detection,so multilevel augmentation is proposed for analysis.The proposed method performs in two phases namely integrated aug-mentation model and dataset correlation(i.e.relationships).It eliminates overfit-ting problem by considering relevant dataset.This method is used for solving the Diabetic Retinopathy problem with a thin vessel identification using the UNET model.UNET based image segmentation achieves 98.3%accuracy when com-pared to RV-GAN and different UNET models with high detection rate.展开更多
Augmented solar images were used to research the adaptability of four representative image extraction and matching algorithms in space weather domain.These include the scale-invariant feature transform algorithm,speed...Augmented solar images were used to research the adaptability of four representative image extraction and matching algorithms in space weather domain.These include the scale-invariant feature transform algorithm,speeded-up robust features algorithm,binary robust invariant scalable keypoints algorithm,and oriented fast and rotated brief algorithm.The performance of these algorithms was estimated in terms of matching accuracy,feature point richness,and running time.The experiment result showed that no algorithm achieved high accuracy while keeping low running time,and all algorithms are not suitable for image feature extraction and matching of augmented solar images.To solve this problem,an improved method was proposed by using two-frame matching to utilize the accuracy advantage of the scale-invariant feature transform algorithm and the speed advantage of the oriented fast and rotated brief algorithm.Furthermore,our method and the four representative algorithms were applied to augmented solar images.Our application experiments proved that our method achieved a similar high recognition rate to the scale-invariant feature transform algorithm which is significantly higher than other algorithms.Our method also obtained a similar low running time to the oriented fast and rotated brief algorithm,which is significantly lower than other algorithms.展开更多
Dog breed identification is essential for many reasons,particularly for understanding individual breeds′conditions,health concerns,interaction behavior,and natural instinct.This paper presents a solution for identify...Dog breed identification is essential for many reasons,particularly for understanding individual breeds′conditions,health concerns,interaction behavior,and natural instinct.This paper presents a solution for identifying dog breeds using their images of their faces.The proposed method applies a deep learning based approach in order to recognize their breeds.The method begins with a transfer learning by retraining existing pretrained convolutional neural networks(CNNs)on the public dog breed dataset.Then,the image augmentation with various settings is also applied on the training dataset,in order to improve the classification performance.The proposed method is evaluated using three different CNNs with various augmentation settings and comprehensive experimental comparisons.The proposed model achieves a promising accuracy of 89.92%on the published dataset with 133 dog breeds.展开更多
文摘Image retrieval for food ingredients is important work,tremendously tiring,uninteresting,and expensive.Computer vision systems have extraordinary advancements in image retrieval with CNNs skills.But it is not feasible for small-size food datasets using convolutional neural networks directly.In this study,a novel image retrieval approach is presented for small and medium-scale food datasets,which both augments images utilizing image transformation techniques to enlarge the size of datasets,and promotes the average accuracy of food recognition with state-of-the-art deep learning technologies.First,typical image transformation techniques are used to augment food images.Then transfer learning technology based on deep learning is applied to extract image features.Finally,a food recognition algorithm is leveraged on extracted deepfeature vectors.The presented image-retrieval architecture is analyzed based on a smallscale food dataset which is composed of forty-one categories of food ingredients and one hundred pictures for each category.Extensive experimental results demonstrate the advantages of image-augmentation architecture for small and medium datasets using deep learning.The novel approach combines image augmentation,ResNet feature vectors,and SMO classification,and shows its superiority for food detection of small/medium-scale datasets with comprehensive experiments.
文摘Offline signature verification(OfSV)is essential in preventing the falsification of documents.Deep learning(DL)based OfSVs require a high number of signature images to attain acceptable performance.However,a limited number of signature samples are available to train these models in a real-world scenario.Several researchers have proposed models to augment new signature images by applying various transformations.Others,on the other hand,have used human neuromotor and cognitive-inspired augmentation models to address the demand for more signature samples.Hence,augmenting a sufficient number of signatures with variations is still a challenging task.This study proposed OffSig-SinGAN:a deep learning-based image augmentation model to address the limited number of signatures problem on offline signature verification.The proposed model is capable of augmenting better quality signatures with diversity from a single signature image only.It is empirically evaluated on widely used public datasets;GPDSsyntheticSignature.The quality of augmented signature images is assessed using four metrics like pixel-by-pixel difference,peak signal-to-noise ratio(PSNR),structural similarity index measure(SSIM),and frechet inception distance(FID).Furthermore,various experiments were organised to evaluate the proposed image augmentation model’s performance on selected DL-based OfSV systems and to prove whether it helped to improve the verification accuracy rate.Experiment results showed that the proposed augmentation model performed better on the GPDSsyntheticSignature dataset than other augmentation methods.The improved verification accuracy rate of the selected DL-based OfSV system proved the effectiveness of the proposed augmentation model.
文摘The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications.
文摘Diabetic Retinopathy is a disease,which happens due to abnormal growth of blood vessels that causes spots on the vision and vision loss.Various techniques are applied to identify the disease in the early stage with different methods and parameters.Machine Learning(ML)techniques are used for analyz-ing the images andfinding out the location of the disease.The restriction of the ML is a dataset size,which is used for model evaluation.This problem has been overcome by using an augmentation method by generating larger datasets with multidimensional features.Existing models are using only one augmentation tech-nique,which produces limited features of dataset and also lacks in the association of those data during DR detection,so multilevel augmentation is proposed for analysis.The proposed method performs in two phases namely integrated aug-mentation model and dataset correlation(i.e.relationships).It eliminates overfit-ting problem by considering relevant dataset.This method is used for solving the Diabetic Retinopathy problem with a thin vessel identification using the UNET model.UNET based image segmentation achieves 98.3%accuracy when com-pared to RV-GAN and different UNET models with high detection rate.
基金Supported by the Key Research Program of the Chinese Academy of Sciences(ZDRE-KT-2021-3)。
文摘Augmented solar images were used to research the adaptability of four representative image extraction and matching algorithms in space weather domain.These include the scale-invariant feature transform algorithm,speeded-up robust features algorithm,binary robust invariant scalable keypoints algorithm,and oriented fast and rotated brief algorithm.The performance of these algorithms was estimated in terms of matching accuracy,feature point richness,and running time.The experiment result showed that no algorithm achieved high accuracy while keeping low running time,and all algorithms are not suitable for image feature extraction and matching of augmented solar images.To solve this problem,an improved method was proposed by using two-frame matching to utilize the accuracy advantage of the scale-invariant feature transform algorithm and the speed advantage of the oriented fast and rotated brief algorithm.Furthermore,our method and the four representative algorithms were applied to augmented solar images.Our application experiments proved that our method achieved a similar high recognition rate to the scale-invariant feature transform algorithm which is significantly higher than other algorithms.Our method also obtained a similar low running time to the oriented fast and rotated brief algorithm,which is significantly lower than other algorithms.
基金the Royal Golden Jubilee(RGJ)Ph.D.Programme under the Thailand Research Fund(No.PHD/0053/2561)。
文摘Dog breed identification is essential for many reasons,particularly for understanding individual breeds′conditions,health concerns,interaction behavior,and natural instinct.This paper presents a solution for identifying dog breeds using their images of their faces.The proposed method applies a deep learning based approach in order to recognize their breeds.The method begins with a transfer learning by retraining existing pretrained convolutional neural networks(CNNs)on the public dog breed dataset.Then,the image augmentation with various settings is also applied on the training dataset,in order to improve the classification performance.The proposed method is evaluated using three different CNNs with various augmentation settings and comprehensive experimental comparisons.The proposed model achieves a promising accuracy of 89.92%on the published dataset with 133 dog breeds.