The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object si...With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis.展开更多
Spatiotemporal forecasting of surface soil moisture(SSM)is recognized as a critical scientific issue in precision agricultural irrigation,regional drought monitoring,and early warning systems for extreme precipitation...Spatiotemporal forecasting of surface soil moisture(SSM)is recognized as a critical scientific issue in precision agricultural irrigation,regional drought monitoring,and early warning systems for extreme precipitation.However,long-term forecasting continues to pose formidable challenges because of the complexity observed across both the spatial and temporal scales.In this study,we used a daily SSM dataset at a 0.05°×0.05°spatial resolution over the Qilian Mountains,China and proposed a hybrid Convolutional Long Short-Term Memory(ConvLSTM)-Nudging model,which combined deep neural networks with data assimilation to increase the accuracy of long-term SSM forecasting.We trained and evaluated the SSM predictive performance of four models(Convolutional Neural Network(CNN),Long Short-Term Memory(LSTM),ConvLSTM,and ConvLSTM with Squeeze-and-Excitation(SE)attention mechanism(ConvLSTM-SE))in both short-term and long-term scenarios.The results showed that all the models perform well under short-term predictions,but the accuracy decrease substantially in long-term predictions.Therefore,we integrated Nudging technique during the long-term prediction phase to assimilate observational information and rectify model biases.Comprehensive evaluations demonstrate that Nudging significantly improves all the models,with ConvLSTM-Nudging achieving the best performance under the 200-d forecasting scenario.Relative to those of the best-performing ConvLSTM model for long-term forecasts,when observation noiseδ=0.00 and observation fraction obs=50.0%,the coefficient of determination(R2)of ConvLSTM-Nudging increases by approximately 82.1%,while its mean absolute error(MAE)and root mean squared error(RMSE)decrease by approximately 84.8%and 77.3%,respectively;the average Pearson correlation coefficient(r)improves by approximately 23.6%,and Bias is reduced by 98.1%.These results demonstrated that although pure deep learning models achieve high accuracy in the short-term predictions,they are prone to error accumulation and systematic drift in long-term autoregressive predictions.Integrating data assimilation with deep learning and continuously correcting the state through observation can effectively suppress long-term biases,thereby achieving robust long-term SSM forecasting.展开更多
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
基金funded by Zhejiang Basic Public Welfare Research Project,grant number LZY24E060001supported by Guangzhou Development Zone Science and Technology(2021GH10,2020GH10,2023GH02)+1 种基金the University of Macao(MYRG2022-00271-FST)the Science and Technology Development Fund(FDCT)of Macao(0032/2022/A).
文摘With the continuous development of artificial intelligence and machine learning techniques,there have been effective methods supporting the work of dermatologist in the field of skin cancer detection.However,object significant challenges have been presented in accurately segmenting melanomas in dermoscopic images due to the objects that could interfere human observations,such as bubbles and scales.To address these challenges,we propose a dual U-Net network framework for skin melanoma segmentation.In our proposed architecture,we introduce several innovative components that aim to enhance the performance and capabilities of the traditional U-Net.First,we establish a novel framework that links two simplified U-Nets,enabling more comprehensive information exchange and feature integration throughout the network.Second,after cascading the second U-Net,we introduce a skip connection between the decoder and encoder networks,and incorporate a modified receptive field block(MRFB),which is designed to capture multi-scale spatial information.Third,to further enhance the feature representation capabilities,we add a multi-path convolution block attention module(MCBAM)to the first two layers of the first U-Net encoding,and integrate a new squeeze-and-excitation(SE)mechanism with residual connections in the second U-Net.To illustrate the performance of our proposed model,we conducted comprehensive experiments on widely recognized skin datasets.On the ISIC-2017 dataset,the IoU value of our proposed model increased from 0.6406 to 0.6819 and the Dice coefficient increased from 0.7625 to 0.8023.On the ISIC-2018 dataset,the IoU value of proposed model also improved from 0.7138 to 0.7709,while the Dice coefficient increased from 0.8285 to 0.8665.Furthermore,the generalization experiments conducted on the jaw cyst dataset from Quzhou People’s Hospital further verified the outstanding segmentation performance of the proposed model.These findings collectively affirm the potential of our approach as a valuable tool in supporting clinical decision-making in the field of skin cancer detection,as well as advancing research in medical image analysis.
基金funded by the National Natural Science Foundation of China(42461053)the Department of Education of Gansu Province:Higher Education Innovation Fund Project(2023B-064)+1 种基金the Youth Doctoral Fund Project(2024QB-014)the Natural Science Foundation of Gansu Province(25JRRA012).
文摘Spatiotemporal forecasting of surface soil moisture(SSM)is recognized as a critical scientific issue in precision agricultural irrigation,regional drought monitoring,and early warning systems for extreme precipitation.However,long-term forecasting continues to pose formidable challenges because of the complexity observed across both the spatial and temporal scales.In this study,we used a daily SSM dataset at a 0.05°×0.05°spatial resolution over the Qilian Mountains,China and proposed a hybrid Convolutional Long Short-Term Memory(ConvLSTM)-Nudging model,which combined deep neural networks with data assimilation to increase the accuracy of long-term SSM forecasting.We trained and evaluated the SSM predictive performance of four models(Convolutional Neural Network(CNN),Long Short-Term Memory(LSTM),ConvLSTM,and ConvLSTM with Squeeze-and-Excitation(SE)attention mechanism(ConvLSTM-SE))in both short-term and long-term scenarios.The results showed that all the models perform well under short-term predictions,but the accuracy decrease substantially in long-term predictions.Therefore,we integrated Nudging technique during the long-term prediction phase to assimilate observational information and rectify model biases.Comprehensive evaluations demonstrate that Nudging significantly improves all the models,with ConvLSTM-Nudging achieving the best performance under the 200-d forecasting scenario.Relative to those of the best-performing ConvLSTM model for long-term forecasts,when observation noiseδ=0.00 and observation fraction obs=50.0%,the coefficient of determination(R2)of ConvLSTM-Nudging increases by approximately 82.1%,while its mean absolute error(MAE)and root mean squared error(RMSE)decrease by approximately 84.8%and 77.3%,respectively;the average Pearson correlation coefficient(r)improves by approximately 23.6%,and Bias is reduced by 98.1%.These results demonstrated that although pure deep learning models achieve high accuracy in the short-term predictions,they are prone to error accumulation and systematic drift in long-term autoregressive predictions.Integrating data assimilation with deep learning and continuously correcting the state through observation can effectively suppress long-term biases,thereby achieving robust long-term SSM forecasting.