Low-light image enhancement is one of the most active research areas in the field of computer vision in recent years.In the low-light image enhancement process,loss of image details and increase in noise occur inevita...Low-light image enhancement is one of the most active research areas in the field of computer vision in recent years.In the low-light image enhancement process,loss of image details and increase in noise occur inevitably,influencing the quality of enhanced images.To alleviate this problem,a low-light image enhancement model called RetinexNet model based on Retinex theory was proposed in this study.The model was composed of an image decomposition module and a brightness enhancement module.In the decomposition module,a convolutional block attention module(CBAM)was incorporated to enhance feature representation capacity of the network,focusing on crucial features and suppressing irrelevant ones.A multifeature fusion denoising module was designed within the brightness enhancement module,circumventing the issue of feature loss during downsampling.The proposed model outperforms the existing algorithms in terms of PSNR and SSIM metrics on the publicly available datasets LOL and MIT-Adobe FiveK,as well as gives superior results in terms of NIQE metrics on the publicly available dataset LIME.展开更多
Foundation models(FMs)have rapidly evolved and have achieved signicant accomplishments in computer vision tasks.Specically,the prompt mechanism conveniently allows users to integrate image prior information into the m...Foundation models(FMs)have rapidly evolved and have achieved signicant accomplishments in computer vision tasks.Specically,the prompt mechanism conveniently allows users to integrate image prior information into the model,making it possible to apply models without any training.Therefore,we proposed a workflow based on foundation models and zero training to solve the tasks of photoacoustic(PA)image processing.We employed the Segment Anything Model(SAM)by setting simple prompts and integrating the model's outputs with prior knowledge of the imaged objects to accomplish various tasks,including:(1)removing the skin signal in three-dimensional PA image rendering;(2)dual speed-of-sound reconstruction,and(3)segmentation ofnger blood vessels.Through these demonstrations,we have concluded that FMs can be directly applied in PA imaging without the requirement for network design and training.This potentially allows for a hands-on,convenient approach to achieving efficient and accurate segmentation of PA images.This paper serves as a comprehensive tutorial,facilitating the mastery of the technique through the provision of code and sample datasets.展开更多
Traditional steganography conceals information by modifying cover data,but steganalysis tools easily detect such alterations.While deep learning-based steganography often involves high training costs and complex deplo...Traditional steganography conceals information by modifying cover data,but steganalysis tools easily detect such alterations.While deep learning-based steganography often involves high training costs and complex deployment.Diffusion model-based methods face security vulnerabilities,particularly due to potential information leakage during generation.We propose a fixed neural network image steganography framework based on secure diffu-sion models to address these challenges.Unlike conventional approaches,our method minimizes cover modifications through neural network optimization,achieving superior steganographic performance in human visual perception and computer vision analyses.The cover images are generated in an anime style using state-of-the-art diffusion models,ensuring the transmitted images appear more natural.This study introduces fixed neural network technology that allows senders to transmit only minimal critical information alongside stego-images.Recipients can accurately reconstruct secret images using this compact data,significantly reducing transmission overhead compared to conventional deep steganography.Furthermore,our framework innovatively integrates ElGamal,a cryptographic algorithm,to protect critical information during transmission,enhancing overall system security and ensuring end-to-end information protection.This dual optimization of payload reduction and cryptographic reinforcement establishes a new paradigm for secure and efficient image steganography.展开更多
In Hyperspectral Imaging(HSI),the detrimental influence of noise and distortions on data quality is profound,which has severely affected the following-on analytics and decisionmaking such as land mapping.This study pr...In Hyperspectral Imaging(HSI),the detrimental influence of noise and distortions on data quality is profound,which has severely affected the following-on analytics and decisionmaking such as land mapping.This study presents an innovative framework for assessing HSI band quality and reconstructing the low-quality bands,based on the Prophet model.By introducing a comprehensive quality metric to start,the authors approach factors in both spatial and spectral characteristics across local and global scales.This metric effectively captures the intricate noise and distortions inherent in the HSI data.Subsequently,the authors employ the Prophet model to forecast information within the low-quality bands,leveraging insights from neighbouring high-quality bands.To validate the effectiveness of the authors’proposed model,extensive experiments on three publicly available uncorrected datasets are conducted.In a head-to-head comparison,the framework against six state-ofthe-art band reconstruction algorithms including three spectral methods,two spatialspectral methods and one deep learning method is benchmarked.The authors’experiments also delve into strategies for band selection based on quality metrics and the quality evaluation of the reconstructed bands.In addition,the authors assess the classification accuracy utilising these reconstructed bands.In various experiments,the results consistently affirm the efficacy of the authors’method in HSI quality assessment and band reconstruction.Notably,the authors’approach obviates the need for manually prefiltering of noisy bands.This comprehensive framework holds promise in addressing HSI data quality concerns whilst enhancing the overall utility of HSI.展开更多
This study introduces a novel method for reconstructing the 3D model of aluminum foam using cross-sectional sequence images.Combining precision milling and image acquisition,high-qual-ity cross-sectional images are ob...This study introduces a novel method for reconstructing the 3D model of aluminum foam using cross-sectional sequence images.Combining precision milling and image acquisition,high-qual-ity cross-sectional images are obtained.Pore structures are segmented by the U-shaped network(U-Net)neural network integrated with the Canny edge detection operator,ensuring accurate pore delineation and edge extraction.The trained U-Net achieves 98.55%accuracy.The 2D data are superimposed and processed into 3D point clouds,enabling reconstruction of the pore structure and aluminum skeleton.Analysis of pore 01 shows the cross-sectional area initially increases,and then decreases with milling depth,with a uniform point distribution of 40 per layer.The reconstructed model exhibits a porosity of 77.5%,with section overlap rates between the 2D pore segmentation and the reconstructed model exceeding 96%,confirming high fidelity.Equivalent sphere diameters decrease with size,averaging 1.95 mm.Compression simulations reveal that the stress-strain curve of the 3D reconstruction model of aluminum foam exhibits fluctuations,and the stresses in the reconstruction model concentrate on thin cell walls,leading to localized deformations.This method accurately restores the aluminum foam’s complex internal structure,improving reconstruction preci-sion and simulation reliability.The approach offers a cost-efficient,high-precision technique for optimizing material performance in engineering applications.展开更多
Accurately predicting geomagnetic field is of great significance for space environment monitoring and space weather forecasting worldwide.This paper proposes a vision Transformer(ViT)hybrid model that leverages aurora...Accurately predicting geomagnetic field is of great significance for space environment monitoring and space weather forecasting worldwide.This paper proposes a vision Transformer(ViT)hybrid model that leverages aurora images to predict local geomagnetic station component,breaking the spatial limitations of geomagnetic stations.Our method utilizes the ViT backbone model in combination with convolutional networks to capture both the large-scale spatial correlation and distinct local feature correlation between aurora images and geomagnetic station data.Essentially,the model comprises a visual geometry group(VGG)image feature extraction network,a ViT-based encoder network,and a regression prediction network.Our experimental findings indicate that global features of aurora images play a more substantial role in predicting geomagnetic data than local features.Specifically,the hybrid model achieves a 39.1%reduction in root mean square error compared to the VGG model,a 29.5%reduction compared to the ViT model and a 35.3%reduction relative to the residual network(ResNet)model.Moreover,the fitting accuracy of the model surpasses that of the VGG,ViT,and ResNet models by 2.14%1.58%,and 4.1%,respectively.展开更多
The internal structures of cells as the basic units of life are a major wonder of the microscopic world.Cellular images provide an intriguing window to help explore and understand the composition and function of these...The internal structures of cells as the basic units of life are a major wonder of the microscopic world.Cellular images provide an intriguing window to help explore and understand the composition and function of these structures.Scientific imagery combined with artistic expression can further expand the potential of imaging in educational dissemination and interdisciplinary applications.展开更多
Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding ...Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis.展开更多
Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textur...Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks(GANs) and the contrastive language-image pretraining(CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range(LDR) images are converted to high dynamic range(HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.展开更多
A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec-...A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.展开更多
During the unmanned aerial vehicles (UAV) reconnaissance missions in the middle-low troposphere, the reconnaissance images are blurred and degraded due to the scattering process of aerosol under fog, haze and other ...During the unmanned aerial vehicles (UAV) reconnaissance missions in the middle-low troposphere, the reconnaissance images are blurred and degraded due to the scattering process of aerosol under fog, haze and other weather conditions, which reduce the image contrast and color fidelity. Considering the characteristics of UAV itself, this paper proposes a new algorithm for dehazing UAV reconnaissance images based on layered scattering model. The algorithm starts with the atmosphere scattering model, using the imaging distance, squint angle and other metadata acquired by the UAV. Based on the original model, a layered scattering model for dehazing is proposed. Considering the relationship between wave-length and extinction coefficient, the airlight intensity and extinction coefficient are calculated in the model. Finally, the restored images are obtained. In addition, a classification method based on Bayesian classification is used for classifica- tion of haze concentration of the image, avoiding the trouble of manual working. Then we evaluate the haze removal results according to both the subjective and objective criteria. The experimental results show that compared with the origin image, the comprehensive index of the image restored by our method increases by 282.84%, which proves that our method can obtain excellent dehazing effect.展开更多
Based on the accurate analysis of cucumber disease images, the low level feature of images was effectively extracted, and Gaussian Mixture Model (GMM) for 8 common cucumber diseases was built. The parameters of GMM ...Based on the accurate analysis of cucumber disease images, the low level feature of images was effectively extracted, and Gaussian Mixture Model (GMM) for 8 common cucumber diseases was built. The parameters of GMM were estimated by the algorithm of expectation maximum (EM) to accurately charac- terize the feature distribution of 8 cucumber diseases, thus increased the correct identification of cucumber diseases and accurate grasping of damage conditions, and provided basis for achievement of real-time and accurate prediction of cucumber diseases.展开更多
Fused deposition modelling(FDM), a widely used rapid prototyping process, is a promising technique in manufacturing engineering. In this work, a method for characterizing elastic constants of FDM-fabricated materials ...Fused deposition modelling(FDM), a widely used rapid prototyping process, is a promising technique in manufacturing engineering. In this work, a method for characterizing elastic constants of FDM-fabricated materials is proposed. First of all, according to the manufacturing process of FDM, orthotropic constitutive model is used to describe the mechanical behavior. Then the virtual fields method(VFM) is applied to characterize all the mechanical parameters(Q, Q, Q, Q) using the full-field strain,which is measured by digital image correlation(DIC). Since the principal axis of the FDM-fabricated structure is sometimes unknown due to the complexity of the manufacturing process, a disk in diametrical compression is used as the load configuration so that the loading angle can be changed conveniently. To verify the feasibility of the proposed method, finite element method(FEM) simulation is conducted to obtain the strain field of the disk. The simulation results show that higher accuracy can be achieved when the loading angle is close to 30?. Finally, a disk fabricated by FDM was used for the experiment. By rotating the disk, several tests with different loading angles were conducted. To determine the position of the principal axis in each test, two groups of parameters(Q, Q, Q, Q) are calculated by two different groups of virtual fields. Then the corresponding loading angle can be determined by minimizing the deviation between two groups of the parameters. After that, the four constants(Q, Q, Q, Q) were determined from the test with an angle of 27?.展开更多
Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassificatio...Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassification. A new segmentation method, called multi-resolution Ganssian mixture model method, is proposed. First, an image pyramid is constructed and son-father link relationship is built between each level of pyramid. Then the mixture model segmentation method is applied to the top level. The segmentation result on the top level is passed top-down to the bottom level according to the son-father link relationship between levels. The proposed method considers not only local but also global information of image, it overcomes the effect of noise and can obtain better segmentation result. Experimental result demonstrates its effectiveness.展开更多
he objective of the research is to develop a fast procedure for segmenting typical videophone images. In this paper, a new approach to color image segmentation based on HSI(Hue, Saturation, Intensity) color model is r...he objective of the research is to develop a fast procedure for segmenting typical videophone images. In this paper, a new approach to color image segmentation based on HSI(Hue, Saturation, Intensity) color model is reported. It is in contrast to the conventional approaches by using the three components of HSI color model in succession. This strategy makes the segmentation procedure much fast and effective. Experimental results with typical “headandshoulders” real images taken from videophone sequences show that the new appproach can fulfill the application requirements.展开更多
The prediction of fundus fluorescein angiography(FFA)images from fundus structural images is a cutting-edge research topic in ophthalmological image processing.Prediction comprises estimating FFA from fundus camera im...The prediction of fundus fluorescein angiography(FFA)images from fundus structural images is a cutting-edge research topic in ophthalmological image processing.Prediction comprises estimating FFA from fundus camera imaging,single-phase FFA from scanning laser ophthalmoscopy(SLO),and three-phase FFA also from SLO.Although many deep learning models are available,a single model can only perform one or two of these prediction tasks.To accomplish three prediction tasks using a unified method,we propose a unified deep learning model for predicting FFA images from fundus structure images using a supervised generative adversarial network.The three prediction tasks are processed as follows:data preparation,network training under FFA supervision,and FFA image prediction from fundus structure images on a test set.By comparing the FFA images predicted by our model,pix2pix,and CycleGAN,we demonstrate the remarkable progress achieved by our proposal.The high performance of our model is validated in terms of the peak signal-to-noise ratio,structural similarity index,and mean squared error.展开更多
BACKGROUND Patients harboring gene mutations like KRAS,NRAS,and BRAF demonstrate highly variable responses to chemotherapy,posing challenges for treatment optimization.Multiparametric magnetic resonance imaging(MRI),w...BACKGROUND Patients harboring gene mutations like KRAS,NRAS,and BRAF demonstrate highly variable responses to chemotherapy,posing challenges for treatment optimization.Multiparametric magnetic resonance imaging(MRI),with its noninvasive capability to assess tumor characteristics in detail,has shown promise in evaluating treatment response and predicting therapeutic outcomes.This technology holds potential for guiding personalized treatment strategies tailored to individual patient profiles,enhancing the precision and effectiveness of colorectal cancer care.AIM To create a multiparametric MRI-based predictive model for assessing chemotherapy efficacy in colorectal cancer patients with gene mutations.METHODS This retrospective study was conducted in a tertiary hospital,analyzing 157 colorectal cancer patients with gene mutations treated between August 2022 and December 2023.Based on chemotherapy outcomes,the patients were categorized into favorable(n=60)and unfavorable(n=50)response groups.Univariate and multivariate logistic regression analyses were performed to identify independent predictors of chemotherapy efficacy.A predictive nomogram was constructed using significant variables,and its performance was assessed using the area under the receiver operating characteristic curve(AUC)in both training and validation sets.RESULTS Univariate analysis identified that tumor differentiation,T2 signal intensity ratio,tumor-to-anal margin distance,and MRI-detected lymph node metastasis as significantly associated with chemotherapy response(P<0.05).Multivariate Logistics regression confirmed these four parameters as independent predictors.The predictive model demonstrated strong discrimination,with an AUC of 0.938(sensitivity:86%;specificity:92%)in the training set,and 0.942(sensitivity:100%;specificity:83%)in the validation set.CONCLUSION We established and validated a multiparametric MRI-based model for predicting chemotherapy response in colorectal cancer patients with gene mutations.This model holds promise for guiding individualized treatment strategies.展开更多
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord...Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation.展开更多
We present a bidirectional reflection distribution function (BRDF) model for thermal coating surfaces based on a three-component reflection assumption, in which the specular reflection is given according to the micr...We present a bidirectional reflection distribution function (BRDF) model for thermal coating surfaces based on a three-component reflection assumption, in which the specular reflection is given according to the microfacet theory and Snell's law, the multiple reflection is considered Nth cosine distributed, and the volume scattering is uniformly distributed in reflection angles according to the experimental results. This model describes the reflection characteristics of thermal coating surfaces more completely and reasonably. Simulation and measurement results of two thermal coating samples SR107 and S781 are given to validate that this three-component model significantly improves the modeling accuracy for thermal coating surfaces compared with the existing BRDF models.展开更多
In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of th...In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets.展开更多
文摘Low-light image enhancement is one of the most active research areas in the field of computer vision in recent years.In the low-light image enhancement process,loss of image details and increase in noise occur inevitably,influencing the quality of enhanced images.To alleviate this problem,a low-light image enhancement model called RetinexNet model based on Retinex theory was proposed in this study.The model was composed of an image decomposition module and a brightness enhancement module.In the decomposition module,a convolutional block attention module(CBAM)was incorporated to enhance feature representation capacity of the network,focusing on crucial features and suppressing irrelevant ones.A multifeature fusion denoising module was designed within the brightness enhancement module,circumventing the issue of feature loss during downsampling.The proposed model outperforms the existing algorithms in terms of PSNR and SSIM metrics on the publicly available datasets LOL and MIT-Adobe FiveK,as well as gives superior results in terms of NIQE metrics on the publicly available dataset LIME.
基金support from Strategic Project of Precision Surgery,Tsinghua UniversityInitiative Scientific Research Program,Institute for Intelligent Healthcare,Tsinghua University+5 种基金Tsinghua-Foshan Institute of Advanced ManufacturingNational Natural Science Foundation of China(61735016)Beijing Nova Program(20230484308)Young Elite Scientists Sponsorship Program by CAST(2023QNRC001)Youth Elite Program of Beijing Friendship Hospital(YYQCJH2022-9)Science and Technology Program of Beijing Tongzhou District(KJ2023CX012).
文摘Foundation models(FMs)have rapidly evolved and have achieved signicant accomplishments in computer vision tasks.Specically,the prompt mechanism conveniently allows users to integrate image prior information into the model,making it possible to apply models without any training.Therefore,we proposed a workflow based on foundation models and zero training to solve the tasks of photoacoustic(PA)image processing.We employed the Segment Anything Model(SAM)by setting simple prompts and integrating the model's outputs with prior knowledge of the imaged objects to accomplish various tasks,including:(1)removing the skin signal in three-dimensional PA image rendering;(2)dual speed-of-sound reconstruction,and(3)segmentation ofnger blood vessels.Through these demonstrations,we have concluded that FMs can be directly applied in PA imaging without the requirement for network design and training.This potentially allows for a hands-on,convenient approach to achieving efficient and accurate segmentation of PA images.This paper serves as a comprehensive tutorial,facilitating the mastery of the technique through the provision of code and sample datasets.
基金supported in part by the National Natural Science Foundation of China under Grants 62102450,62272478 and the Independent Research Project of a Certain Unit under Grant ZZKY20243127。
文摘Traditional steganography conceals information by modifying cover data,but steganalysis tools easily detect such alterations.While deep learning-based steganography often involves high training costs and complex deployment.Diffusion model-based methods face security vulnerabilities,particularly due to potential information leakage during generation.We propose a fixed neural network image steganography framework based on secure diffu-sion models to address these challenges.Unlike conventional approaches,our method minimizes cover modifications through neural network optimization,achieving superior steganographic performance in human visual perception and computer vision analyses.The cover images are generated in an anime style using state-of-the-art diffusion models,ensuring the transmitted images appear more natural.This study introduces fixed neural network technology that allows senders to transmit only minimal critical information alongside stego-images.Recipients can accurately reconstruct secret images using this compact data,significantly reducing transmission overhead compared to conventional deep steganography.Furthermore,our framework innovatively integrates ElGamal,a cryptographic algorithm,to protect critical information during transmission,enhancing overall system security and ensuring end-to-end information protection.This dual optimization of payload reduction and cryptographic reinforcement establishes a new paradigm for secure and efficient image steganography.
基金National Natural Science Foundation Major Project of China,Grant/Award Number:42192580Guangdong Province Key Construction Discipline Scientific Research Ability Promotion Project,Grant/Award Number:2022ZDJS015。
文摘In Hyperspectral Imaging(HSI),the detrimental influence of noise and distortions on data quality is profound,which has severely affected the following-on analytics and decisionmaking such as land mapping.This study presents an innovative framework for assessing HSI band quality and reconstructing the low-quality bands,based on the Prophet model.By introducing a comprehensive quality metric to start,the authors approach factors in both spatial and spectral characteristics across local and global scales.This metric effectively captures the intricate noise and distortions inherent in the HSI data.Subsequently,the authors employ the Prophet model to forecast information within the low-quality bands,leveraging insights from neighbouring high-quality bands.To validate the effectiveness of the authors’proposed model,extensive experiments on three publicly available uncorrected datasets are conducted.In a head-to-head comparison,the framework against six state-ofthe-art band reconstruction algorithms including three spectral methods,two spatialspectral methods and one deep learning method is benchmarked.The authors’experiments also delve into strategies for band selection based on quality metrics and the quality evaluation of the reconstructed bands.In addition,the authors assess the classification accuracy utilising these reconstructed bands.In various experiments,the results consistently affirm the efficacy of the authors’method in HSI quality assessment and band reconstruction.Notably,the authors’approach obviates the need for manually prefiltering of noisy bands.This comprehensive framework holds promise in addressing HSI data quality concerns whilst enhancing the overall utility of HSI.
基金supported by the Key Research and DevelopmentPlan in Shanxi Province of China(No.201803D421045)the Natural Science Foundation of Shanxi Province(No.2021-0302-123104)。
文摘This study introduces a novel method for reconstructing the 3D model of aluminum foam using cross-sectional sequence images.Combining precision milling and image acquisition,high-qual-ity cross-sectional images are obtained.Pore structures are segmented by the U-shaped network(U-Net)neural network integrated with the Canny edge detection operator,ensuring accurate pore delineation and edge extraction.The trained U-Net achieves 98.55%accuracy.The 2D data are superimposed and processed into 3D point clouds,enabling reconstruction of the pore structure and aluminum skeleton.Analysis of pore 01 shows the cross-sectional area initially increases,and then decreases with milling depth,with a uniform point distribution of 40 per layer.The reconstructed model exhibits a porosity of 77.5%,with section overlap rates between the 2D pore segmentation and the reconstructed model exceeding 96%,confirming high fidelity.Equivalent sphere diameters decrease with size,averaging 1.95 mm.Compression simulations reveal that the stress-strain curve of the 3D reconstruction model of aluminum foam exhibits fluctuations,and the stresses in the reconstruction model concentrate on thin cell walls,leading to localized deformations.This method accurately restores the aluminum foam’s complex internal structure,improving reconstruction preci-sion and simulation reliability.The approach offers a cost-efficient,high-precision technique for optimizing material performance in engineering applications.
基金supported by the National Natural Science Foundation of China(No.41471381)the General Project of Jiangsu Natural Science Foundation(No.BK20171410)the Major Scientific and Technological Achievements Cultivation Fund of Nanjing University of Aeronautics and Astronautics(No.1011-XBD23002)。
文摘Accurately predicting geomagnetic field is of great significance for space environment monitoring and space weather forecasting worldwide.This paper proposes a vision Transformer(ViT)hybrid model that leverages aurora images to predict local geomagnetic station component,breaking the spatial limitations of geomagnetic stations.Our method utilizes the ViT backbone model in combination with convolutional networks to capture both the large-scale spatial correlation and distinct local feature correlation between aurora images and geomagnetic station data.Essentially,the model comprises a visual geometry group(VGG)image feature extraction network,a ViT-based encoder network,and a regression prediction network.Our experimental findings indicate that global features of aurora images play a more substantial role in predicting geomagnetic data than local features.Specifically,the hybrid model achieves a 39.1%reduction in root mean square error compared to the VGG model,a 29.5%reduction compared to the ViT model and a 35.3%reduction relative to the residual network(ResNet)model.Moreover,the fitting accuracy of the model surpasses that of the VGG,ViT,and ResNet models by 2.14%1.58%,and 4.1%,respectively.
基金supported by the Fundamental Research Funds for the Central Universities(No.226-2024-00038),China.
文摘The internal structures of cells as the basic units of life are a major wonder of the microscopic world.Cellular images provide an intriguing window to help explore and understand the composition and function of these structures.Scientific imagery combined with artistic expression can further expand the potential of imaging in educational dissemination and interdisciplinary applications.
基金supported by Natural Science Foundation Programme of Gansu Province(No.24JRRA231)National Natural Science Foundation of China(No.62061023)Gansu Provincial Science and Technology Plan Key Research and Development Program Project(No.24YFFA024).
文摘Despite its remarkable performance on natural images,the segment anything model(SAM)lacks domain-specific information in medical imaging.and faces the challenge of losing local multi-scale information in the encoding phase.This paper presents a medical image segmentation model based on SAM with a local multi-scale feature encoder(LMSFE-SAM)to address the issues above.Firstly,based on the SAM,a local multi-scale feature encoder is introduced to improve the representation of features within local receptive field,thereby supplying the Vision Transformer(ViT)branch in SAM with enriched local multi-scale contextual information.At the same time,a multiaxial Hadamard product module(MHPM)is incorporated into the local multi-scale feature encoder in a lightweight manner to reduce the quadratic complexity and noise interference.Subsequently,a cross-branch balancing adapter is designed to balance the local and global information between the local multi-scale feature encoder and the ViT encoder in SAM.Finally,to obtain smaller input image size and to mitigate overlapping in patch embeddings,the size of the input image is reduced from 1024×1024 pixels to 256×256 pixels,and a multidimensional information adaptation component is developed,which includes feature adapters,position adapters,and channel-spatial adapters.This component effectively integrates the information from small-sized medical images into SAM,enhancing its suitability for clinical deployment.The proposed model demonstrates an average enhancement ranging from 0.0387 to 0.3191 across six objective evaluation metrics on BUSI,DDTI,and TN3K datasets compared to eight other representative image segmentation models.This significantly enhances the performance of the SAM on medical images,providing clinicians with a powerful tool in clinical diagnosis.
文摘Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks(GANs) and the contrastive language-image pretraining(CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range(LDR) images are converted to high dynamic range(HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.
基金Supported by the National Natural Science Foundation of China(60505004,60773061)~~
文摘A new two-step framework is proposed for image segmentation. In the first step, the gray-value distribution of the given image is reshaped to have larger inter-class variance and less intra-class variance. In the sec- ond step, the discriminant-based methods or clustering-based methods are performed on the reformed distribution. It is focused on the typical clustering methods-Gaussian mixture model (GMM) and its variant to demonstrate the feasibility of the framework. Due to the independence of the first step in its second step, it can be integrated into the pixel-based and the histogram-based methods to improve their segmentation quality. The experiments on artificial and real images show that the framework can achieve effective and robust segmentation results.
基金supported by the National Natural Science Foundation of China(No.61450008)
文摘During the unmanned aerial vehicles (UAV) reconnaissance missions in the middle-low troposphere, the reconnaissance images are blurred and degraded due to the scattering process of aerosol under fog, haze and other weather conditions, which reduce the image contrast and color fidelity. Considering the characteristics of UAV itself, this paper proposes a new algorithm for dehazing UAV reconnaissance images based on layered scattering model. The algorithm starts with the atmosphere scattering model, using the imaging distance, squint angle and other metadata acquired by the UAV. Based on the original model, a layered scattering model for dehazing is proposed. Considering the relationship between wave-length and extinction coefficient, the airlight intensity and extinction coefficient are calculated in the model. Finally, the restored images are obtained. In addition, a classification method based on Bayesian classification is used for classifica- tion of haze concentration of the image, avoiding the trouble of manual working. Then we evaluate the haze removal results according to both the subjective and objective criteria. The experimental results show that compared with the origin image, the comprehensive index of the image restored by our method increases by 282.84%, which proves that our method can obtain excellent dehazing effect.
基金Supported by National Natural Science Foundation of China ( 60903066,0985244)Natural Science Foundation of Beijing City ( 4102049)+1 种基金 New Teacher Fund of Ministry of Education ( 20090009120006) Basic Scientific Research Expenses of Central College ( 20100008030)~~
文摘Based on the accurate analysis of cucumber disease images, the low level feature of images was effectively extracted, and Gaussian Mixture Model (GMM) for 8 common cucumber diseases was built. The parameters of GMM were estimated by the algorithm of expectation maximum (EM) to accurately charac- terize the feature distribution of 8 cucumber diseases, thus increased the correct identification of cucumber diseases and accurate grasping of damage conditions, and provided basis for achievement of real-time and accurate prediction of cucumber diseases.
基金the financial support from the National Natural Science Foundation of China (Grants 11672153, 11232008, and 11227801)
文摘Fused deposition modelling(FDM), a widely used rapid prototyping process, is a promising technique in manufacturing engineering. In this work, a method for characterizing elastic constants of FDM-fabricated materials is proposed. First of all, according to the manufacturing process of FDM, orthotropic constitutive model is used to describe the mechanical behavior. Then the virtual fields method(VFM) is applied to characterize all the mechanical parameters(Q, Q, Q, Q) using the full-field strain,which is measured by digital image correlation(DIC). Since the principal axis of the FDM-fabricated structure is sometimes unknown due to the complexity of the manufacturing process, a disk in diametrical compression is used as the load configuration so that the loading angle can be changed conveniently. To verify the feasibility of the proposed method, finite element method(FEM) simulation is conducted to obtain the strain field of the disk. The simulation results show that higher accuracy can be achieved when the loading angle is close to 30?. Finally, a disk fabricated by FDM was used for the experiment. By rotating the disk, several tests with different loading angles were conducted. To determine the position of the principal axis in each test, two groups of parameters(Q, Q, Q, Q) are calculated by two different groups of virtual fields. Then the corresponding loading angle can be determined by minimizing the deviation between two groups of the parameters. After that, the four constants(Q, Q, Q, Q) were determined from the test with an angle of 27?.
基金This project was supported by the National Natural Foundation of China (60404022) and the Foundation of Department ofEducation of Hebei Province (2002209).
文摘Mixture model based image segmentation method, which assumes that image pixels are independent and do not consider the position relationship between pixels, is not robust to noise and usually leads to misclassification. A new segmentation method, called multi-resolution Ganssian mixture model method, is proposed. First, an image pyramid is constructed and son-father link relationship is built between each level of pyramid. Then the mixture model segmentation method is applied to the top level. The segmentation result on the top level is passed top-down to the bottom level according to the son-father link relationship between levels. The proposed method considers not only local but also global information of image, it overcomes the effect of noise and can obtain better segmentation result. Experimental result demonstrates its effectiveness.
文摘he objective of the research is to develop a fast procedure for segmenting typical videophone images. In this paper, a new approach to color image segmentation based on HSI(Hue, Saturation, Intensity) color model is reported. It is in contrast to the conventional approaches by using the three components of HSI color model in succession. This strategy makes the segmentation procedure much fast and effective. Experimental results with typical “headandshoulders” real images taken from videophone sequences show that the new appproach can fulfill the application requirements.
基金supported in part by the Gusu Innovation and Entrepreneurship Leading Talents in Suzhou City,grant numbers ZXL2021425 and ZXL2022476Doctor of Innovation and Entrepreneurship Program in Jiangsu Province,grant number JSSCBS20211440+6 种基金Jiangsu Province Key R&D Program,grant number BE2019682Natural Science Foundation of Jiangsu Province,grant number BK20200214National Key R&D Program of China,grant number 2017YFB0403701National Natural Science Foundation of China,grant numbers 61605210,61675226,and 62075235Youth Innovation Promotion Association of Chinese Academy of Sciences,grant number 2019320Frontier Science Research Project of the Chinese Academy of Sciences,grant number QYZDB-SSW-JSC03Strategic Priority Research Program of the Chinese Academy of Sciences,grant number XDB02060000.
文摘The prediction of fundus fluorescein angiography(FFA)images from fundus structural images is a cutting-edge research topic in ophthalmological image processing.Prediction comprises estimating FFA from fundus camera imaging,single-phase FFA from scanning laser ophthalmoscopy(SLO),and three-phase FFA also from SLO.Although many deep learning models are available,a single model can only perform one or two of these prediction tasks.To accomplish three prediction tasks using a unified method,we propose a unified deep learning model for predicting FFA images from fundus structure images using a supervised generative adversarial network.The three prediction tasks are processed as follows:data preparation,network training under FFA supervision,and FFA image prediction from fundus structure images on a test set.By comparing the FFA images predicted by our model,pix2pix,and CycleGAN,we demonstrate the remarkable progress achieved by our proposal.The high performance of our model is validated in terms of the peak signal-to-noise ratio,structural similarity index,and mean squared error.
基金Supported by Shenzhen High-level Hospital Construction Fund.
文摘BACKGROUND Patients harboring gene mutations like KRAS,NRAS,and BRAF demonstrate highly variable responses to chemotherapy,posing challenges for treatment optimization.Multiparametric magnetic resonance imaging(MRI),with its noninvasive capability to assess tumor characteristics in detail,has shown promise in evaluating treatment response and predicting therapeutic outcomes.This technology holds potential for guiding personalized treatment strategies tailored to individual patient profiles,enhancing the precision and effectiveness of colorectal cancer care.AIM To create a multiparametric MRI-based predictive model for assessing chemotherapy efficacy in colorectal cancer patients with gene mutations.METHODS This retrospective study was conducted in a tertiary hospital,analyzing 157 colorectal cancer patients with gene mutations treated between August 2022 and December 2023.Based on chemotherapy outcomes,the patients were categorized into favorable(n=60)and unfavorable(n=50)response groups.Univariate and multivariate logistic regression analyses were performed to identify independent predictors of chemotherapy efficacy.A predictive nomogram was constructed using significant variables,and its performance was assessed using the area under the receiver operating characteristic curve(AUC)in both training and validation sets.RESULTS Univariate analysis identified that tumor differentiation,T2 signal intensity ratio,tumor-to-anal margin distance,and MRI-detected lymph node metastasis as significantly associated with chemotherapy response(P<0.05).Multivariate Logistics regression confirmed these four parameters as independent predictors.The predictive model demonstrated strong discrimination,with an AUC of 0.938(sensitivity:86%;specificity:92%)in the training set,and 0.942(sensitivity:100%;specificity:83%)in the validation set.CONCLUSION We established and validated a multiparametric MRI-based model for predicting chemotherapy response in colorectal cancer patients with gene mutations.This model holds promise for guiding individualized treatment strategies.
基金Natural Science Foundation of Zhejiang Province,Grant/Award Number:LY23F020025Science and Technology Commissioner Program of Huzhou,Grant/Award Number:2023GZ42Sichuan Provincial Science and Technology Support Program,Grant/Award Numbers:2023ZHCG0005,2023ZHCG0008。
文摘Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation.
文摘We present a bidirectional reflection distribution function (BRDF) model for thermal coating surfaces based on a three-component reflection assumption, in which the specular reflection is given according to the microfacet theory and Snell's law, the multiple reflection is considered Nth cosine distributed, and the volume scattering is uniformly distributed in reflection angles according to the experimental results. This model describes the reflection characteristics of thermal coating surfaces more completely and reasonably. Simulation and measurement results of two thermal coating samples SR107 and S781 are given to validate that this three-component model significantly improves the modeling accuracy for thermal coating surfaces compared with the existing BRDF models.
文摘In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets.