Existing lithospheric velocity models exhibit similar structures typically associated with the first-order tectonic features,with dissimilarities due to different data and methods used in model generation.The quantifi...Existing lithospheric velocity models exhibit similar structures typically associated with the first-order tectonic features,with dissimilarities due to different data and methods used in model generation.The quantification of model structural similarity can help in interpreting the geophysical properties of Earth's interior and establishing unified models crucial in natural hazard assessment and resource exploration.Here we employ the complex wavelet structural similarity index measure(CW-SSIM)active in computer image processing to analyze the structural similarity of four lithospheric velocity models of Chinese mainland published in the past decade.We take advantage of this method in its multiscale definition and insensitivity to slight geometrical distortion like translation and scaling,which is particularly crucial in the structural similarity analysis of velocity models accounting for uncertainty and resolution.Our results show that the CW-SSIM values vary in different model pairs,horizontal locations,and depths.While variations in the inter-model CW-SSIM are partly owing to different databases in the model generation,the difference of tomography methods may significantly impact the similar structural features of models,such as the low similarities between the full-wave based FWEA18 and other three models in northeastern China.We finally suggest potential solutions for the next generation of tomographic modeling in different areas according to corresponding structural similarities of existing models.展开更多
It is well-known that classical quality measures,such as Mean Squared Error(MSE),Weighted Mean Squared Error(WMSE)or Peak Signal-to-Noise Ratio(PSNR),are not always corresponding with visual observations.Structural si...It is well-known that classical quality measures,such as Mean Squared Error(MSE),Weighted Mean Squared Error(WMSE)or Peak Signal-to-Noise Ratio(PSNR),are not always corresponding with visual observations.Structural similarity based image quality assessment was proposed under the assumption that the Human Visual System(HVS)is highly adapted for extracting structural information from an image.While the demand on high color quality increases in the media industry,color loss will make the visual quality different.In this paper,we proposed an improved quality assessment(QA)method by adding color comparison into the structural similarity(SSIM)measurement system for evaluating color image quality.Then we divided the task of similarity measurement into four comparisons:luminance,contrast,structure,and color.Experimental results show that the predicted quality scores of the proposed method are more effective and consistent with visual quality than the classical methods using five different distortion types of color image sets.展开更多
Automatic segmentation of ischemic stroke lesions from computed tomography(CT)images is of great significance for identifying and curing this life-threatening condition.However,in addition to the problem of low image ...Automatic segmentation of ischemic stroke lesions from computed tomography(CT)images is of great significance for identifying and curing this life-threatening condition.However,in addition to the problem of low image contrast,it is also challenged by the complex changes in the appearance of the stroke area and the difficulty in obtaining image data.Considering that it is difficult to obtain stroke data and labels,a data enhancement algorithm for one-shot medical image segmentation based on data augmentation using learned transformation was proposed to increase the number of data sets for more accurate segmentation.A deep convolutional neural network based algorithm for stroke lesion segmentation,called structural similarity with light U-structure(USSL)Net,was proposed.We embedded a convolution module that combines switchable normalization,multi-scale convolution and dilated convolution in the network for better segmentation performance.Besides,considering the strong structural similarity between multi-modal stroke CT images,the USSL Net uses the correlation maximized structural similarity loss(SSL)function as the loss function to learn the varying shapes of the lesions.The experimental results show that our framework has achieved results in the following aspects.First,the data obtained by adding our data enhancement algorithm is better than the data directly segmented from the multi-modal image.Second,the performance of our network model is better than that of other models for stroke segmentation tasks.Third,the way SSL functioned as a loss function is more helpful to the improvement of segmentation accuracy than the cross-entropy loss function.展开更多
Objective video quality assessment plays a very important role in multimedia signal processing. Several extensions of the structural similarity (SSIM) index could not predict the quality of the video sequence effect...Objective video quality assessment plays a very important role in multimedia signal processing. Several extensions of the structural similarity (SSIM) index could not predict the quality of the video sequence effectively. In this paper we propose a structural similarity quality metric for videos based on a spatial-temporal visual attention model. This model acquires the motion attended region and the distortion attended region by computing the motion features and the distortion contrast. It mimics the visual attention shifting between the two attended regions and takes the burst of error into account by introducing the non-linear weighting fimctions to give a much higher weighting factor to the extremely damaged frames. The proposed metric based on the model renders the final object quality rating of the whole video sequence and is validated using the 50 Hz video sequences of Video Quality Experts Group Phase I test database.展开更多
Joint inversion is one of the most effective methods for reducing non-uniqueness for geophysical inversion.The current joint inversion methods can be divided into the structural consistency constraint and petrophysica...Joint inversion is one of the most effective methods for reducing non-uniqueness for geophysical inversion.The current joint inversion methods can be divided into the structural consistency constraint and petrophysical consistency constraint methods,which are mutually independent.Currently,there is a need for joint inversion methods that can comprehensively consider the structural consistency constraints and petrophysical consistency constraints.This paper develops the structural similarity index(SSIM)as a new structural and petrophysical consistency constraint for the joint inversion of gravity and vertical gradient data.The SSIM constraint is in the form of a fraction,which may have analytical singularities.Therefore,converting the fractional form to the subtractive form can solve the problem of analytic singularity and finally form a modified structural consistency index of the joint inversion,which enhances the stability of the SSIM constraint applied to the joint inversion.Compared to the reconstructed results from the cross-gradient inversion,the proposed method presents good performance and stability.The SSIM algorithm is a new joint inversion method for petrophysical and structural constraints.It can promote the consistency of the recovered models from the distribution and the structure of the physical property values.Then,applications to synthetic data illustrate that the algorithm proposed in this paper can well process the synthetic data and acquire good reconstructed results.展开更多
Objective evaluations of fused images are important in comparing the performance of different image fusion algorithms. This paper describes a structural similarity metric that does not use a reference image for image ...Objective evaluations of fused images are important in comparing the performance of different image fusion algorithms. This paper describes a structural similarity metric that does not use a reference image for image fusion evaluations. The metric is based on the universal image quality index and addresses not only the similarities between the input images and the fused image, but also the similarities among the input images. The evaluation process distinguishes between complementary information and redundant information using similarities among the input images. The metric uses the information classification to estimate how much structural similarity is preserved in the fused image. Tests demonstrate that the metric correlates well with subjective evaluations of the fused images.展开更多
Network embedding which aims to embed a given network into a low-dimensional vector space has been proved effective in various network analysis and mining tasks such as node classification,link prediction and network ...Network embedding which aims to embed a given network into a low-dimensional vector space has been proved effective in various network analysis and mining tasks such as node classification,link prediction and network visualization.The emerging network embedding methods have shifted of emphasis in utilizing mature deep learning models.The neural-network based network embedding has become a mainstream solution because of its high eficiency and capability of preserv-ing the nonlinear characteristics of the network.In this paper,we propose Adversarial Network Embedding using Structural Similarity(ANESS),a novel,versatile,low-complexity GAN-based network embedding model which utilizes the inherent vertex-to-vertex structural similarity attribute of the network.ANESS learns robustness and ffective vertex embeddings via a adversarial training procedure.Specifically,our method aims to exploit the strengths of generative adversarial networks in generating high-quality samples and utilize the structural similarity identity of vertexes to learn the latent representations of a network.Meanwhile,ANESS can dynamically update the strategy of generating samples during each training iteration.The extensive experiments have been conducted on the several benchmark network datasets,and empirical results demon-strate that ANESS significantly outperforms other state-of-theart network embedding methods.展开更多
This study proposes a new component of the composite loss function minimised during training of the Super-Resolution(SR)algorithms—the normalised structural similarity index loss LSSIMN,which has the potential to imp...This study proposes a new component of the composite loss function minimised during training of the Super-Resolution(SR)algorithms—the normalised structural similarity index loss LSSIMN,which has the potential to improve the natural appearance of reconstructed images.Deep learning-based super-resolution(SR)algorithms reconstruct high-resolution images from low-resolution inputs,offering a practical means to enhance image quality without requiring superior imaging hardware,which is particularly important in medical applications where diagnostic accuracy is critical.Although recent SR methods employing convolutional and generative adversarial networks achieve high pixel fidelity,visual artefacts may persist,making the design of the loss function during training essential for ensuring reliable and naturalistic image reconstruction.Our research shows on two models—SR and Invertible Rescaling Neural Network(IRN)—trained on multiple benchmark datasets that the function LSSIMN significantly contributes to the visual quality,preserving the structural fidelity on the reference datasets.The quantitative analysis of results while incorporating LSSIMN shows that including this loss function component has a mean 2.88%impact on the improvement of the final structural similarity of the reconstructed images in the validation set,in comparison to leaving it out and 0.218%in comparison when this component is non-normalised.展开更多
Biometric template protection is essential for finger-based authentication systems,as template tampering and adversarial attacks threaten the security.This paper proposes a DCT-based fragile watermarking scheme incorp...Biometric template protection is essential for finger-based authentication systems,as template tampering and adversarial attacks threaten the security.This paper proposes a DCT-based fragile watermarking scheme incorporating AI-based tamper detection to improve the integrity and robustness of finger authentication.The system was tested against NIST SD4 and Anguli fingerprint datasets,wherein 10,000 watermarked fingerprints were employed for training.The designed approach recorded a tamper detection rate of 98.3%,performing 3–6%better than current DCT,SVD,and DWT-based watermarking approaches.The false positive rate(≤1.2%)and false negative rate(≤1.5%)were much lower compared to previous research,which maintained high reliability for template change detection.The system showed real-time performance,averaging 12–18 ms processing time per template,and is thus suitable for real-world biometric authentication scenarios.Quality analysis of fingerprints indicated that NFIQ scores were enhanced from 2.07 to 1.81,reflecting improved minutiae clarity and ridge structure preservation.The approach also exhibited strong resistance to compression and noise distortions,with the improvements in PSNR being 2 dB(JPEG compression Q=80)and the SSIM values rising by 3%–5%under noise attacks.Comparative assessment demonstrated that training with NIST SD4 data greatly improved the ridge continuity and quality of fingerprints,resulting in better match scores(260–295)when tested against Bozorth3.Smaller batch sizes(batch=2)also resulted in improved ridge clarity,whereas larger batch sizes(batch=8)resulted in distortions.The DCNN-based tamper detection model supported real-time classification,which greatly minimized template exposure to adversarial attacks and synthetic fingerprint forgeries.Results demonstrate that fragile watermarking with AI indeed greatly enhances fingerprint security,providing privacy-preserving biometric authentication with high robustness,accuracy,and computational efficiency.展开更多
The structure similarity of secreted proteins in rice blast fungus Magnaporthe oryzae and its host Oryza sativa was analyzed. One thousand two hundred and forty one proteins were predicted as secreted proteins using f...The structure similarity of secreted proteins in rice blast fungus Magnaporthe oryzae and its host Oryza sativa was analyzed. One thousand two hundred and forty one proteins were predicted as secreted proteins using four algorithms based on 11 074 proteins in genome of M. oryzae. One hundred and forty six secreted proteins( 11. 8% of M. oryzae secretome) were aligned with 116 rice proteins( 0. 21% of 56 278 rice proteins) using BLAST search on rice genome. One hundred sixteen rice similar proteins participated in rice cell wall modification( cell wall associated enzymes) and signal transduction( proteases). These results imply that both cell wall involved proteins and signal transduction are probably hijacks pathway between host pants and pathogenic fungi. Because these proteins are highly conserved among fungi and plants,the express patterns of these protein coding genes during the interaction process are valuable to study in detail.展开更多
This article presents an adaptive attitude tracking controller with external disturbances and unknown inertia parameters. The similar skew-symmetric structure is extended from the autonomous case to the non-autonomous...This article presents an adaptive attitude tracking controller with external disturbances and unknown inertia parameters. The similar skew-symmetric structure is extended from the autonomous case to the non-autonomous case. The non-autonomous similar skew-symmetric is chosen as the desired structure of the closed loop system for attitude controller design. Based on this structure, a novel adaptive backstepping scheme is proposed to design the attitude controller by taking full advantage of the symmetry and the positive definiteness of the inertia matrix. The attitude tracking precision is enhanced by employing the linear parameterized form of the external disturbance torques. Simulation results demonstrate the effectiveness of the proposed attitude controller.展开更多
Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In ...Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method.展开更多
In the last years, digital image processing and analysis are used for computer assisted evaluation of semen quality with therapeutic goals or to estimate its fertility by means of spermatozoid motility and morphology....In the last years, digital image processing and analysis are used for computer assisted evaluation of semen quality with therapeutic goals or to estimate its fertility by means of spermatozoid motility and morphology. Sperm morphology is assessed routinely as part of standard laboratory analysis in the diagnosis of human male infertility. Nowadays assessments of sperm morphology are mostly done based on subjective criteria. In order to avoid subjectivity, numerous studies that incorporate image analysis techniques in the assessment of sperm morphology have been proposed. The primary step of all these methods is segmentation of sperm’s parts. In this paper, we have proposed a new method for segmentation of sperm’s Acrosome, Nucleus, Mid-piece and identification of sperm’s tail through some points which are placed on the sperm’s tail, accurately. These estimated points could be used to verify the morphological characteristics of sperm’s tail such as length, shape and etc. At first, sperm’s Acrosome, Nucleus and Mid-piece are segmented through a method based on a Bayesian classifier which utilizes the entropy based expectation–maximization (EM) algorithm and Markov random field (MRF) model to obtain and upgrade the class conditional probability density function (CCPDF) and the apriori probability of each class. Then, a pixel at the end of sperm’s Mid-piece, is selected as an initial point. To find other pixels which are placed on the sperm’s tail, structural similarity index (SSIM) is used in an iterative scheme. In order to stop the algorithm automatically at the end of sperm’s tail, local entropy is estimated and used as a feature to determine if a point is located on the sperm’s tail or not. To compare the performance of the proposed approach with those of previous approaches including manual segmentation, the Accuracy, Sensitivity and Specificity were calculated.展开更多
With the development of digital information technologies,robust watermarking framework is taken into real consideration as a challenging issue in the area of image processing,due to the large applicabilities and its u...With the development of digital information technologies,robust watermarking framework is taken into real consideration as a challenging issue in the area of image processing,due to the large applicabilities and its utilities in a number of academic and real environments.There are a wide range of solutions to provide image watermarking frameworks,while each one of them is attempted to address an efficient and applicable idea.In reality,the traditional techniques do not have sufficient merit to realize an accurate application.Due to the fact that the main idea behind the approach is organized based on contourlet representation,the only state-of-the-art materials that are investigated along with an integration of the aforementioned contourlet representation in line with watermarking framework are concentrated to be able to propose the novel and skilled technique.In a word,the main process of the proposed robust watermarking framework is organized to deal with both new embedding and de-embedding processes in the area of contourlet transform to generate watermarked image and the corresponding extracted logo image with high accuracy.In fact,the motivation of the approach is that the suggested complexity can be of novelty,which consists of the contourlet representation,the embedding and the corresponding de-embedding modules and the performance monitoring including an analysis of the watermarked image as well as the extracted logo image.There is also a scrambling module that is working in association with levels-directions decomposition in contourlet embedding mechanism,while a decision maker system is designed to deal with the appropriate number of sub-bands to be embedded in the presence of a series of simulated attacks.The required performance is tangibly considered through an integration of the peak signal-to-noise ratio and the structural similarity indices that are related to watermarked image.And the bit error rate and the normal correlation are considered that are related to the extracted logo analysis,as well.Subsequently,the outcomes are fully analyzed to be competitive with respect to the potential techniques in the image colour models including hue or tint in terms of their shade,saturation or amount of gray and their brightness via value or luminance and also hue,saturation and intensity representations,as long as the performance of the whole of channels are concentrated to be presented.The performance monitoring outcomes indicate that the proposed framework is of significance to be verified.展开更多
Authentication of the digital image has much attention for the digital revolution.Digital image authentication can be verified with image watermarking and image encryption schemes.These schemes are widely used to prot...Authentication of the digital image has much attention for the digital revolution.Digital image authentication can be verified with image watermarking and image encryption schemes.These schemes are widely used to protect images against forgery attacks,and they are useful for protecting copyright and rightful ownership.Depending on the desirable applications,several image encryption and watermarking schemes have been proposed to moderate this attention.This framework presents a new scheme that combines a Walsh Hadamard Transform(WHT)-based image watermarking scheme with an image encryption scheme based on Double Random Phase Encoding(DRPE).First,on the sender side,the secret medical image is encrypted using DRPE.Then the encrypted image is watermarking based on WHT.The combination between watermarking and encryption increases the security and robustness of transmitting an image.The performance evaluation of the proposed scheme is obtained by testing Structural Similarity Index(SSIM),Peak Signal-to-Noise Ratio(PSNR),Normalized cross-correlation(NC),and Feature Similarity Index(FSIM).展开更多
Advanced technology used for arithmetic computing application,comprises greater number of approximatemultipliers and approximate adders.Truncation and Rounding-based Scalable ApproximateMultiplier(TRSAM)distinguish a ...Advanced technology used for arithmetic computing application,comprises greater number of approximatemultipliers and approximate adders.Truncation and Rounding-based Scalable ApproximateMultiplier(TRSAM)distinguish a variety of modes based on height(h)and truncation(t)as TRSAM(h,t)in the architecture.This TRSAM operation produces higher absolute error in Least Significant Bit(LSB)data shift unit.A new scalable approximate multiplier approach that uses truncation and rounding TRSAM(3,7)is proposed to increase themultiplier accuracy.With the help of foremost one bit architecture,the proposed scalable approximate multiplier approach reduces the partial products.The proposed approximate TRSAM multiplier architecture gives better results in terms of area,delay,and power.The accuracy of 95.2%and the energy utilization of 24.6 nJ is observed in the proposed multiplier design.The proposed approach shows 0.11%,0.23%,and 0.24%less Mean Absolute Relative Error(MARE)when compared with the existing approach for the input of 8-bit,16-bit,and 32-bit respectively.It also shows 0.13%,0.19%,and 0.2%less Variance of Absolute Relative Error(VARE)when compared with the existing approach for the input of 8-bit,16-bit,and 32-bit respectively.The proposed approach is implemented with Field-Programmable Gate Array(FPGA)and shows the delay of 3.640,6.481,12.505,22.572,and 36.893 ns for the input of 8-bit,16-bit,32-bit,64-bit,and 128-bit respectively.The proposed approach is applied in digital filters designwhich shows the Peak-Signal-to-NoiseRatio(PSNR)of 25.05 dB and Structural Similarity Index Measure(SSIM)of 0.98 with 393 pJ energy consumptions when used in image application.The proposed approach is simulated with Xilinx and MATLAB and implemented with FPGA.展开更多
The existence of shadow leads to the degradation of the image qualities and the defect of ground object information.Shadow removal is therefore an essential research topic in image processing filed.The biggest challen...The existence of shadow leads to the degradation of the image qualities and the defect of ground object information.Shadow removal is therefore an essential research topic in image processing filed.The biggest challenge of shadow removal is how to restore the content of shadow areas correctly while removing the shadow in the image.Paired regions for shadow removal approach based on multi-features is proposed, in which shadow removal is only performed on related sunlit areas.Feature distance between regions is calculated to find the optimal paired regions with considering of multi-features(texture, gradient feature, etc.) comprehensively.Images in different scenes with peak signal-to-noise ratio(PSNR) and structural similarity(SSIM) evaluation indexes are chosen for experiments.The results are shown with six existing comparison methods by visual and quantitative assessments, which verified that the proposed method shows excellent shadow removal effect, the brightness, color of the removed shadow area, and the surrounding non-shadow area can be naturally fused.展开更多
We present the analysis of three independent and most widely used image smoothing techniques on a new fractional based convolution edge detector originally constructed by same authors for image edge analysis. The impl...We present the analysis of three independent and most widely used image smoothing techniques on a new fractional based convolution edge detector originally constructed by same authors for image edge analysis. The implementation was done using only Gaussian function as its smoothing function based on predefined assumptions and therefore did not scale well for some types of edges and noise. The experiments conducted on this mask using known images with realistic geometry suggested the need for image smoothing adaptation to obtain a more optimal performance. In this paper, we use the structural similarity index measure and show that the adaptation technique for choosing smoothing function has significant advantages over a single function implementation. The new adaptive fractional based convolution mask can smoothly find edges of various types in detail quite significantly. The method can now trap both local discontinuities in intensity and its derivatives as well as locating Dirac edges.展开更多
In recent years,enhancement of underwater images is a challenging task,which is gaining priority since the human eye cannot perceive images under water.The significant details underwater are not clearly captured using...In recent years,enhancement of underwater images is a challenging task,which is gaining priority since the human eye cannot perceive images under water.The significant details underwater are not clearly captured using the conventional image acquisition techniques,and also they are expensive.Hence,the quality of the image processing algorithms can be enhanced in the absence of costly and reliable acquisition techniques.Traditional algorithms have certain limitations in the case of these images with varying degrees of fuzziness and color deviation.In the proposed model,the authors used a deep learning model for underwater image enhancement.First,the original image is pre-processed by the white balance algorithm for colour correction and the contrast of the image is improved using the contrast enhancement technique.Next,the pre-processed image is given to the MIRNet for enhancement.MIRNet is a deep learning framework that can be used to enhance the low-light level images.The enhanced image quality is measured using peak signal-to-noise ratio(PSNR),root mean square error(RMSE),and structural similarity index(SSIM)parameters.展开更多
Detection of wood plate surface defects using image processing is a complicated problem in the forest industry as the image of the wood surface contains different kinds of defects. In order to obtain complete defect i...Detection of wood plate surface defects using image processing is a complicated problem in the forest industry as the image of the wood surface contains different kinds of defects. In order to obtain complete defect images, we used convex optimization(CO) with different weights as a pretreatment method for smoothing and the Otsu segmentation method to obtain the target defect area images. Structural similarity(SSIM) results between original image and defect image were calculated to evaluate the performance of segmentation with different convex optimization weights. The geometric and intensity features of defects were extracted before constructing a classification and regression tree(CART) classifier. The average accuracy of the classifier is 94.1% with four types of defects on Xylosma congestum wood plate surface: pinhole, crack,live knot and dead knot. Experimental results showed that CO can save the edge of target defects maximally, SSIM can select the appropriate weight for CO, and the CART classifier appears to have the advantages of good adaptability and high classification accuracy.展开更多
基金supported by the National Natural Science Foundation of China(Nos.42174063,92155307,41976046)Guangdong Provincial Key Laboratory of Geophysical High-resolution Imaging Technology under(No.2022B1212010002)Project for introduced Talents Team of Southern Marine Science and Engineering Guangdong(Guangzhou)(No.GML2019ZD0203)。
文摘Existing lithospheric velocity models exhibit similar structures typically associated with the first-order tectonic features,with dissimilarities due to different data and methods used in model generation.The quantification of model structural similarity can help in interpreting the geophysical properties of Earth's interior and establishing unified models crucial in natural hazard assessment and resource exploration.Here we employ the complex wavelet structural similarity index measure(CW-SSIM)active in computer image processing to analyze the structural similarity of four lithospheric velocity models of Chinese mainland published in the past decade.We take advantage of this method in its multiscale definition and insensitivity to slight geometrical distortion like translation and scaling,which is particularly crucial in the structural similarity analysis of velocity models accounting for uncertainty and resolution.Our results show that the CW-SSIM values vary in different model pairs,horizontal locations,and depths.While variations in the inter-model CW-SSIM are partly owing to different databases in the model generation,the difference of tomography methods may significantly impact the similar structural features of models,such as the low similarities between the full-wave based FWEA18 and other three models in northeastern China.We finally suggest potential solutions for the next generation of tomographic modeling in different areas according to corresponding structural similarities of existing models.
文摘It is well-known that classical quality measures,such as Mean Squared Error(MSE),Weighted Mean Squared Error(WMSE)or Peak Signal-to-Noise Ratio(PSNR),are not always corresponding with visual observations.Structural similarity based image quality assessment was proposed under the assumption that the Human Visual System(HVS)is highly adapted for extracting structural information from an image.While the demand on high color quality increases in the media industry,color loss will make the visual quality different.In this paper,we proposed an improved quality assessment(QA)method by adding color comparison into the structural similarity(SSIM)measurement system for evaluating color image quality.Then we divided the task of similarity measurement into four comparisons:luminance,contrast,structure,and color.Experimental results show that the predicted quality scores of the proposed method are more effective and consistent with visual quality than the classical methods using five different distortion types of color image sets.
基金the National Natural Science Foundation of China(No.61976091)。
文摘Automatic segmentation of ischemic stroke lesions from computed tomography(CT)images is of great significance for identifying and curing this life-threatening condition.However,in addition to the problem of low image contrast,it is also challenged by the complex changes in the appearance of the stroke area and the difficulty in obtaining image data.Considering that it is difficult to obtain stroke data and labels,a data enhancement algorithm for one-shot medical image segmentation based on data augmentation using learned transformation was proposed to increase the number of data sets for more accurate segmentation.A deep convolutional neural network based algorithm for stroke lesion segmentation,called structural similarity with light U-structure(USSL)Net,was proposed.We embedded a convolution module that combines switchable normalization,multi-scale convolution and dilated convolution in the network for better segmentation performance.Besides,considering the strong structural similarity between multi-modal stroke CT images,the USSL Net uses the correlation maximized structural similarity loss(SSL)function as the loss function to learn the varying shapes of the lesions.The experimental results show that our framework has achieved results in the following aspects.First,the data obtained by adding our data enhancement algorithm is better than the data directly segmented from the multi-modal image.Second,the performance of our network model is better than that of other models for stroke segmentation tasks.Third,the way SSL functioned as a loss function is more helpful to the improvement of segmentation accuracy than the cross-entropy loss function.
文摘Objective video quality assessment plays a very important role in multimedia signal processing. Several extensions of the structural similarity (SSIM) index could not predict the quality of the video sequence effectively. In this paper we propose a structural similarity quality metric for videos based on a spatial-temporal visual attention model. This model acquires the motion attended region and the distortion attended region by computing the motion features and the distortion contrast. It mimics the visual attention shifting between the two attended regions and takes the burst of error into account by introducing the non-linear weighting fimctions to give a much higher weighting factor to the extremely damaged frames. The proposed metric based on the model renders the final object quality rating of the whole video sequence and is validated using the 50 Hz video sequences of Video Quality Experts Group Phase I test database.
基金supported by the National Key Research and Development Program(Grant No.2021YFA0716100)the National Key Research and Development Program of China Project(Grant No.2018YFC0603502)+1 种基金the Henan Youth Science Fund Program(Grant No.212300410105)the provincial key R&D and promotion special project of Henan Province(Grant No.222102320279).
文摘Joint inversion is one of the most effective methods for reducing non-uniqueness for geophysical inversion.The current joint inversion methods can be divided into the structural consistency constraint and petrophysical consistency constraint methods,which are mutually independent.Currently,there is a need for joint inversion methods that can comprehensively consider the structural consistency constraints and petrophysical consistency constraints.This paper develops the structural similarity index(SSIM)as a new structural and petrophysical consistency constraint for the joint inversion of gravity and vertical gradient data.The SSIM constraint is in the form of a fraction,which may have analytical singularities.Therefore,converting the fractional form to the subtractive form can solve the problem of analytic singularity and finally form a modified structural consistency index of the joint inversion,which enhances the stability of the SSIM constraint applied to the joint inversion.Compared to the reconstructed results from the cross-gradient inversion,the proposed method presents good performance and stability.The SSIM algorithm is a new joint inversion method for petrophysical and structural constraints.It can promote the consistency of the recovered models from the distribution and the structure of the physical property values.Then,applications to synthetic data illustrate that the algorithm proposed in this paper can well process the synthetic data and acquire good reconstructed results.
基金Supported by the National Natural Science Foundation of China (No.60673024)
文摘Objective evaluations of fused images are important in comparing the performance of different image fusion algorithms. This paper describes a structural similarity metric that does not use a reference image for image fusion evaluations. The metric is based on the universal image quality index and addresses not only the similarities between the input images and the fused image, but also the similarities among the input images. The evaluation process distinguishes between complementary information and redundant information using similarities among the input images. The metric uses the information classification to estimate how much structural similarity is preserved in the fused image. Tests demonstrate that the metric correlates well with subjective evaluations of the fused images.
基金This work was supported by the National Key R&D Program of China(2018YFB1003404)the National Natural Science Foundation of China(Grant Nos.61872070,U1811261)+1 种基金the Fundamental Research Funds for the Central Universities(N171605001)Liao Ning Revitalization Talents Program(XLYC1807158).
文摘Network embedding which aims to embed a given network into a low-dimensional vector space has been proved effective in various network analysis and mining tasks such as node classification,link prediction and network visualization.The emerging network embedding methods have shifted of emphasis in utilizing mature deep learning models.The neural-network based network embedding has become a mainstream solution because of its high eficiency and capability of preserv-ing the nonlinear characteristics of the network.In this paper,we propose Adversarial Network Embedding using Structural Similarity(ANESS),a novel,versatile,low-complexity GAN-based network embedding model which utilizes the inherent vertex-to-vertex structural similarity attribute of the network.ANESS learns robustness and ffective vertex embeddings via a adversarial training procedure.Specifically,our method aims to exploit the strengths of generative adversarial networks in generating high-quality samples and utilize the structural similarity identity of vertexes to learn the latent representations of a network.Meanwhile,ANESS can dynamically update the strategy of generating samples during each training iteration.The extensive experiments have been conducted on the several benchmark network datasets,and empirical results demon-strate that ANESS significantly outperforms other state-of-theart network embedding methods.
基金support from the following institutional grant.Internal Grant Agency of the Faculty of Economics and Management,Czech University of Life Sciences Prague,grant no.2023A0004(https://iga.pef.czu.cz/,accessed on 6 June 2025).
文摘This study proposes a new component of the composite loss function minimised during training of the Super-Resolution(SR)algorithms—the normalised structural similarity index loss LSSIMN,which has the potential to improve the natural appearance of reconstructed images.Deep learning-based super-resolution(SR)algorithms reconstruct high-resolution images from low-resolution inputs,offering a practical means to enhance image quality without requiring superior imaging hardware,which is particularly important in medical applications where diagnostic accuracy is critical.Although recent SR methods employing convolutional and generative adversarial networks achieve high pixel fidelity,visual artefacts may persist,making the design of the loss function during training essential for ensuring reliable and naturalistic image reconstruction.Our research shows on two models—SR and Invertible Rescaling Neural Network(IRN)—trained on multiple benchmark datasets that the function LSSIMN significantly contributes to the visual quality,preserving the structural fidelity on the reference datasets.The quantitative analysis of results while incorporating LSSIMN shows that including this loss function component has a mean 2.88%impact on the improvement of the final structural similarity of the reconstructed images in the validation set,in comparison to leaving it out and 0.218%in comparison when this component is non-normalised.
文摘Biometric template protection is essential for finger-based authentication systems,as template tampering and adversarial attacks threaten the security.This paper proposes a DCT-based fragile watermarking scheme incorporating AI-based tamper detection to improve the integrity and robustness of finger authentication.The system was tested against NIST SD4 and Anguli fingerprint datasets,wherein 10,000 watermarked fingerprints were employed for training.The designed approach recorded a tamper detection rate of 98.3%,performing 3–6%better than current DCT,SVD,and DWT-based watermarking approaches.The false positive rate(≤1.2%)and false negative rate(≤1.5%)were much lower compared to previous research,which maintained high reliability for template change detection.The system showed real-time performance,averaging 12–18 ms processing time per template,and is thus suitable for real-world biometric authentication scenarios.Quality analysis of fingerprints indicated that NFIQ scores were enhanced from 2.07 to 1.81,reflecting improved minutiae clarity and ridge structure preservation.The approach also exhibited strong resistance to compression and noise distortions,with the improvements in PSNR being 2 dB(JPEG compression Q=80)and the SSIM values rising by 3%–5%under noise attacks.Comparative assessment demonstrated that training with NIST SD4 data greatly improved the ridge continuity and quality of fingerprints,resulting in better match scores(260–295)when tested against Bozorth3.Smaller batch sizes(batch=2)also resulted in improved ridge clarity,whereas larger batch sizes(batch=8)resulted in distortions.The DCNN-based tamper detection model supported real-time classification,which greatly minimized template exposure to adversarial attacks and synthetic fingerprint forgeries.Results demonstrate that fragile watermarking with AI indeed greatly enhances fingerprint security,providing privacy-preserving biometric authentication with high robustness,accuracy,and computational efficiency.
基金Supported by National Basic Research Program(2012CB722901)Academic Award for Up-and-coming Doctoral Candidates of Yunnan ProvinceYunnan Agricultural University Innovation Foundation for Postgraduate
文摘The structure similarity of secreted proteins in rice blast fungus Magnaporthe oryzae and its host Oryza sativa was analyzed. One thousand two hundred and forty one proteins were predicted as secreted proteins using four algorithms based on 11 074 proteins in genome of M. oryzae. One hundred and forty six secreted proteins( 11. 8% of M. oryzae secretome) were aligned with 116 rice proteins( 0. 21% of 56 278 rice proteins) using BLAST search on rice genome. One hundred sixteen rice similar proteins participated in rice cell wall modification( cell wall associated enzymes) and signal transduction( proteases). These results imply that both cell wall involved proteins and signal transduction are probably hijacks pathway between host pants and pathogenic fungi. Because these proteins are highly conserved among fungi and plants,the express patterns of these protein coding genes during the interaction process are valuable to study in detail.
文摘This article presents an adaptive attitude tracking controller with external disturbances and unknown inertia parameters. The similar skew-symmetric structure is extended from the autonomous case to the non-autonomous case. The non-autonomous similar skew-symmetric is chosen as the desired structure of the closed loop system for attitude controller design. Based on this structure, a novel adaptive backstepping scheme is proposed to design the attitude controller by taking full advantage of the symmetry and the positive definiteness of the inertia matrix. The attitude tracking precision is enhanced by employing the linear parameterized form of the external disturbance torques. Simulation results demonstrate the effectiveness of the proposed attitude controller.
文摘Invoice document digitization is crucial for efficient management in industries.The scanned invoice image is often noisy due to various reasons.This affects the OCR(optical character recognition)detection accuracy.In this paper,letter data obtained from images of invoices are denoised using a modified autoencoder based deep learning method.A stacked denoising autoencoder(SDAE)is implemented with two hidden layers each in encoder network and decoder network.In order to capture the most salient features of training samples,a undercomplete autoencoder is designed with non-linear encoder and decoder function.This autoencoder is regularized for denoising application using a combined loss function which considers both mean square error and binary cross entropy.A dataset consisting of 59,119 letter images,which contains both English alphabets(upper and lower case)and numbers(0 to 9)is prepared from many scanned invoices images and windows true type(.ttf)files,are used for training the neural network.Performance is analyzed in terms of Signal to Noise Ratio(SNR),Peak Signal to Noise Ratio(PSNR),Structural Similarity Index(SSIM)and Universal Image Quality Index(UQI)and compared with other filtering techniques like Nonlocal Means filter,Anisotropic diffusion filter,Gaussian filters and Mean filters.Denoising performance of proposed SDAE is compared with existing SDAE with single loss function in terms of SNR and PSNR values.Results show the superior performance of proposed SDAE method.
文摘In the last years, digital image processing and analysis are used for computer assisted evaluation of semen quality with therapeutic goals or to estimate its fertility by means of spermatozoid motility and morphology. Sperm morphology is assessed routinely as part of standard laboratory analysis in the diagnosis of human male infertility. Nowadays assessments of sperm morphology are mostly done based on subjective criteria. In order to avoid subjectivity, numerous studies that incorporate image analysis techniques in the assessment of sperm morphology have been proposed. The primary step of all these methods is segmentation of sperm’s parts. In this paper, we have proposed a new method for segmentation of sperm’s Acrosome, Nucleus, Mid-piece and identification of sperm’s tail through some points which are placed on the sperm’s tail, accurately. These estimated points could be used to verify the morphological characteristics of sperm’s tail such as length, shape and etc. At first, sperm’s Acrosome, Nucleus and Mid-piece are segmented through a method based on a Bayesian classifier which utilizes the entropy based expectation–maximization (EM) algorithm and Markov random field (MRF) model to obtain and upgrade the class conditional probability density function (CCPDF) and the apriori probability of each class. Then, a pixel at the end of sperm’s Mid-piece, is selected as an initial point. To find other pixels which are placed on the sperm’s tail, structural similarity index (SSIM) is used in an iterative scheme. In order to stop the algorithm automatically at the end of sperm’s tail, local entropy is estimated and used as a feature to determine if a point is located on the sperm’s tail or not. To compare the performance of the proposed approach with those of previous approaches including manual segmentation, the Accuracy, Sensitivity and Specificity were calculated.
文摘With the development of digital information technologies,robust watermarking framework is taken into real consideration as a challenging issue in the area of image processing,due to the large applicabilities and its utilities in a number of academic and real environments.There are a wide range of solutions to provide image watermarking frameworks,while each one of them is attempted to address an efficient and applicable idea.In reality,the traditional techniques do not have sufficient merit to realize an accurate application.Due to the fact that the main idea behind the approach is organized based on contourlet representation,the only state-of-the-art materials that are investigated along with an integration of the aforementioned contourlet representation in line with watermarking framework are concentrated to be able to propose the novel and skilled technique.In a word,the main process of the proposed robust watermarking framework is organized to deal with both new embedding and de-embedding processes in the area of contourlet transform to generate watermarked image and the corresponding extracted logo image with high accuracy.In fact,the motivation of the approach is that the suggested complexity can be of novelty,which consists of the contourlet representation,the embedding and the corresponding de-embedding modules and the performance monitoring including an analysis of the watermarked image as well as the extracted logo image.There is also a scrambling module that is working in association with levels-directions decomposition in contourlet embedding mechanism,while a decision maker system is designed to deal with the appropriate number of sub-bands to be embedded in the presence of a series of simulated attacks.The required performance is tangibly considered through an integration of the peak signal-to-noise ratio and the structural similarity indices that are related to watermarked image.And the bit error rate and the normal correlation are considered that are related to the extracted logo analysis,as well.Subsequently,the outcomes are fully analyzed to be competitive with respect to the potential techniques in the image colour models including hue or tint in terms of their shade,saturation or amount of gray and their brightness via value or luminance and also hue,saturation and intensity representations,as long as the performance of the whole of channels are concentrated to be presented.The performance monitoring outcomes indicate that the proposed framework is of significance to be verified.
基金Princess Nourah bint Abdulrahman University Researchers Supporting ProjectNumber (PNURSP2022R66), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
文摘Authentication of the digital image has much attention for the digital revolution.Digital image authentication can be verified with image watermarking and image encryption schemes.These schemes are widely used to protect images against forgery attacks,and they are useful for protecting copyright and rightful ownership.Depending on the desirable applications,several image encryption and watermarking schemes have been proposed to moderate this attention.This framework presents a new scheme that combines a Walsh Hadamard Transform(WHT)-based image watermarking scheme with an image encryption scheme based on Double Random Phase Encoding(DRPE).First,on the sender side,the secret medical image is encrypted using DRPE.Then the encrypted image is watermarking based on WHT.The combination between watermarking and encryption increases the security and robustness of transmitting an image.The performance evaluation of the proposed scheme is obtained by testing Structural Similarity Index(SSIM),Peak Signal-to-Noise Ratio(PSNR),Normalized cross-correlation(NC),and Feature Similarity Index(FSIM).
文摘Advanced technology used for arithmetic computing application,comprises greater number of approximatemultipliers and approximate adders.Truncation and Rounding-based Scalable ApproximateMultiplier(TRSAM)distinguish a variety of modes based on height(h)and truncation(t)as TRSAM(h,t)in the architecture.This TRSAM operation produces higher absolute error in Least Significant Bit(LSB)data shift unit.A new scalable approximate multiplier approach that uses truncation and rounding TRSAM(3,7)is proposed to increase themultiplier accuracy.With the help of foremost one bit architecture,the proposed scalable approximate multiplier approach reduces the partial products.The proposed approximate TRSAM multiplier architecture gives better results in terms of area,delay,and power.The accuracy of 95.2%and the energy utilization of 24.6 nJ is observed in the proposed multiplier design.The proposed approach shows 0.11%,0.23%,and 0.24%less Mean Absolute Relative Error(MARE)when compared with the existing approach for the input of 8-bit,16-bit,and 32-bit respectively.It also shows 0.13%,0.19%,and 0.2%less Variance of Absolute Relative Error(VARE)when compared with the existing approach for the input of 8-bit,16-bit,and 32-bit respectively.The proposed approach is implemented with Field-Programmable Gate Array(FPGA)and shows the delay of 3.640,6.481,12.505,22.572,and 36.893 ns for the input of 8-bit,16-bit,32-bit,64-bit,and 128-bit respectively.The proposed approach is applied in digital filters designwhich shows the Peak-Signal-to-NoiseRatio(PSNR)of 25.05 dB and Structural Similarity Index Measure(SSIM)of 0.98 with 393 pJ energy consumptions when used in image application.The proposed approach is simulated with Xilinx and MATLAB and implemented with FPGA.
基金Supported by the National Natural Science Foundation of China (No. 41971356, 41701446)the Open Fund of Key Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Natural Resources (No. KF-2022-07-001)。
文摘The existence of shadow leads to the degradation of the image qualities and the defect of ground object information.Shadow removal is therefore an essential research topic in image processing filed.The biggest challenge of shadow removal is how to restore the content of shadow areas correctly while removing the shadow in the image.Paired regions for shadow removal approach based on multi-features is proposed, in which shadow removal is only performed on related sunlit areas.Feature distance between regions is calculated to find the optimal paired regions with considering of multi-features(texture, gradient feature, etc.) comprehensively.Images in different scenes with peak signal-to-noise ratio(PSNR) and structural similarity(SSIM) evaluation indexes are chosen for experiments.The results are shown with six existing comparison methods by visual and quantitative assessments, which verified that the proposed method shows excellent shadow removal effect, the brightness, color of the removed shadow area, and the surrounding non-shadow area can be naturally fused.
文摘We present the analysis of three independent and most widely used image smoothing techniques on a new fractional based convolution edge detector originally constructed by same authors for image edge analysis. The implementation was done using only Gaussian function as its smoothing function based on predefined assumptions and therefore did not scale well for some types of edges and noise. The experiments conducted on this mask using known images with realistic geometry suggested the need for image smoothing adaptation to obtain a more optimal performance. In this paper, we use the structural similarity index measure and show that the adaptation technique for choosing smoothing function has significant advantages over a single function implementation. The new adaptive fractional based convolution mask can smoothly find edges of various types in detail quite significantly. The method can now trap both local discontinuities in intensity and its derivatives as well as locating Dirac edges.
文摘In recent years,enhancement of underwater images is a challenging task,which is gaining priority since the human eye cannot perceive images under water.The significant details underwater are not clearly captured using the conventional image acquisition techniques,and also they are expensive.Hence,the quality of the image processing algorithms can be enhanced in the absence of costly and reliable acquisition techniques.Traditional algorithms have certain limitations in the case of these images with varying degrees of fuzziness and color deviation.In the proposed model,the authors used a deep learning model for underwater image enhancement.First,the original image is pre-processed by the white balance algorithm for colour correction and the contrast of the image is improved using the contrast enhancement technique.Next,the pre-processed image is given to the MIRNet for enhancement.MIRNet is a deep learning framework that can be used to enhance the low-light level images.The enhanced image quality is measured using peak signal-to-noise ratio(PSNR),root mean square error(RMSE),and structural similarity index(SSIM)parameters.
基金supported by the Fund of Forestry 948project(2015-4-52)the Fundamental Research Funds for the Central Universities(2572017DB05)the Natural Science Foundation of Heilongjiang Province(C2017005)
文摘Detection of wood plate surface defects using image processing is a complicated problem in the forest industry as the image of the wood surface contains different kinds of defects. In order to obtain complete defect images, we used convex optimization(CO) with different weights as a pretreatment method for smoothing and the Otsu segmentation method to obtain the target defect area images. Structural similarity(SSIM) results between original image and defect image were calculated to evaluate the performance of segmentation with different convex optimization weights. The geometric and intensity features of defects were extracted before constructing a classification and regression tree(CART) classifier. The average accuracy of the classifier is 94.1% with four types of defects on Xylosma congestum wood plate surface: pinhole, crack,live knot and dead knot. Experimental results showed that CO can save the edge of target defects maximally, SSIM can select the appropriate weight for CO, and the CART classifier appears to have the advantages of good adaptability and high classification accuracy.