Objective To evaluate the value of MRI diffusion weighted imaging in localization of prostate cancer with whole-mount step section pathology. Methods We treated 36 patients using laparoscopic radical prostatectomy fro...Objective To evaluate the value of MRI diffusion weighted imaging in localization of prostate cancer with whole-mount step section pathology. Methods We treated 36 patients using laparoscopic radical prostatectomy from Oct. 2009 to Jun. 2010. Patients who did not have an MRL /DWI examination or a surgical history of pros-展开更多
In existing image manipulation localization methods,the receptive field of standard convolution is limited,and during feature transfer,it is easy to lose high-frequency information about traces of manipulation.In addi...In existing image manipulation localization methods,the receptive field of standard convolution is limited,and during feature transfer,it is easy to lose high-frequency information about traces of manipulation.In addition,during feature fusion,the use of fixed sampling kernels makes it difficult to focus on local changes in features,leading to limited localization accuracy.This paper proposes an image manipulation localization method based on dual-branch hybrid convolution.First,a dual-branch hybrid convolution module is designed to expand the receptive field of the model to enhance the feature extraction ability of contextual semantic information,while also enabling the model to focus more on the high-frequency detail features of manipulation traces while localizing the manipulated area.Second,a multiscale content-aware feature fusion module is used to dynamically generate adaptive sampling kernels for each position in the feature map,enabling the model to focus more on the details of local features while locating the manipulated area.Experimental results on multiple datasets show that this method not only effectively improves the accuracy of image manipulation localization but also enhances the robustness of the model.展开更多
Existing image manipulation localization(IML)techniques require large,densely annotated sets of forged images.This requirement greatly increases labeling costs and limits a model’s ability to handle manipulation type...Existing image manipulation localization(IML)techniques require large,densely annotated sets of forged images.This requirement greatly increases labeling costs and limits a model’s ability to handle manipulation types that are novel or absent from the training data.To address these issues,we present CLIP-IML,an IML framework that leverages contrastive language-image pre-training(CLIP).A lightweight feature-reconstruction module transforms CLIP token sequences into spatial tensors,after which a compact feature-pyramid network and a multi-scale fusion decoder work together to capture information from fine to coarse levels.We evaluated CLIP-IML on ten public datasets that cover copy-move,splicing,removal,and artificial intelligence(AI)-generated forgeries.The framework raises the average F1-score by 7.85%relative to the strongest recent baselines and secures either the first-or second-place performance on every dataset.Ablation studies show that CLIP pre-training,higher resolution inputs,and the multi-scale decoder each make complementary contributions.Under six common post-processing perturbations,as well as the compression pipelines used by Facebook,Weibo,and WeChat,the performance decline never exceeds 2.2%,confirming strong practical robustness.Moreover,CLIP-IML requires only a few thousand annotated images for training,which markedly reduces data-collection and labeling effort compared with previous methods.All of these results indicate that CLIP-IML is highly generalizable for image tampering localization across a wide range of tampering scenarios.展开更多
With the rapid development of information technology,digital images have become an important medium for information transmission.However,manipulating images is becoming a common task with the powerful image editing to...With the rapid development of information technology,digital images have become an important medium for information transmission.However,manipulating images is becoming a common task with the powerful image editing tools and software,and people can tamper the images content without leaving any visible traces of splicing in order to gain personal goal.Images are easily spliced and distributed,and the situation will be a great threat to social security.The survey covers splicing image and its localization.The present status of splicing image localization approaches is discussed along with a recommendation for future research.展开更多
Nitric oxide has played an important role in many physiological and pathological processes as a kind of important gas signal molecules. In this work, a new fluorescent probe LysoNO-Naph for detecting NO in lysosomes b...Nitric oxide has played an important role in many physiological and pathological processes as a kind of important gas signal molecules. In this work, a new fluorescent probe LysoNO-Naph for detecting NO in lysosomes based on 1,8-naphthalimide was reported. LysoNO-Naph has sub-groups of o-phenylene- diamine as a NO reaction site and 4-(2-aminoethyl)-morpholine as a lysosome-targetable group. This probe exhibited good selectivity and high sensitivity (4.57 μmol/L) toward NO in a wide pH range from 4 to 12. Furthermore, LysoNO-Naph can be used for imaging NO in lysosomes in living cells.展开更多
As an essential part of artificial intelligence,many works focus on image processing which is the branch of computer vision.Nevertheless,image localization faces complex challenges in image processing with image data ...As an essential part of artificial intelligence,many works focus on image processing which is the branch of computer vision.Nevertheless,image localization faces complex challenges in image processing with image data increases.At the same time,quantum computing has the unique advantages of improving computing power and reducing energy consumption.So,combining the advantage of quantum computing is necessary for studying the quantum image localization algorithms.At present,many quantum image localization algorithms have been proposed,and their efficiency is theoretically higher than the corresponding classical algorithms.But,in quantum computing experiments,quantum gates in quantum computing hardware need to work at very low temperatures,which brings great challenges to experiments.This paper proposes a single-photon-based quantum image localization algorithm based on the fundamental theory of single-photon image classification.This scheme realizes the operation of the mixed national institute of standards and technology database(MNIST)quantum image localization by a learned transformation for non-noise condition,noisy condition,and environmental attack condition,respectively.Compared with the regular use of entanglement between multi-qubits and low-temperature noise reduction conditions for image localization,the advantage of this method is that it does not deliberately require low temperature and entanglement resources,and it improves the lower bound of the localization success rate.This method paves a way to study quantum computer vision.展开更多
In multi-view image localization task,the features of the images captured from different views should be fused properly.This paper considers the classification-based image localization problem.We propose the relationa...In multi-view image localization task,the features of the images captured from different views should be fused properly.This paper considers the classification-based image localization problem.We propose the relational graph location network(RGLN)to perform this task.In this network,we propose a heterogeneous graph construction approach for graph classification tasks,which aims to describe the location in a more appropriate way,thereby improving the expression ability of the location representation module.Experiments show that the expression ability of the proposed graph construction approach outperforms the compared methods by a large margin.In addition,the proposed localization method outperforms the compared localization methods by around 1.7%in terms of meter-level accuracy.展开更多
Localization of the inspected chip image is one of the key problems with machine vision aided surface mount devices (SMD) and other micro-electronic equipments. This paper presents a new edge-directed subpixel edge lo...Localization of the inspected chip image is one of the key problems with machine vision aided surface mount devices (SMD) and other micro-electronic equipments. This paper presents a new edge-directed subpixel edge localization method. The image is divided into two regions, edge and non-edge, using edge detection to emphasize the edge feature. Since the edges of the chip image are straight, they have straight-line characteristics locally and globally. First, the line segments of the straight edge are located to subpixel precision, according to their local straight properties, in a 3×3 neighborhood of the edge region. Second, the subpixel midpoints of the line segments are computed. Finally, the straight edge is fitted using the midpoints and the least square method, according to its global straight property in the entire edge region. In this way, the edge is located to subpixel precision. While fitting the edge, the irregular points are eliminated by the angles of the line segments to improve the precision. We can also distinguish different edges and their intersections using the angles of the line segments and distances between the edge points, then give the vectorial result of the image edge with high precision.展开更多
As image manipulation technology advances rapidly,the malicious use of image tampering has alarmingly escalated,posing a significant threat to social stability.In the realm of image tampering localization,accurately l...As image manipulation technology advances rapidly,the malicious use of image tampering has alarmingly escalated,posing a significant threat to social stability.In the realm of image tampering localization,accurately localizing limited samples,multiple types,and various sizes of regions remains a multitude of challenges.These issues impede the model’s universality and generalization capability and detrimentally affect its performance.To tackle these issues,we propose FL-MobileViT-an improved MobileViT model devised for image tampering localization.Our proposed model utilizes a dual-stream architecture that independently processes the RGB and noise domain,and captures richer traces of tampering through dual-stream integration.Meanwhile,the model incorporating the Focused Linear Attention mechanism within the lightweight network(MobileViT).This substitution significantly diminishes computational complexity and resolves homogeneity problems associated with traditional Transformer attention mechanisms,enhancing feature extraction diversity and improving the model’s localization performance.To comprehensively fuse the generated results from both feature extractors,we introduce the ASPP architecture for multi-scale feature fusion.This facilitates a more precise localization of tampered regions of various sizes.Furthermore,to bolster the model’s generalization ability,we adopt a contrastive learning method and devise a joint optimization training strategy that leverages fused features and captures the disparities in feature distribution in tampered images.This strategy enables the learning of contrastive loss at various stages of the feature extractor and employs it as an additional constraint condition in conjunction with cross-entropy loss.As a result,overfitting issues are effectively alleviated,and the differentiation between tampered and untampered regions is enhanced.Experimental evaluations on five benchmark datasets(IMD-20,CASIA,NIST-16,Columbia and Coverage)validate the effectiveness of our proposed model.The meticulously calibrated FL-MobileViT model consistently outperforms numerous existing general models regarding localization accuracy across diverse datasets,demonstrating superior adaptability.展开更多
Considering the three-dimensional(3D) U-Net lacks sufficient local feature extraction for image features and lacks attention to the fusion of high-and low-level features, we propose a new model called 3DMAU-Net based ...Considering the three-dimensional(3D) U-Net lacks sufficient local feature extraction for image features and lacks attention to the fusion of high-and low-level features, we propose a new model called 3DMAU-Net based on the 3D U-Net architecture for liver region segmentation. Our model replaces the last two layers of the 3D U-Net with a sliding window-based multilayer perceptron(SMLP), enabling better extraction of local image features. We also design a high-and low-level feature fusion dilated convolution block that focuses on local features and better supplements the surrounding information of the target region. This block is embedded in the entire encoding process, ensuring that the overall network is not simply downsampling. Before each feature extraction, the input features are processed by the dilated convolution block. We validate our experiments on the liver tumor segmentation challenge 2017(Lits2017) dataset, and our model achieves a Dice coefficient of 0.95, which is an improvement of 0.015 compared to the 3D U-Net model. Furthermore, we compare our results with other segmentation methods, and our model consistently outperforms them.展开更多
We propose a method based on the Poynting vector that combines angle-domain imaging and image amplitude correction to overcome the shortcomings of reverse-time migration that cannot handle different angles during wave...We propose a method based on the Poynting vector that combines angle-domain imaging and image amplitude correction to overcome the shortcomings of reverse-time migration that cannot handle different angles during wave propagation. First, the local image matrix (LIM) and local illumination matrix are constructed, and the wavefield propagation directions are decomposed. The angle-domain imaging conditions are established in the local imaging matrix to remove low-wavenumber artifacts. Next, the angle-domain common image gathers are extracted and the dip angle is calculated, and the amplitude-corrected factors in the dip angle domain are calculated. The partial images are corrected by factors corresponding to the different angles and then are superimposed to perform the amplitude correction of the final image. Angle-domain imaging based on the Poynting vector improves the computation efficiency compared with local plane-wave decomposition. Finally, numerical simulations based on the SEG/EAGE velocity model are used to validate the proposed method.展开更多
Differences in illumination of the same face can defeat simple face recognition systems,yet most methods that compensate are too difficult to implement. Local quotient image (LQI) is an efficient illumination preproce...Differences in illumination of the same face can defeat simple face recognition systems,yet most methods that compensate are too difficult to implement. Local quotient image (LQI) is an efficient illumination preprocessing method for face recognition systems. An illumination model and a face model were developed,and their use in the new method was analyzed. Analysis of the method's computational complexity showed it to be efficient. Experimental results on Yale Face Database B showed that the method can effectively eliminate the effects of differences in illumination and provides considerable improvement in recognition rates.展开更多
Image forging is the alteration of a digital image to conceal some of the necessary or helpful information.It cannot be easy to distinguish themodified region fromthe original image in somecircumstances.The demand for...Image forging is the alteration of a digital image to conceal some of the necessary or helpful information.It cannot be easy to distinguish themodified region fromthe original image in somecircumstances.The demand for authenticity and the integrity of the image drive the detection of a fabricated image.There have been cases of ownership infringements or fraudulent actions by counterfeiting multimedia files,including re-sampling or copy-moving.This work presents a high-level view of the forensics of digital images and their possible detection approaches.This work presents a thorough analysis of digital image forgery detection techniques with their steps and effectiveness.These methods have identified forgery and its type and compared it with state of the art.This work will help us to find the best forgery detection technique based on the different environments.It also shows the current issues in other methods,which can help researchers find future scope for further research in this field.展开更多
The study aimed to apply to Machine Learning(ML)researchers working in image processing and biomedical analysis who play an extensive role in compre-hending and performing on complex medical data,eventually improving ...The study aimed to apply to Machine Learning(ML)researchers working in image processing and biomedical analysis who play an extensive role in compre-hending and performing on complex medical data,eventually improving patient care.Developing a novel ML algorithm specific to Diabetic Retinopathy(DR)is a chal-lenge and need of the hour.Biomedical images include several challenges,including relevant feature selection,class variations,and robust classification.Although the cur-rent research in DR has yielded favourable results,several research issues need to be explored.There is a requirement to look at novel pre-processing methods to discard irrelevant features,balance the obtained relevant features,and obtain a robust classi-fication.This is performed using the Steerable Kernalized Partial Derivative and Platt Scale Classifier(SKPD-PSC)method.The novelty of this method relies on the appropriate non-linear classification of exclusive image processing models in har-mony with the Platt Scale Classifier(PSC)to improve the accuracy of DR detection.First,a Steerable Filter Kernel Pre-processing(SFKP)model is applied to the Retinal Images(RI)to remove irrelevant and redundant features and extract more meaningful pathological features through Directional Derivatives of Gaussians(DDG).Next,the Partial Derivative Image Localization(PDIL)model is applied to the extracted fea-tures to localize candidate features and suppress the background noise.Finally,a Platt Scale Classifier(PSC)is applied to the localized features for robust classification.For the experiments,we used the publicly available DR detection database provided by Standard Diabetic Retinopathy(SDR),called DIARETDB0.A database of 130 image samples has been collected to train and test the ML-based classifiers.Experimental results show that the proposed method that combines the image processing and ML models can attain good detection performance with a high DR detection accu-racy rate with minimum time and complexity compared to the state-of-the-art meth-ods.The accuracy and speed of DR detection for numerous types of images will be tested through experimental evaluation.Compared to state-of-the-art methods,the method increases DR detection accuracy by 24%and DR detection time by 37.展开更多
Accessing local dynamics within a single macromolecule is the key to understand the physical origin of the viscoelasticity and especially the glass transition. In order to extract specific information on the dynamics ...Accessing local dynamics within a single macromolecule is the key to understand the physical origin of the viscoelasticity and especially the glass transition. In order to extract specific information on the dynamics of the branch point of a star polymer around its glass transition temperature, four-arm star poly (n-butyl methacrylate) with a fluorescent core was synthesized using perylene diimide as initiator and polymerization conducted via atom transfer radical polymerization. The process is found to be effective in positioning the fluorophore at the branch point with the fluorophore intact, which allows the successful application of single molecule fluorescence defocus imaging in examining the local site- sensitive dynamics. The power spectra of rotation trajectories, the population of rotating fluorophores as well as the distribution of angular displacement were used to revel the difference in local dynamics between branch point and the arm's end. It is discovered that the local dynamics at the core of the star polymer is much less activated than that at the arm's end. The results demonstrate the strong effect dues to the topological constrain at the branch point and the more free space at the arm's end.展开更多
For vision-based mobile robot navigation, images of the same scene may undergo a general affine transformation in the case of significant viewpoint changes. So, a novel method for detecting affine invariant interest p...For vision-based mobile robot navigation, images of the same scene may undergo a general affine transformation in the case of significant viewpoint changes. So, a novel method for detecting affine invariant interest points is proposed to obtain the invariant local features, which is coined polynomial local orientation tensor(PLOT). The new detector is based on image local orientation tensor that is constructed from the polynomial expansion of image signal. Firstly, the properties of local orientation tensor of PLOT are analyzed, and a suitable tuning parameter of local orientation tensor is chosen so as to extract invariant features. The initial interest points are detected by local maxima search for the smaller eigenvalues of the orientation tensor. Then, an iterative procedure is used to allow the initial interest points to converge to affine invariant interest points and regions. The performances of this detector are evaluated on the repeatability criteria and recall versus 1-precision graphs, and then are compared with other existing approaches. Experimental results for PLOT show strong performance under affine transformation in the real-world conditions.展开更多
The strain distributions near the interface when the elbow steel fiber is pulled out from the half-mould concrete matrix are directly measured using a combined method of single fiber pull-out test and digital image co...The strain distributions near the interface when the elbow steel fiber is pulled out from the half-mould concrete matrix are directly measured using a combined method of single fiber pull-out test and digital image correlation. Meanwhile, the real-time processes of the bonding, debonding and sliding at the interface are observed. The micro-mechanism of the strain localization in the failure process of interface when debonding occurs and the strengthening mechanism at the imbedded fiber are discussed. The experimental results show that the meso-scale strain localization gives rise to the localization of shear damage near the fiber interface. This strain localization characterized by the debonding process near the interface occurs, develops and moves gradually at an apparently regular interval. At the elbow part of the imbedded fiber, the peak value of the shearing stress occurs. But the primary debonding does not occur at this place because the strength of the shear damage is increased at the local area of the elbow part in the concrete, displaying an apparent reinforced effect at the end of the fiber.展开更多
In this paper the method of approximate expansion is used to analyse a perfect planar surround sound system, resulting in an order of new and upgrade systems. First reproductinn signals of the perfect system and the c...In this paper the method of approximate expansion is used to analyse a perfect planar surround sound system, resulting in an order of new and upgrade systems. First reproductinn signals of the perfect system and the characteristics of different orders systems are analysed. The independent transmission signals and decoding (reproduction) equation of the systexns are given. The compatibility among different orders systems and the problem of simplifying output channels are discussed. The problem of signal picking up, recording,transmitting and the possibility of putting the systems into practical use are studied. A sound hoage localization experiment for the systems is carried out in order to study haage localization in relaion to the numbers of transmission signals and output channels. The experimental result is consistemt with the theoretical result. This work lay down a base for practical use.展开更多
Agricultural crop production is a major contributing element to any country’s economy.To maintain the economic growth of any country plants disease detection is a leading factor in agriculture.The contribution of the...Agricultural crop production is a major contributing element to any country’s economy.To maintain the economic growth of any country plants disease detection is a leading factor in agriculture.The contribution of the proposed algorithm is to optimize the extracted infor-mation from the available resources for the betterment of the result without any additional complexity.The proposed technique basically localizes the leaf region prior to the image classification into healthy and diseased.The novelty of this work is to fuse the information extracted from the available resources and optimize it to enhance the expected outcome.The leaf colors are analyzed using color transformation for the seed region identification.The mapping of a low-dimensional RGB color image into L*a*b color space provides an expansion of the spectral range.The neighboring pixels-based leaf region growing is applied on the initial seeds.In order to refine the leaf boundary and the disease-affected areas,we employed a random sample consensus(RANSAC)for suitable curve fitting.The feature sets using bag of visual words,Fisher vectors,and handcrafted features are extracted followed by classification using logistic regression,multilayer perceptron model,and support vector machine.The performance of the proposal is analyzed through PlantVillage datasets of apple,bell pepper,cherry,corn,grape,potato,and tomato.The simulation-based analysis of the proposed contextualization-based image categorization process outperforms as compared with the state of arts.The proposed approach provides average accuracy and area under the curve of 0.932 and 0.903,respectively.展开更多
文摘Objective To evaluate the value of MRI diffusion weighted imaging in localization of prostate cancer with whole-mount step section pathology. Methods We treated 36 patients using laparoscopic radical prostatectomy from Oct. 2009 to Jun. 2010. Patients who did not have an MRL /DWI examination or a surgical history of pros-
基金National Natural Science Foundation of China(61703363)Shanxi Provincial Basic Research Program(202403021221206)+2 种基金Key Project of Shanxi Provincial Strategic Research on Science and Technology(202304031401011)Funding Project for Scientific Research Innovation Team on Data Mining and Industrial Intelligence Applications(YCXYTD-202402)Yuncheng University Research Project(YQ-2020021)。
文摘In existing image manipulation localization methods,the receptive field of standard convolution is limited,and during feature transfer,it is easy to lose high-frequency information about traces of manipulation.In addition,during feature fusion,the use of fixed sampling kernels makes it difficult to focus on local changes in features,leading to limited localization accuracy.This paper proposes an image manipulation localization method based on dual-branch hybrid convolution.First,a dual-branch hybrid convolution module is designed to expand the receptive field of the model to enhance the feature extraction ability of contextual semantic information,while also enabling the model to focus more on the high-frequency detail features of manipulation traces while localizing the manipulated area.Second,a multiscale content-aware feature fusion module is used to dynamically generate adaptive sampling kernels for each position in the feature map,enabling the model to focus more on the details of local features while locating the manipulated area.Experimental results on multiple datasets show that this method not only effectively improves the accuracy of image manipulation localization but also enhances the robustness of the model.
基金supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region under Grant No.2023D01C21the National Natural Science Foundation of China under Grant No.62362063.
文摘Existing image manipulation localization(IML)techniques require large,densely annotated sets of forged images.This requirement greatly increases labeling costs and limits a model’s ability to handle manipulation types that are novel or absent from the training data.To address these issues,we present CLIP-IML,an IML framework that leverages contrastive language-image pre-training(CLIP).A lightweight feature-reconstruction module transforms CLIP token sequences into spatial tensors,after which a compact feature-pyramid network and a multi-scale fusion decoder work together to capture information from fine to coarse levels.We evaluated CLIP-IML on ten public datasets that cover copy-move,splicing,removal,and artificial intelligence(AI)-generated forgeries.The framework raises the average F1-score by 7.85%relative to the strongest recent baselines and secures either the first-or second-place performance on every dataset.Ablation studies show that CLIP pre-training,higher resolution inputs,and the multi-scale decoder each make complementary contributions.Under six common post-processing perturbations,as well as the compression pipelines used by Facebook,Weibo,and WeChat,the performance decline never exceeds 2.2%,confirming strong practical robustness.Moreover,CLIP-IML requires only a few thousand annotated images for training,which markedly reduces data-collection and labeling effort compared with previous methods.All of these results indicate that CLIP-IML is highly generalizable for image tampering localization across a wide range of tampering scenarios.
基金This work was supported in part by the Natural Science Foundation of China under Grants(Nos.61772281,U1636219,61502241,61272421,61232016,61402235 and 61572258)in part by the National Key R&D Program of China(Grant Nos.2016YFB0801303 and 2016QY01W0105)+3 种基金in part by the plan for Scientific Talent of Henan Province(Grant No.2018JR0018)in part by the Natural Science Foundation of Jiangsu Province,China under Grant BK20141006in part by the Natural Science Foundation of the Universities in Jiangsu Province under Grant 14KJB520024the PAPD fund and the CICAEET fund.
文摘With the rapid development of information technology,digital images have become an important medium for information transmission.However,manipulating images is becoming a common task with the powerful image editing tools and software,and people can tamper the images content without leaving any visible traces of splicing in order to gain personal goal.Images are easily spliced and distributed,and the situation will be a great threat to social security.The survey covers splicing image and its localization.The present status of splicing image localization approaches is discussed along with a recommendation for future research.
基金financial supports from the National Natural Science Foundation of China (Nos. 21276251, 21506206, 21402191, 21502189)the 100 talents program funded by Chinese Academy of Sciences, Dalian Cultivation Fund for Distinguished Young Scholars (Nos. 2014J11JH130, 2015J12JH205)the National Science Fund for Excellent Young Scholars (No. 21422606)
文摘Nitric oxide has played an important role in many physiological and pathological processes as a kind of important gas signal molecules. In this work, a new fluorescent probe LysoNO-Naph for detecting NO in lysosomes based on 1,8-naphthalimide was reported. LysoNO-Naph has sub-groups of o-phenylene- diamine as a NO reaction site and 4-(2-aminoethyl)-morpholine as a lysosome-targetable group. This probe exhibited good selectivity and high sensitivity (4.57 μmol/L) toward NO in a wide pH range from 4 to 12. Furthermore, LysoNO-Naph can be used for imaging NO in lysosomes in living cells.
基金This work was supported by the National Key R&D Program of China,Grant No.2018YFA0306703Chengdu Innovation and Technology Project,No.2021-YF05-02413-GX.
文摘As an essential part of artificial intelligence,many works focus on image processing which is the branch of computer vision.Nevertheless,image localization faces complex challenges in image processing with image data increases.At the same time,quantum computing has the unique advantages of improving computing power and reducing energy consumption.So,combining the advantage of quantum computing is necessary for studying the quantum image localization algorithms.At present,many quantum image localization algorithms have been proposed,and their efficiency is theoretically higher than the corresponding classical algorithms.But,in quantum computing experiments,quantum gates in quantum computing hardware need to work at very low temperatures,which brings great challenges to experiments.This paper proposes a single-photon-based quantum image localization algorithm based on the fundamental theory of single-photon image classification.This scheme realizes the operation of the mixed national institute of standards and technology database(MNIST)quantum image localization by a learned transformation for non-noise condition,noisy condition,and environmental attack condition,respectively.Compared with the regular use of entanglement between multi-qubits and low-temperature noise reduction conditions for image localization,the advantage of this method is that it does not deliberately require low temperature and entanglement resources,and it improves the lower bound of the localization success rate.This method paves a way to study quantum computer vision.
文摘In multi-view image localization task,the features of the images captured from different views should be fused properly.This paper considers the classification-based image localization problem.We propose the relational graph location network(RGLN)to perform this task.In this network,we propose a heterogeneous graph construction approach for graph classification tasks,which aims to describe the location in a more appropriate way,thereby improving the expression ability of the location representation module.Experiments show that the expression ability of the proposed graph construction approach outperforms the compared methods by a large margin.In addition,the proposed localization method outperforms the compared localization methods by around 1.7%in terms of meter-level accuracy.
文摘Localization of the inspected chip image is one of the key problems with machine vision aided surface mount devices (SMD) and other micro-electronic equipments. This paper presents a new edge-directed subpixel edge localization method. The image is divided into two regions, edge and non-edge, using edge detection to emphasize the edge feature. Since the edges of the chip image are straight, they have straight-line characteristics locally and globally. First, the line segments of the straight edge are located to subpixel precision, according to their local straight properties, in a 3×3 neighborhood of the edge region. Second, the subpixel midpoints of the line segments are computed. Finally, the straight edge is fitted using the midpoints and the least square method, according to its global straight property in the entire edge region. In this way, the edge is located to subpixel precision. While fitting the edge, the irregular points are eliminated by the angles of the line segments to improve the precision. We can also distinguish different edges and their intersections using the angles of the line segments and distances between the edge points, then give the vectorial result of the image edge with high precision.
基金This study was funded by the Science and Technology Project in Xi’an(No.22GXFW0123)this work was supported by the Special Fund Construction Project of Key Disciplines in Ordinary Colleges and Universities in Shaanxi Province,the authors would like to thank the anonymous reviewers for their helpful comments and suggestions.
文摘As image manipulation technology advances rapidly,the malicious use of image tampering has alarmingly escalated,posing a significant threat to social stability.In the realm of image tampering localization,accurately localizing limited samples,multiple types,and various sizes of regions remains a multitude of challenges.These issues impede the model’s universality and generalization capability and detrimentally affect its performance.To tackle these issues,we propose FL-MobileViT-an improved MobileViT model devised for image tampering localization.Our proposed model utilizes a dual-stream architecture that independently processes the RGB and noise domain,and captures richer traces of tampering through dual-stream integration.Meanwhile,the model incorporating the Focused Linear Attention mechanism within the lightweight network(MobileViT).This substitution significantly diminishes computational complexity and resolves homogeneity problems associated with traditional Transformer attention mechanisms,enhancing feature extraction diversity and improving the model’s localization performance.To comprehensively fuse the generated results from both feature extractors,we introduce the ASPP architecture for multi-scale feature fusion.This facilitates a more precise localization of tampered regions of various sizes.Furthermore,to bolster the model’s generalization ability,we adopt a contrastive learning method and devise a joint optimization training strategy that leverages fused features and captures the disparities in feature distribution in tampered images.This strategy enables the learning of contrastive loss at various stages of the feature extractor and employs it as an additional constraint condition in conjunction with cross-entropy loss.As a result,overfitting issues are effectively alleviated,and the differentiation between tampered and untampered regions is enhanced.Experimental evaluations on five benchmark datasets(IMD-20,CASIA,NIST-16,Columbia and Coverage)validate the effectiveness of our proposed model.The meticulously calibrated FL-MobileViT model consistently outperforms numerous existing general models regarding localization accuracy across diverse datasets,demonstrating superior adaptability.
基金supported by the Shandong Provincial Natural Science Foundation (Nos.ZR2023MF062 and ZR2021MF115)the Introduction and Cultivation Program for Young Innovative Talents of Universities in Shandong (No.2021QCYY003)。
文摘Considering the three-dimensional(3D) U-Net lacks sufficient local feature extraction for image features and lacks attention to the fusion of high-and low-level features, we propose a new model called 3DMAU-Net based on the 3D U-Net architecture for liver region segmentation. Our model replaces the last two layers of the 3D U-Net with a sliding window-based multilayer perceptron(SMLP), enabling better extraction of local image features. We also design a high-and low-level feature fusion dilated convolution block that focuses on local features and better supplements the surrounding information of the target region. This block is embedded in the entire encoding process, ensuring that the overall network is not simply downsampling. Before each feature extraction, the input features are processed by the dilated convolution block. We validate our experiments on the liver tumor segmentation challenge 2017(Lits2017) dataset, and our model achieves a Dice coefficient of 0.95, which is an improvement of 0.015 compared to the 3D U-Net model. Furthermore, we compare our results with other segmentation methods, and our model consistently outperforms them.
基金sponsored by the Natural Science Fund of Heilongjiang Province(No.F201404)
文摘We propose a method based on the Poynting vector that combines angle-domain imaging and image amplitude correction to overcome the shortcomings of reverse-time migration that cannot handle different angles during wave propagation. First, the local image matrix (LIM) and local illumination matrix are constructed, and the wavefield propagation directions are decomposed. The angle-domain imaging conditions are established in the local imaging matrix to remove low-wavenumber artifacts. Next, the angle-domain common image gathers are extracted and the dip angle is calculated, and the amplitude-corrected factors in the dip angle domain are calculated. The partial images are corrected by factors corresponding to the different angles and then are superimposed to perform the amplitude correction of the final image. Angle-domain imaging based on the Poynting vector improves the computation efficiency compared with local plane-wave decomposition. Finally, numerical simulations based on the SEG/EAGE velocity model are used to validate the proposed method.
文摘Differences in illumination of the same face can defeat simple face recognition systems,yet most methods that compensate are too difficult to implement. Local quotient image (LQI) is an efficient illumination preprocessing method for face recognition systems. An illumination model and a face model were developed,and their use in the new method was analyzed. Analysis of the method's computational complexity showed it to be efficient. Experimental results on Yale Face Database B showed that the method can effectively eliminate the effects of differences in illumination and provides considerable improvement in recognition rates.
文摘Image forging is the alteration of a digital image to conceal some of the necessary or helpful information.It cannot be easy to distinguish themodified region fromthe original image in somecircumstances.The demand for authenticity and the integrity of the image drive the detection of a fabricated image.There have been cases of ownership infringements or fraudulent actions by counterfeiting multimedia files,including re-sampling or copy-moving.This work presents a high-level view of the forensics of digital images and their possible detection approaches.This work presents a thorough analysis of digital image forgery detection techniques with their steps and effectiveness.These methods have identified forgery and its type and compared it with state of the art.This work will help us to find the best forgery detection technique based on the different environments.It also shows the current issues in other methods,which can help researchers find future scope for further research in this field.
基金supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R195),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘The study aimed to apply to Machine Learning(ML)researchers working in image processing and biomedical analysis who play an extensive role in compre-hending and performing on complex medical data,eventually improving patient care.Developing a novel ML algorithm specific to Diabetic Retinopathy(DR)is a chal-lenge and need of the hour.Biomedical images include several challenges,including relevant feature selection,class variations,and robust classification.Although the cur-rent research in DR has yielded favourable results,several research issues need to be explored.There is a requirement to look at novel pre-processing methods to discard irrelevant features,balance the obtained relevant features,and obtain a robust classi-fication.This is performed using the Steerable Kernalized Partial Derivative and Platt Scale Classifier(SKPD-PSC)method.The novelty of this method relies on the appropriate non-linear classification of exclusive image processing models in har-mony with the Platt Scale Classifier(PSC)to improve the accuracy of DR detection.First,a Steerable Filter Kernel Pre-processing(SFKP)model is applied to the Retinal Images(RI)to remove irrelevant and redundant features and extract more meaningful pathological features through Directional Derivatives of Gaussians(DDG).Next,the Partial Derivative Image Localization(PDIL)model is applied to the extracted fea-tures to localize candidate features and suppress the background noise.Finally,a Platt Scale Classifier(PSC)is applied to the localized features for robust classification.For the experiments,we used the publicly available DR detection database provided by Standard Diabetic Retinopathy(SDR),called DIARETDB0.A database of 130 image samples has been collected to train and test the ML-based classifiers.Experimental results show that the proposed method that combines the image processing and ML models can attain good detection performance with a high DR detection accu-racy rate with minimum time and complexity compared to the state-of-the-art meth-ods.The accuracy and speed of DR detection for numerous types of images will be tested through experimental evaluation.Compared to state-of-the-art methods,the method increases DR detection accuracy by 24%and DR detection time by 37.
基金supported by National Basic Research Program of China(No. 2014CB643601)
文摘Accessing local dynamics within a single macromolecule is the key to understand the physical origin of the viscoelasticity and especially the glass transition. In order to extract specific information on the dynamics of the branch point of a star polymer around its glass transition temperature, four-arm star poly (n-butyl methacrylate) with a fluorescent core was synthesized using perylene diimide as initiator and polymerization conducted via atom transfer radical polymerization. The process is found to be effective in positioning the fluorophore at the branch point with the fluorophore intact, which allows the successful application of single molecule fluorescence defocus imaging in examining the local site- sensitive dynamics. The power spectra of rotation trajectories, the population of rotating fluorophores as well as the distribution of angular displacement were used to revel the difference in local dynamics between branch point and the arm's end. It is discovered that the local dynamics at the core of the star polymer is much less activated than that at the arm's end. The results demonstrate the strong effect dues to the topological constrain at the branch point and the more free space at the arm's end.
基金Projects(61203332,61203208) supported by the National Natural Science Foundation of China
文摘For vision-based mobile robot navigation, images of the same scene may undergo a general affine transformation in the case of significant viewpoint changes. So, a novel method for detecting affine invariant interest points is proposed to obtain the invariant local features, which is coined polynomial local orientation tensor(PLOT). The new detector is based on image local orientation tensor that is constructed from the polynomial expansion of image signal. Firstly, the properties of local orientation tensor of PLOT are analyzed, and a suitable tuning parameter of local orientation tensor is chosen so as to extract invariant features. The initial interest points are detected by local maxima search for the smaller eigenvalues of the orientation tensor. Then, an iterative procedure is used to allow the initial interest points to converge to affine invariant interest points and regions. The performances of this detector are evaluated on the repeatability criteria and recall versus 1-precision graphs, and then are compared with other existing approaches. Experimental results for PLOT show strong performance under affine transformation in the real-world conditions.
基金the National Natural Science Foundation of China(Nos.10972097,11062007)Specialized Research Fund for the Doctoral Programof Higher Education of China(No.20101514120005)the Inner Mongolia Natural Science Foundation of China(No.2010MS0703)
文摘The strain distributions near the interface when the elbow steel fiber is pulled out from the half-mould concrete matrix are directly measured using a combined method of single fiber pull-out test and digital image correlation. Meanwhile, the real-time processes of the bonding, debonding and sliding at the interface are observed. The micro-mechanism of the strain localization in the failure process of interface when debonding occurs and the strengthening mechanism at the imbedded fiber are discussed. The experimental results show that the meso-scale strain localization gives rise to the localization of shear damage near the fiber interface. This strain localization characterized by the debonding process near the interface occurs, develops and moves gradually at an apparently regular interval. At the elbow part of the imbedded fiber, the peak value of the shearing stress occurs. But the primary debonding does not occur at this place because the strength of the shear damage is increased at the local area of the elbow part in the concrete, displaying an apparent reinforced effect at the end of the fiber.
文摘In this paper the method of approximate expansion is used to analyse a perfect planar surround sound system, resulting in an order of new and upgrade systems. First reproductinn signals of the perfect system and the characteristics of different orders systems are analysed. The independent transmission signals and decoding (reproduction) equation of the systexns are given. The compatibility among different orders systems and the problem of simplifying output channels are discussed. The problem of signal picking up, recording,transmitting and the possibility of putting the systems into practical use are studied. A sound hoage localization experiment for the systems is carried out in order to study haage localization in relaion to the numbers of transmission signals and output channels. The experimental result is consistemt with the theoretical result. This work lay down a base for practical use.
文摘Agricultural crop production is a major contributing element to any country’s economy.To maintain the economic growth of any country plants disease detection is a leading factor in agriculture.The contribution of the proposed algorithm is to optimize the extracted infor-mation from the available resources for the betterment of the result without any additional complexity.The proposed technique basically localizes the leaf region prior to the image classification into healthy and diseased.The novelty of this work is to fuse the information extracted from the available resources and optimize it to enhance the expected outcome.The leaf colors are analyzed using color transformation for the seed region identification.The mapping of a low-dimensional RGB color image into L*a*b color space provides an expansion of the spectral range.The neighboring pixels-based leaf region growing is applied on the initial seeds.In order to refine the leaf boundary and the disease-affected areas,we employed a random sample consensus(RANSAC)for suitable curve fitting.The feature sets using bag of visual words,Fisher vectors,and handcrafted features are extracted followed by classification using logistic regression,multilayer perceptron model,and support vector machine.The performance of the proposal is analyzed through PlantVillage datasets of apple,bell pepper,cherry,corn,grape,potato,and tomato.The simulation-based analysis of the proposed contextualization-based image categorization process outperforms as compared with the state of arts.The proposed approach provides average accuracy and area under the curve of 0.932 and 0.903,respectively.