Neutron radiographic images(NRIs)typically suffer from multiple distortions,including various types of noise,geometric unsharpness,and white spots.Image quality assessment(IQA)can guide on-site image screening and eve...Neutron radiographic images(NRIs)typically suffer from multiple distortions,including various types of noise,geometric unsharpness,and white spots.Image quality assessment(IQA)can guide on-site image screening and even provide metrics for subsequent image processing.However,existing IQA methods for NRIs cannot effectively evaluate the quality of real NRIs with a specific distortion of white spots,limiting their practical application.In this paper,a novel no-reference IQA method is proposed to comprehensively evaluate the quality of real NRIs with multiple distortions.First,we construct large-scale NRI datasets with more than 20,000 images,including high-quality original NRIs and synthetic NRIs with various distortions.Next,an image quality calibration method based on visual salience and a local quality map is introduced to label the NRI dataset with quality scores.Finally,a lightweight convolutional neural network(CNN)model is designed to learn the abstract relationship between the NRIs and quality scores using the constructed NRI training dataset.Extensive experimental results demonstrate that the proposed method exhibits good consistency with human visual perception when evaluating both real NRIs and processed NRIs using enhancement and restoration algorithms,highlighting its application potential.展开更多
Background:Source-free unsupervised domain adaptation(SFUDA)methods aim to address the challenge of domain shift while preserving data privacy.Existing SFUDA approaches construct reliable and confident pseudo-labels f...Background:Source-free unsupervised domain adaptation(SFUDA)methods aim to address the challenge of domain shift while preserving data privacy.Existing SFUDA approaches construct reliable and confident pseudo-labels for target-domain data through denoising methods,thereby guiding the training of the target-domain model.The effectiveness of denoising approaches is influenced by the degree of domain gap between the source and target domains.A marked shift can cause the pseudo-labels to be unreliable,even after applying denoising.Methods:We propose a novel 2-stage framework for SFUDA called visual prompt source-free domain adaptation(VP-SFDA).We propose input-specific visual prompt in the first stage,prompting process,which bridges the target-domain data to source-domain distribution.Our method utilizes visual prompts and batch normalization constraint to enable the alignment model to learn domainspecific knowledge and align the target-domain data with the source-domain contribution.The second stage is the adaptation process,which aims at optimizing the segmentation model from the source domain to the target domain.This is accomplished through the denoising techniques,ultimately enhancing the performance.Results:Our study presents a comparative analysis of several SFUDA techniques in the VPSFDA framework across 4 tasks:abdominal magnetic resonance imaging(MRI)to computed tomography(CT),abdominal CT to MRI,cardiac MRI to CT,and cardiac CT to MRI.Notably,in the abdominal MRI to CT adaptation task,the VP-OS method achieved a remarkable improvement,increasing the average DICE score from 0.658 to 0.773(P<0.01)and reducing the average surface distance(ASD)from 3.489 to 2.961(P<0.01).Similarly,the VP-LD and VP-DPL methods also showed significant improvements over their base algorithms in both abdominal and cardiac MRI to CT tasks.Conclusions:This paper proposes VP-SFDA,a novel 2-stage framework for SFUDA in medical imaging,which achieves superior performance through input-specific visual prompts and batch normalization constraint for domain adaptation,coupled with denoising methods for enhanced results.Comparative experiments on 4 medical SFUDA tasks demonstrate that VO-SFDA surpasses existing methods,with ablation studies confirming the benefits of domain-specific patterns.展开更多
Exploration of artworks is enjoyable but often time consuming.For example,it is not always easy to discover the favorite types of unknown painting works.It is not also always easy to explore unpopular painting works w...Exploration of artworks is enjoyable but often time consuming.For example,it is not always easy to discover the favorite types of unknown painting works.It is not also always easy to explore unpopular painting works which looks similar to painting works created by famous artists.This paper presents a painting image browser which assists the explorative discovery of user-interested painting works.The presented browser applies a new multidimensional data visualization technique that highlights particular ranges of particular numeric values based on association rules to suggest cues to find favorite painting images.This study assumes a large number of painting images are provided where categorical information(e.g.,names of artists,created year)is assigned to the images.The presented system firstly calculates the feature values of the images as a preprocessing step.Then the browser visualizes the multidimensional feature values as a heatmap and highlights association rules discovered from the relationships between the feature values and categorical information.This mechanism enables users to explore favorite painting images or painting images that look similar to famous painting works.Our case study and user evaluation demonstrates the effectiveness of the presented image browser.展开更多
A novel method combining visualization particle tracking with image-based dynamic light scattering was developed to achieve the in situ and real-time size measurement of nanobubbles(NBs).First,the in situ size distrib...A novel method combining visualization particle tracking with image-based dynamic light scattering was developed to achieve the in situ and real-time size measurement of nanobubbles(NBs).First,the in situ size distribution of NBs was visualized by dark-field microscopy.Then,real-time size during the preparation was measured using image-based dynamic light scattering,and the longitudinal size distribution of NBs in the sample cell was obtained in a steady state.Results show that this strategy can provide a detailed and accurate size of bubbles in the whole sample compared with the commercial ZetaSizer Nano equipment.Therefore,the developed method is a real-time and simple technology with excellent accuracy,providing new insights into the accurate measurement of the size distribution of NBs or nanoparticles in solution.展开更多
The advantages and disadvantages of two existing methods for explosive field visualization are analyzed in this paper. And a new method based on image fusion is proposed to integrate their complementary advantages. Wi...The advantages and disadvantages of two existing methods for explosive field visualization are analyzed in this paper. And a new method based on image fusion is proposed to integrate their complementary advantages. With the method, two source images built by equal mapping and modulus mapping are individually decomposed into two Gauss-Laplacian pyramid sequences. Then, the two individual sequences are used to make a composite one according to the process of fusion. Finally, a new image is reconstructed from the composite sequence. Experimental results show that the new images integrate the advantages of sources, effectively improve the visualization, and disclose more information about explosive field.展开更多
Background A medical content-based image retrieval(CBIR)system is designed to retrieve images from large imaging repositories that are visually similar to a user′s query image.CBIR is widely used in evidence-based di...Background A medical content-based image retrieval(CBIR)system is designed to retrieve images from large imaging repositories that are visually similar to a user′s query image.CBIR is widely used in evidence-based diagnosis,teaching,and research.Although the retrieval accuracy has largely improved,there has been limited development toward visualizing important image features that indicate the similarity of retrieved images.Despite the prevalence of 3D volumetric data in medical imaging such as computed tomography(CT),current CBIR systems still rely on 2D cross-sectional views for the visualization of retrieved images.Such 2D visualization requires users to browse through the image stacks to confirm the similarity of the retrieved images and often involves mental reconstruction of 3D information,including the size,shape,and spatial relations of multiple structures.This process is time-consuming and reliant on users'experience.Methods In this study,we proposed an importance-aware 3D volume visualization method.The rendering parameters were automatically optimized to maximize the visibility of important structures that were detected and prioritized in the retrieval process.We then integrated the proposed visualization into a CBIR system,thereby complementing the 2D cross-sectional views for relevance feedback and further analyses.Results Our preliminary results demonstrate that 3D visualization can provide additional information using multimodal positron emission tomography and computed tomography(PETCT)images of a non-small cell lung cancer dataset.展开更多
In this paper, with the general retrospect to the research on surface reconstruction and the marching cubes algorithm, we gave detailed description of an algorithm on the construction of object surfaces. The possible ...In this paper, with the general retrospect to the research on surface reconstruction and the marching cubes algorithm, we gave detailed description of an algorithm on the construction of object surfaces. The possible ambiguity problem in the original marching cubes algorithm was eliminated by its index mechanism. Some results on the MRI images were presented. Based on extracting and clipping contours from a set of medial slice images and setting the patch vertices values according to the gray images, this algorithm may be applied to form the arbitrary section images with three dimensional effects. It can also enhance the visual effect and interpretation of medical data.展开更多
Three-dimensional medical image visualization becomes an essential part for medical field, including computer aided diagnosis, surgery planning and simulation, artificial limb surgery, radiotherapy planning, and teach...Three-dimensional medical image visualization becomes an essential part for medical field, including computer aided diagnosis, surgery planning and simulation, artificial limb surgery, radiotherapy planning, and teaching etc. In this paper, marching cubes algorithm is adopted to reconstruct the 3-D images for the CT image sequence in DICOM format under theVC++6.0 and the visual package VTK platform. The relatively simple interactive operations such as rotation and transfer can be realized on the platform. Moreover, the normal vector and interior point are calculated to form the virtual clipping plane, which is then used to incise the 3-D object. Information of the virtual slice can be obtained, in the mean while the virtual slice images are displayed on the screen. The technique can realize the real time interaction extraction of virtual slice on 3-D CT image. The cuboids structured can be zoomed, moved and eircumrotated by operating mouse to incise the 3-D reconstruction object. Real time interaction can be realized by clipping the reconstruction object. The coordinates can be acquired by the mouse clicking in the 3D space, to realize the point mouse pick-up as well angle and distance interactive measurement. We can get quantitative information about 3-D images through measurement.展开更多
SQL Server 2005的image型数据不能通过INSERT和UPDATE等语句进行插入和更新,这给处理image型数据带来十分不便。讨论了在Visual Basic中处理SQL Server 2005的image型数据的一般方法,即利用ADO数据对象的Fields集合的AppendChunk方法和...SQL Server 2005的image型数据不能通过INSERT和UPDATE等语句进行插入和更新,这给处理image型数据带来十分不便。讨论了在Visual Basic中处理SQL Server 2005的image型数据的一般方法,即利用ADO数据对象的Fields集合的AppendChunk方法和GetChunk方法及ADO Data控件进行数据的填充和读取。展开更多
Synthetic Aperture Radar(SAR) imaging systems have been widely used in civil and military fields due to their all-weather and all-day abilities and various other advantages. However, due to image data exponentially in...Synthetic Aperture Radar(SAR) imaging systems have been widely used in civil and military fields due to their all-weather and all-day abilities and various other advantages. However, due to image data exponentially increasing, there is a need for novel automatic target detection and recognition technologies. In recent years, the visual attention mechanism in the visual system has helped humans effectively deal with complex visual signals. In particular, biologically inspired top-down attention models have garnered much attention recently. This paper presents a visual attention model for SAR target detection, comprising a bottom-up stage and top-down process.In the bottom-up step, the Itti model is improved based on the difference between SAR and optical images. The top-down step fully utilizes prior information to further detect targets. Extensive detection experiments carried out on the benchmark Moving and Stationary Target Acquisition and Recognition(MSTAR) dataset show that, compared with typical visual models and other popular detection methods, our model has increased ability and robustness for SAR target detection, under a range of Signal to Clutter Ratio(SCR) conditions and scenes. In addition, results obtained using only the bottom-up stage are inferior to those of the proposed method, further demonstrating the effectiveness and rationality of a top-down strategy. In summary, our proposed visual attention method can be considered a potential benchmark resource for the SAR research community.展开更多
Methods of arc length control and visual image based weld detection for precision pulse TIG welding were investigated. With a particular all hardware circuit, arc voltage during peak current stage is sampled and inte...Methods of arc length control and visual image based weld detection for precision pulse TIG welding were investigated. With a particular all hardware circuit, arc voltage during peak current stage is sampled and integrated to indicate arc length, deviation of arc length and adjusting parameters are calculated and output to drive a step motor directly. According to the features of welding image grabbed with CCD camera, a special algorithm was developed to detect the central line of weld fast and accurately. Then an application system were established, whose static arc length error is ±0.1 mm with 20 A average current and 1 mm given arc length, static detection precision of weld is 0.01 mm , processing time of each image is less than 120 ms . Precision pulse TIG welding of some given thin stainless steel components with complicated curved surface was successfully realized.展开更多
To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement al...To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement algorithm. This algorithm introduces fuzzy entropy, makes full use of neighborhood information, fuzzy information and human visual characteristics.To enhance an image, this paper first carries out the reasonable fuzzy-3 partition of its histogram into the dark region, intermediate region and bright region. It then extracts the statistical characteristics of the three regions and adaptively selects the parameter αaccording to the statistical characteristics of the image’s gray-scale values. It also adds a useful nonlinear transform, thus increasing the ubiquity of the algorithm. Finally, the causes for the gray-scale value overcorrection that occurs in the traditional image enhancement algorithms are analyzed and their solutions are proposed.The simulation results show that our image enhancement algorithm can effectively suppress the noise of an image, enhance its contrast and visual effect, sharpen its edge and adjust its dynamic range.展开更多
A method for creating digital image copyright protection is proposed in this paper. The proposed method in this paper is based on visual cryptography defined by Noor and Shamir. The proposed method is working on selec...A method for creating digital image copyright protection is proposed in this paper. The proposed method in this paper is based on visual cryptography defined by Noor and Shamir. The proposed method is working on selection of random pixels from the original digital image instead of specific selection of pixels. The new method proposed does not require that the watermark pattern to be embedded in to the original digital image. Instead of that, verification information is generated which will be used to verify the ownership of the image. This leaves the marked image equal to the original image. The method is based on the relationship between randomly selected pixels and their 8-neighbors’ pixels. This relationship keeps the marked image coherent against diverse attacks even if the most significant bits of randomly selected pixels have been changed by attacker as we will see later in this paper. Experimental results show the proposed method can recover the watermark pattern from the marked image even if major changes are made to the original digital image.展开更多
With the development of anti-virus technology,malicious documents have gradually become the main pathway of Advanced Persistent Threat(APT)attacks,therefore,the development of effective malicious document classifiers ...With the development of anti-virus technology,malicious documents have gradually become the main pathway of Advanced Persistent Threat(APT)attacks,therefore,the development of effective malicious document classifiers has become particularly urgent.Currently,detection methods based on document structure and behavioral features encounter challenges in feature engineering,these methods not only have limited accuracy,but also consume large resources,and usually can only detect documents in specific formats,which lacks versatility and adaptability.To address such problems,this paper proposes a novel malicious document detection method-visualizing documents as GGE images(Grayscale,Grayscale matrix,Entropy).The GGE method visualizes the original byte sequence of the malicious document as a grayscale image,the information entropy sequence of the document as an entropy image,and at the same time,the grayscale level co-occurrence matrix and the texture and spatial information stored in it are converted into grayscale matrix image,and fuses the three types of images to get the GGE color image.The Convolutional Block Attention Module-EfficientNet-B0(CBAM-EfficientNet-B0)model is then used for classification,combining transfer learning and applying the pre-trained model on the ImageNet dataset to the feature extraction process of GGE images.As shown in the experimental results,the GGE method has superior performance compared with other methods,which is suitable for detecting malicious documents in different formats,and achieves an accuracy of 99.44%and 97.39%on Portable Document Format(PDF)and office datasets,respectively,and consumes less time during the detection process,which can be effectively applied to the task of detecting malicious documents in real-time.展开更多
The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet d...The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet domain Inclusive-OR denoising algorithm(WDIDA), which distinguishes the wavelet coefficients belonging to image or noise by considering their phases and modulus maxima simultaneously. Using this new algorithm, the denoising effects are improved and the computation time is reduced. Furthermore, in order to enhance the edges of the image but not magnify noise, a contrast nonlinear enhancing algorithm is presented according to human visual properties. Compared with traditional enhancing algorithms, the algorithm that we proposed has a better noise reducing performanee , preserving edges and improving the visual quality of images.展开更多
The dynamic multichannel binocular visual image modeling is studied based on Internet of Things (IoT) Perception Layer, using mobile robot self-organizing network. By employing multigroup mobile robots with binocular ...The dynamic multichannel binocular visual image modeling is studied based on Internet of Things (IoT) Perception Layer, using mobile robot self-organizing network. By employing multigroup mobile robots with binocular visual system, the real visual images of the object will be obtained. Then through the mobile self-organizing network, a three-dimensional model is rebuilt by synthesizing the returned images. On this basis, we formalize a novel algorithm for multichannel binocular visual three-dimensional images based on fast three-dimensional modeling. Compared with the method based on single binocular visual system, the new algorithm can improve the Integrity and accuracy of the dynamic three-dimensional object modeling. The simulation results show that the new method can effectively accelerate the modeling speed, improve the similarity and not increase the data size.展开更多
Visual cortical prostheses have the potential to restore partial vision. Still limited by the low-resolution visual percepts provided by visual cortical prostheses, implant wearers can currently only "see" pixelized...Visual cortical prostheses have the potential to restore partial vision. Still limited by the low-resolution visual percepts provided by visual cortical prostheses, implant wearers can currently only "see" pixelized images, and how to obtain the specific brain responses to different pixelized images in the primary visual cortex(the implant area) is still unknown. We conducted a functional magnetic resonance imaging experiment on normal human participants to investigate the brain activation patterns in response to 18 different pixelized images. There were 100 voxels in the brain activation pattern that were selected from the primary visual cortex, and voxel size was 4 mm × 4 mm × 4 mm. Multi-voxel pattern analysis was used to test if these 18 different brain activation patterns were specific. We chose a Linear Support Vector Machine(LSVM) as the classifier in this study. The results showed that the classification accuracies of different brain activation patterns were significantly above chance level, which suggests that the classifier can successfully distinguish the brain activation patterns. Our results suggest that the specific brain activation patterns to different pixelized images can be obtained in the primary visual cortex using a 4 mm × 4 mm × 4 mm voxel size and a 100-voxel pattern.展开更多
A visual sensing system was developed. The system is suitable for titanium-alloy electron-beam welding, and senses and detects molten-pool dynamic processes. A suite of processing programs for colored molten-pool imag...A visual sensing system was developed. The system is suitable for titanium-alloy electron-beam welding, and senses and detects molten-pool dynamic processes. A suite of processing programs for colored molten-pool images in titanium-alloy electron-beam welding was developed using Matlab software; molten-pool edge images are completely obtained using the program. The Matlab software was used to write a program which could extract the molten-pool width. The functional relationship between the molten-pool width and penetration under the experimental conditions was obtained by a curve-fitting method, and provided the theoretical basis for further penetration control.展开更多
Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt ver...Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt vertical variations of story stiffness are known to significantly increase the likelihood of collapse during moderate or severe earthquakes. Identifying and retrofitting buildings with such irregularities—generally termed as soft-story buildings—is, therefore, vital in earthquake preparedness and loss mitigation efforts. Soft-story building identification through conventional means is a labor-intensive and time-consuming process. In this study, an automated procedure was devised based on deep learning techniques for identifying soft-story buildings from street-view images at a regional scale. A database containing a large number of building images and a semi-automated image labeling approach that effectively annotates new database entries was developed for developing the deep learning model. Extensive computational experiments were carried out to examine the effectiveness of the proposed procedure, and to gain insights into automated soft-story building identification.展开更多
基金supported by the National Natural Science Foundation of China(Nos.11905028 and 12105040)Scientific Research Project of the Education Department of Jilin Province(No.JJKH20231294KJ)the Youth Growth Technology Project of the Science and Technology Department of Jilin Province(No.20210508027RQ).
文摘Neutron radiographic images(NRIs)typically suffer from multiple distortions,including various types of noise,geometric unsharpness,and white spots.Image quality assessment(IQA)can guide on-site image screening and even provide metrics for subsequent image processing.However,existing IQA methods for NRIs cannot effectively evaluate the quality of real NRIs with a specific distortion of white spots,limiting their practical application.In this paper,a novel no-reference IQA method is proposed to comprehensively evaluate the quality of real NRIs with multiple distortions.First,we construct large-scale NRI datasets with more than 20,000 images,including high-quality original NRIs and synthetic NRIs with various distortions.Next,an image quality calibration method based on visual salience and a local quality map is introduced to label the NRI dataset with quality scores.Finally,a lightweight convolutional neural network(CNN)model is designed to learn the abstract relationship between the NRIs and quality scores using the constructed NRI training dataset.Extensive experimental results demonstrate that the proposed method exhibits good consistency with human visual perception when evaluating both real NRIs and processed NRIs using enhancement and restoration algorithms,highlighting its application potential.
基金supportted by the Natural Science Foundation of China(62394311,62394310)Beijing Natural Science Foundation(QY24034)National Biomedical Imaging Facility Grant and from the startup funds of Peking University Health Science Center.
文摘Background:Source-free unsupervised domain adaptation(SFUDA)methods aim to address the challenge of domain shift while preserving data privacy.Existing SFUDA approaches construct reliable and confident pseudo-labels for target-domain data through denoising methods,thereby guiding the training of the target-domain model.The effectiveness of denoising approaches is influenced by the degree of domain gap between the source and target domains.A marked shift can cause the pseudo-labels to be unreliable,even after applying denoising.Methods:We propose a novel 2-stage framework for SFUDA called visual prompt source-free domain adaptation(VP-SFDA).We propose input-specific visual prompt in the first stage,prompting process,which bridges the target-domain data to source-domain distribution.Our method utilizes visual prompts and batch normalization constraint to enable the alignment model to learn domainspecific knowledge and align the target-domain data with the source-domain contribution.The second stage is the adaptation process,which aims at optimizing the segmentation model from the source domain to the target domain.This is accomplished through the denoising techniques,ultimately enhancing the performance.Results:Our study presents a comparative analysis of several SFUDA techniques in the VPSFDA framework across 4 tasks:abdominal magnetic resonance imaging(MRI)to computed tomography(CT),abdominal CT to MRI,cardiac MRI to CT,and cardiac CT to MRI.Notably,in the abdominal MRI to CT adaptation task,the VP-OS method achieved a remarkable improvement,increasing the average DICE score from 0.658 to 0.773(P<0.01)and reducing the average surface distance(ASD)from 3.489 to 2.961(P<0.01).Similarly,the VP-LD and VP-DPL methods also showed significant improvements over their base algorithms in both abdominal and cardiac MRI to CT tasks.Conclusions:This paper proposes VP-SFDA,a novel 2-stage framework for SFUDA in medical imaging,which achieves superior performance through input-specific visual prompts and batch normalization constraint for domain adaptation,coupled with denoising methods for enhanced results.Comparative experiments on 4 medical SFUDA tasks demonstrate that VO-SFDA surpasses existing methods,with ablation studies confirming the benefits of domain-specific patterns.
文摘Exploration of artworks is enjoyable but often time consuming.For example,it is not always easy to discover the favorite types of unknown painting works.It is not also always easy to explore unpopular painting works which looks similar to painting works created by famous artists.This paper presents a painting image browser which assists the explorative discovery of user-interested painting works.The presented browser applies a new multidimensional data visualization technique that highlights particular ranges of particular numeric values based on association rules to suggest cues to find favorite painting images.This study assumes a large number of painting images are provided where categorical information(e.g.,names of artists,created year)is assigned to the images.The presented system firstly calculates the feature values of the images as a preprocessing step.Then the browser visualizes the multidimensional feature values as a heatmap and highlights association rules discovered from the relationships between the feature values and categorical information.This mechanism enables users to explore favorite painting images or painting images that look similar to famous painting works.Our case study and user evaluation demonstrates the effectiveness of the presented image browser.
基金The National Key Research and Development Program of China(No.2017YFA0104302)the National Natural Science Foundation of China(No.51832001,61821002,81971750).
文摘A novel method combining visualization particle tracking with image-based dynamic light scattering was developed to achieve the in situ and real-time size measurement of nanobubbles(NBs).First,the in situ size distribution of NBs was visualized by dark-field microscopy.Then,real-time size during the preparation was measured using image-based dynamic light scattering,and the longitudinal size distribution of NBs in the sample cell was obtained in a steady state.Results show that this strategy can provide a detailed and accurate size of bubbles in the whole sample compared with the commercial ZetaSizer Nano equipment.Therefore,the developed method is a real-time and simple technology with excellent accuracy,providing new insights into the accurate measurement of the size distribution of NBs or nanoparticles in solution.
基金Sponsored by the National Natural Science Foundation of China(10625208)the Basic Research Foundation of Beijing Institute of Technology(20061242005)the Foundation of State Key Laboratory of Explosion Science and Technology(ZDKT08-02)
文摘The advantages and disadvantages of two existing methods for explosive field visualization are analyzed in this paper. And a new method based on image fusion is proposed to integrate their complementary advantages. With the method, two source images built by equal mapping and modulus mapping are individually decomposed into two Gauss-Laplacian pyramid sequences. Then, the two individual sequences are used to make a composite one according to the process of fusion. Finally, a new image is reconstructed from the composite sequence. Experimental results show that the new images integrate the advantages of sources, effectively improve the visualization, and disclose more information about explosive field.
文摘Background A medical content-based image retrieval(CBIR)system is designed to retrieve images from large imaging repositories that are visually similar to a user′s query image.CBIR is widely used in evidence-based diagnosis,teaching,and research.Although the retrieval accuracy has largely improved,there has been limited development toward visualizing important image features that indicate the similarity of retrieved images.Despite the prevalence of 3D volumetric data in medical imaging such as computed tomography(CT),current CBIR systems still rely on 2D cross-sectional views for the visualization of retrieved images.Such 2D visualization requires users to browse through the image stacks to confirm the similarity of the retrieved images and often involves mental reconstruction of 3D information,including the size,shape,and spatial relations of multiple structures.This process is time-consuming and reliant on users'experience.Methods In this study,we proposed an importance-aware 3D volume visualization method.The rendering parameters were automatically optimized to maximize the visibility of important structures that were detected and prioritized in the retrieval process.We then integrated the proposed visualization into a CBIR system,thereby complementing the 2D cross-sectional views for relevance feedback and further analyses.Results Our preliminary results demonstrate that 3D visualization can provide additional information using multimodal positron emission tomography and computed tomography(PETCT)images of a non-small cell lung cancer dataset.
文摘In this paper, with the general retrospect to the research on surface reconstruction and the marching cubes algorithm, we gave detailed description of an algorithm on the construction of object surfaces. The possible ambiguity problem in the original marching cubes algorithm was eliminated by its index mechanism. Some results on the MRI images were presented. Based on extracting and clipping contours from a set of medial slice images and setting the patch vertices values according to the gray images, this algorithm may be applied to form the arbitrary section images with three dimensional effects. It can also enhance the visual effect and interpretation of medical data.
基金National 973 Basic Research Program of Chinagrant number:2010CB732600+4 种基金Major Research Equipment Fund of the Chinese Academy of Sciences and Knowledge Innovation Project of the Chinese Academy of Sciences,2008 Shenzhen Controversial Technology Innovation Research Projectsgrant number:FG200805230224AConcentration plan of innovation sources of Shenzhen-R&D projects of international cooperation on science and technologygrant number:ZYA200903260065ANatural Science Foundation of Guangdong Province,China 8478922035-X0007007
文摘Three-dimensional medical image visualization becomes an essential part for medical field, including computer aided diagnosis, surgery planning and simulation, artificial limb surgery, radiotherapy planning, and teaching etc. In this paper, marching cubes algorithm is adopted to reconstruct the 3-D images for the CT image sequence in DICOM format under theVC++6.0 and the visual package VTK platform. The relatively simple interactive operations such as rotation and transfer can be realized on the platform. Moreover, the normal vector and interior point are calculated to form the virtual clipping plane, which is then used to incise the 3-D object. Information of the virtual slice can be obtained, in the mean while the virtual slice images are displayed on the screen. The technique can realize the real time interaction extraction of virtual slice on 3-D CT image. The cuboids structured can be zoomed, moved and eircumrotated by operating mouse to incise the 3-D reconstruction object. Real time interaction can be realized by clipping the reconstruction object. The coordinates can be acquired by the mouse clicking in the 3D space, to realize the point mouse pick-up as well angle and distance interactive measurement. We can get quantitative information about 3-D images through measurement.
文摘SQL Server 2005的image型数据不能通过INSERT和UPDATE等语句进行插入和更新,这给处理image型数据带来十分不便。讨论了在Visual Basic中处理SQL Server 2005的image型数据的一般方法,即利用ADO数据对象的Fields集合的AppendChunk方法和GetChunk方法及ADO Data控件进行数据的填充和读取。
基金supported by the National Natural Science Foundation of China(Nos.61771027,61071139,61471019,61671035)supported in part under the Royal Society of Edinburgh-National Natural Science Foundation of China(RSE-NNSFC)Joint Project(2017–2019)(No.6161101383)with China University of Petroleum(Huadong)partially supported by the UK Engineering and Physical Sciences Research Council(EPSRC)(Nos.EP/I009310/1,EP/M026981/1)
文摘Synthetic Aperture Radar(SAR) imaging systems have been widely used in civil and military fields due to their all-weather and all-day abilities and various other advantages. However, due to image data exponentially increasing, there is a need for novel automatic target detection and recognition technologies. In recent years, the visual attention mechanism in the visual system has helped humans effectively deal with complex visual signals. In particular, biologically inspired top-down attention models have garnered much attention recently. This paper presents a visual attention model for SAR target detection, comprising a bottom-up stage and top-down process.In the bottom-up step, the Itti model is improved based on the difference between SAR and optical images. The top-down step fully utilizes prior information to further detect targets. Extensive detection experiments carried out on the benchmark Moving and Stationary Target Acquisition and Recognition(MSTAR) dataset show that, compared with typical visual models and other popular detection methods, our model has increased ability and robustness for SAR target detection, under a range of Signal to Clutter Ratio(SCR) conditions and scenes. In addition, results obtained using only the bottom-up stage are inferior to those of the proposed method, further demonstrating the effectiveness and rationality of a top-down strategy. In summary, our proposed visual attention method can be considered a potential benchmark resource for the SAR research community.
文摘Methods of arc length control and visual image based weld detection for precision pulse TIG welding were investigated. With a particular all hardware circuit, arc voltage during peak current stage is sampled and integrated to indicate arc length, deviation of arc length and adjusting parameters are calculated and output to drive a step motor directly. According to the features of welding image grabbed with CCD camera, a special algorithm was developed to detect the central line of weld fast and accurately. Then an application system were established, whose static arc length error is ±0.1 mm with 20 A average current and 1 mm given arc length, static detection precision of weld is 0.01 mm , processing time of each image is less than 120 ms . Precision pulse TIG welding of some given thin stainless steel components with complicated curved surface was successfully realized.
基金supported by the National Natural Science Foundation of China(61472324)
文摘To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement algorithm. This algorithm introduces fuzzy entropy, makes full use of neighborhood information, fuzzy information and human visual characteristics.To enhance an image, this paper first carries out the reasonable fuzzy-3 partition of its histogram into the dark region, intermediate region and bright region. It then extracts the statistical characteristics of the three regions and adaptively selects the parameter αaccording to the statistical characteristics of the image’s gray-scale values. It also adds a useful nonlinear transform, thus increasing the ubiquity of the algorithm. Finally, the causes for the gray-scale value overcorrection that occurs in the traditional image enhancement algorithms are analyzed and their solutions are proposed.The simulation results show that our image enhancement algorithm can effectively suppress the noise of an image, enhance its contrast and visual effect, sharpen its edge and adjust its dynamic range.
文摘A method for creating digital image copyright protection is proposed in this paper. The proposed method in this paper is based on visual cryptography defined by Noor and Shamir. The proposed method is working on selection of random pixels from the original digital image instead of specific selection of pixels. The new method proposed does not require that the watermark pattern to be embedded in to the original digital image. Instead of that, verification information is generated which will be used to verify the ownership of the image. This leaves the marked image equal to the original image. The method is based on the relationship between randomly selected pixels and their 8-neighbors’ pixels. This relationship keeps the marked image coherent against diverse attacks even if the most significant bits of randomly selected pixels have been changed by attacker as we will see later in this paper. Experimental results show the proposed method can recover the watermark pattern from the marked image even if major changes are made to the original digital image.
基金supported by the Natural Science Foundation of Henan Province(Grant No.242300420297)awarded to Yi Sun.
文摘With the development of anti-virus technology,malicious documents have gradually become the main pathway of Advanced Persistent Threat(APT)attacks,therefore,the development of effective malicious document classifiers has become particularly urgent.Currently,detection methods based on document structure and behavioral features encounter challenges in feature engineering,these methods not only have limited accuracy,but also consume large resources,and usually can only detect documents in specific formats,which lacks versatility and adaptability.To address such problems,this paper proposes a novel malicious document detection method-visualizing documents as GGE images(Grayscale,Grayscale matrix,Entropy).The GGE method visualizes the original byte sequence of the malicious document as a grayscale image,the information entropy sequence of the document as an entropy image,and at the same time,the grayscale level co-occurrence matrix and the texture and spatial information stored in it are converted into grayscale matrix image,and fuses the three types of images to get the GGE color image.The Convolutional Block Attention Module-EfficientNet-B0(CBAM-EfficientNet-B0)model is then used for classification,combining transfer learning and applying the pre-trained model on the ImageNet dataset to the feature extraction process of GGE images.As shown in the experimental results,the GGE method has superior performance compared with other methods,which is suitable for detecting malicious documents in different formats,and achieves an accuracy of 99.44%and 97.39%on Portable Document Format(PDF)and office datasets,respectively,and consumes less time during the detection process,which can be effectively applied to the task of detecting malicious documents in real-time.
文摘The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet domain Inclusive-OR denoising algorithm(WDIDA), which distinguishes the wavelet coefficients belonging to image or noise by considering their phases and modulus maxima simultaneously. Using this new algorithm, the denoising effects are improved and the computation time is reduced. Furthermore, in order to enhance the edges of the image but not magnify noise, a contrast nonlinear enhancing algorithm is presented according to human visual properties. Compared with traditional enhancing algorithms, the algorithm that we proposed has a better noise reducing performanee , preserving edges and improving the visual quality of images.
基金supported by HiTech Researchand Development Program of China under Grant No.2007AA10Z235
文摘The dynamic multichannel binocular visual image modeling is studied based on Internet of Things (IoT) Perception Layer, using mobile robot self-organizing network. By employing multigroup mobile robots with binocular visual system, the real visual images of the object will be obtained. Then through the mobile self-organizing network, a three-dimensional model is rebuilt by synthesizing the returned images. On this basis, we formalize a novel algorithm for multichannel binocular visual three-dimensional images based on fast three-dimensional modeling. Compared with the method based on single binocular visual system, the new algorithm can improve the Integrity and accuracy of the dynamic three-dimensional object modeling. The simulation results show that the new method can effectively accelerate the modeling speed, improve the similarity and not increase the data size.
基金supported by the National Natural Science Foundation of China,No.31070758,31271060the Natural Science Foundation of Chongqing in China,No.cstc2013jcyj A10085
文摘Visual cortical prostheses have the potential to restore partial vision. Still limited by the low-resolution visual percepts provided by visual cortical prostheses, implant wearers can currently only "see" pixelized images, and how to obtain the specific brain responses to different pixelized images in the primary visual cortex(the implant area) is still unknown. We conducted a functional magnetic resonance imaging experiment on normal human participants to investigate the brain activation patterns in response to 18 different pixelized images. There were 100 voxels in the brain activation pattern that were selected from the primary visual cortex, and voxel size was 4 mm × 4 mm × 4 mm. Multi-voxel pattern analysis was used to test if these 18 different brain activation patterns were specific. We chose a Linear Support Vector Machine(LSVM) as the classifier in this study. The results showed that the classification accuracies of different brain activation patterns were significantly above chance level, which suggests that the classifier can successfully distinguish the brain activation patterns. Our results suggest that the specific brain activation patterns to different pixelized images can be obtained in the primary visual cortex using a 4 mm × 4 mm × 4 mm voxel size and a 100-voxel pattern.
文摘A visual sensing system was developed. The system is suitable for titanium-alloy electron-beam welding, and senses and detects molten-pool dynamic processes. A suite of processing programs for colored molten-pool images in titanium-alloy electron-beam welding was developed using Matlab software; molten-pool edge images are completely obtained using the program. The Matlab software was used to write a program which could extract the molten-pool width. The functional relationship between the molten-pool width and penetration under the experimental conditions was obtained by a curve-fitting method, and provided the theoretical basis for further penetration control.
基金supported by the US National Science Foundation under Grant No. 1612843. NHERI Design Safe (Rathje et al., 2017)Texas Advanced Computing Center (TACC)。
文摘Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt vertical variations of story stiffness are known to significantly increase the likelihood of collapse during moderate or severe earthquakes. Identifying and retrofitting buildings with such irregularities—generally termed as soft-story buildings—is, therefore, vital in earthquake preparedness and loss mitigation efforts. Soft-story building identification through conventional means is a labor-intensive and time-consuming process. In this study, an automated procedure was devised based on deep learning techniques for identifying soft-story buildings from street-view images at a regional scale. A database containing a large number of building images and a semi-automated image labeling approach that effectively annotates new database entries was developed for developing the deep learning model. Extensive computational experiments were carried out to examine the effectiveness of the proposed procedure, and to gain insights into automated soft-story building identification.