期刊文献+
共找到15篇文章
< 1 >
每页显示 20 50 100
Automatic salient object segmentation using saliency map and color segmentation 被引量:1
1
作者 HAN Sung-ho JUNG Gye-dong +2 位作者 LEE Sangh-yuk HONG Yeong-pyo LEE Sang-hun 《Journal of Central South University》 SCIE EI CAS 2013年第9期2407-2413,共7页
A new method for automatic salient object segmentation is presented.Salient object segmentation is an important research area in the field of object recognition,image retrieval,image editing,scene reconstruction,and 2... A new method for automatic salient object segmentation is presented.Salient object segmentation is an important research area in the field of object recognition,image retrieval,image editing,scene reconstruction,and 2D/3D conversion.In this work,salient object segmentation is performed using saliency map and color segmentation.Edge,color and intensity feature are extracted from mean shift segmentation(MSS)image,and saliency map is created using these features.First average saliency per segment image is calculated using the color information from MSS image and generated saliency map.Then,second average saliency per segment image is calculated by applying same procedure for the first image to the thresholding,labeling,and hole-filling applied image.Thresholding,labeling and hole-filling are applied to the mean image of the generated two images to get the final salient object segmentation.The effectiveness of proposed method is proved by showing 80%,89%and 80%of precision,recall and F-measure values from the generated salient object segmentation image and ground truth image. 展开更多
关键词 salient object visual attention saliency map color segmentation
在线阅读 下载PDF
Rate Control Algorithm of Wireless Video Based on Visual Saliency Map Model 被引量:1
2
作者 阮若林 胡瑞敏 +1 位作者 李忠明 尹黎明 《China Communications》 SCIE CSCD 2011年第7期105-110,共6页
In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on... In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission. 展开更多
关键词 human visual system saliency map model wireless video coding rate control H.264/AVC
在线阅读 下载PDF
Deformable Registration Algorithm via Non-subsampled Contourlet Transform and Saliency Map
3
作者 Chang Qing Yang Wenyou Chen Lanlan 《Journal of Shanghai Jiaotong university(Science)》 EI 2022年第4期452-462,共11页
Medical image registration is widely used in image-guided therapy and image-guided surgery to estimate spatial correspondence between planning and treatment images.However,most methods based on intensity have the prob... Medical image registration is widely used in image-guided therapy and image-guided surgery to estimate spatial correspondence between planning and treatment images.However,most methods based on intensity have the problems of matching ambiguity and ignoring the influence of weak correspondence areas on the overall registration.In this study,we propose a novel general-purpose registration algorithm based on free-form deformation by non-subsampled contourlet transform and saliency map,which can reduce the matching ambiguities and maintain the topological structure of weak correspondence areas.An optimization method based on Markov random fields is used to optimize the registration process.Experiments on four public datasets from brain,cardiac,and lung have demonstrated the general applicability and the accuracy of our algorithm compared with two state-of-the-art methods. 展开更多
关键词 medical image registration non-subsampled contourlet transform saliency map Markov random fields
原文传递
Anatomic Boundary-Aware Explanation for Convolutional Neural Networks in Diagnostic Radiology
4
作者 Han Yuan 《iRADIOLOGY》 2025年第1期47-60,共14页
Background:Convolutional neural networks(CNN)have achieved remarkable success in medical image analysis.However,unlike some general-domain tasks where model accuracy is paramount,medical applications demand both accur... Background:Convolutional neural networks(CNN)have achieved remarkable success in medical image analysis.However,unlike some general-domain tasks where model accuracy is paramount,medical applications demand both accuracy and explainability due to the high stakes affecting patients'lives.Based on model explanations,clinicians can evaluate the diagnostic decisions suggested by CNN.Nevertheless,prior explainable artificial intelligence methods treat medical image tasks akin to general vision tasks,following end-to-end paradigms to generate explanations and frequently overlooking crucial clinical domain knowledge.Methods:We propose a plug-and-play module that explicitly integrates anatomic boundary information into the explanation process for CNN-based thoracopathy classifiers.To generate the anatomic boundary of the lung parenchyma,we utilize a lung segmentation model developed on external public datasets and deploy it on the unseen target dataset to constrain model ex-planations within the lung parenchyma for the clinical task of thoracopathy classification.Results:Assessed by the intersection over union and dice similarity coefficient between model-extracted explanations and expert-annotated lesion areas,our method consistently outperformed the baseline devoid of clinical domain knowledge in 71 out of 72 scenarios,encompassing 3 CNN architectures(VGG-11,ResNet-18,and AlexNet),2 classification settings(binary and multi-label),3 explanation methods(Saliency Map,Grad-CAM,and Integrated Gradients),and 4 co-occurred thoracic diseases(Atelectasis,Fracture,Mass,and Pneumothorax).Conclusions:We underscore the effectiveness of leveraging radiology knowledge in improving model explanations for CNN and envisage that it could inspire future efforts to integrate clinical domain knowledge into medical image analysis. 展开更多
关键词 ATELECTASIS convolutional neural networks diagnostic radiology explainable artificial intelligence FRACTURE grad-cam integrated gradients mass PNEUMOTHORAX saliency map
在线阅读 下载PDF
Enhanced Object Detection and Classification via Multi-Method Fusion
5
作者 Muhammad Waqas Ahmed Nouf Abdullah Almujally +2 位作者 Abdulwahab Alazeb Asaad Algarni Jeongmin Park 《Computers, Materials & Continua》 SCIE EI 2024年第5期3315-3331,共17页
Advances in machine vision systems have revolutionized applications such as autonomous driving,robotic navigation,and augmented reality.Despite substantial progress,challenges persist,including dynamic backgrounds,occ... Advances in machine vision systems have revolutionized applications such as autonomous driving,robotic navigation,and augmented reality.Despite substantial progress,challenges persist,including dynamic backgrounds,occlusion,and limited labeled data.To address these challenges,we introduce a comprehensive methodology toenhance image classification and object detection accuracy.The proposed approach involves the integration ofmultiple methods in a complementary way.The process commences with the application of Gaussian filters tomitigate the impact of noise interference.These images are then processed for segmentation using Fuzzy C-Meanssegmentation in parallel with saliency mapping techniques to find the most prominent regions.The Binary RobustIndependent Elementary Features(BRIEF)characteristics are then extracted fromdata derived fromsaliency mapsand segmented images.For precise object separation,Oriented FAST and Rotated BRIEF(ORB)algorithms areemployed.Genetic Algorithms(GAs)are used to optimize Random Forest classifier parameters which lead toimproved performance.Our method stands out due to its comprehensive approach,adeptly addressing challengessuch as changing backdrops,occlusion,and limited labeled data concurrently.A significant enhancement hasbeen achieved by integrating Genetic Algorithms(GAs)to precisely optimize parameters.This minor adjustmentnot only boosts the uniqueness of our system but also amplifies its overall efficacy.The proposed methodologyhas demonstrated notable classification accuracies of 90.9%and 89.0%on the challenging Corel-1k and MSRCdatasets,respectively.Furthermore,detection accuracies of 87.2%and 86.6%have been attained.Although ourmethod performed well in both datasets it may face difficulties in real-world data especially where datasets havehighly complex backgrounds.Despite these limitations,GAintegration for parameter optimization shows a notablestrength in enhancing the overall adaptability and performance of our system. 展开更多
关键词 BRIEF features saliency map fuzzy c-means object detection object recognition
在线阅读 下载PDF
Infrared and Visible Image Fusion Based on Region of Interest Detection and Nonsubsampled Contourlet Transform 被引量:17
6
作者 刘欢喜 朱天竑 赵佳佳 《Journal of Shanghai Jiaotong university(Science)》 EI 2013年第5期526-534,共9页
In order to enhance the contrast of the fused image and reduce the loss of fine details in the process of image fusion,a novel fusion algorithm of infrared and visible images is proposed.First of all,regions of intere... In order to enhance the contrast of the fused image and reduce the loss of fine details in the process of image fusion,a novel fusion algorithm of infrared and visible images is proposed.First of all,regions of interest(RoIs)are detected in two original images by using saliency map.Then,nonsubsampled contourlet transform(NSCT)on both the infrared image and the visible image is performed to get a low-frequency sub-band and a certain amount of high-frequency sub-bands.Subsequently,the coefcients of all sub-bands are classified into four categories based on the result of RoI detection:the region of interest in the low-frequency sub-band(LSRoI),the region of interest in the high-frequency sub-band(HSRoI),the region of non-interest in the low-frequency sub-band(LSNRoI)and the region of non-interest in the high-frequency sub-band(HSNRoI).Fusion rules are customized for each kind of coefcients and fused image is achieved by performing the inverse NSCT to the fused coefcients.Experimental results show that the fusion scheme proposed in this paper achieves better efect than the other fusion algorithms both in visual efect and quantitative metrics. 展开更多
关键词 image fusion region of interest(RoI) detection saliency map nonsubsampled contourlet transform(NSCT)
原文传递
A Convolutional Neural Network Classifier VGG-19 Architecture for Lesion Detection and Grading in Diabetic Retinopathy Based on Deep Learning 被引量:3
7
作者 V.Sudha T.R.Ganeshbabu 《Computers, Materials & Continua》 SCIE EI 2021年第1期827-842,共16页
Diabetic Retinopathy(DR)is a type of disease in eyes as a result of a diabetic condition that ends up damaging the retina,leading to blindness or loss of vision.Morphological and physiological retinal variations invol... Diabetic Retinopathy(DR)is a type of disease in eyes as a result of a diabetic condition that ends up damaging the retina,leading to blindness or loss of vision.Morphological and physiological retinal variations involving slowdown of blood flow in the retina,elevation of leukocyte cohesion,basement membrane dystrophy,and decline of pericyte cells,develop.As DR in its initial stage has no symptoms,early detection and automated diagnosis can prevent further visual damage.In this research,using a Deep Neural Network(DNN),segmentation methods are proposed to detect the retinal defects such as exudates,hemorrhages,microaneurysms from digital fundus images and then the conditions are classified accurately to identify the grades as mild,moderate,severe,no PDR,PDR in DR.Initially,saliency detection is applied on color images to detect maximum salient foreground objects from the background.Next,structure tensor is applied powerfully to enhance the local patterns of edge elements and intensity changes that occur on edges of the object.Finally,active contours approximation is performed using gradient descent to segment the lesions from the images.Afterwards,the output images from the proposed segmentation process are subjected to evaluate the ratio between the total contour area and the total true contour arc length to label the classes as mild,moderate,severe,No PDR and PDR.Based on the computed ratio obtained from segmented images,the severity levels were identified.Meanwhile,statistical parameters like the mean and the standard deviation of pixel intensities,mean of hue,saturation and deviation clustering,are estimated through K-means,which are computed as features from the output images of the proposed segmentation process.Using these derived feature sets as input to the classifier,the classification of DR was performed.Finally,a VGG-19 deep neural network was trained and tested using the derived feature sets from the KAGGLE fundus image dataset containing 35,126 images in total.The VGG-19 is trained with features extracted from 20,000 images and tested with features extracted from 5,000 images to achieve a sensitivity of 82%and an accuracy of 96%.The proposed system was able to label and classify DR grades automatically. 展开更多
关键词 Diabetic retinopathy saliency map structure tensor gradient descent method EXUDATES haemorrhages MICROANEURYSMS VGG-19
在线阅读 下载PDF
Interpreting Randomly Wired Graph Models for Chinese NER
8
作者 Jie Chen Jiabao Xu +2 位作者 Xuefeng Xi Zhiming Cui Victor S.Sheng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第1期747-761,共15页
Interpreting deep neural networks is of great importance to understand and verify deep models for natural language processing(NLP)tasks.However,most existing approaches only focus on improving the performance of model... Interpreting deep neural networks is of great importance to understand and verify deep models for natural language processing(NLP)tasks.However,most existing approaches only focus on improving the performance of models but ignore their interpretability.In this work,we propose a Randomly Wired Graph Neural Network(RWGNN)by using graph to model the structure of Neural Network,which could solve two major problems(word-boundary ambiguity and polysemy)of ChineseNER.Besides,we develop a pipeline to explain the RWGNNby using Saliency Map and Adversarial Attacks.Experimental results demonstrate that our approach can identify meaningful and reasonable interpretations for hidden states of RWGNN. 展开更多
关键词 Named entity recognition graph neural network saliency map random graph network INTERPRETATION
在线阅读 下载PDF
ELM-Based Shape Adaptive DCT Compression Technique for Underwater Image Compression
9
作者 M.Jamuna Rani C.Vasanthanayaki 《Computer Systems Science & Engineering》 SCIE EI 2023年第5期1953-1970,共18页
Underwater imagery and transmission possess numerous challenges like lower signal bandwidth,slower data transmission bit rates,Noise,underwater blue/green light haze etc.These factors distort the estimation of Region ... Underwater imagery and transmission possess numerous challenges like lower signal bandwidth,slower data transmission bit rates,Noise,underwater blue/green light haze etc.These factors distort the estimation of Region of Interest and are prime hurdles in deploying efficient compression techniques.Due to the presence of blue/green light in underwater imagery,shape adaptive or block-wise compression techniques faces failures as it becomes very difficult to estimate the compression levels/coefficients for a particular region.This method is proposed to efficiently deploy an Extreme Learning Machine(ELM)model-based shape adaptive Discrete Cosine Transformation(DCT)for underwater images.Underwater color image restoration techniques based on veiling light estimation and restoration of images followed by Saliency map estimation based on Gray Level Cooccurrence Matrix(GLCM)features are explained.An ELM network is modeled which takes two parameters,signal strength and saliency value of the region to be compressed and level of compression(DCT coefficients and compression steps)are predicted by ELM.This method ensures lesser errors in the Region of Interest and a better trade-off between available signal strength and compression level. 展开更多
关键词 Extreme learning machine discrete cosine transform saliency map
在线阅读 下载PDF
Key Frame Extraction Algorithm of Surveillance Video Based on Quaternion Fourier Significance Detection
10
作者 Zhang Yunzuo Zhang Jiayu Cai Zhaoquan 《Journal of New Media》 2022年第1期1-11,共11页
With the improvement of people’s security awareness,numerous monitoring equipment has been put into use,resulting in the explosive growth of surveillance video data.Key frame extraction technology is a paramount tech... With the improvement of people’s security awareness,numerous monitoring equipment has been put into use,resulting in the explosive growth of surveillance video data.Key frame extraction technology is a paramount technology for improving video storage efficiency and enhancing the accuracy of video retrieval.It can extract key frame sets that can express video content from massive videos.However,the existing key frame extraction algorithms of surveillance video still have deficiencies,such as the destruction of image information integrity and the inability to extract key frames accurately.To this end,this paper proposes a key frame extraction algorithm of surveillance video based on quaternion Fourier saliency detection.Firstly,the algorithm used colors,and intensity features to perform quaternion Fourier transform on surveillance video sequences.Next,the phase spectrum of the quaternion Fourier transformed image was obtained,and he image visual saliency map was obtained according to the quaternion Fourier phase spectrum.Then,the image visual saliency map of two adjacent frames is used to characterize the change of target motion state.Finally,the frames that can accurately express the motion state of the target are selected as key frames.The experimental results show that the method proposed in this paper can accurately capture the changes of the local motion state of the target while maintaining the integrity of the image information. 展开更多
关键词 Quaternion fourier transform phase spectrum image saliency map motion status
在线阅读 下载PDF
Salience adaptive morphological structuring element construction method based on minimum spanning tree 被引量:1
11
作者 YANG Wenting WANG Xiaopeng FANG Chao 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2021年第1期36-43,共8页
Classical mathematical morphology operations use a fixed size and shape structuring element to process the whole image.Due to the diversity of image content and the complexity of target structure,for processed image,i... Classical mathematical morphology operations use a fixed size and shape structuring element to process the whole image.Due to the diversity of image content and the complexity of target structure,for processed image,its shape may be changed and part of the information may be lost.Therefore,we propose a method for constructing salience adaptive morphological structuring elements based on minimum spanning tree(MST).First,the gradient image of the input image is calculated,the edge image is obtained by non-maximum suppression(NMS)of the gradient image,and then chamfer distance transformation is performed on the edge image to obtain a salience map(SM).Second,the radius of structuring element is determined by calculating the maximum and minimum values of SM and then the minimum spanning tree is calculated on the SM.Finally,the radius is used to construct a structuring element whose shape and size adaptively change with the local features of the input image.In addition,the basic morphological operators such as erosion,dilation,opening and closing are redefined using the adaptive structuring elements and then compared with the classical morphological operators.The simulation results show that the proposed method can make full use of the local features of the image and has better processing results in image structure preservation and image filtering. 展开更多
关键词 adaptive structuring element mathematical morphology salience map(SM) minimum spanning tree(MST)
在线阅读 下载PDF
Explaining deep neural network models for electricity price forecasting with XAI
12
作者 Antoine Pesenti Aidan O’Sullivan 《Energy and AI》 2025年第3期202-213,共12页
Electricity markets are highly complex,involving lots of interactions and complex dependencies that make it hard to understand the inner workings of the market and what is driving prices.Econometric methods have been ... Electricity markets are highly complex,involving lots of interactions and complex dependencies that make it hard to understand the inner workings of the market and what is driving prices.Econometric methods have been developed for this,white-box models,however,they are not as powerful as deep neural network models(DNN).In this paper,we use a DNN to forecast the price and then use XAI methods to understand the factors driving the price dynamics in the market.The objective is to increase our understanding of how different electricity markets work.To do that,we apply explainable methods such as SHAP and Gradient,combined with visual techniques like heatmaps(saliency maps)to analyse the behaviour and contributions of various features across five electricity markets.We introduce the novel concepts of SSHAP values and SSHAP lines to enhance the complex representation of high-dimensional tabular models. 展开更多
关键词 Electricity price forecasting EPF Explainable methods XAI Explainable AI SHAP GRADIENT saliency map
在线阅读 下载PDF
Development of a cutting-edge ensemble pipeline for rapid and accurate diagnosis of plant leaf diseases
13
作者 S.M.Nuruzzaman Nobel Maharin Afroj +1 位作者 Md Mohsin Kabir M.F.Mridha 《Artificial Intelligence in Agriculture》 2024年第4期56-72,共17页
Selecting techniques is a crucial aspect of disease detection analysis,particularly in the convergence of computer vision and agricultural technology.Maintaining crop disease detection in a timely and accurate manner ... Selecting techniques is a crucial aspect of disease detection analysis,particularly in the convergence of computer vision and agricultural technology.Maintaining crop disease detection in a timely and accurate manner is essential to maintaining global food security.Deep learning is a viable answer to meet this need.To proceed with this study,we have developed and evaluated a disease detection model using a novel ensemble technique.We propose to introduce DenseNetMini,a smaller version of DenseNet.We propose combining DenseNetMini with a learning resizer in ensemble approach to enhance training accuracy and expedite learning.Another unique proposition involves utilizing Gradient Product(GP)as an optimization technique,effectively reducing the training time and improving the model performance.Examining images at different magnifications reveals noteworthy diagnostic agreement and accuracy improvements.Test accuracy rates of 99.65%,98.96%,and 98.11%are seen in the Plantvillage,Tomato leaf,and Appleleaf9 datasets,respectively.One of the research's main achievements is the significant decrease in processing time,which suggests that using the GP could improve disease detection in agriculture's accessibility and efficiency.Beyond quantitative successes,the study highlights Explainable Artificial Intelligence(XAl)methods,which are essential to improving the disease detection model's interpretability and transparency.XAI enhances the interpretability of the model by visually identifying critical areas on plant leaves for disease identification,which promotes confidence and understanding of the model's functionality. 展开更多
关键词 Leaf disease Transfer learning GradCam saliency map Computer vision SUSTAINABILITY Agriculture
原文传递
Visual perception driven collage synthesis 被引量:2
14
作者 Zuyi Yang Qinghui Dai Junsong Zhang 《Computational Visual Media》 SCIE EI CSCD 2022年第1期79-91,共13页
A collage is a composite artwork made from the spatial layout of multiple pictures on a canvas,collected from the Internet or user photographs.Collages,usually made by skilled artists,involve a complex manual process,... A collage is a composite artwork made from the spatial layout of multiple pictures on a canvas,collected from the Internet or user photographs.Collages,usually made by skilled artists,involve a complex manual process,especially when searching for component pictures and adjusting their spatial layout to meet artistic requirements.In this paper,we present a visual perception driven method for automatically synthesizing visually pleasing collages.Unlike previous works,we focus on how to design a collage layout which not only provides easy access to the theme of the overall image,but also conforms to human visual perception.To achieve this goal,we formulate the generation of collages as a mapping problem:given a canvas image,first,compute a saliency map for it and a vector field for each sub-region of it.Second,using a divide-and-conquer strategy,generate a series of patch sets from the canvas image,where the salient map and the vector field are used to determine each patch’s size and direction respectively.Third,construct a Gestalt-based energy function to choose the most visually pleasing and orderly patch set as the final layout.Finally,using a semantic-color metric,map the picture set to the patch set to generate the final collage.Extensive experimental and user study results show that this method can generate visual pleasing collages. 展开更多
关键词 COLLAGE gestalt psychology saliency map layout optimization human visual perception
原文传递
Salient object extraction for user-targeted video content association
15
作者 Jia LI Han-nan YU +2 位作者 Yong-hong TIAN Tie-jun HUANG Wen GAO 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2010年第11期850-859,共10页
The increasing amount of videos on the Internet and digital libraries highlights the necessity and importance of interactive video services such as automatically associating additional materials(e.g.,advertising logos... The increasing amount of videos on the Internet and digital libraries highlights the necessity and importance of interactive video services such as automatically associating additional materials(e.g.,advertising logos and relevant selling information) with the video content so as to enrich the viewing experience.Toward this end,this paper presents a novel approach for user-targeted video content association(VCA) .In this approach,the salient objects are extracted automatically from the video stream using complementary saliency maps.According to these salient objects,the VCA system can push the related logo images to the users.Since the salient objects often correspond to important video content,the associated images can be considered as content-related.Our VCA system also allows users to associate images to the preferred video content through simple interactions by the mouse and an infrared pen.Moreover,by learning the preference of each user through collecting feedbacks on the pulled or pushed images,the VCA system can provide user-targeted services.Experimental results show that our approach can effectively and efficiently extract the salient objects.Moreover,subjective evaluations show that our system can provide content-related and user-targeted VCA services in a less intrusive way. 展开更多
关键词 Salient object extraction User-targeted video content association Complementary saliency maps
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部