期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Feedback LSTM Network Based on Attention for Image Description Generator 被引量:2
1
作者 Zhaowei Qu Bingyu Cao +3 位作者 Xiaoru Wang Fu Li Peirong Xu Luhan Zhang 《Computers, Materials & Continua》 SCIE EI 2019年第5期575-589,共15页
Images are complex multimedia data which contain rich semantic information.Most of current image description generator algorithms only generate plain description,with the lack of distinction between primary and second... Images are complex multimedia data which contain rich semantic information.Most of current image description generator algorithms only generate plain description,with the lack of distinction between primary and secondary object,leading to insufficient high-level semantic and accuracy under public evaluation criteria.The major issue is the lack of effective network on high-level semantic sentences generation,which contains detailed description for motion and state of the principal object.To address the issue,this paper proposes the Attention-based Feedback Long Short-Term Memory Network(AFLN).Based on existing codec framework,there are two independent sub tasks in our method:attention-based feedback LSTM network during decoding and the Convolutional Block Attention Module(CBAM)in the coding phase.First,we propose an attentionbased network to feedback the features corresponding to the generated word from the previous LSTM decoding unit.We implement feedback guidance through the related field mapping algorithm,which quantifies the correlation between previous word and latter word,so that the main object can be tracked with highlighted detailed description.Second,we exploit the attention idea and apply a lightweight and general module called CBAM after the last layer of VGG 16 pretraining network,which can enhance the expression of image coding features by combining channel and spatial dimension attention maps with negligible overheads.Extensive experiments on COCO dataset validate the superiority of our network over the state-of-the-art algorithms.Both scores and actual effects are proved.The BLEU 4 score increases from 0.291 to 0.301 while the CIDEr score rising from 0.912 to 0.952. 展开更多
关键词 image description generator feedback LSTM network ATTENTION CBAM
在线阅读 下载PDF
2D registration based on contour matching for partial matching images 被引量:1
2
作者 张见威 黄达承 +1 位作者 桂姜琴 叶文忠 《Journal of Central South University》 SCIE EI CAS 2014年第12期4553-4562,共10页
The mean Hausdorff distance, though highly applicable in image registration, does not work well on partial matching images. An improvement upon traditional Hausdorff-distance-based image registration method is propose... The mean Hausdorff distance, though highly applicable in image registration, does not work well on partial matching images. An improvement upon traditional Hausdorff-distance-based image registration method is proposed, which consists of the following two aspects. One is to estimate transformation parameters between two images from the distributions of geometric property differences instead of establishing explicit feature correspondences. This procedure is treated as the pre-registration. The other aspect is that mean Hausdorff distance computation is replaced with the analysis of the second difference of generalized Hausdorff distance so as to eliminate the redundant points. Experimental results show that our registration method outperforms the method based on mean Hausdorff distance. The registration errors are noticeably reduced in the partial matching images. 展开更多
关键词 image registration generalized Hausdorff distance partial matching image
在线阅读 下载PDF
Analysis of normal human retinal vascular network architecture using multifractal geometry 被引量:1
3
作者 Stefan Talu Sebastian Stach +2 位作者 Dan Mihai Calugaru Carmen Alina Lupascu Simona Delia Nicoara 《International Journal of Ophthalmology(English edition)》 SCIE CAS 2017年第3期434-438,共5页
AIM:To apply the multifractal analysis method as a quantitative approach to a comprehensive description of the microvascular network architecture of the normal human retina.METHODS:Fifty volunteers were enrolled in ... AIM:To apply the multifractal analysis method as a quantitative approach to a comprehensive description of the microvascular network architecture of the normal human retina.METHODS:Fifty volunteers were enrolled in this study in the Ophthalmological Clinic of Cluj-Napoca,Romania,between January 2012 and January 2014. A set of 100 segmented and skeletonised human retinal images,corresponding to normal states of the retina were studied. An automatic unsupervised method for retinal vessel segmentation was applied before multifractal analysis. The multifractal analysis of digital retinal images was made with computer algorithms,applying the standard boxcounting method. Statistical analyses were performed using the Graph Pad In Stat software.RESULTS:The architecture of normal human retinal microvascular network was able to be described using the multifractal geometry. The average of generalized dimensions(D_q)for q=0,1,2,the width of the multifractal spectrum(Δα=α_(max)-α_(min))and the spectrum arms' heights difference(│Δf│)of the normal images were expressed as mean±standard deviation(SD):for segmented versions,D_0=1.7014±0.0057; D_1=1.6507±0.0058; D_2=1.5772±0.0059; Δα=0.92441±0.0085; │Δf│= 0.1453±0.0051; for skeletonised versions,D_0=1.6303±0.0051; D_1=1.6012±0.0059; D_2=1.5531± 0.0058; Δα=0.65032±0.0162; │Δf│= 0.0238±0.0161. The average of generalized dimensions(D_q)for q=0,1,2,the width of the multifractal spectrum(Δα)and the spectrum arms' heights difference(│Δf│)of the segmented versions was slightly greater than the skeletonised versions.CONCLUSION:The multifractal analysis of fundus photographs may be used as a quantitative parameter for the evaluation of the complex three-dimensional structure of the retinal microvasculature as a potential marker for early detection of topological changes associated with retinal diseases. 展开更多
关键词 generalized dimensions multifractal retinal vessel segmentation retinal image analysis retinal microvasculature standard box-counting method
原文传递
Can You Sue Al for Lying?A Landmark Ruling Says No
4
《Beijing Review》 2026年第8期48-48,共1页
In a landmark decision,the Hangzhou Internet Court has ruled that AI-generated content,including its infamous“hallucinations,”constitutes a service,not a product with independent liability.“Hallucination”is the un... In a landmark decision,the Hangzhou Internet Court has ruled that AI-generated content,including its infamous“hallucinations,”constitutes a service,not a product with independent liability.“Hallucination”is the universally adopted term in the world of AI for when a chatbot or image generator confidently outputs information that is incorrect,nonsensical or entirely fabricated. 展开更多
关键词 Hangzhou Internet Court image generator landmark decision LIABILITY AI generated content service product chatbot
原文传递
A Survey of Image Synthesis and Editing with Generative Adversarial Networks 被引量:21
5
作者 Xian Wu Kun Xu Peter Hall 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第6期660-674,共15页
This paper presents a survey of image synthesis and editing with Generative Adversarial Networks(GANs). GANs consist of two deep networks, a generator and a discriminator, which are trained in a competitive way. Due... This paper presents a survey of image synthesis and editing with Generative Adversarial Networks(GANs). GANs consist of two deep networks, a generator and a discriminator, which are trained in a competitive way. Due to the power of deep networks and the competitive training manner, GANs are capable of producing reasonable and realistic images, and have shown great capability in many image synthesis and editing applications.This paper surveys recent GAN papers regarding topics including, but not limited to, texture synthesis, image inpainting, image-to-image translation, and image editing. 展开更多
关键词 image synthesis image editing constrained image synthesis generative adversarial networks imageto-image translation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部