期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Open-Vocabulary 3D Scene Segmentation via Dual-Modal Interaction
1
作者 Wuyang Luan Lei Pan +2 位作者 Junhui Li Yuan Zheng Chang Xu 《IEEE/CAA Journal of Automatica Sinica》 2025年第10期2156-2158,共3页
Dear Editor,This letter proposes an innovative open-vocabulary 3D scene understanding model based on visual-language model.By efficiently integrating 3D point cloud data,image data,and text data,our model effectively ... Dear Editor,This letter proposes an innovative open-vocabulary 3D scene understanding model based on visual-language model.By efficiently integrating 3D point cloud data,image data,and text data,our model effectively overcomes the segmentation problem[1],[2]of traditional models dealing with unknown categories[3].By deeply learning the deep semantic mapping between vision and language,the network significantly improves its ability to recognize unlabeled categories and exceeds current state-of-the-art methods in the task of scene understanding in open-vocabulary. 展开更多
关键词 segmentation problem open vocabulary recognize unlabeled categories deeply learning deep semantic mapping traditional models D scene segmentation text dataour visual language model
在线阅读 下载PDF
A novel image super-resolution reconstruction algorithm based on improved GANs and gradient penalty 被引量:2
2
作者 Shuangshuang Liu Xiaoling Li 《International Journal of Intelligent Computing and Cybernetics》 EI 2019年第3期400-413,共14页
Purpose–Conventional image super-resolution reconstruction by the conventional deep learning architectures suffers from the problems of hard training and gradient disappearing.In order to solve such problems,the purp... Purpose–Conventional image super-resolution reconstruction by the conventional deep learning architectures suffers from the problems of hard training and gradient disappearing.In order to solve such problems,the purposeof this paperis to proposea novel image super-resolutionalgorithmbasedon improved generative adversarial networks(GANs)with Wasserstein distance and gradient penalty.Design/methodology/approach–The proposed algorithm first introduces the conventional GANs architecture,the Wasserstein distance and the gradient penalty for the task of image super-resolution reconstruction(SRWGANs-GP).In addition,a novel perceptual loss function is designed for the SRWGANs-GP to meet the task of image super-resolution reconstruction.The content loss is extracted from the deep model’s feature maps,and such features are introduced to calculate mean square error(MSE)for the loss calculation of generators.Findings–To validate the effectiveness and feasibility of the proposed algorithm,a lot of compared experiments are applied on three common data sets,i.e.Set5,Set14 and BSD100.Experimental results have shown that the proposed SRWGANs-GP architecture has a stable error gradient and iteratively convergence.Compared with the baseline deep models,the proposed GANs models have a significant improvement on performance and efficiency for image super-resolution reconstruction.The MSE calculated by the deep model’s feature maps gives more advantages for constructing contour and texture.Originality/value–Compared with the state-of-the-art algorithms,the proposed algorithm obtains a better performance on image super-resolution and better reconstruction results on contour and texture. 展开更多
关键词 deep model’s feature maps Generative adversarial networks Gradient penalty Image super-resolution Wasserstein distance
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部