期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Narrow Pooling Clothing Classification Based on Attention Mechanism 被引量:3
1
作者 MA Xiao WANG Shaoyu +3 位作者 YE Shaoping FAN Jingyi XU An XIA Xiaoling 《Journal of Donghua University(English Edition)》 CAS 2022年第4期367-372,共6页
In recent years,with the rapid development of e-commerce,people need to classify the wide variety and a large number of clothing images appearing on e-commerce platforms.In order to solve the problems of long time con... In recent years,with the rapid development of e-commerce,people need to classify the wide variety and a large number of clothing images appearing on e-commerce platforms.In order to solve the problems of long time consumption and unsatisfactory classification accuracy arising from the classification of a large number of clothing images,researchers have begun to exploit deep learning techniques instead of traditional learning methods.The paper explores the use of convolutional neural networks(CNNs)for feature learning to enhance global feature information interactions by adding an improved hybrid attention mechanism(HAM)that fully utilizes feature weights in three dimensions:channel,height,and width.Moreover,the improved pooling layer not only captures local feature information,but also fuses global and local information to improve the misclassification problem that occurs between similar categories.Experiments on the Fashion-MNIST and DeepFashion datasets show that the proposed method significantly improves the accuracy of clothing classification(93.62%and 67.9%)compared with residual network(ResNet)and convolutional block attention module(CBAM). 展开更多
关键词 clothing classification convolutional neural network(CNN) residual network(resnet) attention mechanism narrow pooling
在线阅读 下载PDF
Improved Medical Image Segmentation Model Based on 3D U-Net 被引量:2
2
作者 LIN Wei FAN Hong +3 位作者 HU Chenxi YANG Yi YU Suping NI Lin 《Journal of Donghua University(English Edition)》 CAS 2022年第4期311-316,共6页
With the widespread application of deep learning in the field of computer vision,gradually allowing medical image technology to assist doctors in making diagnoses has great practical and research significance.Aiming a... With the widespread application of deep learning in the field of computer vision,gradually allowing medical image technology to assist doctors in making diagnoses has great practical and research significance.Aiming at the shortcomings of the traditional U-Net model in 3D spatial information extraction,model over-fitting,and low degree of semantic information fusion,an improved medical image segmentation model has been used to achieve more accurate segmentation of medical images.In this model,we make full use of the residual network(ResNet)to solve the over-fitting problem.In order to process and aggregate data at different scales,the inception network is used instead of the traditional convolutional layer,and the dilated convolution is used to increase the receptive field.The conditional random field(CRF)can complete the contour refinement work.Compared with the traditional 3D U-Net network,the segmentation accuracy of the improved liver and tumor images increases by 2.89%and 7.66%,respectively.As a part of the image processing process,the method in this paper not only can be used for medical image segmentation,but also can lay the foundation for subsequent image 3D reconstruction work. 展开更多
关键词 medical image segmentation 3D U-Net residual network(resnet) inception model conditional random field(CRF)
在线阅读 下载PDF
Multidimensional attention and multiscale upsampling for semantic segmentation
3
作者 LU Zhongda ZHANG Chunda +1 位作者 WANG Lijing XU Fengxia 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第1期68-78,共11页
Semantic segmentation is for pixel-level classification tasks,and contextual information has an important impact on the performance of segmentation.In order to capture richer contextual information,we adopt ResNet as ... Semantic segmentation is for pixel-level classification tasks,and contextual information has an important impact on the performance of segmentation.In order to capture richer contextual information,we adopt ResNet as the backbone network and designs an encoder-decoder architecture based on multidimensional attention(MDA)module and multiscale upsampling(MSU)module.The MDA module calculates the attention matrices of the three dimensions to capture the dependency of each position,and adaptively captures the image features.The MSU module adopts parallel branches to capture the multiscale features of the images,and multiscale feature aggregation can enhance contextual information.A series of experiments demonstrate the validity of the model on Cityscapes and Camvid datasets. 展开更多
关键词 semantic segmentation attention mechanism multiscale feature convolutional neural network(CNN) residual network(resnet)
在线阅读 下载PDF
EPIMR:Prediction of Enhancer-Promoter Interactions by Multi-Scale ResNet on Image Representation
4
作者 Qiaozhen Meng Yinuo Lyu +3 位作者 Xiaoqing Peng Junhai Xu Jijun Tang Fei Guo 《Big Data Mining and Analytics》 EI CSCD 2024年第3期668-681,共14页
Prediction of enhancer-promoter interactions(EPIs)is key to regulating gene expression and diagnosing genetic diseases.Due to limited resolution,biological experiments perform not as well as expected while precisely i... Prediction of enhancer-promoter interactions(EPIs)is key to regulating gene expression and diagnosing genetic diseases.Due to limited resolution,biological experiments perform not as well as expected while precisely identifying specific interactions,giving rise to computational biology approaches.Many EPI predictors have been developed,but their prediction accuracy still needs to be enhanced.Here,we design a new model named EPIMR to identify enhancer-promoter interactions.First,Hilbert Curve is utilized to represent sequences to images to preserve the position and spatial information.Second,a multi-scale residual neural network(ResNet)is used to learn the distinguishing features of different abstraction levels.Finally,matching heuristics are adopted to concatenate the learned features of enhancers and promoters,which pays attention to their potential interaction information.Experimental results on six cell lines indicate that EPIMR performs better than existing methods,with higher area under the precision-recall curve(AUPR)and area under the receiver operating characteristic(AUROC)results on benchmark and under-sampling datasets.Furthermore,our model is pre-trained on all cell lines,which improves not only the transferability of cross-cell line prediction,but also cell line-specific prediction ability.In conclusion,our method serves as a valuable technical tool for predicting enhancer-promoter interactions,contributing to the understanding of gene transcription mechanisms.Our code and results are available at https://github.com/guofei-tju/EPIMR. 展开更多
关键词 enhancer-promoter interactions Hilbert Curve multi-scale residual neural network(resnet)
原文传递
Glaucoma Detection with Retinal Fundus Images Using Segmentation and Classification 被引量:2
5
作者 Thisara Shyamalee Dulani Meedeniya 《Machine Intelligence Research》 EI CSCD 2022年第6期563-580,共18页
Glaucoma is a prevalent cause of blindness worldwide.If not treated promptly,it can cause vision and quality of life to deteriorate.According to statistics,glaucoma affects approximately 65 million individuals globall... Glaucoma is a prevalent cause of blindness worldwide.If not treated promptly,it can cause vision and quality of life to deteriorate.According to statistics,glaucoma affects approximately 65 million individuals globally.Fundus image segmentation depends on the optic disc(OD)and optic cup(OC).This paper proposes a computational model to segment and classify retinal fundus images for glaucoma detection.Different data augmentation techniques were applied to prevent overfitting while employing several data pre-processing approaches to improve the image quality and achieve high accuracy.The segmentation models are based on an attention U-Net with three separate convolutional neural networks(CNNs)backbones:Inception-v3,visual geometry group 19(VGG19),and residual neural network 50(ResNet50).The classification models also employ a modified version of the above three CNN architectures.Using the RIM-ONE dataset,the attention U-Net with the ResNet50 model as the encoder backbone,achieved the best accuracy of 99.58%in segmenting OD.The Inception-v3 model had the highest accuracy of 98.79%for glaucoma classification among the evaluated segmentation,followed by the modified classification architectures. 展开更多
关键词 Attention U-Net SEGMENTATION classification Inception-v3 visual geometry group 19(VGG19) residual neural network 50(resnet50) GLAUCOMA fundus images
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部