期刊文献+

融合注意力机制的移动端人像分割网络 被引量:4

Mobile-based portrait segmentation network with attention mechanism
在线阅读 下载PDF
导出
摘要 现有的移动端人像分割网络存在分割精度差、分割边缘模糊等问题。为此,提出了一种融合注意力机制的轻量化人像分割网络。首先,利用MobileNetV2网络提取图像特征。然后对注意力模块NLNet(Non-local neural networks)进行轻量化处理,随后将优化过的注意力模块嵌入到四层解码网络中。利用融合注意力机制的解码网络自适应地学习有效特征,最后通过SoftMax层得到人像分割结果图。同时改进了损失函数,引入多损失函数(Multi-Loss),使网络更容易收敛。解码网络融合注意力机制的方式使得轻量化网络可以在语义分割任务上取得较好的效果。实验结果表明,模型在550张自采集的人像测试集上达到了92.29%的交并比(MeanIOU),单张图片在Inter(R)Core i5 CPU上的分割时间为0.74 s。与传统的人像分割网络相比,研究网络的分割精度和分割速度优势明显,适合应用于移动端设备。 Existing mobile terminal segmentation network has problems such as poor segmentation accuracy and segmentation edge blurring.To this end,a lightweight portrait segmentation network that incorporates attention mechanisms is proposed.Firstly,the image features are extracted by MobileNetV2 network.Then,the attention module NLNet(Non-local neural networks)is lightened,and then the optimized attention module is embedded in the four-layer decoding network.The decoding network with attention mechanism adaptively learns the effective features,and finally obtains the portrait segmentation result map through SoftMax.At the same time,the loss function is improved,and Multi-Loss(multi-loss function)is introduced to make the network easier to converge.The way of attention mechanisms fused in decoding networks makes the lightweight network achieve better results in the semantic segmentation task.Experimental results show that the model achieves 92.29%MeanIOU on the 550 self-collected portrait test set,and the split time of the single picture on the Inter(R)Core i5 CPU is 0.74 s.Compared with the traditional portrait segmentation network,the research network has obvious advantages in segmentation precision and segmentation speed,and is suitable for mobile devices.
作者 周鹏 姚剑敏 林志贤 严群 郭太良 ZHOU Peng;YAO Jian-min;LIN Zhi-xian;YAN Qun;GUO Tai-liang(Nation & Local United Engineering Laboratory of Flat Panel DisplayTechnology,College of Physics and Information Engineering, Fuzhou University, Fuzhou 350108, China;Jinjiang RichSense Electronic Technology Co., Ltd., Jinjiang 362200, China)
出处 《液晶与显示》 CAS CSCD 北大核心 2020年第6期547-554,共8页 Chinese Journal of Liquid Crystals and Displays
基金 国家重点研发计划课题(No.2016YFB0401503) 广东省科技重大专项(No.2016B090906001) 福建省科技重大专项(No.2014HZ0003-1) 广东省光信息材料与技术重点实验室开放基金资助项目(No.2017B030301007)。
关键词 人像分割 注意力机制 轻量化 卷积神经网络 portrait segmentation attention mechanism lightweight convolutional neural network
  • 相关文献

参考文献4

二级参考文献53

  • 1BENGIO Y, DELALLEAU O. On the expressive power of deep archi- tectures[ C ]//Proc of the 14th International Conference on Discovery Science. Berlin : Springer-Verlag, 2011 : 18 - 36.
  • 2BENGIO Y. Leaming deep architectures for AI[ J]. Foundations and Trends in Machine Learning ,2009,2 ( 1 ) : 1-127.
  • 3HINTON G,OSINDERO S,TEH Y. A fast learning algorithm for deep belief nets [ J ]. Neural Computation ,2006,18 (7) : 1527-1554.
  • 4BENGIO Y, LAMBLIN P, POPOVICI D, et al. Greedy layer-wise training of deep networks [ C ]//Proc of the 12th Annual Conference on Neural Information Processing System. 2006:153-160.
  • 5LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning ap- plied to document recognition[ J]. Proceedings of the iEEE, 1998, 86( 11 ) :2278-2324.
  • 6VINCENT P, LAROCHELLE H, BENGIO Y, et al. Extracting and composing robust features with denoising autoencoders[ C ]//Proc of the 25th International Conference on Machine Learning. New York: ACM Press ,2008 : 1096-1103.
  • 7VINCENT P, LAROCHELLE H, LAJOIE I, et aL Stacked denoising autoencoders:learning useftd representations in a deep network with a local denoising criterion [ J ]. Journal of Machine Learning Re- search ,2010,11 ( 12 ) :3371-3408.
  • 8YU Dong, DENG Li. Deep convex net: a scalable architecture for speech pattern classification [ C]//Proc of the 12th Annual Confe-rence of International Speech Comunication Association. 2011 : 2285- 2288.
  • 9POON H, DOMINGOS P. Sum-product networks:a new deep architec- ture[ C ]//Proc of IEEE Intemational Conference on Computer Vi- sion. 2011:689-690.
  • 10BENGIO Y,LECUN Y. Scaling learning algorithms towards AI[ M]// BOTTOU L,CHAPELLE O, DeCOSTE D,et al. Large-Scale Kernel Machines. Cambridge: MIT Press ,2007:321-358.

共引文献737

同被引文献30

引证文献4

二级引证文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部