期刊文献+

复杂场景下的水表示数检测与识别 被引量:7

Watermeter representation number detection and recognition in complex scenes
在线阅读 下载PDF
导出
摘要 目前自然场景下水表示数的统计工作由人工完成,但在大工作量的情况下人工统计容易出现统计错误和遗漏的情况,而且这一过程繁琐,耗时耗力。针对这种情况,提出一种基于深度学习的数字识别与检测算法。首先,使用旋转区域卷积神经网络(R2CNN)检测出文本框;然后,对该文本框区域使用基于注意机制(Attention)的文本识别算法预测出最终结果;最终,通过对不同深度的卷积神经网络(CNN)的速度和准确度进行对比,使用了一种准确有效的配置。结果显示训练好的网络模型可以应用于自然环境下的水表示数检测,能够达到实时检测的效果,并且优于其他检测识别算法的效果。不同数据下的结果验证了算法的有效性和鲁棒性。 At present,the statistical work of watermeter number in natural scenes is done manually,but in the case of large workloads,manual statistics are prone to statistical errors and omissions,and this process is cumbersome,time consuming and labor intensive.Aiming at this situation,a digital recognition and detection algorithm based on deep learning was proposed.Firstly,a text box was detected using a Rotating Region Convolutional Neural Network(R2CNN);then a text recognition algorithm based on attention is used for the text box region to predict the final result.Finally,by comparing the speed and accuracy of different depth Convolutional Neural Networks(CNN),an accurate and efficient configuration was found.The results show that the trained network model can be applied to the detection of water representation in the natural environment,which can achieve real-time detection and is superior to other detection and recognition algorithms.The results under different data verify the effectiveness and robustness of the algorithm.
作者 康鑫 孙晓刚 万磊 KANG Xin;SUN Xiaogang;WAN Lei(Chengdu Institute of Computer Application,Chinese Academy of Sciences,Chengdu Sichuan 610041,China;University of Chinese Academy of Science,Beijing 100049,China;Chengdu Zhongcai Xinda Technology Company Limited,Chengdu Sichuan 610041,China)
出处 《计算机应用》 CSCD 北大核心 2019年第S02期63-67,共5页 journal of Computer Applications
基金 四川省重点研发计划项目(2018GZ0231)
关键词 文本检测 文本识别 深度学习 旋转区域卷积神经网络 注意机制 text detection text recognition deep learning Rotational Region Convolutional Neural Network(R2CNN) attention
  • 相关文献

参考文献2

二级参考文献12

  • 1赵万鹏,古乐野.基于Adaboost的手写体数字识别[J].计算机应用,2005,25(10):2413-2414. 被引量:9
  • 2潘石柱,殳伟群,王令群.基于Adaboost的汽车牌照快速定位[J].计算机工程,2006,32(12):187-188. 被引量:9
  • 3VIOLA P, JONES M. Robust real-time face detection[J]. International Journal of Computer Vision, 2004, 57(2) : 137 -154.
  • 4PAUL B, ATHITHAN G, MURTY M N. Speeding up AdaBoost classifier with random projection[ C] // 7th International Conference on Advances in Pattern Recognition. Washington, DC: IEEE Computer Society, 2009:251 -254.
  • 5VALIANT L G. A theory of the learnable[J]. Communication of the ACM, 1984, 27 (11): 1134-1142.
  • 6SCHAPIRE R E. The strength of weak learnability[ C]//30th Annual Symposium on Foundations of Computer Science. Washington, DC: IEEE Computer Society, 1989:28 -33.
  • 7FREUND Y, SCHAPIRE R E. A decision theoretic generalization of online learning and an application to boosting[J]. Journal of Computer and System Sciences, 1997, 55 (1) : 119 - 139.
  • 8WU JIAN XIN, BRUBAKER S C, MULLIN D M, et al. Fast asymmetric learning for cascade face detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(3): 369-382.
  • 9种衍文,匡湖林,李清泉.一种基于多特征和机器学习的分级行人检测方法[J].自动化学报,2012,38(3):375-381. 被引量:28
  • 10曹莹,苗启广,刘家辰,高琳.AdaBoost算法研究进展与展望[J].自动化学报,2013,39(6):745-758. 被引量:298

共引文献17

同被引文献64

引证文献7

二级引证文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部