摘要
针对图像视频中数字自动识别处理的需求,提出了一种改进的数字区域定位及读数识别方法。该方法使用自适应阈值进行图像整体二值化,然后设计改进的笔画宽度变化算法(SWT)来确定仪表数字显示的大体位置,再根据数字的颜色、宽高比以及空间排列等特征来过滤得到准确位置,并使用多层次扩展合并处理方法去除遮挡粘连影响,实现读数区域的精确定位,效果理想。最后对数字区域提取多种高区分度特征,通过训练好的多分类模型即可准确识别得到对应数字值,实现图像视频中读数的自动识别。实验结果表明,该方法具有很高的准确度及较强的鲁棒性,能避免光照、倾斜、部分遮挡的影响,准确找到读数区域,并据此识别出其中的数字,适用于自动巡检、远程抄表等多种应用。
According to the requirement of the automatic recognition for digital video, an improved digits auto-locating and recognition method is presented. It adopts self-adaptive threshold for binarization of image and then an improved algorithm of Stroke Width Trans- form (SWT) is designed to make a coarse locating of the digits' regions. After that, the precise positions of the digits are determined by filtering them with some useful features, such as its height-width-ratio, color and spatial arrangement, and the multi-level extension and merging is applied to eliminate the influence on shield and adhesion for the exact locating of digits region with perfection. At last, after extraction of the high discriminative features in digital regions, the digits can be accurately recognized and achieved by trained multi-clas- sifted models, which can implement the automatic recognition of digits in videos. The experimental results show that the proposed method owns high accuracy and strong robustness, without impact on light, titlt and partial shield, and locate the correct digits regions for recogni- tion of digits. It is suitable for automatic inspection, remote meter reading and so on.
出处
《计算机技术与发展》
2017年第12期67-70,共4页
Computer Technology and Development
基金
国家自然科学基金资助项目(61201396)
国家电网公司科技项目(5212D01502DB)
关键词
笔画宽度变换算法
读数精确定位
多层次扩展合并
读数识别
stroke width transform
digits auto-locating
multi-level extension and merging
reading recognition