期刊文献+

快速实用的通用表格分析方法

Fast and practical method for universal form analysis
在线阅读 下载PDF
导出
摘要 表格分析是对表格的基本结构及形状进行识别的过程,是以后能否从表格单元中正确提取文本信息的关键。在结合表格特点的基础上,采用了表格线检测与处理相结合的方法获取表格框线。检测表格线过程中,通过定义了主表格线长度来加快扫描的速度;在表格线的处理中,针对杂线的剔除、表格线的调整及最终获得表格结构等方面进行了系统的探讨。大量的实验结果表明所提方法是可行的。 Form analysis is a recognition process for a basic structure of form and shape. It is crucial to extract text information correctly from form unit. Based on the property of form, the method that combine lines detecting and processing together to detect the table line effectively is used. In the process of line detecting, by defining main line to accelerate scan. The elimination of the other lines is discussed, the table line is adjusted and finally the structure of form is gotten when process line. Lots of experimental results show that the method is feasible.
出处 《计算机工程与设计》 CSCD 北大核心 2008年第19期5114-5116,共3页 Computer Engineering and Design
基金 广东省自然科学基金项目(032356、07010869) 北京大学视觉与听觉信息处理国家重点实验室开放课题基金项目(0505) 江门市科技计划基金项目(【2007】28号)
关键词 表格分析 表格识别 直线提取 直线检测 表格结构 form analysis form recognition line extraction line detection form structure
  • 相关文献

参考文献8

二级参考文献36

  • 1张圣希,张薇,李国强,顾国庆.利用顶点链编码探测表格的斜率[J].华东师范大学学报(自然科学版),2004(3):54-58. 被引量:5
  • 2S. Gopisetty, R. Lorie, J. Mao, M. Mohiuddin, A. Sorin, and E. Yair. Automated Forms-Processing Software and Services[J]. IBM Journal of Research and Development, 1996, 40(2) : 211 - 230.
  • 3J. Kittler, and J. Illingworth. Minimum Error Thresholding[J]. Pattern Recognition, 1986, 19(1): 41-47.
  • 4N. Otsu. A Threshold Selection Method from Gray Level Histograms[J]. Pattern Recognition, 1979, 9(1): 62-66.
  • 5H. Sako, M. Seki, N. Furukawa, H. Ikeda, and A. Imaizuml. Form Reading Based on Form-Type Identification and Form-Data Recognition[ A]. Proceedings of 7th International Conference on Document Analysis and Recognition[C]. IEEE Computer Society Press, 2003, 926-930.
  • 6Y.Y. Tang, C.Y. Suen, C.D. Yah, and M. Cheriet. Financial Document Processing Based on Staff Line and Description Language[J]. IEEE Trans. Systems, Man, and Cybernetics, 1995, 25(5): 738- 753.
  • 7A. Ting, and K.H. Leung. Form Recognition Using Linear Structure[J]. Pattern Recognition, 1999, 32(4): 645- 656.
  • 8T. Watanabe, Q. Iato, and N. Sugie. Layout Recognition of Multi-Kinds of Table-Form Documents[J]. IEEE Trans. Pattern Analysis and Machine Intelligence, 1995, 17(4): 432- 445.
  • 9B. Yu, and A.K. Jain. A Generic System for Form Dropout[J]. IEEE Trans. Pattern Analysis and Machine Intelligence, 1996, 18(11): 1127-1131.
  • 10C. Zhang, and P. Wang. A New Method of Color Image Segmentation Based on Intensity and Hue Clnstering[A].Proceedings of 15th International Conference on Pattern Recognition[ C ]. IEEE Computer Society Press, 2000,617 - 620.

共引文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部