期刊文献+

基于结构化局部边缘模式的文档图像分类 被引量:3

Document Image Classification Based on Structured Local Edge Pattern
在线阅读 下载PDF
导出
摘要 采用图像的结构化局部边缘模式特征(structured local edge pattern,SLEP)对文档图像进行分类,由于该算法精确描述了图像边缘方向邻域中的空间分布,因此相应的学习对于文档图像类型具有很强的区分能力.与基于图像复杂结构分布特征的方法或基于光学字符识别系统特征(OCR)的方法相比,基于SLEP特征的方法更简单有效.本实验通过组建文档图像数据库,利用支持向量机(SVM)作为分类器,总共对4种文档图像类型进行分类,分别为学术论文(paper),影像照片(photo),表格文件(table),幻灯影片(slide).实验结果表明,基于SLEP特征的方法在准确率、召回率等方面都明显优于所对比方法,并且即使在文档图像低分辨率的情况下,所分类结果仍然有不错表现. This paper adopts structured local edge pattern (SLEP) feature to have a classification on document images, the algorithm accurately describes the spatial distribution of the image in the neighborhood of the edge direction, thus the corresponding learning has a strong ability to distinguish for document image type classification. Compared with the method of based on complex image structure distribution characteristics and the method of using optical character recognition system (OCR), the method of based on SLEP feature is more simple and more effective. Through assembling a database, using support vector machines (SVM) as the classi- fier, this paper will have a classification on four document image types, respectively paper, photo, table, slide. The experiment confirms that the method of based on SLEP feature was significantly better than the comparative method both in precision and recall, and it still has a good performance even in th'e case of low-resolution images.
出处 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2013年第3期349-355,共7页 Journal of Xiamen University:Natural Science
基金 国家自然科学基金项目(60873179 61202143) 国家教育部博士点专项基金项目(20090121110032) 台湾行政院国家科学委员会项目(NSC 100-2221-E-155-086) 福建省自然科学基金项目(2011J01367) 深圳科学技术研究基金项目(JC200903180630A ZYB200907110169A) 深圳市战略性新兴产业发展专项资金项目(JCYJ20120614164600201)
关键词 类型识别 图像处理 结构化局部边缘模式 模式分类 genre identification image processing structured local edge pattern pattern classification
  • 相关文献

参考文献12

  • 1Nawei C,Dorothea B. A survey of document image classi-fication: problem statement, classifier architecture andperformance evaluation[J]. Int J Document Analysis andRecognition.2007,10(1) : 1-16.
  • 2Sarkar P. Image classification: classifying distributions ofvisual features[J], Pattern Recognition,2006,2:472-475.
  • 3Christian S, David D,Azriel R. Classification of documentpages using structure-based features [J], Int J DocumentAnalysis and Recognition,2001,3(4) :232-247.
  • 4Diligenti M,Frasconi P,Gori M. Hidden tree Markovmodels for document image classification[J], IEEE TransPattern Analysis and Machine Intelligence, 2003 , 25 (4):519-523.
  • 5Andrew D B, Marcel W. First order Gaussian graphs forefficient structure ciaasification[J], Pattern Recognition,2003,36(6).-1311-1324.
  • 6Gerd M,Peter S,Thomas B. Classification of documentsby form and content [J]. Pattern Recognition Letters,1997,18(11/12/13):1225-1231.
  • 7Kim Y,Ross S. Examining variations of prominent fea-tures in genre classification[C] // Proc. Int'l Conf. SystemSciences. Waikoloa. HI: IEEE Press,2008 : 556-560.
  • 8Francine C,Andreas G,Matthew C,et al. Genre identifica-tion for office document search and browsing [J]. Int JDocument Analysis and Recognition,2012,15 (3):167-182.
  • 9Su S Z,Chen S Y.Li S Z,et al. Structured local edge pat-tern moment for pedestrian detection [C] // Proc. Int' 1Conf Image Analysis and Signal Processing. Zhejiang,Chi-na: IEEE Press,2010:556-560.
  • 10Cheng Y C,Chen S Y. Image classification using color,texture and regions [J]. Image and Vision Computing,2003,21(9) :759-776.

同被引文献20

  • 1张伟业,赵群飞.读书机器人的版面分析及文字图像预处理算法[J].微型电脑应用,2011(1):58-61. 被引量:8
  • 2尹立敏,刘艳滢,顾蕊,雷凯.一种可控的直方图均衡算法[J].微计算机信息,2005,21(12X):147-148. 被引量:17
  • 3Liu Hong,Ye Lu.A method restore Chinese warped document imagesbased on binding characters and building curved lines [C]International Conference on System s, Man and Cybernetics:ICSM C2009:2009:989-993.
  • 4Li Zhang, Yip Andy M,Brown M ichael S,et al.A unified framework fordocum ent restoration using inpainting and shape-from-shading[J].PatternRecognition ,2009,42(11):2961-2978.
  • 5Liu Hong,Ding Runwei. International Conference on Systems Man and Cybernetics [C] ICSMC 2009:Restoring Chinese warped docum entimages based on text boundary lines,2009.
  • 6Zhang Shengnan, Yuan Shanlei,Niu Lianqiang.Automatic Recognition Method for Checkbox in Data Form Image [C]Sixth International Conference on Measuring Technology and Mechatronics Automation,2014:159-162 .
  • 7Hamed Behin ,Afsh in Ebrahimi,Sepideh Ebrahimi.Incorporated Preprocessingand Physical Layout Analysis of a Binary Document Image Using a Two Stage Classification [C]International Conference on Computer and Communication Engineering:ICCCE2010:2010.
  • 8付芦静,钱军浩,钟云飞.基于汉字联通分量的印刷图像版面分割方法[J/OL].计算机工程与应用,2013,49(3):4[2013-07-31].1^-tp://www.cnki.net/kems/detail/11.2127.TP.20130731.1817.001.html.
  • 9Amir Reza Ghods,Saeed Mozaffari, Farhad Ahmadpanahi.Document ImageDewarping using Kinect Depth Sensor [C] 21st Iranian Conference, Electrical Engineering:ICEE2013:2014:1-6 .
  • 10Tong Lijing,Zhang Guoliang, Peng Quanyao,et al.Warped document imagemosaicing method based on inflection point detection and registration, International Conference on Multimedia Information Networking and Security MINES2012:November 2-4 ,2012[C] Nanjing, 2012:306-310.

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部