期刊文献+

数学公式中数学符号的特征分析及提取 被引量:1

Feature analysis of mathematical symbol in mathematical formula and its extraction
在线阅读 下载PDF
导出
摘要 为了研制高性能的数学公式自动识别系统,在详细分析印刷体公式中各种数学符号(包括大小写英文字母、数字及各种运算符、关系符)特点的基础上,选择字符图像中的字符宽高比、网格特征、穿线特征、直线特征、孔洞数、交叉点、端点、质心位置、直方图峰值等9个特征共29维作为字符识别的特征进行识别,并在识别过程中,训练一个高性能、高识别率的支持向量机分类器.实验结果表明,使用这些特征,可以很好地区分数学公式中出现的符号,提高整个系统的识别率. In order to develop efficient automatic recognition system of mathematical formula,the characteristics of various symbols in printed mathematical expression(including English alphabets in upper/lower case,digits,operators,and relation characters) was analyzed in detail and on this basis,nine features of mathematical symbol with 29 dimensions in all were chosen and extracted as the symbols being recognized;they were width-to-height ratio of the symbol in its image,mesh feature,line crossing feature,straightness,number of holes,intersection point,end point,mass-center position,and peak value of histogram.In addition,a supporting vector machine classifier with high performance and recognition rate was trained in the process of recognition.It was shown by experimental result that by using these features,the symbols appearing in a mathematical formula could well be distinguished so that the recognition rate of entire system would be improved.
出处 《兰州理工大学学报》 CAS 北大核心 2012年第5期98-101,共4页 Journal of Lanzhou University of Technology
基金 湖南文理学院科研基金项目(JJYB0914)的资助
关键词 数学公式 符号 特征提取 分类 识别 支持向量机 mathematical formula symbol feature extraction classification recognition support vector machine
  • 相关文献

参考文献12

  • 1陈德裕,朱学芳,苏啸晨,杭月芹.印刷体文献中数学公式识别及描述系统研究[J].计算机应用,2009,29(3):789-791. 被引量:1
  • 2CHAUDHURI B B,GARAIN U. An approach for recognition and interpretation of mathematical expressions in printed docu- ment [J]. Pattern Analysis : Applications, 2002,3 : 120-131.
  • 3LEE H J ,WANG J S. Design of a mathematical expression un- derstanding system [J]. Pattern Recognition Letters 1997,18: 289-298.
  • 4李永华,王科俊,上官伟,唐立群.数学公式基线结构分析及识别算法研究[J].计算机工程与应用,2008,44(16):18-22. 被引量:4
  • 5SAIN K, DASGUPTA A, GARAIN U. EMERS: a tree mate- hing - based performance evaluation of mathematical expres- sion recognition systems [J]. International Journal on Docu- ment Analysis and Recognition, 2011 (14) : 75-85.
  • 6GENOE R, KECHADI T. Fuzzy spatial analysis techniques for mathematical expression recognition [J]. Lecture Notes in Computer Science, Artificial Intelligence and Soft Computing, 2010(6113) :80-87.
  • 7TOYOTA S, UCHIDA S, SUZUKI M. Structural analysis of mathematical formulae with verification based on formula de- scription grammar [J]. Lecture Notes in Computer Science, Document Analysis Systems VII, 2006(3872) : 153-163.
  • 8赵学军.手写数学表达式自动识别的研究[D].重庆:重庆大学,1998.
  • 9TRIER O D,JAIN A K,TAXT T. Feature extraction methods for character recognition - A survey [J]. Pattern Recognition, 1996,9(4) : 641-662.
  • 10MORI S, SUEN C Y, YAMAMOTO K. Historical review of OCR research and development [J]. Proceedings of the IEEE, 1992,80(7) : 1029-1058.

二级参考文献40

  • 1刘峰,袁春风.基于MathML的数学表达式等价性的研究[J].计算机应用研究,2004,21(11):54-56. 被引量:8
  • 2马洪庆.汽车牌照自动识别[M].杭州:浙江大学,1997,3..
  • 3Zanibbi R,Blostein D,Cordy J.Baseline structure analysis of handwritten mathematics notation.Department of Computing and Information Science Queen's University,Kingston,Ontario,Canada,February 14,2001.
  • 4Lee Hsi Jian,Lee Min-Chou.Understanding mathematical expressions using procedure-oriented transformation[J].Pattern Recognition, 1994,27 ( 3 ) : 447-457.
  • 5Fateman R J.How to find mathematics on a scanned page[C]// Lopresti D P,Zhou Jiang-ying.Proc SPIE Vol 3967:Document recognition and retrieval VII,SPIE, 1999,3967:89-109.
  • 6Yang M, Fateman R.Extracting mathematical expressions from postscript documents[C]//ISSAC, 2003.
  • 7Chan Kam-Fail,Yeung Dit-Yan.Mathematical expression recognition: a survey[J].International Journal on Document Analysis and Recognition, 2001,3 ( 1 ) : 3-15.
  • 8Hazewinkel M.Key words and key phrases in scientific databases. aspects of guaranteeing output quality for databases of information[C]//Proceedings of the ISI Conference on Statistical Publishing,Warsaw,August 1999, ISI, 1999: 44-48.
  • 9Zanibbi R.Recognition of mathematics notation via computer using baseline structure,ISBN-0836-0227-2000-439[R].Dept Computing and Information Science,Queen' s University, Kingston,Ontario, August 2000.
  • 10Zanibbi R,Blostein D,Cordy J R.Directions in recognition tabular structure of handwritten mathematics notation.Department of Computing and Information Science, Queen's University, Kingston, Ontario, Canada, 2000.

共引文献56

同被引文献6

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部