摘要
联机连续文本识别是字符识别技术领域中新的研究方向.基于分层构筑法(Level-Building,LB)和动态时间规整算法(Dynamic Time Warping,DTW)建立了面向连续手写文本识别的手写部件识别器.将部件看作笔段和连续文本的中间模式,根据手写文本的特点建立了由484个手写部件构成的部件集.提取笔段的长度、角度等特征用于LB中每一层的DTW网格匹配中.测试样本包括6 763个汉字和303个连续手写文本.实验结果表明手写体部件集能够有效地支撑笔段和连续文本之间的联系,串识别率达到86.47%.
In this paper, a handwritten radical recognizer, with the purpose of obtaining reliable radicals in online Chinese words or sentences recognition task, was designed based on a hybrid method of Level-Building (LB) and Dynamic Time Warping (DTW) algorithm. Radicals were considered as mid-patterns between strokes and continuous handwritten script in the recognizer. A handwritten radical set was established in terms of handwritten script characteristics. Adjacent stroke relative feature vector sequences were put into the grid point matching process of DTW in each level of LB structure. The test samples include 303 handwritten sequences and 6763 Chinese characters. It is shown that the radical set couht be in effect between strokes and a handwritten sequence, 86.47% of recognition rate is obtained.
出处
《交通运输系统工程与信息》
EI
CSCD
2006年第1期51-54,122,共5页
Journal of Transportation Systems Engineering and Information Technology
关键词
联机连续手写文本
手写部件
对数正态分布
LB与DTW融合算法
on-line continuous handwritten text
handwritten radicals
logarithmic normal distribution
hybrid of LB and DTW algorithm