期刊文献+
共找到332篇文章
< 1 2 17 >
每页显示 20 50 100
A Convolutional Neural Network Based Optical Character Recognition for Purely Handwritten Characters and Digits
1
作者 Syed Atir Raza Muhammad Shoaib Farooq +3 位作者 Uzma Farooq Hanen Karamti Tahir Khurshaid Imran Ashraf 《Computers, Materials & Continua》 2025年第8期3149-3173,共25页
Urdu,a prominent subcontinental language,serves as a versatile means of communication.However,its handwritten expressions present challenges for optical character recognition(OCR).While various OCR techniques have bee... Urdu,a prominent subcontinental language,serves as a versatile means of communication.However,its handwritten expressions present challenges for optical character recognition(OCR).While various OCR techniques have been proposed,most of them focus on recognizing printed Urdu characters and digits.To the best of our knowledge,very little research has focused solely on Urdu pure handwriting recognition,and the results of such proposed methods are often inadequate.In this study,we introduce a novel approach to recognizing Urdu pure handwritten digits and characters using Convolutional Neural Networks(CNN).Our proposed method utilizes convolutional layers to extract important features from input images and classifies them using fully connected layers,enabling efficient and accurate detection of Urdu handwritten digits and characters.We implemented the proposed technique on a large publicly available dataset of Urdu handwritten digits and characters.The findings demonstrate that the CNN model achieves an accuracy of 98.30%and an F1 score of 88.6%,indicating its effectiveness in detecting and classifyingUrdu handwritten digits and characters.These results have far-reaching implications for various applications,including document analysis,text recognition,and language understanding,which have previously been unexplored in the context of Urdu handwriting data.This work lays a solid foundation for future research and development in Urdu language detection and processing,opening up new opportunities for advancement in this field. 展开更多
关键词 Image processing natural language processing handwritten Urdu characters optical character recognition deep learning feature extraction CLASSIFICATION
在线阅读 下载PDF
Deep Learning-Based Natural Language Processing Model and Optical Character Recognition for Detection of Online Grooming on Social Networking Services
2
作者 Sangmin Kim Byeongcheon Lee +2 位作者 Muazzam Maqsood Jihoon Moon Seungmin Rho 《Computer Modeling in Engineering & Sciences》 2025年第5期2079-2108,共30页
The increased accessibility of social networking services(SNSs)has facilitated communication and information sharing among users.However,it has also heightened concerns about digital safety,particularly for children a... The increased accessibility of social networking services(SNSs)has facilitated communication and information sharing among users.However,it has also heightened concerns about digital safety,particularly for children and adolescents who are increasingly exposed to online grooming crimes.Early and accurate identification of grooming conversations is crucial in preventing long-term harm to victims.However,research on grooming detection in South Korea remains limited,as existing models trained primarily on English text and fail to reflect the unique linguistic features of SNS conversations,leading to inaccurate classifications.To address these issues,this study proposes a novel framework that integrates optical character recognition(OCR)technology with KcELECTRA,a deep learning-based natural language processing(NLP)model that shows excellent performance in processing the colloquial Korean language.In the proposed framework,the KcELECTRA model is fine-tuned by an extensive dataset,including Korean social media conversations,Korean ethical verification data from AI-Hub,and Korean hate speech data from Hug-gingFace,to enable more accurate classification of text extracted from social media conversation images.Experimental results show that the proposed framework achieves an accuracy of 0.953,outperforming existing transformer-based models.Furthermore,OCR technology shows high accuracy in extracting text from images,demonstrating that the proposed framework is effective for online grooming detection.The proposed framework is expected to contribute to the more accurate detection of grooming text and the prevention of grooming-related crimes. 展开更多
关键词 Online grooming KcELECTRA natural language processing optical character recognition social networking service text classification
在线阅读 下载PDF
Arabic Optical Character Recognition:A Review 被引量:1
3
作者 Salah Alghyaline 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期1825-1861,共37页
This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingl... This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingly.The study describes the characteristics of the Arabic language,different types of OCR systems,different stages of the Arabic OCR system,the researcher’s contributions in each step,and the evaluationmetrics for OCR.The study reviews the existing datasets for the Arabic OCR and their characteristics.Additionally,this study implemented some preprocessing and segmentation stages of Arabic OCR.The study compares the performance of the existing methods in terms of recognition accuracy.In addition to researchers’OCRmethods,commercial and open-source systems are used in the comparison.The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters.Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment,and few approaches were tested on pagelevel scripts.Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low,under 75%for printed text,and further improvement is needed.Moreover,most of the current approaches are offline OCR systems,and there is no remarkable contribution to online OCR systems. 展开更多
关键词 Arabic optical character recognition(ocr) Arabic ocr software Arabic ocr datasets Arabic ocr evaluation
在线阅读 下载PDF
Review of Optical Character Recognition for Power System Image Based on Artificial Intelligence Algorithm
4
作者 Xun Zhang Wanrong Bai Haoyang Cui 《Energy Engineering》 EI 2023年第3期665-679,共15页
Optical Character Recognition(OCR)refers to a technology that uses image processing technology and character recognition algorithms to identify characters on an image.This paper is a deep study on the recognition effe... Optical Character Recognition(OCR)refers to a technology that uses image processing technology and character recognition algorithms to identify characters on an image.This paper is a deep study on the recognition effect of OCR based on Artificial Intelligence(AI)algorithms,in which the different AI algorithms for OCR analysis are classified and reviewed.Firstly,the mechanisms and characteristics of artificial neural network-based OCR are summarized.Secondly,this paper explores machine learning-based OCR,and draws the conclusion that the algorithms available for this form of OCR are still in their infancy,with low generalization and fixed recognition errors,albeit with better recognition effect and higher recognition accuracy.Finally,this paper explores several of the latest algorithms such as deep learning and pattern recognition algorithms.This paper concludes that OCR requires algorithms with higher recognition accuracy. 展开更多
关键词 optical character recognition artificial intelligence power system image artificial neural network machine leaning deep learning
在线阅读 下载PDF
Optical Character Recognition Functionality Introduction in Mobile Application for Car Diary
5
作者 Ioannis Patias 《Journal of Electrical Engineering》 2017年第6期335-339,共5页
The purpose of the paper is to develop a mobile Android application--"Car Log" that gives to users the ability to track all the costs for a vehicle and the ability to add fuel cost data by taking a photo of the cash... The purpose of the paper is to develop a mobile Android application--"Car Log" that gives to users the ability to track all the costs for a vehicle and the ability to add fuel cost data by taking a photo of the cash receipt from the respective gas station where the charging was performed. OCR (optical character recognition) is the conversion of images of typed, handwritten or printed text into machine-encoded text. Once we have the text machine-encoded we can further use it in machine processes, like translation, or extracted, meaning text-to-speech transformed, helping people in simple everyday tasks. Users of the application will be able to enter other completely different costs grouped into categories and other charges. Car Log application quickly and easily can visualize, edit and add different costs for a ear. It also supports the ability to add multiple profiles, by entering data for all ears in a single family, for example, or a small business. The test results are positive thus we intend to further develop a cloud ready application. 展开更多
关键词 optical character recognition mobile application car diary.
在线阅读 下载PDF
A Fast Recognition System for Isolated Printed Characters Using Center of Gravity and Principal Axis 被引量:1
6
作者 Ahmed M. Shaffie Galal A. Elkobrosy 《Applied Mathematics》 2013年第9期1313-1319,共7页
The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A f... The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A fast recognition system for isolated printed characters using center of gravity”, LAP LAMBERT Academic Publishing 2011, ISBN: 978-38465-0002-6), but here we add using principal axis in order to make the algorithm rotation invariant. In my previous work which is published in LAP LAMBERT, I face a big problem that when the character is rotated I can’t recognize the character. So this adds constrain on the document to be well oriented but here I use the principal axis in order to unify the orientation of the character set and the characters in the scanned document. The algorithm can be applied for any isolated character such as Latin, Chinese, Japanese, and Arabic characters but it has been applied in this paper for Arabic characters. The approach uses normalized and isolated characters of the same size and extracts an image signature based on the center of gravity of the character after making the character principal axis vertical, and then the system compares these values to a set of signatures for typical characters of the set. The system then provides the closeness of match to all other characters in the set. 展开更多
关键词 ocr Pattern recognition CONFUSION Matrix Image SIGNATURE Word Segmentation character FRAGMENTATION
暂未订购
CHARACTER DETECTION AND RECOGNITION SYSTEM OF BEER BOTTLES 被引量:1
7
作者 Shen Bangxing Wu Wenjun +2 位作者 Zhang Yepeng Shen Gang Yang Liangen 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2005年第3期467-469,共3页
An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer b... An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer bottles. The system consists of a hardware subsystem, including a rotating device, CCD, 16 mm focus lens, a frame grabber card, a penetrating lighting and a computer, and a software subsystem. The software subsystem performs pretreatment, character segmentation and character recognition. In the pretreatment, the original image is filtered with preset threshold to remove isolated spots. Then the horizontal projection and the vertical projection are used respectively to retrieve the character segmentation. Subsequently, the configuration characteristic algorithm is applied to recognize the characters. The experimental results demonstrate that this system can recognize the characters on beer bottles accurately and effectively; the algorithm is proven fast, stable and robust, making it suitable in the industrial environment. 展开更多
关键词 optical imaging system Raised character recognition Configuration characteristic algorithm
在线阅读 下载PDF
Optimised CNN Architectures for Handwritten Arabic Character Recognition
8
作者 Salah Alghyaline 《Computers, Materials & Continua》 SCIE EI 2024年第6期4905-4924,共20页
Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.T... Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.The Arabic language includes 28 characters.Each character has up to four shapes according to its location in the word(at the beginning,middle,end,and isolated).This paper proposed 12 CNN architectures for recognizing handwritten Arabic characters.The proposed architectures were derived from the popular CNN architectures,such as VGG,ResNet,and Inception,to make them applicable to recognizing character-size images.The experimental results on three well-known datasets showed that the proposed architectures significantly enhanced the recognition rate compared to the baseline models.The experiments showed that data augmentation improved the models’accuracies on all tested datasets.The proposed model outperformed most of the existing approaches.The best achieved results were 93.05%,98.30%,and 96.88%on the HIJJA,AHCD,and AIA9K datasets. 展开更多
关键词 optical character recognition(ocr) handwritten arabic characters deep learning
在线阅读 下载PDF
Support Vector Machine Based Handwritten Hindi Character Recognition and Summarization
9
作者 Sunil Dhankhar Mukesh Kumar Gupta +3 位作者 Fida Hussain Memon Surbhi Bhatia Pankaj Dadheech Arwa Mashat 《Computer Systems Science & Engineering》 SCIE EI 2022年第10期397-412,共16页
In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English l... In today’s digital era,the text may be in form of images.This research aims to deal with the problem by recognizing such text and utilizing the support vector machine(SVM).A lot of work has been done on the English language for handwritten character recognition but very less work on the under-resourced Hindi language.A method is developed for identifying Hindi language characters that use morphology,edge detection,histograms of oriented gradients(HOG),and SVM classes for summary creation.SVM rank employs the summary to extract essential phrases based on paragraph position,phrase position,numerical data,inverted comma,sentence length,and keywords features.The primary goal of the SVM optimization function is to reduce the number of features by eliminating unnecessary and redundant features.The second goal is to maintain or improve the classification system’s performance.The experiment included news articles from various genres,such as Bollywood,politics,and sports.The proposed method’s accuracy for Hindi character recognition is 96.97%,which is good compared with baseline approaches,and system-generated summaries are compared to human summaries.The evaluated results show a precision of 72%at a compression ratio of 50%and a precision of 60%at a compression ratio of 25%,in comparison to state-of-the-art methods,this is a decent result. 展开更多
关键词 Support vector machine(SVM) optimization PRECISION Hindi character recognition optical character recognition(ocr) automatic summarization and compression ratio
在线阅读 下载PDF
面向智能巡检的电柜OCR识别方法研究 被引量:1
10
作者 康智信 林姝 +1 位作者 徐俊刚 孔庆杰 《石油化工自动化》 2025年第4期59-64,共6页
化工厂配电室电柜标签检测是智能自主巡检中的重要一环。针对传统人工检测方法存在检测效率低、漏检错检率高、危险性高等缺点,基于光学字符识别技术(OCR)的深度学习模型可以高效地检测出电柜上标签的文本框位置并准确地识别出文本内容... 化工厂配电室电柜标签检测是智能自主巡检中的重要一环。针对传统人工检测方法存在检测效率低、漏检错检率高、危险性高等缺点,基于光学字符识别技术(OCR)的深度学习模型可以高效地检测出电柜上标签的文本框位置并准确地识别出文本内容。在自建电柜数据集上,训练一个可以检测出电柜文本框位置的文本检测模型DBNet和一个文本识别模型ABINet。DBNet模型在测试集上检测文本框的准确率可达96%,ABINet识别文本的准确率可达89%,模型的平均检测速度可达1张/s。通过实验表明,该方法可满足工业应用的检测精度与检测速度需求。 展开更多
关键词 光学字符识别 文本检测 文本识别 深度学习
在线阅读 下载PDF
基于PP-OCRv3迁移学习的煤矿数字显示器字符识别研究
11
作者 包姣 肖粲俊 +2 位作者 石发强 王晨宇 杨竣翔 《煤矿安全》 北大核心 2025年第12期239-248,共10页
煤矿设备数字显示器(简称“数显屏”)字符识别是煤矿智能化建设中的研究热点之一,然而该领域仍然面临2个显著的问题:(1)由于有效识别区域小,井下光照条件复杂,图像质量低等干扰因素导致的识别效果不佳;(2)受煤矿井下作业环境条件制约引... 煤矿设备数字显示器(简称“数显屏”)字符识别是煤矿智能化建设中的研究热点之一,然而该领域仍然面临2个显著的问题:(1)由于有效识别区域小,井下光照条件复杂,图像质量低等干扰因素导致的识别效果不佳;(2)受煤矿井下作业环境条件制约引起样本数据采集受限从而导致模型泛化能力不足。针对上述存在的问题,提出了一种基于PP-OCRv3(A Practical Ultra Light Weight Optical-Character Recognition,PP-OCR)迁移学习的煤矿数显屏字符识别算法。首先,采用PP-OCRv3作为预训练模型,提高文本通用特征表达能力,提升煤矿复杂环境中字符检测和识别的精度;其次,分别以文本识别公共数据集、自制数显屏字符数据集、真实和模拟煤矿数显屏字符数据集为驱动,多次逐步迁移PP-OCRv3模型,驱使模型从一般场景自适应转变到煤矿的特殊场景,实现模型跨场景泛化性能和识别速度的提升。试验验证表明:在抗干扰能力测试中,迁移优化模型平均准确度达78.83%(提升17.29%),其中在干扰块场景下的提升尤为显著,高达79.73%(提升29.32%);实时性评估显示,推理帧率平均提升27.295帧/s,其中在模糊场景下提升高达57.67帧/s;多次迁移后的PP-OCRv3模型在有效降低对标注数据样本的依赖性的同时,识别准确性和识别速度均优于对比模型。 展开更多
关键词 煤矿智能化 数字显示器(数显屏) 字符识别 PP-ocrv3 迁移学习
在线阅读 下载PDF
基于OCR的药品信息识别系统设计
12
作者 张定华 马小强 +2 位作者 赖佳华 官敬超 巢建树 《中国医疗设备》 2025年第8期13-19,63,共8页
目的 为解决传统药品生产过程中人工检测药品包装三期信息(批号、生产日期和有效期)效率低、错误率高的问题,设计一种融合光学成像优化与深度学习算法改进的光学字符识别系统。方法 通过构建多角度LED光源与工业相机协同的硬件检测平台... 目的 为解决传统药品生产过程中人工检测药品包装三期信息(批号、生产日期和有效期)效率低、错误率高的问题,设计一种融合光学成像优化与深度学习算法改进的光学字符识别系统。方法 通过构建多角度LED光源与工业相机协同的硬件检测平台,结合光度立体技术增强压印字符成像质量;软件算法采用改进的DBNet检测模型集成可变形卷积模块和特征金字塔增强模块-特征融合模块架构与集成可变形注意力机制的ABINet识别模型,用于提升复杂场景下的文本检测与低质量字符识别能力。结果 系统在1200张低质量药品图像测试中,识别准确度为89.0%,较传统ABINet模型提高了3.7%;同时,在生产线实测中,对5类药品包装检测准确度达98.5%以上,系统检测平均耗时约为人工检测平均耗时的1/4。结论 该研究通过硬件成像优化与算法架构创新,有效地解决了当前药企中对于药品信息识别的易错难控问题。 展开更多
关键词 药品信息识别 光度立体技术 光学字符识别 工业相机 特征增强 可变形注意力机制
在线阅读 下载PDF
基于OCR技术的远程工业数据采集系统的设计
13
作者 吕宜光 郝明 +2 位作者 孙凯明 高亮 周丽丽 《黑龙江科学》 2025年第4期131-133,共3页
为引导企业数字化转型,推动新型生产力的发展,需要对老旧工厂进行升级改造。但在进行工业数据采集时,很多生产设备由于年代久远,与现有的数据通信技术难以兼容,而完全更新这些设备又会带来巨大的经济成本。针对此问题,提出一种远程工业... 为引导企业数字化转型,推动新型生产力的发展,需要对老旧工厂进行升级改造。但在进行工业数据采集时,很多生产设备由于年代久远,与现有的数据通信技术难以兼容,而完全更新这些设备又会带来巨大的经济成本。针对此问题,提出一种远程工业数据采集系统,通过摄像头拍摄设备的人机界面(HMI),用光学字符识别(OCR)技术从画面中提取生产数据信息,并将这些数据整合到厂区现有的工业信息化系统中。该系统不仅有效解决了老旧设备数据通信与新型信息化平台不兼容的问题,还显著降低了升级成本,提高了数据采集的灵活性和准确性。 展开更多
关键词 机器视觉 光学字符识别 工业信息化
在线阅读 下载PDF
利用OCR识别技术实现视频中文字的提取 被引量:22
14
作者 陈义 李言俊 孙小炜 《计算机工程与应用》 CSCD 北大核心 2010年第10期180-183,共4页
为了在视频图像中进行字幕信息的实时提取,提出了一套简捷而有效的方法。首先进行文字事件检测,然后进行边缘检测、阈值计算和边缘尺寸限制,最后依据文字像素密度范围进一步滤去非文字区域的视频字幕,提出的叠加水平和垂直方向边缘的方... 为了在视频图像中进行字幕信息的实时提取,提出了一套简捷而有效的方法。首先进行文字事件检测,然后进行边缘检测、阈值计算和边缘尺寸限制,最后依据文字像素密度范围进一步滤去非文字区域的视频字幕,提出的叠加水平和垂直方向边缘的方法,加强了检测到的文字的边缘;对边缘进行尺寸限制过滤掉了不符合文字尺寸的边缘。应用投影法最终确定视频字幕所在区域。最后,利用OCR识别技术对提取出来的文字区域进行识别,完成视频中文字的提取。以上方法的结合保证了提出算法的正确率和鲁棒性。 展开更多
关键词 光学文字识别 文字事件检测 数字视频
在线阅读 下载PDF
新的基于统计熵功率的OCR算法及其DMCU实现 被引量:4
15
作者 吴永亮 万旺根 +1 位作者 钱锋 徐鸿玮 《计算机工程与应用》 CSCD 北大核心 2009年第1期195-197,共3页
使用摄像头进行文字识别最大的问题在于图像抖动。为了有效地消除图像抖动并正确实现文字识别,提出了一种基于统计熵功率的新的识别算法。这种方法将采集到的数据作为随机信号处理。实验证明,此算法计算复杂度低,识别率高,适用于低成本... 使用摄像头进行文字识别最大的问题在于图像抖动。为了有效地消除图像抖动并正确实现文字识别,提出了一种基于统计熵功率的新的识别算法。这种方法将采集到的数据作为随机信号处理。实验证明,此算法计算复杂度低,识别率高,适用于低成本嵌入式系统,在中国台湾俊亿公司24MHZ16位DMCU嵌入式系统上,获得了94%以上的正确识别率。 展开更多
关键词 统计 熵功率 文字识别
在线阅读 下载PDF
基于OCR识别技术的自动阅卷系统的研究 被引量:4
16
作者 马壮 赵国权 任占鹏 《河北工业科技》 CAS 2005年第6期354-357,共4页
利用光学字符识别(OCR)技术,设计的自动阅卷系统,不仅能够实现对答题卡的自动阅卷工作,还可以完成成绩的统计和成绩单的打印工作,已在实际中得到应用。结果表明该系统提高了考试阅卷工作的效率,与一般的自动阅卷系统相比,有着独特的优点... 利用光学字符识别(OCR)技术,设计的自动阅卷系统,不仅能够实现对答题卡的自动阅卷工作,还可以完成成绩的统计和成绩单的打印工作,已在实际中得到应用。结果表明该系统提高了考试阅卷工作的效率,与一般的自动阅卷系统相比,有着独特的优点,应用前景广阔。 展开更多
关键词 ocr 识别技术 自动阅卷 DELPHI7.0
在线阅读 下载PDF
基于霍夫变换的铭牌OCR图像旋转矫正方法 被引量:14
17
作者 张勇红 《电测与仪表》 北大核心 2015年第8期125-128,共4页
在对互感器铭牌图像进行扫描输入时,铭牌图像或多或少会出现一定程度的倾斜,这种图像的倾斜最终会导致其字符识别准确率下降。针对此问题提出一种基于霍夫变换获取图像倾斜角度,进而通过图像旋转矫正提高光学字符识别(Optical Character... 在对互感器铭牌图像进行扫描输入时,铭牌图像或多或少会出现一定程度的倾斜,这种图像的倾斜最终会导致其字符识别准确率下降。针对此问题提出一种基于霍夫变换获取图像倾斜角度,进而通过图像旋转矫正提高光学字符识别(Optical Character Recognition,OCR)准确率的方法:首先对原始图像进行二值化,进而获得铭牌的轮廓,再采用基于霍夫变换的方法获得铭牌中的水平线段,通过计算得到线段的水平倾斜角,利用此倾角对图像进行还原。实验结果表明,该方法能快速地计算图像的倾斜角度,提高了OCR识别准确率且准确率可达95%以上。 展开更多
关键词 ocr 字符识别 霍夫变换 旋转矫正
在线阅读 下载PDF
基于OCR光学字符识别的翻译优化方法 被引量:10
18
作者 王晓艺 高挺挺 《激光杂志》 北大核心 2020年第12期156-160,共5页
以转化并翻译纸张等载体上文本资料为数字化信息为目标,提出一种基于OCR光学字符识别的翻译优化方法。利用具备摄像功能的设备拍摄含待翻译字符的图像,图像预处理时采用区域灰度差生长算法和叠加灰度值方式判断原始图像中背景信息和表... 以转化并翻译纸张等载体上文本资料为数字化信息为目标,提出一种基于OCR光学字符识别的翻译优化方法。利用具备摄像功能的设备拍摄含待翻译字符的图像,图像预处理时采用区域灰度差生长算法和叠加灰度值方式判断原始图像中背景信息和表格线条,去除原始图像中非字符像素干扰,得到二值化文本图像;图像分割时采用基于改进FCM聚类算法的图像分割方法,利用小波多尺度图像框架,引入时效性函数,降低二值化文本图像分割计算量,充分考虑相邻域信息,解决图像分割缺陷及干扰,获取二值化文本图像单个字符或单词;依据一阶Minkowski距离实现分割后的图像特征分类后,利用后处理方式结合上下文信息展开特征分类结果的进一步处理,提升翻译准确度。实验结果表明:所提方法可实现文字的精准翻译,应用效果较好。 展开更多
关键词 ocr 光学字符 识别 翻译 优化 图像
原文传递
西文OCR后处理中的有限自动机模型 被引量:2
19
作者 王恺 靳简明 王庆人 《计算机工程与应用》 CSCD 北大核心 2004年第23期26-29,共4页
在西文OCR中,从候选结果中挑选最佳结果的后处理操作是必不可少的,并且利用单词拼写检查进行后处理是完全可行的。但是,以往的方法分别在不同程度上具有低可靠性和局限性。为此,该文提出将有限自动机模型应用于西文OCR后处理中,该方法... 在西文OCR中,从候选结果中挑选最佳结果的后处理操作是必不可少的,并且利用单词拼写检查进行后处理是完全可行的。但是,以往的方法分别在不同程度上具有低可靠性和局限性。为此,该文提出将有限自动机模型应用于西文OCR后处理中,该方法有效地将拼写检查和识别结果信息结合起来,克服了以往方法中存在的低可靠性和局限性,并通过实验验证了该方法的有效性。以识别后处理辅助识别,错误率从0.79%降到0.59%;以识别后处理和系统后处理结合辅助识别,错误率降低到0.55%。 展开更多
关键词 字符串匹配 有限自动机 光学字符识别 文档图像处理 光学字符识别 ocr 文字信息电子化
在线阅读 下载PDF
基于图元识别的OCR文本图像倾斜矫正快速算法 被引量:2
20
作者 张秀山 吴产乐 《海军工程大学学报》 CAS 2004年第4期48-52,共5页
提出了一种基于文本图元识别以跟踪字符中心线的高精度矫正OCR图像倾斜的快速算法,该算法思想虽然简单,却具有高效和高精度的特点,同时还具有高可靠性和良好的抗噪特征.实验表明,该方法完全满足实时应用的需要.
关键词 光学字符识别 倾斜矫正 图元识别 图元标准包围盒
在线阅读 下载PDF
上一页 1 2 17 下一页 到第
使用帮助 返回顶部