期刊文献+
共找到64篇文章
< 1 2 4 >
每页显示 20 50 100
Fireworks Optimization with Deep Learning-Based Arabic Handwritten Characters Recognition Model
1
作者 Abdelwahed Motwakel Badriyya B.Al-onazi +5 位作者 Jaber S.Alzahrani Ayman Yafoz Mahmoud Othman Abu Sarwar Zamani Ishfaq Yaseen Amgad Atta Abdelmageed 《Computer Systems Science & Engineering》 2024年第5期1387-1403,共17页
Handwritten character recognition becomes one of the challenging research matters.More studies were presented for recognizing letters of various languages.The availability of Arabic handwritten characters databases wa... Handwritten character recognition becomes one of the challenging research matters.More studies were presented for recognizing letters of various languages.The availability of Arabic handwritten characters databases was confined.Almost a quarter of a billion people worldwide write and speak Arabic.More historical books and files indicate a vital data set for many Arab nationswritten in Arabic.Recently,Arabic handwritten character recognition(AHCR)has grabbed the attention and has become a difficult topic for pattern recognition and computer vision(CV).Therefore,this study develops fireworks optimizationwith the deep learning-based AHCR(FWODL-AHCR)technique.Themajor intention of the FWODL-AHCR technique is to recognize the distinct handwritten characters in the Arabic language.It initially pre-processes the handwritten images to improve their quality of them.Then,the RetinaNet-based deep convolutional neural network is applied as a feature extractor to produce feature vectors.Next,the deep echo state network(DESN)model is utilized to classify handwritten characters.Finally,the FWO algorithm is exploited as a hyperparameter tuning strategy to boost recognition performance.Various simulations in series were performed to exhibit the enhanced performance of the FWODL-AHCR technique.The comparison study portrayed the supremacy of the FWODL-AHCR technique over other approaches,with 99.91%and 98.94%on Hijja and AHCD datasets,respectively. 展开更多
关键词 Arabic language handwritten character recognition deep learning CLASSIFICATION parameter tuning
在线阅读 下载PDF
Offline Handwritten Characters Recognition Using Moments Features and Neural Networks
2
作者 Mohamed Abaynarh Lahbib Zenkouar 《Computer Technology and Application》 2015年第1期19-29,共11页
In this paper we revise the moment theory for pattern recognition designed, to extract patterns from the noisy character datas, and develop unconstrained handwritten. Amazigh character recognition method based upon or... In this paper we revise the moment theory for pattern recognition designed, to extract patterns from the noisy character datas, and develop unconstrained handwritten. Amazigh character recognition method based upon orthogonal moments and neural networks classifier. We argue that, given the natural flexibility of neural network models and the extent of parallel processing that they allow, our algorithm is a step forward in character recognition. More importantly, following the approach proposed, we apply our system to two different databases, to examine the ability to recognize patterns under noise. We discover overwhelming support for different style of writing. Moreover, this basic conclusion appears to remain valid across different levels of smoothing and insensitive to the nuances of character patterns. Experiments tested the effect of set size on recognition accuracy which can reach 97.46%. The novelty of the proposed method is independence of size, slant, orientation, and translation. The performance of the proposed method is experimentally evaluated and the promising results and findings are presented. Our method is compared to K-NN (k-nearest neighbors) classifier algorithm; results show performances of our method. 展开更多
关键词 Neural network character recognition orthogonal moments pattern recognition.
在线阅读 下载PDF
A Convolutional Neural Network Based Optical Character Recognition for Purely Handwritten Characters and Digits
3
作者 Syed Atir Raza Muhammad Shoaib Farooq +3 位作者 Uzma Farooq Hanen Karamti Tahir Khurshaid Imran Ashraf 《Computers, Materials & Continua》 2025年第8期3149-3173,共25页
Urdu,a prominent subcontinental language,serves as a versatile means of communication.However,its handwritten expressions present challenges for optical character recognition(OCR).While various OCR techniques have bee... Urdu,a prominent subcontinental language,serves as a versatile means of communication.However,its handwritten expressions present challenges for optical character recognition(OCR).While various OCR techniques have been proposed,most of them focus on recognizing printed Urdu characters and digits.To the best of our knowledge,very little research has focused solely on Urdu pure handwriting recognition,and the results of such proposed methods are often inadequate.In this study,we introduce a novel approach to recognizing Urdu pure handwritten digits and characters using Convolutional Neural Networks(CNN).Our proposed method utilizes convolutional layers to extract important features from input images and classifies them using fully connected layers,enabling efficient and accurate detection of Urdu handwritten digits and characters.We implemented the proposed technique on a large publicly available dataset of Urdu handwritten digits and characters.The findings demonstrate that the CNN model achieves an accuracy of 98.30%and an F1 score of 88.6%,indicating its effectiveness in detecting and classifyingUrdu handwritten digits and characters.These results have far-reaching implications for various applications,including document analysis,text recognition,and language understanding,which have previously been unexplored in the context of Urdu handwriting data.This work lays a solid foundation for future research and development in Urdu language detection and processing,opening up new opportunities for advancement in this field. 展开更多
关键词 Image processing natural language processing handwritten Urdu characters optical character recognition deep learning feature extraction CLASSIFICATION
在线阅读 下载PDF
Recognition of Characters by Adaptive Combination of Classifiers
4
作者 王飞 李在铭 《Journal of Electronic Science and Technology of China》 2004年第2期7-9,共3页
In this paper, the visual feature space based on the long Horizontals, the long Verticals, and the radicals are given. An adaptive combination of classifiers, whose coefficients vary with the input pattern, is also pr... In this paper, the visual feature space based on the long Horizontals, the long Verticals, and the radicals are given. An adaptive combination of classifiers, whose coefficients vary with the input pattern, is also proposed. Experiments show that the approach is promising for character recognition in video sequences. 展开更多
关键词 character recognition adaptive combination multiple classifiers
在线阅读 下载PDF
Deep Learning-Based Natural Language Processing Model and Optical Character Recognition for Detection of Online Grooming on Social Networking Services
5
作者 Sangmin Kim Byeongcheon Lee +2 位作者 Muazzam Maqsood Jihoon Moon Seungmin Rho 《Computer Modeling in Engineering & Sciences》 2025年第5期2079-2108,共30页
The increased accessibility of social networking services(SNSs)has facilitated communication and information sharing among users.However,it has also heightened concerns about digital safety,particularly for children a... The increased accessibility of social networking services(SNSs)has facilitated communication and information sharing among users.However,it has also heightened concerns about digital safety,particularly for children and adolescents who are increasingly exposed to online grooming crimes.Early and accurate identification of grooming conversations is crucial in preventing long-term harm to victims.However,research on grooming detection in South Korea remains limited,as existing models trained primarily on English text and fail to reflect the unique linguistic features of SNS conversations,leading to inaccurate classifications.To address these issues,this study proposes a novel framework that integrates optical character recognition(OCR)technology with KcELECTRA,a deep learning-based natural language processing(NLP)model that shows excellent performance in processing the colloquial Korean language.In the proposed framework,the KcELECTRA model is fine-tuned by an extensive dataset,including Korean social media conversations,Korean ethical verification data from AI-Hub,and Korean hate speech data from Hug-gingFace,to enable more accurate classification of text extracted from social media conversation images.Experimental results show that the proposed framework achieves an accuracy of 0.953,outperforming existing transformer-based models.Furthermore,OCR technology shows high accuracy in extracting text from images,demonstrating that the proposed framework is effective for online grooming detection.The proposed framework is expected to contribute to the more accurate detection of grooming text and the prevention of grooming-related crimes. 展开更多
关键词 Online grooming KcELECTRA natural language processing optical character recognition social networking service text classification
在线阅读 下载PDF
Traffic light detection and recognition in intersections based on intelligent vehicle
6
作者 张宁 何铁军 +1 位作者 高朝晖 黄卫 《Journal of Southeast University(English Edition)》 EI CAS 2008年第4期517-521,共5页
To ensure revulsive driving of intelligent vehicles at intersections, a method is presented to detect and recognize the traffic lights. First, the stabling siding at intersections is detected by applying Hough transfo... To ensure revulsive driving of intelligent vehicles at intersections, a method is presented to detect and recognize the traffic lights. First, the stabling siding at intersections is detected by applying Hough transformation. Then, the colors of traffic lights are detected with color space transformation. Finally, self-associative memory is used to recognize the countdown characters of the traffic lights. Test results at 20 real intersections show that the ratio of correct stabling siding recognition reaches up to 90%;and the ratios of recognition of traffic lights and divided characters are 85% and 97%, respectively. The research proves that the method is efficient for the detection of stabling siding and is robust enough to recognize the characters from images with noise and broken edges. 展开更多
关键词 intelligent vehicle stabling siding detection traffic lights detection self-associative memory light-emitting diode (LED) characters recognition
在线阅读 下载PDF
SVM Model Selection Using PSO for Learning Handwritten Arabic Characters 被引量:2
7
作者 Mamouni El Mamoun Zennaki Mahmoud Sadouni Kaddour 《Computers, Materials & Continua》 SCIE EI 2019年第9期995-1008,共14页
Using Support Vector Machine(SVM)requires the selection of several parameters such as multi-class strategy type(one-against-all or one-against-one),the regularization parameter C,kernel function and their parameters.T... Using Support Vector Machine(SVM)requires the selection of several parameters such as multi-class strategy type(one-against-all or one-against-one),the regularization parameter C,kernel function and their parameters.The choice of these parameters has a great influence on the performance of the final classifier.This paper considers the grid search method and the particle swarm optimization(PSO)technique that have allowed to quickly select and scan a large space of SVM parameters.A comparative study of the SVM models is also presented to examine the convergence speed and the results of each model.SVM is applied to handwritten Arabic characters learning,with a database containing 4840 Arabic characters in their different positions(isolated,beginning,middle and end).Some very promising results have been achieved. 展开更多
关键词 SVM PSO handwritten Arabic grid search character recognition
在线阅读 下载PDF
ERCS: An Efficient and Robust Card Recognition System for Camera-Based Image 被引量:1
8
作者 Zhonghong Ou Baiqiao Xiong +1 位作者 Fenrui Xiao Meina Song 《China Communications》 SCIE CSCD 2020年第12期247-264,共18页
Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal iden... Cards Recognition Systems,(CRSs)are representative computer vision-based applications.They have a broad range of usage scenarios.For example,they can be used to recognize images containing business cards,personal identification cards,and bank cards etc.Even though CRSs have been studied for many years,it is still difficult to recognize cards in camera-based images taken by ordinary devices,e.g.,mobile phones.Diversity of viewpoints and complex backgrounds in the images make the recognition task challenging.Existing systems employing traditional image processing schemes are not robust to varied environment,and are inefficient in dealing with natural images,e.g.,taken by mobile phones.To tackle the problem,we propose a novel framework for card recognition by employing a Convolutional Neutral Network(CNN)based approach.The system localizes the foreground of the image by utilizing a Fully Convolutional Network(FCN).With the help of the foreground map,the system localizes the corners of the card region and employs perspective transformation to alleviate the effects from distortion.Text lines in the card region are detected and recognized by utilizing CNN and Long Short Term Memory,(LSTM).To evaluate the proposed scheme,we collect a large dataset which contains 4,065 images in a variety of shooting scenarios.Experimental results demonstrate the efficacy of the proposed scheme.Specifically,it is able to achieve an accuracy of 90.62%in the end-toend test,outperforming the state-of-the-art. 展开更多
关键词 card localization card recognition optical character recognition CNN
在线阅读 下载PDF
Arabic Optical Character Recognition:A Review 被引量:1
9
作者 Salah Alghyaline 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期1825-1861,共37页
This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingl... This study aims to review the latest contributions in Arabic Optical Character Recognition(OCR)during the last decade,which helps interested researchers know the existing techniques and extend or adapt them accordingly.The study describes the characteristics of the Arabic language,different types of OCR systems,different stages of the Arabic OCR system,the researcher’s contributions in each step,and the evaluationmetrics for OCR.The study reviews the existing datasets for the Arabic OCR and their characteristics.Additionally,this study implemented some preprocessing and segmentation stages of Arabic OCR.The study compares the performance of the existing methods in terms of recognition accuracy.In addition to researchers’OCRmethods,commercial and open-source systems are used in the comparison.The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters.Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment,and few approaches were tested on pagelevel scripts.Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low,under 75%for printed text,and further improvement is needed.Moreover,most of the current approaches are offline OCR systems,and there is no remarkable contribution to online OCR systems. 展开更多
关键词 Arabic Optical Character recognition(OCR) Arabic OCR software Arabic OCR datasets Arabic OCR evaluation
在线阅读 下载PDF
A novel license plate recognition method using HTD and VTD features 被引量:2
10
作者 Zhang Xiangdong Shen Peiyi Li Liangchao Wang Wei Bai Jianhua Zhang Wenbo 《Engineering Sciences》 EI 2010年第1期71-76,共6页
In this paper, a novel method of licence plate recognition (LPR) using the vertical traverse density (VTD) and horizontal traverse density (HTD) is presented. The neutral network algorithm using VTD and HTD features i... In this paper, a novel method of licence plate recognition (LPR) using the vertical traverse density (VTD) and horizontal traverse density (HTD) is presented. The neutral network algorithm using VTD and HTD features is also an innovation. In addition, a so called secondary recognition method which splits characters into different parts is developed. Experimental results show that it is a simple and fast algorithm, which meets the request of real time and nicety performances of LPR and thus has applied value in intelligence transportation system (ITS). 展开更多
关键词 license plate recognition character segment character recognition VTD and HTD features
在线阅读 下载PDF
An on-line free handwritten Chinese character recognition method based on component cascaded HMMs 被引量:1
11
作者 Zhao Wei(赵巍) Liu Jiafeng Tang Xianglong 《High Technology Letters》 EI CAS 2005年第3期301-305,共5页
This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and... This paper presents a cascaded Hidden Markov Model (HMM), which allows state's transition, skip and duration. The cascaded HMM extends the way of HMM pattern description of Handwritten Chinese Character (HCC) and depicts the behavior of handwritten curve more reliably in terms of the statistic probability. Hence character segmentation and labeling are unnecessary. Viterbi algorithm is integrated in the cascaded HMM after the whole sample sequence of a HCC is input. More than 26,000 component samples are used tor training 407 handwritten component HMMs. At the improved training stage 94 models of 94 Chinese characters are gained by 32,000 samples, Compared with the Segment HMMs approach, the recognition rate of this model tier the tirst candidate is 87.89% and the error rate could be reduced by 12.4%. 展开更多
关键词 chinese character recognition handwritten component HMM cascaded model
在线阅读 下载PDF
CHARACTER DETECTION AND RECOGNITION SYSTEM OF BEER BOTTLES 被引量:1
12
作者 Shen Bangxing Wu Wenjun +2 位作者 Zhang Yepeng Shen Gang Yang Liangen 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2005年第3期467-469,共3页
An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer b... An optical imaging system and a configuration characteristic algorithm are presented to reduce the difficulties in extracting intact characters image with weak contrast, in recognizing characters on fast moving beer bottles. The system consists of a hardware subsystem, including a rotating device, CCD, 16 mm focus lens, a frame grabber card, a penetrating lighting and a computer, and a software subsystem. The software subsystem performs pretreatment, character segmentation and character recognition. In the pretreatment, the original image is filtered with preset threshold to remove isolated spots. Then the horizontal projection and the vertical projection are used respectively to retrieve the character segmentation. Subsequently, the configuration characteristic algorithm is applied to recognize the characters. The experimental results demonstrate that this system can recognize the characters on beer bottles accurately and effectively; the algorithm is proven fast, stable and robust, making it suitable in the industrial environment. 展开更多
关键词 Optical imaging system Raised character recognition Configuration characteristic algorithm
在线阅读 下载PDF
A Vision-Based Fingertip-Writing Character Recognition System 被引量:1
13
作者 Ching-Long Shih Wen-Yo Lee Yu-Te Ku 《Journal of Computer and Communications》 2016年第4期160-168,共9页
This paper presents a vision-based fingertip-writing character recognition system. The overall system is implemented through a CMOS image camera on a FPGA chip. A blue cover is mounted on the top of a finger to simpli... This paper presents a vision-based fingertip-writing character recognition system. The overall system is implemented through a CMOS image camera on a FPGA chip. A blue cover is mounted on the top of a finger to simplify fingertip detection and to enhance recognition accuracy. For each character stroke, 8 sample points (including start and end points) are recorded. 7 tangent angles between consecutive sampled points are also recorded as features. In addition, 3 features angles are extracted: angles of the triangle consisting of the start point, end point and average point of all (8 total) sampled points. According to these key feature angles, a simple template matching K-nearest-neighbor classifier is applied to distinguish each character stroke. Experimental result showed that the system can successfully recognize fingertip-writing character strokes of digits and small lower case letter alphabets with an accuracy of almost 100%. Overall, the proposed finger-tip-writing recognition system provides an easy-to-use and accurate visual character input method. 展开更多
关键词 Visual Character recognition Fingertip Detection Template Matching K-Nearest-Neighbor Classifier FPGA
在线阅读 下载PDF
Libyan Licenses Plate Recognition Using Template Matching Method 被引量:1
14
作者 Alla A. El. Senoussi Abdella 《Journal of Computer and Communications》 2016年第7期62-71,共10页
License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a compu... License plate recognition (LPR) applies image processing and character recognition technology to identify vehicles by automatically reading their license plates. The work presented in this paper aims to create a computer vision system capable of taking real-time input image from a static camera and identifying the license plate from extracted image. This problem is examined in two stages: First the license plate region detection and extraction from background and plate segmentation to sub-images, and second the character recognition stage. The method used for the license plate region detection is based on the assumption that the license plate area is a high concentration of smaller details, making it a region of high intensity of edges. The Sobel filter and their vertical and horizontal projections are used to identify the plate region. The result of testing this stage was an accuracy of 67.5%. The final stage of the LPR system is optical character recognition (OCR). The method adopted for this stage is based on template matching using correlation. Testing the performance of OCR resulted in an overall recognition rate of 87.76%. 展开更多
关键词 License Plate recognition Optical Character recognition Computer Vision System
在线阅读 下载PDF
Design and Implementation of Prototype System for Online Handwritten Uyghur Character Recognition 被引量:1
15
作者 IBRAYIM Mayire HAMDULLA Askar 《Wuhan University Journal of Natural Sciences》 CAS 2012年第2期131-136,共6页
Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on i... Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on its modules.In the preprocessing procedure,we use the linear and nonlinear normalization based on dot density method.Both structural and statistical features are extracted due to the fact that there are some very similar characters in Uyghur literature.In clustering analysis,we adopt the dynamic clustering algorithm based on the minimum spanning tree(MST),and use the k-nearest neighbor matching classification as classifier.The testing results of prototype system show that the recognition rates for characters of the four different types(independent,suffix,intermediate,and initial type) are 74.67%,70.42%,63.33%,and 72.02%,respectively;the recognition rates for the case of five candidates for those characters are 94.34%,94.19%,93.15%,and 95.86%,respectively.The ideas and methods used in this paper have some commonality and usefulness for the recognition of other characters that belong to Altaic languages family. 展开更多
关键词 online handwriting recognition Uyghur characters feature extraction cluster analysis
原文传递
Research on Handwritten Chinese Character Recognition Based on BP Neural Network 被引量:2
16
作者 Zihao Ning 《Modern Electronic Technology》 2022年第1期12-32,共21页
The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object ... The application of pattern recognition technology enables us to solve various human-computer interaction problems that were difficult to solve before.Handwritten Chinese character recognition,as a hot research object in image pattern recognition,has many applications in people’s daily life,and more and more scholars are beginning to study off-line handwritten Chinese character recognition.This paper mainly studies the recognition of handwritten Chinese characters by BP(Back Propagation)neural network.Establish a handwritten Chinese character recognition model based on BP neural network,and then verify the accuracy and feasibility of the neural network through GUI(Graphical User Interface)model established by Matlab.This paper mainly includes the following aspects:Firstly,the preprocessing process of handwritten Chinese character recognition in this paper is analyzed.Among them,image preprocessing mainly includes six processes:graying,binarization,smoothing and denoising,character segmentation,histogram equalization and normalization.Secondly,through the comparative selection of feature extraction methods for handwritten Chinese characters,and through the comparative analysis of the results of three different feature extraction methods,the most suitable feature extraction method for this paper is found.Finally,it is the application of BP neural network in handwritten Chinese character recognition.The establishment,training process and parameter selection of BP neural network are described in detail.The simulation software platform chosen in this paper is Matlab,and the sample images are used to train BP neural network to verify the feasibility of Chinese character recognition.Design the GUI interface of human-computer interaction based on Matlab,show the process and results of handwritten Chinese character recognition,and analyze the experimental results. 展开更多
关键词 Pattern recognition Handwritten Chinese character recognition BP neural network
在线阅读 下载PDF
Instance Segmentation of Characters Recognized in Palmyrene Aramaic Inscriptions
17
作者 Adéla Hamplová Alexey Lyavdansky +3 位作者 TomášNovák Ondrej Svojše David Franc Arnošt Veselý 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第9期2869-2889,共21页
This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The go... This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis. 展开更多
关键词 Optical character recognition instance segmentation Palmyrene ancient languages computer vision
在线阅读 下载PDF
Improved Approach Based on SVM for License Plate Character Recognition
18
作者 王晓华 王晓光 《Journal of Beijing Institute of Technology》 EI CAS 2005年第4期378-381,共4页
An improved approach based on support vector machine (SVM) called the center distance ratio method is presented for license plate character recognition. First the support vectors are pre-extraeted. A minimal set cal... An improved approach based on support vector machine (SVM) called the center distance ratio method is presented for license plate character recognition. First the support vectors are pre-extraeted. A minimal set called the margin vector set, which contains all support vectors, is extracted. These margin vectors compose new training data and construct the classifier by using the general SVM optimized. The experimental resuhs show that the improved SVM method does well at correct rate and training speed. 展开更多
关键词 support vector machine(SVM) center distance ratio method margin vector support vector character recognition
在线阅读 下载PDF
A Novel Siamese Network for Few/Zero-Shot Handwritten Character Recognition Tasks
19
作者 Nagwa Elaraby Sherif Barakat Amira Rezk 《Computers, Materials & Continua》 SCIE EI 2023年第1期1837-1854,共18页
Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Netw... Deep metric learning is one of the recommended methods for the challenge of supporting few/zero-shot learning by deep networks.It depends on building a Siamese architecture of two homogeneous Convolutional Neural Networks(CNNs)for learning a distance function that can map input data from the input space to the feature space.Instead of determining the class of each sample,the Siamese architecture deals with the existence of a few training samples by deciding if the samples share the same class identity or not.The traditional structure for the Siamese architecture was built by forming two CNNs from scratch with randomly initialized weights and trained by binary cross-entropy loss.Building two CNNs from scratch is a trial and error and time-consuming phase.In addition,training with binary crossentropy loss sometimes leads to poor margins.In this paper,a novel Siamese network is proposed and applied to few/zero-shot Handwritten Character Recognition(HCR)tasks.The novelties of the proposed network are in.1)Utilizing transfer learning and using the pre-trained AlexNet as a feature extractor in the Siamese architecture.Fine-tuning a pre-trained network is typically faster and easier than building from scratch.2)Training the Siamese architecture with contrastive loss instead of the binary cross-entropy.Contrastive loss helps the network to learn a nonlinear mapping function that enables it to map the extracted features in the vector space with an optimal way.The proposed network is evaluated on the challenging Chars74K datasets by conducting two experiments.One is for testing the proposed network in few-shot learning while the other is for testing it in zero-shot learning.The recognition accuracy of the proposed network reaches to 85.6%and 82%in few-and zero-shot learning respectively.In addition,a comparison between the performance of the proposed Siamese network and the traditional Siamese CNNs is conducted.The comparison results show that the proposed network achieves higher recognition results in less time.The proposed network reduces the training time from days to hours in both experiments. 展开更多
关键词 Handwritten character recognition(HCR) few-shot learning zero-shot learning deep metric learning transfer learning contrastive loss Chars74K datasets
在线阅读 下载PDF
Sailfish Optimizer with Deep Transfer Learning-Enabled Arabic Handwriting Character Recognition
20
作者 Mohammed Maray Badriyya B.Al-onazi +5 位作者 Jaber S.Alzahrani Saeed Masoud Alshahrani Najm Alotaibi Sana Alazwari Mahmoud Othman Manar Ahmed Hamza 《Computers, Materials & Continua》 SCIE EI 2023年第3期5467-5482,共16页
The recognition of the Arabic characters is a crucial task incomputer vision and Natural Language Processing fields. Some major complicationsin recognizing handwritten texts include distortion and patternvariabilities... The recognition of the Arabic characters is a crucial task incomputer vision and Natural Language Processing fields. Some major complicationsin recognizing handwritten texts include distortion and patternvariabilities. So, the feature extraction process is a significant task in NLPmodels. If the features are automatically selected, it might result in theunavailability of adequate data for accurately forecasting the character classes.But, many features usually create difficulties due to high dimensionality issues.Against this background, the current study develops a Sailfish Optimizer withDeep Transfer Learning-Enabled Arabic Handwriting Character Recognition(SFODTL-AHCR) model. The projected SFODTL-AHCR model primarilyfocuses on identifying the handwritten Arabic characters in the inputimage. The proposed SFODTL-AHCR model pre-processes the input imageby following the Histogram Equalization approach to attain this objective.The Inception with ResNet-v2 model examines the pre-processed image toproduce the feature vectors. The Deep Wavelet Neural Network (DWNN)model is utilized to recognize the handwritten Arabic characters. At last,the SFO algorithm is utilized for fine-tuning the parameters involved in theDWNNmodel to attain better performance. The performance of the proposedSFODTL-AHCR model was validated using a series of images. Extensivecomparative analyses were conducted. The proposed method achieved a maximum accuracy of 99.73%. The outcomes inferred the supremacy of theproposed SFODTL-AHCR model over other approaches. 展开更多
关键词 Arabic language handwritten character recognition deep learning feature extraction hyperparameter tuning
在线阅读 下载PDF
上一页 1 2 4 下一页 到第
使用帮助 返回顶部