期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Pashto Characters Recognition Using Multi-Class Enabled Support Vector Machine
1
作者 Sulaiman Khan Shah Nazir +1 位作者 Habib Ullah Khan Anwar Hussain 《Computers, Materials & Continua》 SCIE EI 2021年第6期2831-2844,共14页
During the last two decades signicant work has been reported in the eld of cursive language’s recognition especially,in the Arabic,the Urdu and the Persian languages.The unavailability of such work in the Pashto lang... During the last two decades signicant work has been reported in the eld of cursive language’s recognition especially,in the Arabic,the Urdu and the Persian languages.The unavailability of such work in the Pashto language is because of:the absence of a standard database and of signicant research work that ultimately acts as a big barrier for the research community.The slight change in the Pashto characters’shape is an additional challenge for researchers.This paper presents an efcient OCR system for the handwritten Pashto characters based on multi-class enabled support vector machine using manifold feature extraction techniques.These feature extraction techniques include,tools such as zoning feature extractor,discrete cosine transform,discrete wavelet transform,and Gabor lters and histogram of oriented gradients.A hybrid feature map is developed by combining the manifold feature maps.This research work is performed by developing a medium-sized dataset of handwritten Pashto characters that encapsulate 200 handwritten samples for each 44 characters in the Pashto language.Recognition results are generated for the proposed model based on a manifold and hybrid feature map.An overall accuracy rates of 63.30%,65.13%,68.55%,68.28%,67.02%and 83%are generated based on a zoning technique,HoGs,Gabor lter,DCT,DWT and hybrid feature maps respectively.Applicability of the proposed model is also tested by comparing its results with a convolution neural network model.The convolution neural network-based model generated an accuracy rate of 81.02%smaller than the multi-class support vector machine.The highest accuracy rate of 83%for the multi-class SVM model based on a hybrid feature map reects the applicability of the proposed model. 展开更多
关键词 pashto multi-class support vector machine handwritten characters database ZONING and histogram of oriented gradients
在线阅读 下载PDF
Baseline Isolated Printed Text Image Database for Pashto Script Recognition
2
作者 Arfa Siddiqu Abdul Basit +3 位作者 Waheed Noor Muhammad Asfandyar Khan M.Saeed H.Kakar Azam Khan 《Intelligent Automation & Soft Computing》 SCIE 2023年第7期875-885,共11页
The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the... The optical character recognition for the right to left and cursive languages such as Arabic is challenging and received little attention from researchers in the past compared to the other Latin languages.Moreover,the absence of a standard publicly available dataset for several low-resource lan-guages,including the Pashto language remained a hurdle in the advancement of language processing.Realizing that,a clean dataset is the fundamental and core requirement of character recognition,this research begins with dataset generation and aims at a system capable of complete language understanding.Keeping in view the complete and full autonomous recognition of the cursive Pashto script.The first achievement of this research is a clean and standard dataset for the isolated characters of the Pashto script.In this paper,a database of isolated Pashto characters for forty four alphabets using various font styles has been introduced.In order to overcome the font style shortage,the graphical software Inkscape has been used to generate sufficient image data samples for each character.The dataset has been pre-processed and reduced in dimensions to 32×32 pixels,and further converted into the binary format with a black background and white text so that it resembles the Modified National Institute of Standards and Technology(MNIST)database.The benchmark database is publicly available for further research on the standard GitHub and Kaggle database servers both in pixel and Comma Separated Values(CSV)formats. 展开更多
关键词 Text-image database optical character recognition(OCR) pashto isolated characters visual recognition autonomous language understanding deep learning convolutional neural network(CNN)
在线阅读 下载PDF
阿富汗主流媒体涉华报道信源选择及影响分析 被引量:3
3
作者 何杰 《情报杂志》 CSSCI 北大核心 2022年第8期80-86,共7页
[研究目的]阿富汗主流媒体是了解阿富汗对华认知的重要窗口。目前阿富汗局势发生重大变化,媒体生态格局可能发生改变,有必要加强和改善中国与阿富汗媒体和社会的对话沟通。[研究方法]以阿富汗主流媒体黎明新闻频道、帕支瓦克阿富汗新闻... [研究目的]阿富汗主流媒体是了解阿富汗对华认知的重要窗口。目前阿富汗局势发生重大变化,媒体生态格局可能发生改变,有必要加强和改善中国与阿富汗媒体和社会的对话沟通。[研究方法]以阿富汗主流媒体黎明新闻频道、帕支瓦克阿富汗新闻社2012-2020年的716篇普什图语涉华报道为研究样本,从信源的角度探讨阿富汗主流媒体涉华报道的特性,进而探讨这些特性对涉华报道本身的态度、传播效果的影响。[研究结论]研究发现,阿富汗主流媒体涉华报道在信源选择上主要依赖来自阿富汗、中国政府官方的信源,同时信源的态度倾向对涉华报道的态度有明显的潜在影响。基于此,中国应构建“政府—媒体—社会(公众)”多元主体协同参与的“大合唱”式的传播主体格局,通过汇聚更多的资源和力量,达到增强对阿富汗国际传播效果的目的。 展开更多
关键词 阿富汗 主流媒体 普什图语 涉华报道 信源 中国国家形象 媒体合作
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部