期刊文献+
共找到84,023篇文章
< 1 2 250 >
每页显示 20 50 100
Resource Efficient Hardware Implementation for Real-Time Traffic Sign Recognition
1
作者 Huai-Mao Weng Ching-Te Chiu 《Journal of Transportation Technologies》 2018年第3期209-231,共23页
Traffic sign recognition (TSR, or Road Sign Recognition, RSR) is one of the Advanced Driver Assistance System (ADAS) devices in modern cars. To concern the most important issues, which are real-time and resource effic... Traffic sign recognition (TSR, or Road Sign Recognition, RSR) is one of the Advanced Driver Assistance System (ADAS) devices in modern cars. To concern the most important issues, which are real-time and resource efficiency, we propose a high efficiency hardware implementation for TSR. We divide the TSR procedure into two stages, detection and recognition. In the detection stage, under the assumption that most German traffic signs have red or blue colors with circle, triangle or rectangle shapes, we use Normalized RGB color transform and Single-Pass Connected Component Labeling (CCL) to find the potential traffic signs efficiently. For Single-Pass CCL, our contribution is to eliminate the “merge-stack” operations by recording connected relations of region in the scan phase and updating the labels in the iterating phase. In the recognition stage, the Histogram of Oriented Gradient (HOG) is used to generate the descriptor of the signs, and we classify the signs with Support Vector Machine (SVM). In the HOG module, we analyze the required minimum bits under different recognition rate. The proposed method achieves 96.61% detection rate and 90.85% recognition rate while testing with the GTSDB dataset. Our hardware implementation reduces the storage of CCL and simplifies the HOG computation. Main CCL storage size is reduced by 20% comparing to the most advanced design under typical condition. By using TSMC 90 nm technology, the proposed design operates at 105 MHz clock rate and processes in 135 fps with the image size of 1360 × 800. The chip size is about 1 mm2 and the power consumption is close to 8 mW. Therefore, this work is resource efficient and achieves real-time requirement. 展开更多
关键词 TRAFFIC SIGN recognition Advanced Driver ASSISTANCE System REAL-TIME Processing Color Segmentation Connected Component Analysis Histo-gram of Oriented Gradient Support Vector Machine German TRAFFIC SIGN Detection BENCHMARK CMOS ASIC VLSI
在线阅读 下载PDF
Cosic’s Resonance Recognition Model for Protein Sequences and Photon Emission Differentiates Lethal and Non-Lethal Ebola Strains: Implications for Treatment
2
作者 Nirosha J. Murugan Lukasz M. Karbowski Michael A. Persinger 《Open Journal of Biophysics》 2015年第1期35-43,共9页
The Cosic Resonance Recognition Model (RRM) for amino acid sequences was applied to the classes of proteins displayed by four strains (Sudan, Zaire, Reston, Ivory Coast) of Ebola virus that produced either high or min... The Cosic Resonance Recognition Model (RRM) for amino acid sequences was applied to the classes of proteins displayed by four strains (Sudan, Zaire, Reston, Ivory Coast) of Ebola virus that produced either high or minimal numbers of human fatalities. The results clearly differentiated highly lethal and non-lethal strains. Solutions for the two lethal strains exhibited near ultraviolet (~230 nm) photon values while the two asymptomatic forms displayed near infrared (~1000 nm) values. Cross-correlations of spectral densities of the RRM values of the different classes of proteins associated with the genome of the viruses supported this dichotomy. The strongest coefficient occurred only between Sudan-Zaire strains but not for any of the other pairs of strains for sGP, the small glycoprotein that intercalated with the plasma cell membrane to promote insertion of viral contents into cellular space. A surprising, statistically significant cross-spectral correlation occurred between the “spike” glycoprotein component (GP1) of the virus that associated the anchoring of the virus to the mammalian cell plasma membrane and the Schumann resonance of the earth whose intensities were determined by the incidence of equatorial thunderstorms. Previous applications of the RRM to shifting photon wavelengths emitted by melanoma cells adapting to reduced ambient temperature have validated Cosic’s model and have demonstrated very narrowwave-length (about 10 nm) specificity. One possible ancillary and non-invasive treatment of people within which the fatal Ebola strains are residing would be whole body application of narrow band near-infrared light pulsed as specific physiologically-patterned sequences with sufficient radiant flux density to perfuse the entire body volume. 展开更多
关键词 Cosic RESONANCE recognition Model EBOLA Virus FATAL VS Asymptomatic Forms Ultraviolet VS Infrared Photon EQUIVALENTS Schumann RESONANCE Cross-Spectral Analyses of Viral Proteins
暂未订购
Feature Recognition and Selection Method of the Equipment State Based on Improved Mahalanobis-Taguchi System 被引量:1
3
作者 WANG Ning ZHANG Zhuo 《Journal of Shanghai Jiaotong university(Science)》 EI 2020年第2期214-222,共9页
Mahalanobis-Taguchi system(MTS)is a kind of data mining and pattern recognition method which can identify the attribute characteristics of multidimensional data by constructing Mahalanobis distance(MD)measurement scal... Mahalanobis-Taguchi system(MTS)is a kind of data mining and pattern recognition method which can identify the attribute characteristics of multidimensional data by constructing Mahalanobis distance(MD)measurement scale.In this paper,considering the influence of irregular distribution of the sample data and abnormal variation of the normal data on accuracy of MTS,a feature recognition and selection model of the equipment state based on the improved MTS is proposed,and two aspects of the model namely construction of the original Mahalanobis space(MS)and determination of the threshold are studied.Firstly,the original training sample space is statistically controlled by the X-bar-S control chart,and extreme data of the single characteristic attribute is filtered to reduce the impact of extreme condition on the accuracy of the model,so as to construct a more robust MS.Furthermore,the box plot method is used to determine the threshold of the model.And the stability of the model and the tolerance to the extreme condition are improved by leaving sufficient range of the variation for the extreme condition which is identified as in the normal range.Finally,the improved model is compared with the traditional one based on the unimproved MTS by using the data from the literature.The result shows that compared with the traditional model,the accuracy and sensitivity of the improved model for state identification can be greatly enhanced. 展开更多
关键词 Mahalanobis-Taguchi system(MTS) EXTREME condition X-bar-S control CHART BOX PLOT method Mahalanobis space(MS) Mahalanobis distance(MD) threshold feature recognition equipment STATE
原文传递
A Study of Visual Recognition of Facial Emotional Expressions in a Normal Aging Population in the Absence of Cognitive Disorders
4
作者 Philippe Granato Shreekumar Vinekar +2 位作者 Olivier Godefroy Jean-Pierre Vangansberghe Raymond Bruyer 《Open Journal of Psychiatry》 2014年第3期251-260,共10页
Objective: To examine and measure the decision-making processes involved in Visual Recognition of Facial Emotional Expressions (VRFEE) and to study the effects of demographic factors on this process. Method: We evalua... Objective: To examine and measure the decision-making processes involved in Visual Recognition of Facial Emotional Expressions (VRFEE) and to study the effects of demographic factors on this process. Method: We evaluated a newly designed software application (M.A.R.I.E.) that permits computerized metric measurement of VRFEE. We administered it to 204 cognitively normal participants ranging in age from 20 to 70 years. Results: We established normative values for the recognition of anger, disgust, joy, fear, surprise and sadness expressed on the faces of three individuals. There was a significant difference in the: 1) measurement (F (8.189) = 3896, p = 0.0001);2) education level (x2(12) = 28.4, p = 0.005);3) face (F(2.195) = 10, p = 0.0001);4)series (F (8.189)=28, p = 0.0001);5) interaction between the identity and recognition of emotions (F (16, 181 =11, p = 0.0001). However, performance did not differ according to: 1) age (F (6.19669) = 1.35, p = 0.2) or 2) level of education (F (1, 1587) = 0.6, p = 0.4). Conclusions: In healthy participants, the VRFEE remains stable throughout the lifespan when cognitive functions remain optimal. Disgust, sadness, fear, and joy seem to be the four most easily recognized facial emotions, while anger and surprise are not easily recognized. Visual recognition of disgust and fear is independent of aging. The characteristics of a face have a significant influence on the ease with which people recognize expressed emotions (idiosyncrasy). Perception and recognition of emotions is categorical, even when the facial images are integrated in a spectrum of morphs reflecting two different emotions on either side. 展开更多
关键词 recognition Emotions M.A.R.I.E. Aging Healthy PARTICIPANTS EMOTION STIMULUS (ES) EMOTION Set (ESet) EMOTION Series (ESr) VRFEE EMOTION recognition (ER) Canonical Emotions (CE) Intermediate Emotions (IE)
暂未订购
MCS HOG Features and SVM Based Handwritten Digit Recognition System
5
作者 Hamayun A. Khan 《Journal of Intelligent Learning Systems and Applications》 2017年第2期21-33,共13页
Digit Recognition is an essential element of the process of scanning and converting documents into electronic format. In this work, a new Multiple-Cell Size (MCS) approach is being proposed for utilizing Histogram of ... Digit Recognition is an essential element of the process of scanning and converting documents into electronic format. In this work, a new Multiple-Cell Size (MCS) approach is being proposed for utilizing Histogram of Oriented Gradient (HOG) features and a Support Vector Machine (SVM) based classifier for efficient classification of Handwritten Digits. The HOG based technique is sensitive to the cell size selection used in the relevant feature extraction computations. Hence a new MCS approach has been used to perform HOG analysis and compute the HOG features. The system has been tested on the Benchmark MNIST Digit Database of handwritten digits and a classification accuracy of 99.36% has been achieved using an Independent Test set strategy. A Cross-Validation analysis of the classification system has also been performed using the 10-Fold Cross-Validation strategy and a 10-Fold classification accuracy of 99.26% has been obtained. The classification performance of the proposed system is superior to existing techniques using complex procedures since it has achieved at par or better results using simple operations in both the Feature Space and in the Classifier Space. The plots of the system’s Confusion Matrix and the Receiver Operating Characteristics (ROC) show evidence of the superior performance of the proposed new MCS HOG and SVM based digit classification system. 展开更多
关键词 Handwritten DIGIT recognition MNIST Benchmark Database HOG ANALYSIS Multiple-Cell Size HOG ANALYSIS SVM Classifier 10-Fold Cross-Validation CONFUSION Matrix Receiver Operating Characteristics
暂未订购
Automatic de-noising and recognition algorithm for drilling fluid pulse signal 被引量:1
6
作者 HU Yongjian HUANG Yanfu LI Xianyi 《Petroleum Exploration and Development》 2019年第2期393-400,共8页
Wavelet forced de-noising algorithm is suitable for denoising of unsteady drilling fluid pulse signal, including baseline drift rectification and two-stage de-noising processing of frame synchronization signal and ins... Wavelet forced de-noising algorithm is suitable for denoising of unsteady drilling fluid pulse signal, including baseline drift rectification and two-stage de-noising processing of frame synchronization signal and instruction signal. Two-stage de-noising processing can reduce the impact of baseline drift and determine automatic peak detection threshold range for signal recognition by distinguishing different features of frame synchronization pulse and instruction pulse. Rising and falling edge relative protruding threshold is defined for peak detection in signal recognition, which can make full use of the degree of the signal peak change and detect peaks flexibly with rising and falling edge relative protruding threshold combination. A synchronous decoding method was designed to reduce position uncertainty of the frame synchronization pulse and eliminate the accumulative error of time base drift, which determines the first instruction pulse position according to position of the frame synchronization pulse and decodes subsequent instruction pulse by taking current instruction pulse as new bit synchronization pulse. Special tool software was developed to tune algorithm parameters, which has a decoding success rate of about 95% for the universal coded signals. For the special coded signals with check byte, decoding success rate using the automatic threshold adjustment algorithm is as high as 99%. 展开更多
关键词 drilling fluid pulse SIGNAL SIGNAL processing DECODING SUCCESS rate AUTOMATIC DE-NOISING and recognition wavelet FORCED DE-NOISING peak detection synchronous DECODING
在线阅读 下载PDF
A YOLOv11-Based Deep Learning Framework for Multi-Class Human Action Recognition
7
作者 Nayeemul Islam Nayeem Shirin Mahbuba +4 位作者 Sanjida Islam Disha Md Rifat Hossain Buiyan Shakila Rahman M.Abdullah-Al-Wadud Jia Uddin 《Computers, Materials & Continua》 2025年第10期1541-1557,共17页
Human activity recognition is a significant area of research in artificial intelligence for surveillance,healthcare,sports,and human-computer interaction applications.The article benchmarks the performance of You Only... Human activity recognition is a significant area of research in artificial intelligence for surveillance,healthcare,sports,and human-computer interaction applications.The article benchmarks the performance of You Only Look Once version 11-based(YOLOv11-based)architecture for multi-class human activity recognition.The article benchmarks the performance of You Only Look Once version 11-based(YOLOv11-based)architecture for multi-class human activity recognition.The dataset consists of 14,186 images across 19 activity classes,from dynamic activities such as running and swimming to static activities such as sitting and sleeping.Preprocessing included resizing all images to 512512 pixels,annotating them in YOLO’s bounding box format,and applying data augmentation methods such as flipping,rotation,and cropping to enhance model generalization.The proposed model was trained for 100 epochs with adaptive learning rate methods and hyperparameter optimization for performance improvement,with a mAP@0.5 of 74.93%and a mAP@0.5-0.95 of 64.11%,outperforming previous versions of YOLO(v10,v9,and v8)and general-purpose architectures like ResNet50 and EfficientNet.It exhibited improved precision and recall for all activity classes with high precision values of 0.76 for running,0.79 for swimming,0.80 for sitting,and 0.81 for sleeping,and was tested for real-time deployment with an inference time of 8.9 ms per image,being computationally light.Proposed YOLOv11’s improvements are attributed to architectural advancements like a more complex feature extraction process,better attention modules,and an anchor-free detection mechanism.While YOLOv10 was extremely stable in static activity recognition,YOLOv9 performed well in dynamic environments but suffered from overfitting,and YOLOv8,while being a decent baseline,failed to differentiate between overlapping static activities.The experimental results determine proposed YOLOv11 to be the most appropriate model,providing an ideal balance between accuracy,computational efficiency,and robustness for real-world deployment.Nevertheless,there exist certain issues to be addressed,particularly in discriminating against visually similar activities and the use of publicly available datasets.Future research will entail the inclusion of 3D data and multimodal sensor inputs,such as depth and motion information,for enhancing recognition accuracy and generalizability to challenging real-world environments. 展开更多
关键词 Human activity recognition YOLOv11 deep learning real-time detection anchor-free detection attention mechanisms object detection image classification multi-class recognition surveillance applications
在线阅读 下载PDF
Real-Time Face Detection and Recognition in Complex Background
8
作者 Xin Zhang Thomas Gonnot Jafar Saniie 《Journal of Signal and Information Processing》 2017年第2期99-112,共14页
This paper provides efficient and robust algorithms for real-time face detection and recognition in complex backgrounds. The algorithms are implemented using a series of signal processing methods including Ada Boost, ... This paper provides efficient and robust algorithms for real-time face detection and recognition in complex backgrounds. The algorithms are implemented using a series of signal processing methods including Ada Boost, cascade classifier, Local Binary Pattern (LBP), Haar-like feature, facial image pre-processing and Principal Component Analysis (PCA). The Ada Boost algorithm is implemented in a cascade classifier to train the face and eye detectors with robust detection accuracy. The LBP descriptor is utilized to extract facial features for fast face detection. The eye detection algorithm reduces the false face detection rate. The detected facial image is then processed to correct the orientation and increase the contrast, therefore, maintains high facial recognition accuracy. Finally, the PCA algorithm is used to recognize faces efficiently. Large databases with faces and non-faces images are used to train and validate face detection and facial recognition algorithms. The algorithms achieve an overall true-positive rate of 98.8% for face detection and 99.2% for correct facial recognition. 展开更多
关键词 FACE Detection FACIAL recognition ADA BOOST Algorithm CASCADE CLASSIFIER Local Binary Pattern Haar-Like Features Principal Component Analysis
在线阅读 下载PDF
STUDY OF RECOGNITION TECHNIQUE OF RADAR TARGET'S ONE-DIMENSIONAL IMAGES BASED ON RADIAL BASIS FUNCTION NETWORK 被引量:1
9
作者 黄德双 保铮 《Journal of Electronics(China)》 1995年第3期200-210,共11页
This paper studies the problem applying Radial Basis Function Network(RBFN) which is trained by the Recursive Least Square Algorithm(RLSA) to the recognition of one dimensional images of radar targets. The equivalence... This paper studies the problem applying Radial Basis Function Network(RBFN) which is trained by the Recursive Least Square Algorithm(RLSA) to the recognition of one dimensional images of radar targets. The equivalence between the RBFN and the estimate of Parzen window probabilistic density is proved. It is pointed out that the I/O functions in RBFN hidden units can be generalized to general Parzen window probabilistic kernel function or potential function, too. This paper discusses the effects of the shape parameter a in the RBFN and the forgotten factor A in RLSA on the results of the recognition of three kinds of kernel function such as Gaussian, triangle, double-exponential, at the same time, also discusses the relationship between A and the training time in the RBFN. 展开更多
关键词 recognition KERNEL FUNCTION Shape parameter Forgotten factor One dimensional image RECURSIVE least SQUARE RADIAL basis FUNCTION network
在线阅读 下载PDF
An Integrated Face Tracking and Facial Expression Recognition System
10
作者 Angappan Geetha Venkatachalam Ramalingam Sengottaiyan Palanivel 《Journal of Intelligent Learning Systems and Applications》 2011年第4期201-208,共8页
This article proposes a feature extraction method for an integrated face tracking and facial expression recognition in real time video. The method proposed by Viola and Jones [1] is used to detect the face region in t... This article proposes a feature extraction method for an integrated face tracking and facial expression recognition in real time video. The method proposed by Viola and Jones [1] is used to detect the face region in the first frame of the video. A rectangular bounding box is fitted over for the face region and the detected face is tracked in the successive frames using the cascaded Support vector machine (SVM) and cascaded Radial basis function neural network (RBFNN). The haar-like features are extracted from the detected face region and they are used to create a cascaded SVM and RBFNN classifiers. Each stage of the SVM classifier and RBFNN classifier rejects the non-face regions and pass the face regions to the next stage in the cascade thereby efficiently tracking the face. The performance of tracking is evaluated using one hour video data. The performance of the cascaded SVM is compared with the cascaded RBFNN. The experiment results show that the proposed cascaded SVM classifier method gives better performance over the RBFNN and also the methods described in the literature using single SVM classifier [2]. While the face is being tracked, features are extracted from the mouth region for expression recognition. The features are modelled using a multi-class SVM. The SVM finds an optimal hyperplane to distinguish different facial expressions with an accuracy of 96.0%. 展开更多
关键词 FACE Detection FACE Tracking FEATURE Extraction FACIAL Expression recognition Cascaded Support VECTOR Machine Cascaded RADIAL BASIS Function Neural Network
在线阅读 下载PDF
Using Speech Recognition in Learning Primary School Mathematics via Explain, Instruct and Facilitate Techniques 被引量:1
11
作者 Ab Rahman Ahmad Sami M. Halawani Samir K. Boucetta 《Journal of Software Engineering and Applications》 2014年第4期233-255,共23页
The application of Information and Communication Technologies has transformed traditional Teaching and Learning in the past decade to computerized-based era. This evolution has resulted from the emergence of the digit... The application of Information and Communication Technologies has transformed traditional Teaching and Learning in the past decade to computerized-based era. This evolution has resulted from the emergence of the digital system and has greatly impacted on the global education and socio-cultural development. Multimedia has been absorbed into the education sector for producing a new learning concept and a combination of educational and entertainment approach. This research is concerned with the application of Window Speech Recognition and Microsoft Visual Basic 2008 Integrated/Interactive Development Environment in Multimedia-Assisted Courseware prototype development for Primary School Mathematics contents, namely, single digits and the addition. The Teaching and Learning techniques—Explain, Instruct and Facilitate are proposed and these could be viewed as instructors’ centered strategy, instructors’—learners’ dual communication and learners' active participation. The prototype is called M-EIF and deployed only users' voices;hence the activation of Window Speech Recognition is required prior to a test run. 展开更多
关键词 EXPLAIN Instruct and Facilitate TECHNIQUES MULTIMEDIA-ASSISTED COURSEWARE Primary SCHOOL Mathematics Visual Natural Language Window Speech recognition
暂未订购
Neural Network-Powered License Plate Recognition System Design
12
作者 Sakib Hasan Md Nagib Mahfuz Sunny +1 位作者 Abdullah Al Nahian Mohammad Yasin 《Engineering(科研)》 2024年第9期284-300,共17页
The development of scientific inquiry and research has yielded numerous benefits in the realm of intelligent traffic control systems, particularly in the realm of automatic license plate recognition for vehicles. The ... The development of scientific inquiry and research has yielded numerous benefits in the realm of intelligent traffic control systems, particularly in the realm of automatic license plate recognition for vehicles. The design of license plate recognition algorithms has undergone digitalization through the utilization of neural networks. In contemporary times, there is a growing demand for vehicle surveillance due to the need for efficient vehicle processing and traffic management. The design, development, and implementation of a license plate recognition system hold significant social, economic, and academic importance. The study aims to present contemporary methodologies and empirical findings pertaining to automated license plate recognition. The primary focus of the automatic license plate recognition algorithm was on image extraction, character segmentation, and recognition. The task of character segmentation has been identified as the most challenging function based on my observations. The license plate recognition project that we designed demonstrated the effectiveness of this method across various observed conditions. Particularly in low-light environments, such as during periods of limited illumination or inclement weather characterized by precipitation. The method has been subjected to testing using a sample size of fifty images, resulting in a 100% accuracy rate. The findings of this study demonstrate the project’s ability to effectively determine the optimal outcomes of simulations. 展开更多
关键词 Intelligent Traffic Control Systems Automatic License Plate recognition (ALPR) Neural Networks Vehicle Surveillance Traffic Management License Plate recognition Algorithms Image Extraction Character Segmentation Character recognition Low-Light Environments Inclement Weather Empirical Findings Algorithm Accuracy Simulation Outcomes DIGITALIZATION
在线阅读 下载PDF
DM-L Based Feature Extraction and Classifier Ensemble for Object Recognition
13
作者 Hamayun A. Khan 《Journal of Signal and Information Processing》 2018年第2期92-110,共19页
Deep Learning is a powerful technique that is widely applied to Image Recognition and Natural Language Processing tasks amongst many other tasks. In this work, we propose an efficient technique to utilize pre-trained ... Deep Learning is a powerful technique that is widely applied to Image Recognition and Natural Language Processing tasks amongst many other tasks. In this work, we propose an efficient technique to utilize pre-trained Convolutional Neural Network (CNN) architectures to extract powerful features from images for object recognition purposes. We have built on the existing concept of extending the learning from pre-trained CNNs to new databases through activations by proposing to consider multiple deep layers. We have exploited the progressive learning that happens at the various intermediate layers of the CNNs to construct Deep Multi-Layer (DM-L) based Feature Extraction vectors to achieve excellent object recognition performance. Two popular pre-trained CNN architecture models i.e. the VGG_16 and VGG_19 have been used in this work to extract the feature sets from 3 deep fully connected multiple layers namely “fc6”, “fc7” and “fc8” from inside the models for object recognition purposes. Using the Principal Component Analysis (PCA) technique, the Dimensionality of the DM-L feature vectors has been reduced to form powerful feature vectors that have been fed to an external Classifier Ensemble for classification instead of the Softmax based classification layers of the two original pre-trained CNN models. The proposed DM-L technique has been applied to the Benchmark Caltech-101 object recognition database. Conventional wisdom may suggest that feature extractions based on the deepest layer i.e. “fc8” compared to “fc6” will result in the best recognition performance but our results have proved it otherwise for the two considered models. Our experiments have revealed that for the two models under consideration, the “fc6” based feature vectors have achieved the best recognition performance. State-of-the-Art recognition performances of 91.17% and 91.35% have been achieved by utilizing the “fc6” based feature vectors for the VGG_16 and VGG_19 models respectively. The recognition performance has been achieved by considering 30 sample images per class whereas the proposed system is capable of achieving improved performance by considering all sample images per class. Our research shows that for feature extraction based on CNNs, multiple layers should be considered and then the best layer can be selected that maximizes the recognition performance. 展开更多
关键词 DEEP Learning Object recognition CNN DEEP MULTI-LAYER Feature Extraction Principal Component Analysis CLASSIFIER ENSEMBLE Caltech-101 BENCHMARK Database
在线阅读 下载PDF
Augmented Deep-Feature-Based Ear Recognition Using Increased Discriminatory Soft Biometrics
14
作者 Emad Sami Jaha 《Computer Modeling in Engineering & Sciences》 2025年第9期3645-3678,共34页
The human ear has been substantiated as a viable nonintrusive biometric modality for identification or verification.Among many feasible techniques for ear biometric recognition,convolutional neural network(CNN)models ... The human ear has been substantiated as a viable nonintrusive biometric modality for identification or verification.Among many feasible techniques for ear biometric recognition,convolutional neural network(CNN)models have recently offered high-performance and reliable systems.However,their performance can still be further improved using the capabilities of soft biometrics,a research question yet to be investigated.This research aims to augment the traditional CNN-based ear recognition performance by adding increased discriminatory ear soft biometric traits.It proposes a novel framework of augmented ear identification/verification using a group of discriminative categorical soft biometrics and deriving new,more perceptive,comparative soft biometrics for feature-level fusion with hard biometric deep features.It conducts several identification and verification experiments for performance evaluation,analysis,and comparison while varying ear image datasets,hard biometric deep-feature extractors,soft biometric augmentation methods,and classifiers used.The experimental work yields promising results,reaching up to 99.94%accuracy and up to 14%improvement using the AMI and AMIC datasets,along with their corresponding soft biometric label data.The results confirm the proposed augmented approaches’superiority over their standard counterparts and emphasize the robustness of the new ear comparative soft biometrics over their categorical peers. 展开更多
关键词 Ear recognition soft biometrics human identification human verification comparative labeling ranking SVM deep features feature-level fusion convolutional neural networks(CNNs) deep learning
在线阅读 下载PDF
Research on the visualization method of lithology intelligent recognition based on deep learning using mine tunnel images
15
作者 Aiai Wang Shuai Cao +1 位作者 Erol Yilmaz Hui Cao 《International Journal of Minerals,Metallurgy and Materials》 2026年第1期141-152,共12页
An image processing and deep learning method for identifying different types of rock images was proposed.Preprocessing,such as rock image acquisition,gray scaling,Gaussian blurring,and feature dimensionality reduction... An image processing and deep learning method for identifying different types of rock images was proposed.Preprocessing,such as rock image acquisition,gray scaling,Gaussian blurring,and feature dimensionality reduction,was conducted to extract useful feature information and recognize and classify rock images using Tensor Flow-based convolutional neural network(CNN)and Py Qt5.A rock image dataset was established and separated into workouts,confirmation sets,and test sets.The framework was subsequently compiled and trained.The categorization approach was evaluated using image data from the validation and test datasets,and key metrics,such as accuracy,precision,and recall,were analyzed.Finally,the classification model conducted a probabilistic analysis of the measured data to determine the equivalent lithological type for each image.The experimental results indicated that the method combining deep learning,Tensor Flow-based CNN,and Py Qt5 to recognize and classify rock images has an accuracy rate of up to 98.8%,and can be successfully utilized for rock image recognition.The system can be extended to geological exploration,mine engineering,and other rock and mineral resource development to more efficiently and accurately recognize rock samples.Moreover,it can match them with the intelligent support design system to effectively improve the reliability and economy of the support scheme.The system can serve as a reference for supporting the design of other mining and underground space projects. 展开更多
关键词 rock picture recognition convolutional neural network intelligent support for roadways deep learning lithology determination
在线阅读 下载PDF
From ChatGPT to DeepSeek:Potential uses of artificial intelligence in early symptom recognition for stroke care 被引量:1
16
作者 Wai Yan Lam Sunny Chi Lik Au 《Journal of Acute Disease》 2025年第3期13-16,共4页
In the era of artificial intelligence(AI),healthcare and medical sciences are inseparable from different AI technologies[1].ChatGPT once shocked the medical field,but the latest AI model DeepSeek has recently taken th... In the era of artificial intelligence(AI),healthcare and medical sciences are inseparable from different AI technologies[1].ChatGPT once shocked the medical field,but the latest AI model DeepSeek has recently taken the lead[2].PubMed indexed publications on DeepSeek are evolving[3],but limited to editorials and news articles.In this Letter,we explore the use of DeepSeek in early symptoms recognition for stroke care.To the best of our knowledge,this is the first DeepSeek-related writing on stroke. 展开更多
关键词 stroke care indexed publications medical sciences DeepSeek artificial intelligence ai healthcare early symptom recognition artificial intelligence early symptoms recognition
暂未订购
Pattern recognition of messily grown nanowire morphologies applying multi-layer connected self-organized feature maps
17
作者 Qing Liu Hejun Li +1 位作者 Yulei Zhang Zhigang Zhao 《Journal of Materials Science & Technology》 SCIE EI CAS CSCD 2019年第5期946-956,共11页
Multi-layer connected self-organizing feature maps(SOFMs) and the associated learning procedure were proposed to achieve efficient recognition and clustering of messily grown nanowire morphologies. The network is made... Multi-layer connected self-organizing feature maps(SOFMs) and the associated learning procedure were proposed to achieve efficient recognition and clustering of messily grown nanowire morphologies. The network is made up by several paratactic 2-D SOFMs with inter-layer connections. By means of Monte Carlo simulations, virtual morphologies were generated to be the training samples. With the unsupervised inner-layer and inter-layer learning, the neural network can cluster different morphologies of messily grown nanowires and build connections between the morphological microstructure and geometrical features of nanowires within. Then, the as-proposed networks were applied on recognitions and quantitative estimations of the experimental morphologies. Results show that the as-trained SOFMs are able to cluster the morphologies and recognize the average length and quantity of the messily grown nanowires within. The inter-layer connections between winning neurons on each competitive layer have significant influence on the relations between the microstructure of the morphology and physical parameters of the nanowires within. 展开更多
关键词 Artificial neural networks SELF-ORGANIZING feature maps MONTE Carlo simulation Pattern recognition Messily grown NANOWIRE MORPHOLOGIES
原文传递
Analytic Methods for Quality Control of Scientific Publications Part VI: Presentation in Research Gate, Journal Indexing, and Recognition
18
作者 Ilia Brondz 《International Journal of Analytical Mass Spectrometry and Chromatography》 2019年第4期37-44,共8页
In the world of science, recognition of scientific performance is strongly correlated with publication visibility and interest generated among other researchers, which is evident by downloads and citations. A publishe... In the world of science, recognition of scientific performance is strongly correlated with publication visibility and interest generated among other researchers, which is evident by downloads and citations. A published paper’s number of downloads and citations are the best indices of its importance and are useful measures of the researchers’ performance. However, the published paper should be valuated and indexed independently, and the prestige of the journal in which it is published should not influence the value of the paper itself. By participating in and presenting at congresses and international meetings, scientists strongly increase the visibility of their results and recognition of their research;this also promotes their publications. Status in Research Gate (RG), the so-called RG Score, the Percentile, and the h-index give researchers feedback about their performance, or their place and prestige within the scientific community. RG has become an excellent tool for disseminating scientific results and connecting researchers worldwide. RG also allows researchers to present achievements other than publications (e.g., membership in recognized associations such as the American Chemist Society, a biography in Marquis Who’s Who in the World, awards received, and/or ongoing projects). This paper discusses questions regarding how the RG Score, Percentile, and h-index are calculated, whether these methods are correct, and alternative criteria. RG also lists papers with falsified results and the journals that publish them. Thus, it may be appropriate to reduce the indices for such journals, authors, and the institutions with which these authors are affiliated. 展开更多
关键词 Education INSTITUTION Quality of PUBLICATION recognition in Scientific Community Criteria of JUDGMENT for PUBLICATION INDEXING Falsified RESEARCH
在线阅读 下载PDF
Age Invariant Face Recognition Using Convolutional Neural Networks and Set Distances 被引量:4
19
作者 Hachim El Khiyari Harry Wechsler 《Journal of Information Security》 2017年第3期174-185,共12页
Biometric security systems based on facial characteristics face a challenging task due to variability in the intrapersonal facial appearance of subjects traced to factors such as pose, illumination, expression and agi... Biometric security systems based on facial characteristics face a challenging task due to variability in the intrapersonal facial appearance of subjects traced to factors such as pose, illumination, expression and aging. This paper innovates as it proposes a deep learning and set-based approach to face recognition subject to aging. The images for each subject taken at various times are treated as a single set, which is then compared to sets of images belonging to other subjects. Facial features are extracted using a convolutional neural network characteristic of deep learning. Our experimental results show that set-based recognition performs better than the singleton-based approach for both face identification and face verification. We also find that by using set-based recognition, it is easier to recognize older subjects from younger ones rather than younger subjects from older ones. 展开更多
关键词 Aging BIOMETRICS Convolutional Neural Networks (CNN) Deep LEARNING Image Set-Based Face recognition (ISFR) Transfer LEARNING
在线阅读 下载PDF
A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs
20
作者 Zichen Ma Ernest Fokoué 《Open Journal of Statistics》 2014年第4期258-266,共9页
An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC... An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC feature. For each signal, the mean vector of MFCC matrix is used as an input vector for pattern recognition. A sample of 330 signals, containing 165 US voice and 165 non-US voice, is analyzed. By comparison, k-nearest neighbors yield the highest average test accuracy, after using a cross-validation of size 500, and least time being used in the computation. 展开更多
关键词 SPEAKER ACCENT recognition Mel-Frequency Cepstral Coefficients (MFCCs) DISCRIMINANT Analysis Support Vector Machines (SVMs) k-Nearest NEIGHBORS
暂未订购
上一页 1 2 250 下一页 到第
使用帮助 返回顶部