Automated essay scoring(AES)systems have gained significant importance in educational settings,offering a scalable,efficient,and objective method for evaluating student essays.However,developing AES systems for Arabic...Automated essay scoring(AES)systems have gained significant importance in educational settings,offering a scalable,efficient,and objective method for evaluating student essays.However,developing AES systems for Arabic poses distinct challenges due to the language’s complex morphology,diglossia,and the scarcity of annotated datasets.This paper presents a hybrid approach to Arabic AES by combining text-based,vector-based,and embeddingbased similarity measures to improve essay scoring accuracy while minimizing the training data required.Using a large Arabic essay dataset categorized into thematic groups,the study conducted four experiments to evaluate the impact of feature selection,data size,and model performance.Experiment 1 established a baseline using a non-machine learning approach,selecting top-N correlated features to predict essay scores.The subsequent experiments employed 5-fold cross-validation.Experiment 2 showed that combining embedding-based,text-based,and vector-based features in a Random Forest(RF)model achieved an R2 of 88.92%and an accuracy of 83.3%within a 0.5-point tolerance.Experiment 3 further refined the feature selection process,demonstrating that 19 correlated features yielded optimal results,improving R2 to 88.95%.In Experiment 4,an optimal data efficiency training approach was introduced,where training data portions increased from 5%to 50%.The study found that using just 10%of the data achieved near-peak performance,with an R2 of 85.49%,emphasizing an effective trade-off between performance and computational costs.These findings highlight the potential of the hybrid approach for developing scalable Arabic AES systems,especially in low-resource environments,addressing linguistic challenges while ensuring efficient data usage.展开更多
Palmprint identification has been conducted over the last two decades in many biometric systems.High-dimensional data with many uncorrelated and duplicated features remains difficult due to several computational compl...Palmprint identification has been conducted over the last two decades in many biometric systems.High-dimensional data with many uncorrelated and duplicated features remains difficult due to several computational complexity issues.This paper presents an interactive authentication approach based on deep learning and feature selection that supports Palmprint authentication.The proposed model has two stages of learning;the first stage is to transfer pre-trained VGG-16 of ImageNet to specific features based on the extraction model.The second stage involves the VGG-16 Palmprint feature extraction in the Siamese network to learn Palmprint similarity.The proposed model achieves robust and reliable end-to-end Palmprint authentication by extracting the convolutional features using VGG-16 Palmprint and the similarity of two input Palmprint using the Siamese network.The second stage uses the CASIA dataset to train and test the Siamese network.The suggested model outperforms comparable studies based on the deep learning approach achieving accuracy and EER of 91.8%and 0.082%,respectively,on the CASIA left-hand images and accuracy and EER of 91.7%and 0.084,respectively,on the CASIA right-hand images.展开更多
The convolutional neural network(CNN)is one of the main algorithms that is applied to deep transfer learning for classifying two essential types of liver lesions;Hemangioma and hepatocellular carcinoma(HCC).Ultrasound...The convolutional neural network(CNN)is one of the main algorithms that is applied to deep transfer learning for classifying two essential types of liver lesions;Hemangioma and hepatocellular carcinoma(HCC).Ultrasound images,which are commonly available and have low cost and low risk compared to computerized tomography(CT)scan images,will be used as input for the model.A total of 350 ultrasound images belonging to 59 patients are used.The number of images with HCC is 202 and 148,respectively.These images were collected from ultrasound cases.info(28 Hemangiomas patients and 11 HCC patients),the department of radiology,the University of Washington(7 HCC patients),the Atlas of ultrasound Germany(3 HCC patients),and Radiopedia and others(10 HCC patients).The ultrasound images are divided into 225,52,and 73 for training,validation,and testing.A data augmentation technique is used to enhance the validation performance.We proposed an approach based on ensembles of the best-selected deep transfer models from the on-the-shelf models:VGG16,VGG19,DenseNet,Inception,InceptionResNet,ResNet,and EfficientNet.After tuning both the feature extraction and the classification layers,the best models are selected.Validation accuracy is used for model tuning and selection.The accuracy,sensitivity,specificity and AUROC are used to evaluate the performance.The experiments are concluded in five stages.The first stage aims to evaluate the base model performance by training the on-the-shelf models.The best accu-racy obtained in the first stage is 83.5%.In the second stage,we augmented the data and retrained the on-the-shelf models with the augmented data.The best accuracy we obtained in the second stage was 86.3%.In the third stage,we tuned the feature extraction layers of the on-the-shelf models.The best accuracy obtained in the third stage is 89%.In the fourth stage,we fine-tuned the classification layer and obtained an accuracy of 93%as the best accuracy.In the fifth stage,we applied the ensemble approach using the best three-performing models and obtained an accuracy,specificity,sensitivity,and AUROC of 94%,93.7%,95.1%,and 0.944,respectively.展开更多
Accurate forecasting of emerging infectious diseases can guide public health officials in making appropriate decisions related to the allocation of public health resources.Due to the exponential spread of the COVID-19...Accurate forecasting of emerging infectious diseases can guide public health officials in making appropriate decisions related to the allocation of public health resources.Due to the exponential spread of the COVID-19 infection worldwide,several computational models for forecasting the transmission and mortality rates of COVID-19 have been proposed in the literature.To accelerate scientific and public health insights into the spread and impact of COVID-19,Google released the Google COVID-19 search trends symptoms open-access dataset.Our objective is to develop 7 and 14-day-ahead forecasting models of COVID-19 transmission and mortality in the US using the Google search trends for COVID-19 related symptoms.Specifically,we propose a stacked long short-term memory(SLSTM)architecture for predicting COVID-19 confirmed and death cases using historical time series data combined with auxiliary time series data from the Google COVID-19 search trends symptoms dataset.Considering the SLSTM networks trained using historical data only as the base models,our base models for 7 and 14-day-ahead forecasting of COVID cases had the mean absolute percentage error(MAPE)values of 6.6%and 8.8%,respectively.On the other side,our proposed models had improved MAPE values of 3.2%and 5.6%,respectively.For 7 and 14-day-ahead forecasting of COVID-19 deaths,the MAPE values of the base models were 4.8%and 11.4%,while the improved MAPE values of our proposed models were 4.7%and 7.8%,respectively.We found that the Google search trends for“pneumonia,”“shortness of breath,”and“fever”are the most informative search trends for predicting COVID-19 transmission.We also found that the search trends for“hypoxia”and“fever”were the most informative trends for forecasting COVID-19 mortality.展开更多
It is important to determine early on which patients require ICU admissions in managing COVID-19 especially when medical resources are limited.Delay in ICU admissions is associated with negative outcomes such as morta...It is important to determine early on which patients require ICU admissions in managing COVID-19 especially when medical resources are limited.Delay in ICU admissions is associated with negative outcomes such as mortality and cost.Therefore,early identification of patients with a high risk of respiratory failure can prevent complications,enhance risk stratification,and improve the outcomes of severely-ill hospitalized patients.In this paper,we develop a model that uses the characteristics and information collected at the time of patients’admissions and during their early period of hospitalization to accurately predict whether they will need ICU admissions.We use the data explained and organized in a window-based manner by the Sírio-Libanês hospital team(published on Kaggle).Preprocessing is applied,including imputation,cleaning,and feature selection.In the cleaning process,we remove zero-variance,redundant,and/or highly correlated(measured by the Pearson correlation coefficient)features.We use Extreme Gradient Boosting(XGBoost)with early stopping as a predictor in our developed model.We run the experiment in four stages starting from the features of Window 1 in Stage 1 and then incrementally add the features of Windows 2–4 in Stages 2–4 respectively.We achieve AUCs of 0.73,0.92,0.95,and 0.97 in those four stages.展开更多
Many types of research focus on utilizing Palmprint recognition in user identification and authentication.The Palmprint is one of biometric authentication(something you are)invariable during a person’s life and needs...Many types of research focus on utilizing Palmprint recognition in user identification and authentication.The Palmprint is one of biometric authentication(something you are)invariable during a person’s life and needs careful protection during enrollment into different biometric authentication systems.Accuracy and irreversibility are critical requirements for securing the Palmprint template during enrollment and verification.This paper proposes an innovative HAMTE neural network model that contains Hetero-Associative Memory for Palmprint template translation and projection using matrix multiplication and dot product multiplication.A HAMTE-Siamese network is constructed,which accepts two Palmprint templates and predicts whether these two templates belong to the same user or different users.The HAMTE is generated for each user during the enrollment phase,which is responsible for generating a secure template for the enrolled user.The proposed network secures the person’s Palmprint template by translating it into an irreversible template(different features space).It can be stored safely in a trusted/untrusted third-party authentication system that protects the original person’s template from being stolen.Experimental results are conducted on the CASIA database,where the proposed network achieved accuracy close to the original accuracy for the unprotected Palmprint templates.The recognition accuracy deviated by around 3%,and the equal error rate(EER)by approximately 0.02 compared to the original data,with appropriate performance(approximately 13 ms)while preserving the irreversibility property of the secure template.Moreover,the brute-force attack has been analyzed under the new Palmprint protection scheme.展开更多
Face authentication is an important biometric authentication method commonly used in security applications.It is vulnerable to different types of attacks that use authorized users’facial images and videos captured fr...Face authentication is an important biometric authentication method commonly used in security applications.It is vulnerable to different types of attacks that use authorized users’facial images and videos captured from social media to perform spoofing attacks and dynamic movements for penetrating secur-ity applications.This paper presents an innovative challenge-response emotions authentication model based on the horizontal ensemble technique.The proposed model provides high accurate face authentication process by challenging the authorized user using a random sequence of emotions to provide a specific response for every authentication trial with a different sequence of emotions.The proposed model is applied to the KDEF dataset using 10-fold cross-valida-tions.Several improvements are made to the proposed model.First,the VGG16 model is applied to the seven common emotions.Second,the system usability is enhanced by analyzing and selecting only the four common and easy-to-use emotions.Third,the horizontal ensemble technique is applied to enhance the emotion recognition accuracy and minimize the error during authen-tication processes.Finally,the Horizontal Ensemble Best N-Losses(HEBNL)is applied using challenge-response emotion to improve the authentication effi-ciency and minimize the computational power.The successive improvements implemented on the proposed model led to an improvement in the accuracy from 92.1%to 99.27%.展开更多
Identity management is based on the creation and management of useridentities for granting access to the cloud resources based on the user attributes.The cloud identity and access management (IAM) grants the authoriza...Identity management is based on the creation and management of useridentities for granting access to the cloud resources based on the user attributes.The cloud identity and access management (IAM) grants the authorization tothe end-users to perform different actions on the specified cloud resources. Theauthorizations in the IAM are grouped into roles instead of granting them directlyto the end-users. Due to the multiplicity of cloud locations where data resides anddue to the lack of a centralized user authority for granting or denying cloud userrequests, there must be several security strategies and models to overcome theseissues. Another major concern in IAM services is the excessive or the lack ofaccess level to different users with previously granted authorizations. This paperproposes a comprehensive review of security services and threats. Based on thepresented services and threats, advanced frameworks for IAM that provideauthentication mechanisms in public and private cloud platforms. A threat modelhas been applied to validate the proposed authentication frameworks with different security threats. The proposed models proved high efficiency in protectingcloud platforms from insider attacks, single sign-on failure, brute force attacks,denial of service, user privacy threats, and data privacy threats.展开更多
基金funded by Deanship of Graduate studies and Scientific Research at Jouf University under grant No.(DGSSR-2024-02-01264).
文摘Automated essay scoring(AES)systems have gained significant importance in educational settings,offering a scalable,efficient,and objective method for evaluating student essays.However,developing AES systems for Arabic poses distinct challenges due to the language’s complex morphology,diglossia,and the scarcity of annotated datasets.This paper presents a hybrid approach to Arabic AES by combining text-based,vector-based,and embeddingbased similarity measures to improve essay scoring accuracy while minimizing the training data required.Using a large Arabic essay dataset categorized into thematic groups,the study conducted four experiments to evaluate the impact of feature selection,data size,and model performance.Experiment 1 established a baseline using a non-machine learning approach,selecting top-N correlated features to predict essay scores.The subsequent experiments employed 5-fold cross-validation.Experiment 2 showed that combining embedding-based,text-based,and vector-based features in a Random Forest(RF)model achieved an R2 of 88.92%and an accuracy of 83.3%within a 0.5-point tolerance.Experiment 3 further refined the feature selection process,demonstrating that 19 correlated features yielded optimal results,improving R2 to 88.95%.In Experiment 4,an optimal data efficiency training approach was introduced,where training data portions increased from 5%to 50%.The study found that using just 10%of the data achieved near-peak performance,with an R2 of 85.49%,emphasizing an effective trade-off between performance and computational costs.These findings highlight the potential of the hybrid approach for developing scalable Arabic AES systems,especially in low-resource environments,addressing linguistic challenges while ensuring efficient data usage.
基金This work was funded by the Deanship of Scientific Research at Jouf University under Grant No.(DSR-2022-RG-0104).
文摘Palmprint identification has been conducted over the last two decades in many biometric systems.High-dimensional data with many uncorrelated and duplicated features remains difficult due to several computational complexity issues.This paper presents an interactive authentication approach based on deep learning and feature selection that supports Palmprint authentication.The proposed model has two stages of learning;the first stage is to transfer pre-trained VGG-16 of ImageNet to specific features based on the extraction model.The second stage involves the VGG-16 Palmprint feature extraction in the Siamese network to learn Palmprint similarity.The proposed model achieves robust and reliable end-to-end Palmprint authentication by extracting the convolutional features using VGG-16 Palmprint and the similarity of two input Palmprint using the Siamese network.The second stage uses the CASIA dataset to train and test the Siamese network.The suggested model outperforms comparable studies based on the deep learning approach achieving accuracy and EER of 91.8%and 0.082%,respectively,on the CASIA left-hand images and accuracy and EER of 91.7%and 0.084,respectively,on the CASIA right-hand images.
基金funded by the Deanship of Scientific Research at Jouf University under Grant No.(DSR-2022-RG-0104).
文摘The convolutional neural network(CNN)is one of the main algorithms that is applied to deep transfer learning for classifying two essential types of liver lesions;Hemangioma and hepatocellular carcinoma(HCC).Ultrasound images,which are commonly available and have low cost and low risk compared to computerized tomography(CT)scan images,will be used as input for the model.A total of 350 ultrasound images belonging to 59 patients are used.The number of images with HCC is 202 and 148,respectively.These images were collected from ultrasound cases.info(28 Hemangiomas patients and 11 HCC patients),the department of radiology,the University of Washington(7 HCC patients),the Atlas of ultrasound Germany(3 HCC patients),and Radiopedia and others(10 HCC patients).The ultrasound images are divided into 225,52,and 73 for training,validation,and testing.A data augmentation technique is used to enhance the validation performance.We proposed an approach based on ensembles of the best-selected deep transfer models from the on-the-shelf models:VGG16,VGG19,DenseNet,Inception,InceptionResNet,ResNet,and EfficientNet.After tuning both the feature extraction and the classification layers,the best models are selected.Validation accuracy is used for model tuning and selection.The accuracy,sensitivity,specificity and AUROC are used to evaluate the performance.The experiments are concluded in five stages.The first stage aims to evaluate the base model performance by training the on-the-shelf models.The best accu-racy obtained in the first stage is 83.5%.In the second stage,we augmented the data and retrained the on-the-shelf models with the augmented data.The best accuracy we obtained in the second stage was 86.3%.In the third stage,we tuned the feature extraction layers of the on-the-shelf models.The best accuracy obtained in the third stage is 89%.In the fourth stage,we fine-tuned the classification layer and obtained an accuracy of 93%as the best accuracy.In the fifth stage,we applied the ensemble approach using the best three-performing models and obtained an accuracy,specificity,sensitivity,and AUROC of 94%,93.7%,95.1%,and 0.944,respectively.
基金This work is supported in part by the Deanship of Scientific Research at Jouf University under Grant No.(CV-28–41).
文摘Accurate forecasting of emerging infectious diseases can guide public health officials in making appropriate decisions related to the allocation of public health resources.Due to the exponential spread of the COVID-19 infection worldwide,several computational models for forecasting the transmission and mortality rates of COVID-19 have been proposed in the literature.To accelerate scientific and public health insights into the spread and impact of COVID-19,Google released the Google COVID-19 search trends symptoms open-access dataset.Our objective is to develop 7 and 14-day-ahead forecasting models of COVID-19 transmission and mortality in the US using the Google search trends for COVID-19 related symptoms.Specifically,we propose a stacked long short-term memory(SLSTM)architecture for predicting COVID-19 confirmed and death cases using historical time series data combined with auxiliary time series data from the Google COVID-19 search trends symptoms dataset.Considering the SLSTM networks trained using historical data only as the base models,our base models for 7 and 14-day-ahead forecasting of COVID cases had the mean absolute percentage error(MAPE)values of 6.6%and 8.8%,respectively.On the other side,our proposed models had improved MAPE values of 3.2%and 5.6%,respectively.For 7 and 14-day-ahead forecasting of COVID-19 deaths,the MAPE values of the base models were 4.8%and 11.4%,while the improved MAPE values of our proposed models were 4.7%and 7.8%,respectively.We found that the Google search trends for“pneumonia,”“shortness of breath,”and“fever”are the most informative search trends for predicting COVID-19 transmission.We also found that the search trends for“hypoxia”and“fever”were the most informative trends for forecasting COVID-19 mortality.
基金This work is supported by the Deanship of Scientific Research at Jouf University under Grant No.(CV-33-41).
文摘It is important to determine early on which patients require ICU admissions in managing COVID-19 especially when medical resources are limited.Delay in ICU admissions is associated with negative outcomes such as mortality and cost.Therefore,early identification of patients with a high risk of respiratory failure can prevent complications,enhance risk stratification,and improve the outcomes of severely-ill hospitalized patients.In this paper,we develop a model that uses the characteristics and information collected at the time of patients’admissions and during their early period of hospitalization to accurately predict whether they will need ICU admissions.We use the data explained and organized in a window-based manner by the Sírio-Libanês hospital team(published on Kaggle).Preprocessing is applied,including imputation,cleaning,and feature selection.In the cleaning process,we remove zero-variance,redundant,and/or highly correlated(measured by the Pearson correlation coefficient)features.We use Extreme Gradient Boosting(XGBoost)with early stopping as a predictor in our developed model.We run the experiment in four stages starting from the features of Window 1 in Stage 1 and then incrementally add the features of Windows 2–4 in Stages 2–4 respectively.We achieve AUCs of 0.73,0.92,0.95,and 0.97 in those four stages.
基金This work was funded by the Deanship of Scientific Research at Jouf University under Grant No.(DSR-2022-RG-0104).
文摘Many types of research focus on utilizing Palmprint recognition in user identification and authentication.The Palmprint is one of biometric authentication(something you are)invariable during a person’s life and needs careful protection during enrollment into different biometric authentication systems.Accuracy and irreversibility are critical requirements for securing the Palmprint template during enrollment and verification.This paper proposes an innovative HAMTE neural network model that contains Hetero-Associative Memory for Palmprint template translation and projection using matrix multiplication and dot product multiplication.A HAMTE-Siamese network is constructed,which accepts two Palmprint templates and predicts whether these two templates belong to the same user or different users.The HAMTE is generated for each user during the enrollment phase,which is responsible for generating a secure template for the enrolled user.The proposed network secures the person’s Palmprint template by translating it into an irreversible template(different features space).It can be stored safely in a trusted/untrusted third-party authentication system that protects the original person’s template from being stolen.Experimental results are conducted on the CASIA database,where the proposed network achieved accuracy close to the original accuracy for the unprotected Palmprint templates.The recognition accuracy deviated by around 3%,and the equal error rate(EER)by approximately 0.02 compared to the original data,with appropriate performance(approximately 13 ms)while preserving the irreversibility property of the secure template.Moreover,the brute-force attack has been analyzed under the new Palmprint protection scheme.
基金This work is partially supported by the Deanship of Scientific Research at Jouf University under Grant No(DSR-2021–02–0369).
文摘Face authentication is an important biometric authentication method commonly used in security applications.It is vulnerable to different types of attacks that use authorized users’facial images and videos captured from social media to perform spoofing attacks and dynamic movements for penetrating secur-ity applications.This paper presents an innovative challenge-response emotions authentication model based on the horizontal ensemble technique.The proposed model provides high accurate face authentication process by challenging the authorized user using a random sequence of emotions to provide a specific response for every authentication trial with a different sequence of emotions.The proposed model is applied to the KDEF dataset using 10-fold cross-valida-tions.Several improvements are made to the proposed model.First,the VGG16 model is applied to the seven common emotions.Second,the system usability is enhanced by analyzing and selecting only the four common and easy-to-use emotions.Third,the horizontal ensemble technique is applied to enhance the emotion recognition accuracy and minimize the error during authen-tication processes.Finally,the Horizontal Ensemble Best N-Losses(HEBNL)is applied using challenge-response emotion to improve the authentication effi-ciency and minimize the computational power.The successive improvements implemented on the proposed model led to an improvement in the accuracy from 92.1%to 99.27%.
基金funded by the Deanship of Scientific Research at Jouf University under Grant No.(DSR-2021-02-0303).
文摘Identity management is based on the creation and management of useridentities for granting access to the cloud resources based on the user attributes.The cloud identity and access management (IAM) grants the authorization tothe end-users to perform different actions on the specified cloud resources. Theauthorizations in the IAM are grouped into roles instead of granting them directlyto the end-users. Due to the multiplicity of cloud locations where data resides anddue to the lack of a centralized user authority for granting or denying cloud userrequests, there must be several security strategies and models to overcome theseissues. Another major concern in IAM services is the excessive or the lack ofaccess level to different users with previously granted authorizations. This paperproposes a comprehensive review of security services and threats. Based on thepresented services and threats, advanced frameworks for IAM that provideauthentication mechanisms in public and private cloud platforms. A threat modelhas been applied to validate the proposed authentication frameworks with different security threats. The proposed models proved high efficiency in protectingcloud platforms from insider attacks, single sign-on failure, brute force attacks,denial of service, user privacy threats, and data privacy threats.