Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate...Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer.In this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in Arabic.To support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the trade-o.between the model complexity and the overall model performance.Some fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA models.So far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no questions.Hence,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA systems.Experiments indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that a.ects the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model complexity.The Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.展开更多
Touch gesture biometrics authentication system is the study of user's touching behavior on his touch device to identify him.The features traditionally used in touch gesture authentication systems are extracted usi...Touch gesture biometrics authentication system is the study of user's touching behavior on his touch device to identify him.The features traditionally used in touch gesture authentication systems are extracted using hand-crafted feature extraction approach.In this work,we investigate the ability of Deep Learning(DL)to automatically discover useful features of touch gesture and use them to authenticate the user.Four different models are investigated Long-Short Term Memory(LSTM),Gated Recurrent Unit(GRU),Convolutional Neural Network(CNN)combined with LSTM(CNN-LSTM),and CNN combined with GRU(CNN-GRU).In addition,different regularization techniques are investigated such as Activity Regularizer,Batch Normalization(BN),Dropout,and LeakyReLU.These deep networks were trained from scratch and tested using TouchAlytics and BioIdent datasets for dynamic touch authentication.The result reported in terms of authentication accuracy,False Acceptance Rate(FAR),False Rejection Rate(FRR).The best result we have been obtained was 96.73%,96.07%and 96.08%for training,validation and testing accuracy respectively with dynamic touch authentication system on TouchAlytics dataset with CNN-GRU DL model,while the best result of FAR and FRR obtained on TouchAlytics dataset was with CNN-LSTM were FAR was 0.0009 and FRR was 0.0530.For BioIdent dataset the best results have been obtained was 84.87%,78.28%and 78.35%for Training,validation and testing accuracy respectively with CNN-LSTM model.The use of a learning based approach in touch authentication system has shown good results comparing with other state-of-the-art using TouchAlytics dataset.展开更多
Interactive learning tools can facilitate the learning process and increase student engagement,especially tools such as computer programs that are designed for human-computer interaction.Thus,this paper aims to help s...Interactive learning tools can facilitate the learning process and increase student engagement,especially tools such as computer programs that are designed for human-computer interaction.Thus,this paper aims to help students learn five different methods for solving nonlinear equations using an interactive learning tool designed with common principles such as feedback,visibility,affordance,consistency,and constraints.It also compares these methods by the number of iterations and time required to display the result.This study helps students learn these methods using interactive learning tools instead of relying on traditional teaching methods.The tool is implemented using the MATLAB app and is evaluated through usability testing with two groups of users that are categorized by their level of experience with root-finding.Users with no knowledge in root-finding confirmed that they understood the root-finding concept when interacting with the designed tool.The positive results of the user evaluation showed that the tool can be recommended to other users.展开更多
文摘Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer.In this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in Arabic.To support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the trade-o.between the model complexity and the overall model performance.Some fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA models.So far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no questions.Hence,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA systems.Experiments indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that a.ects the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model complexity.The Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions.
文摘Touch gesture biometrics authentication system is the study of user's touching behavior on his touch device to identify him.The features traditionally used in touch gesture authentication systems are extracted using hand-crafted feature extraction approach.In this work,we investigate the ability of Deep Learning(DL)to automatically discover useful features of touch gesture and use them to authenticate the user.Four different models are investigated Long-Short Term Memory(LSTM),Gated Recurrent Unit(GRU),Convolutional Neural Network(CNN)combined with LSTM(CNN-LSTM),and CNN combined with GRU(CNN-GRU).In addition,different regularization techniques are investigated such as Activity Regularizer,Batch Normalization(BN),Dropout,and LeakyReLU.These deep networks were trained from scratch and tested using TouchAlytics and BioIdent datasets for dynamic touch authentication.The result reported in terms of authentication accuracy,False Acceptance Rate(FAR),False Rejection Rate(FRR).The best result we have been obtained was 96.73%,96.07%and 96.08%for training,validation and testing accuracy respectively with dynamic touch authentication system on TouchAlytics dataset with CNN-GRU DL model,while the best result of FAR and FRR obtained on TouchAlytics dataset was with CNN-LSTM were FAR was 0.0009 and FRR was 0.0530.For BioIdent dataset the best results have been obtained was 84.87%,78.28%and 78.35%for Training,validation and testing accuracy respectively with CNN-LSTM model.The use of a learning based approach in touch authentication system has shown good results comparing with other state-of-the-art using TouchAlytics dataset.
文摘Interactive learning tools can facilitate the learning process and increase student engagement,especially tools such as computer programs that are designed for human-computer interaction.Thus,this paper aims to help students learn five different methods for solving nonlinear equations using an interactive learning tool designed with common principles such as feedback,visibility,affordance,consistency,and constraints.It also compares these methods by the number of iterations and time required to display the result.This study helps students learn these methods using interactive learning tools instead of relying on traditional teaching methods.The tool is implemented using the MATLAB app and is evaluated through usability testing with two groups of users that are categorized by their level of experience with root-finding.Users with no knowledge in root-finding confirmed that they understood the root-finding concept when interacting with the designed tool.The positive results of the user evaluation showed that the tool can be recommended to other users.