期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Performance vs.Complexity Comparative Analysis of Multimodal Bilinear Pooling Fusion Approaches for Deep Learning-Based Visual Arabic-Question Answering Systems
1
作者 Sarah M.Kamel Mai A.Fadel +1 位作者 lamiaa elrefaei Shimaa I.Hassan 《Computer Modeling in Engineering & Sciences》 2025年第4期373-411,共39页
Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate... Visual question answering(VQA)is a multimodal task,involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer.In this paper,we propose a VQA system intended to answer yes/no questions about real-world images,in Arabic.To support a robust VQA system,we work in two directions:(1)Using deep neural networks to semantically represent the given image and question in a fine-grainedmanner,namely ResNet-152 and Gated Recurrent Units(GRU).(2)Studying the role of the utilizedmultimodal bilinear pooling fusion technique in the trade-o.between the model complexity and the overall model performance.Some fusion techniques could significantly increase the model complexity,which seriously limits their applicability for VQA models.So far,there is no evidence of how efficient these multimodal bilinear pooling fusion techniques are for VQA systems dedicated to yes/no questions.Hence,a comparative analysis is conducted between eight bilinear pooling fusion techniques,in terms of their ability to reduce themodel complexity and improve themodel performance in this case of VQA systems.Experiments indicate that these multimodal bilinear pooling fusion techniques have improved the VQA model’s performance,until reaching the best performance of 89.25%.Further,experiments have proven that the number of answers in the developed VQA system is a critical factor that a.ects the effectiveness of these multimodal bilinear pooling techniques in achieving their main objective of reducing the model complexity.The Multimodal Local Perception Bilinear Pooling(MLPB)technique has shown the best balance between the model complexity and its performance,for VQA systems designed to answer yes/no questions. 展开更多
关键词 Arabic-VQA deep learning-based VQA deep multimodal information fusion multimodal representation learning VQA of yes/no questions VQA model complexity VQA model performance performance-complexity trade-off
在线阅读 下载PDF
Evaluation of Deep Learning Models for Person Authentication Based on Touch Gesture
2
作者 Asrar Bajaber Mai Fadel lamiaa elrefaei 《Computer Systems Science & Engineering》 SCIE EI 2022年第8期465-481,共17页
Touch gesture biometrics authentication system is the study of user's touching behavior on his touch device to identify him.The features traditionally used in touch gesture authentication systems are extracted usi... Touch gesture biometrics authentication system is the study of user's touching behavior on his touch device to identify him.The features traditionally used in touch gesture authentication systems are extracted using hand-crafted feature extraction approach.In this work,we investigate the ability of Deep Learning(DL)to automatically discover useful features of touch gesture and use them to authenticate the user.Four different models are investigated Long-Short Term Memory(LSTM),Gated Recurrent Unit(GRU),Convolutional Neural Network(CNN)combined with LSTM(CNN-LSTM),and CNN combined with GRU(CNN-GRU).In addition,different regularization techniques are investigated such as Activity Regularizer,Batch Normalization(BN),Dropout,and LeakyReLU.These deep networks were trained from scratch and tested using TouchAlytics and BioIdent datasets for dynamic touch authentication.The result reported in terms of authentication accuracy,False Acceptance Rate(FAR),False Rejection Rate(FRR).The best result we have been obtained was 96.73%,96.07%and 96.08%for training,validation and testing accuracy respectively with dynamic touch authentication system on TouchAlytics dataset with CNN-GRU DL model,while the best result of FAR and FRR obtained on TouchAlytics dataset was with CNN-LSTM were FAR was 0.0009 and FRR was 0.0530.For BioIdent dataset the best results have been obtained was 84.87%,78.28%and 78.35%for Training,validation and testing accuracy respectively with CNN-LSTM model.The use of a learning based approach in touch authentication system has shown good results comparing with other state-of-the-art using TouchAlytics dataset. 展开更多
关键词 Touch authentication system touch gestures behavioral biometric deep learning classification CNN RNN LSTM
在线阅读 下载PDF
Design Principles-Based Interactive Learning Tool for Solving Nonlinear Equations
3
作者 Ahad Alloqmani Omimah Alsaedi +2 位作者 Nadia Bahatheg Reem Alnanih lamiaa elrefaei 《Computer Systems Science & Engineering》 SCIE EI 2022年第3期1023-1042,共20页
Interactive learning tools can facilitate the learning process and increase student engagement,especially tools such as computer programs that are designed for human-computer interaction.Thus,this paper aims to help s... Interactive learning tools can facilitate the learning process and increase student engagement,especially tools such as computer programs that are designed for human-computer interaction.Thus,this paper aims to help students learn five different methods for solving nonlinear equations using an interactive learning tool designed with common principles such as feedback,visibility,affordance,consistency,and constraints.It also compares these methods by the number of iterations and time required to display the result.This study helps students learn these methods using interactive learning tools instead of relying on traditional teaching methods.The tool is implemented using the MATLAB app and is evaluated through usability testing with two groups of users that are categorized by their level of experience with root-finding.Users with no knowledge in root-finding confirmed that they understood the root-finding concept when interacting with the designed tool.The positive results of the user evaluation showed that the tool can be recommended to other users. 展开更多
关键词 Graphical user interface(GUI) interactive learning tool design principles nonlinear equations experimental design
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部