期刊文献+
共找到18篇文章
< 1 >
每页显示 20 50 100
Deep Convolution Neural Networks for Image-Based Android Malware Classification
1
作者 Amel Ksibi Mohammed Zakariah +1 位作者 Latifah Almuqren Ala Saleh Alluhaidan 《Computers, Materials & Continua》 2025年第3期4093-4116,共24页
The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the ... The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of sophistication.To resolve this problem,efficient and flexible malware detection tools are needed.This work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data representations.Moreover,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:a.Trojan,b.Adware,c.Ransomware,d.Spyware,e.Worm.These network traffic features are then converted to image formats for deep learning,which is applied in a CNN framework,including the VGG16 pre-trained model.In addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional techniques.The success of this approach also shows the applicability of deep learning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future. 展开更多
关键词 Android malware detection deep convolutional neural network(DCNN) image processing CIC-AndMal2017 dataset exploratory data analysis VGG16 model
在线阅读 下载PDF
基于改进VGG-16深度学习网络的防护面罩佩戴识别
2
作者 陈威 张皓亮 高崇阳 《安全、健康和环境》 2024年第4期14-20,共7页
为高效识别打磨焊接作业人员是否佩戴防护面罩,提出了改进VGG-16的深度学习模型,构建了基于VGG-16的深度特征提取网络挖掘图像的重要信息。为解决VGG-16网络对图像局部特征和全局结构信息捕捉的不足,建立基于坐标注意力的空间位置信息... 为高效识别打磨焊接作业人员是否佩戴防护面罩,提出了改进VGG-16的深度学习模型,构建了基于VGG-16的深度特征提取网络挖掘图像的重要信息。为解决VGG-16网络对图像局部特征和全局结构信息捕捉的不足,建立基于坐标注意力的空间位置信息感知机制,增强对图像位置和通道信息的关注。最后,建立基于多层全连接层的分类网络输出识别结果。实验表明,该模型对打磨焊接作业人员是否佩戴防护面罩的识别准确率、精确率、召回率和F1分数分别达到95.88%、96.48%、95.25%和95.86%,具有比传统人工巡检方法更好的效果。 展开更多
关键词 打磨焊接作业 防护面罩 坐标注意力机制 vgg-16网络 深度学习 卷积神经网络(CNN) 智能识别
在线阅读 下载PDF
Fruits and Vegetables Freshness Categorization Using Deep Learning 被引量:4
3
作者 Labiba Gillani Fahad Syed Fahad Tahir +3 位作者 Usama Rasheed Hafsa Saqib Mehdi Hassan Hani Alquhayz 《Computers, Materials & Continua》 SCIE EI 2022年第6期5083-5098,共16页
The nutritional value of perishable food items,such as fruits and vegetables,depends on their freshness levels.The existing approaches solve a binary class problem by classifying a known fruit\vegetable class into fre... The nutritional value of perishable food items,such as fruits and vegetables,depends on their freshness levels.The existing approaches solve a binary class problem by classifying a known fruit\vegetable class into fresh or rotten only.We propose an automated fruits and vegetables categorization approach that first recognizes the class of object in an image and then categorizes that fruit or vegetable into one of the three categories:purefresh,medium-fresh,and rotten.We gathered a dataset comprising of 60K images of 11 fruits and vegetables,each is further divided into three categories of freshness,using hand-held cameras.The recognition and categorization of fruits and vegetables are performed through two deep learning models:Visual Geometry Group(VGG-16)and You Only Look Once(YOLO),and their results are compared.VGG-16 classifies fruits and vegetables and categorizes their freshness,while YOLO also localizes them within the image.Furthermore,we have developed an android based application that takes the image of the fruit or vegetable as input and returns its class label and its freshness degree.A comprehensive experimental evaluation of proposed approach demonstrates that the proposed approach can achieve a high accuracy and F1score on gathered FruitVeg Freshness dataset.The dataset is publicly available for further evaluation by the research community. 展开更多
关键词 Fruits and vegetables classification degree of freshness deep learning object detection model vgg-16 YOLO-v5
在线阅读 下载PDF
An Efficient Indoor Localization Based on Deep Attention Learning Model 被引量:1
4
作者 Amr Abozeid Ahmed I.Taloba +3 位作者 Rasha M.Abd El-Aziz Alhanoof Faiz Alwaghid Mostafa Salem Ahmed Elhadad 《Computer Systems Science & Engineering》 SCIE EI 2023年第8期2637-2650,共14页
Indoor localization methods can help many sectors,such as healthcare centers,smart homes,museums,warehouses,and retail malls,improve their service areas.As a result,it is crucial to look for low-cost methods that can ... Indoor localization methods can help many sectors,such as healthcare centers,smart homes,museums,warehouses,and retail malls,improve their service areas.As a result,it is crucial to look for low-cost methods that can provide exact localization in indoor locations.In this context,imagebased localization methods can play an important role in estimating both the position and the orientation of cameras regarding an object.Image-based localization faces many issues,such as image scale and rotation variance.Also,image-based localization’s accuracy and speed(latency)are two critical factors.This paper proposes an efficient 6-DoF deep-learning model for image-based localization.This model incorporates the channel attention module and the Scale PyramidModule(SPM).It not only enhances accuracy but also ensures the model’s real-time performance.In complex scenes,a channel attention module is employed to distinguish between the textures of the foregrounds and backgrounds.Our model adapted an SPM,a feature pyramid module for dealing with image scale and rotation variance issues.Furthermore,the proposed model employs two regressions(two fully connected layers),one for position and the other for orientation,which increases outcome accuracy.Experiments on standard indoor and outdoor datasets show that the proposed model has a significantly lower Mean Squared Error(MSE)for both position and orientation.On the indoor 7-Scenes dataset,the MSE for the position is reduced to 0.19 m and 6.25°for the orientation.Furthermore,on the outdoor Cambridge landmarks dataset,the MSE for the position is reduced to 0.63 m and 2.03°for the orientation.According to the findings,the proposed approach is superior and more successful than the baseline methods. 展开更多
关键词 Image-based localization computer vision deep learning attention module vgg-16
在线阅读 下载PDF
Deep Learning-Based Classification of Rotten Fruits and Identification of Shelf Life 被引量:2
5
作者 S.Sofana Reka Ankita Bagelikar +2 位作者 Prakash Venugopal V.Ravi Harimurugan Devarajan 《Computers, Materials & Continua》 SCIE EI 2024年第1期781-794,共14页
The freshness of fruits is considered to be one of the essential characteristics for consumers in determining their quality,flavor and nutritional value.The primary need for identifying rotten fruits is to ensure that... The freshness of fruits is considered to be one of the essential characteristics for consumers in determining their quality,flavor and nutritional value.The primary need for identifying rotten fruits is to ensure that only fresh and high-quality fruits are sold to consumers.The impact of rotten fruits can foster harmful bacteria,molds and other microorganisms that can cause food poisoning and other illnesses to the consumers.The overall purpose of the study is to classify rotten fruits,which can affect the taste,texture,and appearance of other fresh fruits,thereby reducing their shelf life.The agriculture and food industries are increasingly adopting computer vision technology to detect rotten fruits and forecast their shelf life.Hence,this research work mainly focuses on the Convolutional Neural Network’s(CNN)deep learning model,which helps in the classification of rotten fruits.The proposed methodology involves real-time analysis of a dataset of various types of fruits,including apples,bananas,oranges,papayas and guavas.Similarly,machine learningmodels such as GaussianNaïve Bayes(GNB)and random forest are used to predict the fruit’s shelf life.The results obtained from the various pre-trained models for rotten fruit detection are analysed based on an accuracy score to determine the best model.In comparison to other pre-trained models,the visual geometry group16(VGG16)obtained a higher accuracy score of 95%.Likewise,the random forest model delivers a better accuracy score of 88% when compared with GNB in forecasting the fruit’s shelf life.By developing an accurate classification model,only fresh and safe fruits reach consumers,reducing the risks associated with contaminated produce.Thereby,the proposed approach will have a significant impact on the food industry for efficient fruit distribution and also benefit customers to purchase fresh fruits. 展开更多
关键词 Rotten fruit detection shelf life deep learning convolutional neural network machine learning gaussian naïve bayes random forest visual geometry group16
在线阅读 下载PDF
Detecting Driver Distraction Using Deep-Learning Approach
6
作者 Khalid A.AlShalfan Mohammed Zakariah 《Computers, Materials & Continua》 SCIE EI 2021年第7期689-704,共16页
Currently,distracted driving is among the most important causes of traffic accidents.Consequently,intelligent vehicle driving systems have become increasingly important.Recently,interest in driver-assistance systems t... Currently,distracted driving is among the most important causes of traffic accidents.Consequently,intelligent vehicle driving systems have become increasingly important.Recently,interest in driver-assistance systems that detect driver actions and help them drive safely has increased.In these studies,although some distinct data types,such as the physical conditions of the driver,audio and visual features,and vehicle information,are used,the primary data source is images of the driver that include the face,arms,and hands taken with a camera inside the car.In this study,an architecture based on a convolution neural network(CNN)is proposed to classify and detect driver distraction.An efficient CNN with high accuracy is implemented,and to implement intense convolutional networks for large-scale image recognition,a new architecture was proposed based on the available Visual Geometry Group(VGG-16)architecture.The proposed architecture was evaluated using the StateFarm dataset for driver-distraction detection.This dataset is publicly available on Kaggle and is frequently used for this type of research.The proposed architecture achieved 96.95%accuracy. 展开更多
关键词 deep learning driver-distraction detection convolution neural networks vgg-16
在线阅读 下载PDF
Performance Analysis of Intelligent Neural-Based Deep Learning System on Rank Images Classification
7
作者 Muhammad Hameed Siddiqi Asfandyar Khan +3 位作者 Muhammad Bilal Khan Abdullah Khan Madallah Alruwaili Saad Alanazi 《Computer Systems Science & Engineering》 SCIE EI 2023年第11期2219-2239,共21页
The use of the internet is increasing all over the world on a daily basis in the last two decades.The increase in the internet causes many sexual crimes,such as sexual misuse,domestic violence,and child pornography.Va... The use of the internet is increasing all over the world on a daily basis in the last two decades.The increase in the internet causes many sexual crimes,such as sexual misuse,domestic violence,and child pornography.Various research has been done for pornographic image detection and classification.Most of the used models used machine learning techniques and deep learning models which show less accuracy,while the deep learning model ware used for classification and detection performed better as compared to machine learning.Therefore,this research evaluates the performance analysis of intelligent neural-based deep learning models which are based on Convolution neural network(CNN),Visual geometry group(VGG-16),VGG-14,and Residual Network(ResNet-50)with the expanded dataset,trained using transfer learning approaches applied in the fully connected layer for datasets to classify rank(Pornographic vs.Nonpornographic)classification in images.The simulation result shows that VGG-16 performed better than the used model in this study without augmented data.The VGG-16 model with augmented data reached a training and validation accuracy of 0.97,0.94 with a loss of 0.070,0.16.The precision,recall,and f-measure values for explicit and non-explicit images are(0.94,0.94,0.94)and(0.94,0.94,0.94).Similarly,The VGG-14 model with augmented data reached a training and validation accuracy of 0.98,0.96 with a loss of 0.059,0.11.The f-measure,recall,and precision values for explicit and non-explicit images are(0.98,0.98,0.98)and(0.98,0.98,0.98).The CNN model with augmented data reached a training and validation accuracy of 0.776&0.78 with losses of 0.48&0.46.The f-measure,recall,and precision values for explicit and non-explicit images are(0.80,0.80,0.80)and(0.78,0.79,0.78).The ResNet-50 model with expanded data reached with training accuracy of 0.89 with a loss of 0.389 and 0.86 of validation accuracy and a loss of 0.47.The f-measure,recall,and precision values for explicit and non-explicit images are(0.86,0.97,0.91)and(0.86,0.93,0.89).Where else without augmented data the VGG-16 model reached a training and validation accuracy of 0.997,0.986 with a loss of 0.008,0.056.The f-measure,recall,and precision values for explicit and non-explicit images are(0.94,0.99,0.97)and(0.99,0.93,0.96)which outperforms the used models with the augmented dataset in this study. 展开更多
关键词 vgg-16 vgg-14 pornography detection EXPANSION ResNet-50 convolution neural network(CNN) machine learning
在线阅读 下载PDF
Improved Siamese Palmprint Authentication Using Pre-Trained VGG16-Palmprint and Element-Wise Absolute Difference
8
作者 Mohamed Ezz Waad Alanazi +3 位作者 Ayman Mohamed Mostafa Eslam Hamouda Murtada K.Elbashir Meshrif Alruily 《Computer Systems Science & Engineering》 SCIE EI 2023年第8期2299-2317,共19页
Palmprint identification has been conducted over the last two decades in many biometric systems.High-dimensional data with many uncorrelated and duplicated features remains difficult due to several computational compl... Palmprint identification has been conducted over the last two decades in many biometric systems.High-dimensional data with many uncorrelated and duplicated features remains difficult due to several computational complexity issues.This paper presents an interactive authentication approach based on deep learning and feature selection that supports Palmprint authentication.The proposed model has two stages of learning;the first stage is to transfer pre-trained VGG-16 of ImageNet to specific features based on the extraction model.The second stage involves the VGG-16 Palmprint feature extraction in the Siamese network to learn Palmprint similarity.The proposed model achieves robust and reliable end-to-end Palmprint authentication by extracting the convolutional features using VGG-16 Palmprint and the similarity of two input Palmprint using the Siamese network.The second stage uses the CASIA dataset to train and test the Siamese network.The suggested model outperforms comparable studies based on the deep learning approach achieving accuracy and EER of 91.8%and 0.082%,respectively,on the CASIA left-hand images and accuracy and EER of 91.7%and 0.084,respectively,on the CASIA right-hand images. 展开更多
关键词 Palmprint authentication transfer learning feature extraction CLASSIFICATION vgg-16 and Siamese network
在线阅读 下载PDF
Research on Vector Road Data Matching Method Based on Deep Learning
9
作者 Lin Zhao Yanru Liu +3 位作者 Yuefeng Lu Ying Sun Jing Li Kaizhong Yao 《Journal of Applied Mathematics and Physics》 2023年第1期303-315,共13页
Most of the existing vector data matching methods use traditional feature geometry attribute features to match, however, many of the similarity indicators are not suitable for cross-scale data, resulting in less accur... Most of the existing vector data matching methods use traditional feature geometry attribute features to match, however, many of the similarity indicators are not suitable for cross-scale data, resulting in less accuracy in identifying objects. In order to solve this problem effectively, a deep learning model for vector road data matching is proposed based on siamese neural network and VGG16 convolutional neural network, and matching experiments are carried out. Experimental results show that the proposed vector road data matching model can achieve an accuracy of more than 90% under certain data support and threshold conditions. 展开更多
关键词 deep Learning Vector Matching SIMILARITY VGG16 Siamese network
在线阅读 下载PDF
离心式化工泵轴承振动故障快速诊断方法研究
10
作者 翟法军 张元元 《机械与电子》 2025年第2期40-44,共5页
为了确保离心式化工泵轴承的正常运行,提出一种离心式化工泵轴承振动故障快速诊断方法。结合变分模态分解与奇异值分解2种技术获取轴承振动信号的时频特征,整合轴承振动信号的时域和频域特征,构建一个多域特征集,从而实现对信号特征的... 为了确保离心式化工泵轴承的正常运行,提出一种离心式化工泵轴承振动故障快速诊断方法。结合变分模态分解与奇异值分解2种技术获取轴承振动信号的时频特征,整合轴承振动信号的时域和频域特征,构建一个多域特征集,从而实现对信号特征的全面捕捉和分析。选取不同规格和工况下已知标签数据集作为多源域,其他规格和工况下已知标签数据作为目标域,将多源域知识迁移到VGG-16深度网络,获取多个目标域模型后,将提取的具有代表性的故障特征依次输入到相同的极限学习机中展开特征融合,根据极限学习机输出分类结果,构建离心式化工泵轴承振动故障快速诊断模型,实现故障快速诊断。实验结果表明,所提方法在轴承振动故障快速诊断方面具有较高的准确率。 展开更多
关键词 离心式 化工泵轴承 振动故障 故障诊断 vgg-16深度网络
在线阅读 下载PDF
针对近重复视频的FD-means聚类清洗算法 被引量:2
11
作者 付燕 韩泽 叶鸥 《计算机工程与应用》 CSCD 北大核心 2022年第1期197-203,共7页
近几年,随着视频数据规模的不断增加,近重复视频数据不断涌现,视频的数据质量问题越来越突出。通过近重复视频清洗方法,有助于提高视频集的数据质量。然而,目前针对近重复视频清洗问题的研究较少,主要集中于近重复视频检索等方面的研究... 近几年,随着视频数据规模的不断增加,近重复视频数据不断涌现,视频的数据质量问题越来越突出。通过近重复视频清洗方法,有助于提高视频集的数据质量。然而,目前针对近重复视频清洗问题的研究较少,主要集中于近重复视频检索等方面的研究。现有研究方法尽管可以有效识别近重复视频,但较难在保证数据完整性的前提下,自动清洗近重复视频数据,以便改善视频数据质量。为解决上述问题,提出一种融合VGG-16深度网络与FD means(feature distance-means)聚类的近重复视频清洗方法。该方法借助MOG2模型和中值滤波算法对视频进行背景分割和前景降噪;利用VGG-16深度网络模型提取视频的深度空间特征;构建一种新的FD-means聚类算法模型,通过迭代产生的近重复视频簇,更新簇类中心点,并最终删除簇中中心点之外的近重复视频数据。实验结果表明,该方法能够有效解决近重复视频数据清洗问题,改善视频的数据质量。 展开更多
关键词 视频数据质量 近重复视频 视频清洗 vgg-16深度网络 FD-means聚类
在线阅读 下载PDF
基于轻量级神经网络的人群计数模型设计 被引量:2
12
作者 平嘉蓉 张正华 +5 位作者 沈逸 陈豪 刘源 杨意 尤倩 苏权 《无线电工程》 2020年第6期442-446,共5页
针对传统的卷积神经网络应用在人群计数过程中的参数众多、计算消耗大,难以在轻量级平台上实现的问题,提出一种基于轻量级神经网络的人群计数模型。模型以人群的特征提取为导向,对VGG-16网络重新部署。利用GPU完成训练,在容器化开发环境... 针对传统的卷积神经网络应用在人群计数过程中的参数众多、计算消耗大,难以在轻量级平台上实现的问题,提出一种基于轻量级神经网络的人群计数模型。模型以人群的特征提取为导向,对VGG-16网络重新部署。利用GPU完成训练,在容器化开发环境下,利用深度学习的方法进行压缩量化编码,生成轻量级神经网络,提高资源利用效率。将轻量级网络模型部署到FPGA上,完成软硬件协同推断。在Mall Dataset数据集支持下进行系统验证,实验结果表明,该系统轻量化后的均方误差可达到18.4,能效比由在PC上的0.35提高到在FPGA上的1.13,实现了轻量级神经网络的准确性及低功耗性。 展开更多
关键词 人群计数 vgg-16 轻量级神经网络 深度学习 现场可编程门阵列
在线阅读 下载PDF
基于迁移学习的航拍图像车辆目标检测方法研究 被引量:6
13
作者 袁功霖 尹奎英 李绮雪 《电子测量技术》 2018年第22期77-81,共5页
为有效识别航拍图片中的车辆目标,将迁移学习应用到Faster-RCNN算法模型训练中:将大规模数据集训练好的网络用于模型参数初始化,以减少训练时间并提高识别精度;针对ZF和VGG-16 2种经典网络模型,分别选取不同超参数进行了多组对比实验,... 为有效识别航拍图片中的车辆目标,将迁移学习应用到Faster-RCNN算法模型训练中:将大规模数据集训练好的网络用于模型参数初始化,以减少训练时间并提高识别精度;针对ZF和VGG-16 2种经典网络模型,分别选取不同超参数进行了多组对比实验,以选取最优超参数,并对比分析2种模型的检测效果。实验结果表明,该种方法可以在航拍图片集中有效检测到车辆目标,检测结果优于传统的机器学习方法,同时具有识别速度快的特点,可用于实时检测,在军事侦察及交通管控等方面具有应用价值。 展开更多
关键词 车辆检测 深度学习 卷积神经网络 Faster-RCNN算法 迁移学习 ZF模型 vgg-16模型
原文传递
基于深度学习的变压器图像识别系统 被引量:7
14
作者 薛阳 吴海东 +3 位作者 俞志程 张宁 叶晓康 华茜 《上海电力大学学报》 CAS 2021年第1期51-56,共6页
针对变压器型号多、图像复杂,以及传统基于机器学习的人工设计特征的方法不能对大规模变压器图像准确分类等问题,提出了基于深度学习的变压器图像识别系统,直接对原始图像进行“端对端”的学习。为实现变压器图像的准确分类,提出了改进V... 针对变压器型号多、图像复杂,以及传统基于机器学习的人工设计特征的方法不能对大规模变压器图像准确分类等问题,提出了基于深度学习的变压器图像识别系统,直接对原始图像进行“端对端”的学习。为实现变压器图像的准确分类,提出了改进VGG-16卷积神经网络的变压器图像识别模型。在VGG-16模型的基础上,重新构建了全连接层,针对原有的SoftMax分类器,采用3标签的SoftMax分类器进行替换,以实现网络结构优化,并通过迁移学习共享VGG-16模型卷积层和降采样层的权值参数。通过构建变压器图像的训练集和测试集,对改进模型进行了训练,并进行性能测试。结果表明,与深度神经网络、卷积神经网络模型相比,改进VGG-16模型具有更好的效果,识别误差达到了9.17%,并实现了对3种变压器的准确区分。 展开更多
关键词 深度学习 变压器 图像识别 迁移学习 改进vgg-16网络
在线阅读 下载PDF
Ultra reliability and massive connectivity provision in integrated internet of military things(IoMT)based on tactical datalink 被引量:1
15
作者 Li Bing Yating Gu +4 位作者 Lanke Hu Li Bowen Yang Lihua Jue Wang Yue Yin 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第3期386-398,共13页
One of the major challenges arising in internet of military things(IoMT)is accommodating massive connectivity while providing guaranteed quality of service(QoS)in terms of ultra-high reliability.In this regard,this pa... One of the major challenges arising in internet of military things(IoMT)is accommodating massive connectivity while providing guaranteed quality of service(QoS)in terms of ultra-high reliability.In this regard,this paper presents a class of code-domain nonorthogonal multiple accesses(NOMAs)for uplink ultra reliable networking of massive IoMT based on tactical datalink such as Link-16 and joint tactical information distribution system(JTIDS).In the considered scenario,a satellite equipped with Nr antennas servers K devices including vehicles,drones,ships,sensors,handset radios,etc.Nonorthogonal coded modulation,a special form of multiple input multiple output(MIMO)-NOMA is proposed.The discussion starts with evaluating the output signal to interference-plus-noise(SINR)of receiver filter,leading to the unveiling of a closed-form expression for overloading systems as the number of users is significantly larger than the number of devices admitted such that massive connectivity is rendered.The expression allows for the development of simple yet successful interference suppression based on power allocation and phase shaping techniques that maximizes the sum rate since it is equivalent to fixed-point programming as can be proved.The proposed design is exemplified by nonlinear modulation schemes such as minimum shift keying(MSK)and Gaussian MSK(GMSK),two pivotal modulation formats in IoMT standards such as Link-16 and JITDS.Numerical results show that near capacity performance is offered.Fortunately,the performance is obtained using simple forward error corrections(FECs)of higher coding rate than existing schemes do,while the transmit power is reduced by 6 dB.The proposed design finds wide applications not only in IoMT but also in deep space communications,where ultra reliability and massive connectivity is a keen concern. 展开更多
关键词 Satellite network deep space communications Internet of military things Non-orthogonal multiple access MIMO LINK-16 JITDS
在线阅读 下载PDF
PSWGAN-GP:改进梯度惩罚的生成对抗网络
16
作者 陈云翔 王巍 +3 位作者 宁娟 陈怡丹 赵永新 周庆华 《计算机与现代化》 2022年第4期21-26,共6页
生成对抗网络的出现对解决深度学习领域样本数据不足的研究起到了极大的促进作用。为解决生成对抗网络生成的图像出现轮廓模糊、前景背景分离等细节质量问题,提出一种改进梯度惩罚的Wasserstein生成对抗网络算法(PSWGAN-GP)。该算法在WG... 生成对抗网络的出现对解决深度学习领域样本数据不足的研究起到了极大的促进作用。为解决生成对抗网络生成的图像出现轮廓模糊、前景背景分离等细节质量问题,提出一种改进梯度惩罚的Wasserstein生成对抗网络算法(PSWGAN-GP)。该算法在WGAN-GP的Wasserstein距离损失和梯度惩罚的基础上,在判别器中使用从VGG-16网络的3个池化层中提取的特征,并通过这些特征计算得出风格损失(Style-loss)和感知损失(Perceptual-loss)作为原损失的惩罚项,提升判别器对深层特征的获取和判别能力,对生成图像的细节进行修正和提升。实验结果表明,在生成器和判别器网络结构相同,并保证超参数相同的情况下,PSWGAN-GP的IS评分和FID评分相对于参与对比的其他图像生成算法有所提升,且可有效改善生成图片的细节质量。 展开更多
关键词 深度学习 梯度惩罚的Wasserstein生成对抗网络 vgg-16网络
在线阅读 下载PDF
Contemporary Study for Detection of COVID-19 Using Machine Learning with Explainable AI
17
作者 Saad Akbar Humera Azam +3 位作者 Sulaiman Sulmi Almutairi Omar Alqahtani Habib Shah Aliya Aleryani 《Computers, Materials & Continua》 SCIE EI 2024年第7期1075-1104,共30页
The prompt spread of COVID-19 has emphasized the necessity for effective and precise diagnostic tools.In this article,a hybrid approach in terms of datasets as well as the methodology by utilizing a previously unexplo... The prompt spread of COVID-19 has emphasized the necessity for effective and precise diagnostic tools.In this article,a hybrid approach in terms of datasets as well as the methodology by utilizing a previously unexplored dataset obtained from a private hospital for detecting COVID-19,pneumonia,and normal conditions in chest X-ray images(CXIs)is proposed coupled with Explainable Artificial Intelligence(XAI).Our study leverages less preprocessing with pre-trained cutting-edge models like InceptionV3,VGG16,and VGG19 that excel in the task of feature extraction.The methodology is further enhanced by the inclusion of the t-SNE(t-Distributed Stochastic Neighbor Embedding)technique for visualizing the extracted image features and Contrast Limited Adaptive Histogram Equalization(CLAHE)to improve images before extraction of features.Additionally,an AttentionMechanism is utilized,which helps clarify how the modelmakes decisions,which builds trust in artificial intelligence(AI)systems.To evaluate the effectiveness of the proposed approach,both benchmark datasets and a private dataset obtained with permissions from Jinnah PostgraduateMedical Center(JPMC)in Karachi,Pakistan,are utilized.In 12 experiments,VGG19 showcased remarkable performance in the hybrid dataset approach,achieving 100%accuracy in COVID-19 vs.pneumonia classification and 97%in distinguishing normal cases.Overall,across all classes,the approach achieved 98%accuracy,demonstrating its efficiency in detecting COVID-19 and differentiating it fromother chest disorders(Pneumonia and healthy)while also providing insights into the decision-making process of the models. 展开更多
关键词 COVID-19 detection deep neural networks support vector machine CXIs InceptionV3 VGG16 VGG19 t-SNE embedding CLAHE attention mechanism XAI
在线阅读 下载PDF
Crowd Density Estimation Based on Multi-scale Feature Fusion and Information Enhancement
18
作者 Lina Zou 《IJLAI Transactions on Science and Engineering》 2025年第3期1-11,共11页
Aiming at the problems such as diverse target scales and large-scale changes in crowds in dense crowd scenarios,a crowd density estimation method based on multi-scale feature fusion and information en-hancement is pro... Aiming at the problems such as diverse target scales and large-scale changes in crowds in dense crowd scenarios,a crowd density estimation method based on multi-scale feature fusion and information en-hancement is proposed.Firstly,considering that small-scale targets account for a relatively large proportion in the image,based on the VGG-16 network,the dilated convolution module is introduced to mine the detailed information of the image.Secondly,in order to make full use of the multi-scale information of the target,a new context-aware module is constructed to extract the contrast features between different scales.Finally,con-sidering the characteristic of continuous changes in the target scale,a multi-scale feature aggregation module is designed to enhance the sampling range of dense scales and multi-scale information interaction,thereby improving the network performance.Experiments on public datasets show that the proposed method in this paper can effectively estimate the population density compared with other advanced methods. 展开更多
关键词 Crowd density estimation Multi-scale feature fusion Information enhancement vgg-16 network.
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部