In order to accomplish the task of object recognition in natural scenes,a new object recognition algorithm based on an improved convolutional neural network(CNN)is proposed.First,candidate object windows are extracted...In order to accomplish the task of object recognition in natural scenes,a new object recognition algorithm based on an improved convolutional neural network(CNN)is proposed.First,candidate object windows are extracted from the original image.Then,candidate object windows are input into the improved CNN model to obtain deep features.Finally,the deep features are input into the Softmax and the confidence scores of classes are obtained.The candidate object window with the highest confidence score is selected as the object recognition result.Based on AlexNet,Inception V1 is introduced into the improved CNN and the fully connected layer is replaced by the average pooling layer,which widens the network and deepens the network at the same time.Experimental results show that the improved object recognition algorithm can obtain better recognition results in multiple natural scene images,and has a higher degree of accuracy than the classical algorithms in the field of object recognition.展开更多
Plant species recognition is an important research area in image recognition in recent years.However,the existing plant species recognition methods have low recognition accuracy and do not meet professional requiremen...Plant species recognition is an important research area in image recognition in recent years.However,the existing plant species recognition methods have low recognition accuracy and do not meet professional requirements in terms of recognition accuracy.Therefore,ShuffleNetV2 was improved by combining the current hot concern mechanism,convolution kernel size adjustment,convolution tailoring,and CSP technology to improve the accuracy and reduce the amount of computation in this study.Six convolutional neural network models with sufficient trainable parameters were designed for differentiation learning.The SGD algorithm is used to optimize the training process to avoid overfitting or falling into the local optimum.In this paper,a conventional plant image dataset TJAU10 collected by cell phones in a natural context was constructed,containing 3000 images of 10 plant species on the campus of Tianjin Agricultural University.Finally,the improved model is compared with the baseline version of the model,which achieves better results in terms of improving accuracy and reducing the computational effort.The recognition accuracy tested on the TJAU10 dataset reaches up to 98.3%,and the recognition precision reaches up to 93.6%,which is 5.1%better than the original model and reduces the computational effort by about 31%compared with the original model.In addition,the experimental results were evaluated using metrics such as the confusion matrix,which can meet the requirements of professionals for the accurate identification of plant species.展开更多
The data generated from non-Euclidean domains and its graphical representation(with complex-relationship object interdependence)applications has observed an exponential growth.The sophistication of graph data has pose...The data generated from non-Euclidean domains and its graphical representation(with complex-relationship object interdependence)applications has observed an exponential growth.The sophistication of graph data has posed consequential obstacles to the existing machine learning algorithms.In this study,we have considered a revamped version of a semi-supervised learning algorithm for graph-structured data to address the issue of expanding deep learning approaches to represent the graph data.Additionally,the quantum information theory has been applied through Graph Neural Networks(GNNs)to generate Riemannian metrics in closed-form of several graph layers.In further,to pre-process the adjacency matrix of graphs,a new formulation is established to incorporate high order proximities.The proposed scheme has shown outstanding improvements to overcome the deficiencies in Graph Convolutional Network(GCN),particularly,the information loss and imprecise information representation with acceptable computational overhead.Moreover,the proposed Quantum Graph Convolutional Network(QGCN)has significantly strengthened the GCN on semi-supervised node classification tasks.In parallel,it expands the generalization process with a significant difference by making small random perturbationsG of the graph during the training process.The evaluation results are provided on three benchmark datasets,including Citeseer,Cora,and PubMed,that distinctly delineate the superiority of the proposed model in terms of computational accuracy against state-of-the-art GCN and three other methods based on the same algorithms in the existing literature.展开更多
Different devices in the recent era generated a vast amount of digital video.Generally,it has been seen in recent years that people are forging the video to use it as proof of evidence in the court of justice.Many kin...Different devices in the recent era generated a vast amount of digital video.Generally,it has been seen in recent years that people are forging the video to use it as proof of evidence in the court of justice.Many kinds of researches on forensic detection have been presented,and it provides less accuracy.This paper proposed a novel forgery detection technique in image frames of the videos using enhanced Convolutional Neural Network(CNN).In the initial stage,the input video is taken as of the dataset and then converts the videos into image frames.Next,perform pre-sampling using the Adaptive Rood Pattern Search(ARPS)algorithm intended for reducing the useless frames.In the next stage,perform preprocessing for enhancing the image frames.Then,face detection is done as of the image utilizing the Viola-Jones algorithm.Finally,the improved Crow Search Algorithm(ICSA)has been used to select the extorted features and inputted to the Enhanced Convolutional Neural Network(ECNN)classifier for detecting the forged image frames.The experimental outcome of the proposed system has achieved 97.21%accuracy compared to other existing methods.展开更多
There is a problem of real-time detection difficulty in road surface damage detection. This paper proposes an improved lightweight model based on you only look once version 5(YOLOv5). Firstly, this paper fully utilize...There is a problem of real-time detection difficulty in road surface damage detection. This paper proposes an improved lightweight model based on you only look once version 5(YOLOv5). Firstly, this paper fully utilized the convolutional neural network(CNN) + ghosting bottleneck(G_bneck) architecture to reduce redundant feature maps. Afterwards, we upgraded the original upsampling algorithm to content-aware reassembly of features(CARAFE) and increased the receptive field. Finally, we replaced the spatial pyramid pooling fast(SPPF) module with the basic receptive field block(Basic RFB) pooling module and added dilated convolution. After comparative experiments, we can see that the number of parameters and model size of the improved algorithm in this paper have been reduced by nearly half compared to the YOLOv5s. The frame rate per second(FPS) has been increased by 3.25 times. The mean average precision(m AP@0.5: 0.95) has increased by 8%—17% compared to other lightweight algorithms.展开更多
Accurate ultra-short-term photovoltaic(PV)power forecasting is crucial for mitigating variations caused by PV power generation and ensuring the stable and efficient operation of power grids.To capture intricate tempor...Accurate ultra-short-term photovoltaic(PV)power forecasting is crucial for mitigating variations caused by PV power generation and ensuring the stable and efficient operation of power grids.To capture intricate temporal relationships and enhance the precision of multi-step time forecast,this paper introduces an innovative approach for ultra-short-term photovoltaic(PV)power prediction,leveraging an enhanced Temporal Convolutional Neural Network(TCN)architecture and feature modeling.First,this study introduces a method employing the Spearman coefficient for meteorological feature filtration.Integrated with three-dimensional PV panel modeling,key factors influencing PV power generation are identified and prioritized.Second,the analysis of the correlation coefficient between astronomical features and PV power prediction demonstrates the theoretical substantiation for the practicality and essentiality of incorporating astronomical features.Third,an enhanced TCN model is introduced,augmenting the original TCN structure with a projection head layer to enhance its capacity for learning and expressing nonlinear features.Meanwhile,a new rolling timing network mechanism is constructed to guarantee the segmentation prediction of future long-time output sequences.Multiple experiments demonstrate the superior performance of the proposed forecasting method compared to existing models.The accuracy of PV power prediction in the next 4 hours,devoid of meteorological conditions,increases by 20.5%.Furthermore,incorporating shortwave radiation for predictions over 4 hours,2 hours,and 1 hour enhances accuracy by 11.1%,9.1%,and 8.8%,respectively.展开更多
Human Action Recognition(HAR)in uncontrolled environments targets to recognition of different actions froma video.An effective HAR model can be employed for an application like human-computer interaction,health care,p...Human Action Recognition(HAR)in uncontrolled environments targets to recognition of different actions froma video.An effective HAR model can be employed for an application like human-computer interaction,health care,person tracking,and video surveillance.Machine Learning(ML)approaches,specifically,Convolutional Neural Network(CNN)models had beenwidely used and achieved impressive results through feature fusion.The accuracy and effectiveness of these models continue to be the biggest challenge in this field.In this article,a novel feature optimization algorithm,called improved Shark Smell Optimization(iSSO)is proposed to reduce the redundancy of extracted features.This proposed technique is inspired by the behavior ofwhite sharks,and howthey find the best prey in thewhole search space.The proposed iSSOalgorithmdivides the FeatureVector(FV)into subparts,where a search is conducted to find optimal local features fromeach subpart of FV.Once local optimal features are selected,a global search is conducted to further optimize these features.The proposed iSSO algorithm is employed on nine(9)selected CNN models.These CNN models are selected based on their top-1 and top-5 accuracy in ImageNet competition.To evaluate the model,two publicly available datasets UCF-Sports and Hollywood2 are selected.展开更多
Realizing accurate perception of urban boundary changes is conducive to the formulation of regional development planning and researches of urban sustainable development.In this paper,an improved fully convolution neur...Realizing accurate perception of urban boundary changes is conducive to the formulation of regional development planning and researches of urban sustainable development.In this paper,an improved fully convolution neural network was provided for perceiving large-scale urban change,by modifying network structure and updating network strategy to extract richer feature information,and to meet the requirement of urban construction land extraction under the background of large-scale low-resolution image.This paper takes the Yangtze River Economic Belt of China as an empirical object to verify the practicability of the network,the results show the extraction results of the improved fully convolutional neural network model reached a precision of kappa coefficient of 0.88,which is better than traditional fully convolutional neural networks,it performs well in the construction land extraction at the scale of small and medium-sized cities.展开更多
针对焊接缺陷具有多尺度,形态复杂和易受背景干扰等特点,提出一种基于YOLOv8n的焊接缺陷检测算法YOLOSBRS.首先利用空间和通道重建卷积(spatial and channel reconstruction convolution,SCConv)卷积改进主干网络的C2f模块;同时设计一...针对焊接缺陷具有多尺度,形态复杂和易受背景干扰等特点,提出一种基于YOLOv8n的焊接缺陷检测算法YOLOSBRS.首先利用空间和通道重建卷积(spatial and channel reconstruction convolution,SCConv)卷积改进主干网络的C2f模块;同时设计一种具有双层路由注意力机制的空间金字塔快速平均池化(spatial pyramid pooling fast average pooling with biformer attention module, SPPF_ABF)模块,将原始SPPF模块的最大池化替换为平均池化操作,并引入双层路由Transformer注意力机制;其次采用重参数化广义特征金字塔网络(reparameterized generalized feature pyramid network,RepGFPN)优化特征融合部分;最后,基于参数共享原理及引入联合空间到深度层和非跨步卷积层模块(a module combining space-to-depth and non-strided convolutional layers,SPD_Conv)改进检测头,实现轻量化的同时提升网络对复杂缺陷的检测能力.试验结果表明,改进后算法的精度和交并比为50%的平均精度均值(mean average precision at 50%intersection over union,mAP50)分别提高3.1%和2.8%,为焊接缺陷检测提供一种高效且可行的解决方案.展开更多
基金Supported by the National Natural Science Foundation of China(61701029)Basic Research Foundation of Beijing Institute of Technology(20170542008)Industry-University Research Innovation Foundation of the Science and Technology Development Center of the Ministry of Education(2018A02012)。
文摘In order to accomplish the task of object recognition in natural scenes,a new object recognition algorithm based on an improved convolutional neural network(CNN)is proposed.First,candidate object windows are extracted from the original image.Then,candidate object windows are input into the improved CNN model to obtain deep features.Finally,the deep features are input into the Softmax and the confidence scores of classes are obtained.The candidate object window with the highest confidence score is selected as the object recognition result.Based on AlexNet,Inception V1 is introduced into the improved CNN and the fully connected layer is replaced by the average pooling layer,which widens the network and deepens the network at the same time.Experimental results show that the improved object recognition algorithm can obtain better recognition results in multiple natural scene images,and has a higher degree of accuracy than the classical algorithms in the field of object recognition.
基金supported by the Key Project Supported by Science and Technology of Tianjin Key Research and Development Plan[Grant No.20YFZCSN00220]Tianjin Science and Technology Plan Project[Grant No.21YFSNSN00040]+1 种基金Central Government Guides Local Science and Technology Development Project[Grant No.21ZYCGSN00590]Inner Mongolia Autonomous Region Department of Science and Technology Project[Grant No.2020GG0068].
文摘Plant species recognition is an important research area in image recognition in recent years.However,the existing plant species recognition methods have low recognition accuracy and do not meet professional requirements in terms of recognition accuracy.Therefore,ShuffleNetV2 was improved by combining the current hot concern mechanism,convolution kernel size adjustment,convolution tailoring,and CSP technology to improve the accuracy and reduce the amount of computation in this study.Six convolutional neural network models with sufficient trainable parameters were designed for differentiation learning.The SGD algorithm is used to optimize the training process to avoid overfitting or falling into the local optimum.In this paper,a conventional plant image dataset TJAU10 collected by cell phones in a natural context was constructed,containing 3000 images of 10 plant species on the campus of Tianjin Agricultural University.Finally,the improved model is compared with the baseline version of the model,which achieves better results in terms of improving accuracy and reducing the computational effort.The recognition accuracy tested on the TJAU10 dataset reaches up to 98.3%,and the recognition precision reaches up to 93.6%,which is 5.1%better than the original model and reduces the computational effort by about 31%compared with the original model.In addition,the experimental results were evaluated using metrics such as the confusion matrix,which can meet the requirements of professionals for the accurate identification of plant species.
基金supported by the National Key Research and Development Program of China(2018YFB1600600)the National Natural Science Foundation of China under(61976034,U1808206)the Dalian Science and Technology Innovation Fund(2019J12GX035).
文摘The data generated from non-Euclidean domains and its graphical representation(with complex-relationship object interdependence)applications has observed an exponential growth.The sophistication of graph data has posed consequential obstacles to the existing machine learning algorithms.In this study,we have considered a revamped version of a semi-supervised learning algorithm for graph-structured data to address the issue of expanding deep learning approaches to represent the graph data.Additionally,the quantum information theory has been applied through Graph Neural Networks(GNNs)to generate Riemannian metrics in closed-form of several graph layers.In further,to pre-process the adjacency matrix of graphs,a new formulation is established to incorporate high order proximities.The proposed scheme has shown outstanding improvements to overcome the deficiencies in Graph Convolutional Network(GCN),particularly,the information loss and imprecise information representation with acceptable computational overhead.Moreover,the proposed Quantum Graph Convolutional Network(QGCN)has significantly strengthened the GCN on semi-supervised node classification tasks.In parallel,it expands the generalization process with a significant difference by making small random perturbationsG of the graph during the training process.The evaluation results are provided on three benchmark datasets,including Citeseer,Cora,and PubMed,that distinctly delineate the superiority of the proposed model in terms of computational accuracy against state-of-the-art GCN and three other methods based on the same algorithms in the existing literature.
文摘Different devices in the recent era generated a vast amount of digital video.Generally,it has been seen in recent years that people are forging the video to use it as proof of evidence in the court of justice.Many kinds of researches on forensic detection have been presented,and it provides less accuracy.This paper proposed a novel forgery detection technique in image frames of the videos using enhanced Convolutional Neural Network(CNN).In the initial stage,the input video is taken as of the dataset and then converts the videos into image frames.Next,perform pre-sampling using the Adaptive Rood Pattern Search(ARPS)algorithm intended for reducing the useless frames.In the next stage,perform preprocessing for enhancing the image frames.Then,face detection is done as of the image utilizing the Viola-Jones algorithm.Finally,the improved Crow Search Algorithm(ICSA)has been used to select the extorted features and inputted to the Enhanced Convolutional Neural Network(ECNN)classifier for detecting the forged image frames.The experimental outcome of the proposed system has achieved 97.21%accuracy compared to other existing methods.
基金supported by the Shanghai Sailing Program,China (No.20YF1447600)the Research Start-Up Project of Shanghai Institute of Technology (No.YJ2021-60)+1 种基金the Collaborative Innovation Project of Shanghai Institute of Technology (No.XTCX2020-12)the Science and Technology Talent Development Fund for Young and Middle-Aged Teachers at Shanghai Institute of Technology (No.ZQ2022-6)。
文摘There is a problem of real-time detection difficulty in road surface damage detection. This paper proposes an improved lightweight model based on you only look once version 5(YOLOv5). Firstly, this paper fully utilized the convolutional neural network(CNN) + ghosting bottleneck(G_bneck) architecture to reduce redundant feature maps. Afterwards, we upgraded the original upsampling algorithm to content-aware reassembly of features(CARAFE) and increased the receptive field. Finally, we replaced the spatial pyramid pooling fast(SPPF) module with the basic receptive field block(Basic RFB) pooling module and added dilated convolution. After comparative experiments, we can see that the number of parameters and model size of the improved algorithm in this paper have been reduced by nearly half compared to the YOLOv5s. The frame rate per second(FPS) has been increased by 3.25 times. The mean average precision(m AP@0.5: 0.95) has increased by 8%—17% compared to other lightweight algorithms.
基金supported by National Key Research and Development Program of China(Key Techniques of Adaptive Grid Integration and Active Synchronization for Extremely High Penetration Distributed Photovoltaic Power Generation,2022YFB2402900).
文摘Accurate ultra-short-term photovoltaic(PV)power forecasting is crucial for mitigating variations caused by PV power generation and ensuring the stable and efficient operation of power grids.To capture intricate temporal relationships and enhance the precision of multi-step time forecast,this paper introduces an innovative approach for ultra-short-term photovoltaic(PV)power prediction,leveraging an enhanced Temporal Convolutional Neural Network(TCN)architecture and feature modeling.First,this study introduces a method employing the Spearman coefficient for meteorological feature filtration.Integrated with three-dimensional PV panel modeling,key factors influencing PV power generation are identified and prioritized.Second,the analysis of the correlation coefficient between astronomical features and PV power prediction demonstrates the theoretical substantiation for the practicality and essentiality of incorporating astronomical features.Third,an enhanced TCN model is introduced,augmenting the original TCN structure with a projection head layer to enhance its capacity for learning and expressing nonlinear features.Meanwhile,a new rolling timing network mechanism is constructed to guarantee the segmentation prediction of future long-time output sequences.Multiple experiments demonstrate the superior performance of the proposed forecasting method compared to existing models.The accuracy of PV power prediction in the next 4 hours,devoid of meteorological conditions,increases by 20.5%.Furthermore,incorporating shortwave radiation for predictions over 4 hours,2 hours,and 1 hour enhances accuracy by 11.1%,9.1%,and 8.8%,respectively.
基金supported by the Collabo R&D between Industry,Academy,and Research Institute(S3250534)funded by the Ministry of SMEs and Startups(MSS,Korea)the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.RS-2023-00218176)the Soonchunhyang University Research Fund.
文摘Human Action Recognition(HAR)in uncontrolled environments targets to recognition of different actions froma video.An effective HAR model can be employed for an application like human-computer interaction,health care,person tracking,and video surveillance.Machine Learning(ML)approaches,specifically,Convolutional Neural Network(CNN)models had beenwidely used and achieved impressive results through feature fusion.The accuracy and effectiveness of these models continue to be the biggest challenge in this field.In this article,a novel feature optimization algorithm,called improved Shark Smell Optimization(iSSO)is proposed to reduce the redundancy of extracted features.This proposed technique is inspired by the behavior ofwhite sharks,and howthey find the best prey in thewhole search space.The proposed iSSOalgorithmdivides the FeatureVector(FV)into subparts,where a search is conducted to find optimal local features fromeach subpart of FV.Once local optimal features are selected,a global search is conducted to further optimize these features.The proposed iSSO algorithm is employed on nine(9)selected CNN models.These CNN models are selected based on their top-1 and top-5 accuracy in ImageNet competition.To evaluate the model,two publicly available datasets UCF-Sports and Hollywood2 are selected.
基金supported by Natural Science Foundation of Chongqing in China(No.cstc2020jcyj-jqX0004)the Ministry of education of Humanities and Social Science project(No.20YJA790016)+1 种基金the National Natural Science Foundation of China(Grant No.42171298)We thank the patent supporting the method section of the paper(No.202110750360.1).
文摘Realizing accurate perception of urban boundary changes is conducive to the formulation of regional development planning and researches of urban sustainable development.In this paper,an improved fully convolution neural network was provided for perceiving large-scale urban change,by modifying network structure and updating network strategy to extract richer feature information,and to meet the requirement of urban construction land extraction under the background of large-scale low-resolution image.This paper takes the Yangtze River Economic Belt of China as an empirical object to verify the practicability of the network,the results show the extraction results of the improved fully convolutional neural network model reached a precision of kappa coefficient of 0.88,which is better than traditional fully convolutional neural networks,it performs well in the construction land extraction at the scale of small and medium-sized cities.
文摘针对焊接缺陷具有多尺度,形态复杂和易受背景干扰等特点,提出一种基于YOLOv8n的焊接缺陷检测算法YOLOSBRS.首先利用空间和通道重建卷积(spatial and channel reconstruction convolution,SCConv)卷积改进主干网络的C2f模块;同时设计一种具有双层路由注意力机制的空间金字塔快速平均池化(spatial pyramid pooling fast average pooling with biformer attention module, SPPF_ABF)模块,将原始SPPF模块的最大池化替换为平均池化操作,并引入双层路由Transformer注意力机制;其次采用重参数化广义特征金字塔网络(reparameterized generalized feature pyramid network,RepGFPN)优化特征融合部分;最后,基于参数共享原理及引入联合空间到深度层和非跨步卷积层模块(a module combining space-to-depth and non-strided convolutional layers,SPD_Conv)改进检测头,实现轻量化的同时提升网络对复杂缺陷的检测能力.试验结果表明,改进后算法的精度和交并比为50%的平均精度均值(mean average precision at 50%intersection over union,mAP50)分别提高3.1%和2.8%,为焊接缺陷检测提供一种高效且可行的解决方案.