Cracks represent a significant hazard to pavement integrity,making their efficient and automated extraction essential for effective road health monitoring and maintenance.In response to this challenge,we propose a cra...Cracks represent a significant hazard to pavement integrity,making their efficient and automated extraction essential for effective road health monitoring and maintenance.In response to this challenge,we propose a crack automatic extraction network model that integrates multi⁃scale image features,thereby enhancing the model’s capability to capture crack characteristics and adaptation to complex scenarios.This model is based on the ResUNet architecture,makes modification to the convolutional layer of the model,proposes to construct multiple branches utilizing different convolution kernel sizes,and adds a atrous spatial pyramid pooling module within the intermediate layers.In this paper,comparative experiments on the performance of the basic model,ablation experiments,comparative experiments before and after data augmentation,and generalization verification experiments are conducted.Comparative experimental results indicate that the improved model exhibits superior detail processing capability at crack edges.The overall performance of the model,as measured by the F1⁃score,reaches 71.03%,reflecting a 2.1%improvement over the conventional ResUNet.展开更多
There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capaci...There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capacitor components showa continuous and stable charging and discharging state,a hierarchical time-sharing configuration algorithm of distributed energy storage cloud group end region on the power grid side based on multi-scale and multi feature convolution neural network is proposed.Firstly,a voltage stability analysis model based onmulti-scale and multi feature convolution neural network is constructed,and the multi-scale and multi feature convolution neural network is optimized based on Self-OrganizingMaps(SOM)algorithm to analyze the voltage stability of the cloud group end region of distributed energy storage on the grid side under the framework of credibility.According to the optimal scheduling objectives and network size,the distributed robust optimal configuration control model is solved under the framework of coordinated optimal scheduling at multiple time scales;Finally,the time series characteristics of regional power grid load and distributed generation are analyzed.According to the regional hierarchical time-sharing configuration model of“cloud”,“group”and“end”layer,the grid side distributed energy storage cloud group end regional hierarchical time-sharing configuration algorithm is realized.The experimental results show that after applying this algorithm,the best grid side distributed energy storage configuration scheme can be determined,and the stability of grid side distributed energy storage cloud group end region layered timesharing configuration can be improved.展开更多
Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale inf...Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance.展开更多
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso...Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline.展开更多
Pedestrian attribute classification from a pedestrian image captured in surveillance scenarios is challenging due to diverse clothing appearances,varied poses and different camera views. A multiscale and multi-label c...Pedestrian attribute classification from a pedestrian image captured in surveillance scenarios is challenging due to diverse clothing appearances,varied poses and different camera views. A multiscale and multi-label convolutional neural network( MSMLCNN) is proposed to predict multiple pedestrian attributes simultaneously. The pedestrian attribute classification problem is firstly transformed into a multi-label problem including multiple binary attributes needed to be classified. Then,the multi-label problem is solved by fully connecting all binary attributes to multi-scale features with logistic regression functions. Moreover,the multi-scale features are obtained by concatenating those featured maps produced from multiple pooling layers of the MSMLCNN at different scales. Extensive experiment results show that the proposed MSMLCNN outperforms state-of-the-art pedestrian attribute classification methods with a large margin.展开更多
An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyram...An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyramid network(FPN)structure of the original YOLOv8 mode is replaced by the generalized-FPN(GFPN)structure in GiraffeDet to realize the"cross-layer"and"cross-scale"adaptive feature fusion,to enrich the semantic information and spatial information on the feature map to improve the target detection ability of the model.Secondly,a pyramid-pool module of multi atrous spatial pyramid pooling(MASPP)is designed by using the idea of atrous convolution and feature pyramid structure to extract multi-scale features,so as to improve the processing ability of the model for multi-scale objects.The experimental results show that the detection accuracy of the improved YOLOv8 model on DIOR dataset is 92%and mean average precision(mAP)is 87.9%,respectively 3.5%and 1.7%higher than those of the original model.It is proved the detection and classification ability of the proposed model on multi-dimensional optical remote sensing target has been improved.展开更多
We propose a hierarchical multi-scale attention mechanism-based model in response to the low accuracy and inefficient manual classification of existing oceanic biological image classification methods. Firstly, the hie...We propose a hierarchical multi-scale attention mechanism-based model in response to the low accuracy and inefficient manual classification of existing oceanic biological image classification methods. Firstly, the hierarchical efficient multi-scale attention(H-EMA) module is designed for lightweight feature extraction, achieving outstanding performance at a relatively low cost. Secondly, an improved EfficientNetV2 block is used to integrate information from different scales better and enhance inter-layer message passing. Furthermore, introducing the convolutional block attention module(CBAM) enhances the model's perception of critical features, optimizing its generalization ability. Lastly, Focal Loss is introduced to adjust the weights of complex samples to address the issue of imbalanced categories in the dataset, further improving the model's performance. The model achieved 96.11% accuracy on the intertidal marine organism dataset of Nanji Islands and 84.78% accuracy on the CIFAR-100 dataset, demonstrating its strong generalization ability to meet the demands of oceanic biological image classification.展开更多
Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feat...Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.展开更多
To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illuminat...To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.展开更多
Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstl...Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstly,a multiplexed aggregated feature extraction network is proposed using residual bottleneck block(RES-Bottleneck)and middle partial-convolution(MP-Conv)to capture multi-scale spatial features and enhance focus on disease features for better differentiation between disease targets and background information.Secondly,a lightweight feature fusion network is designed using scale-fuse concatenation(SF-Cat)and triple-scale sequence feature fusion(TSSF)module to merge multi-scale feature maps comprehensively.Depthwise convolution(DWConv)and GhostNet lighten the network,while the cross stage partial bottleneck with 3 convolutions ghost-normalization attention module(C3-GN)reduces missed detections by suppressing irrelevant background information.Finally,soft non-maximum suppression(Soft-NMS)is used in the post-processing stage to improve the problem of misdetection of dense disease sites.The results show that the MSL-Net improves mean average precision at intersection over union of 0.5(mAP@0.5)by 2.0%over the baseline you only look once version 5s(YOLOv5s)and reduces parameters by 44%,reducing computation by 27%,outperforming other state-of-the-art(SOTA)models overall.This method also shows excellent performance compared to the latest research.展开更多
Aspect-oriented sentiment analysis is a meticulous sentiment analysis task that aims to analyse the sentiment polarity of specific aspects. Most of the current research builds graph convolutional networks based on dep...Aspect-oriented sentiment analysis is a meticulous sentiment analysis task that aims to analyse the sentiment polarity of specific aspects. Most of the current research builds graph convolutional networks based on dependent syntactic trees, which improves the classification performance of the models to some extent. However, the technical limitations of dependent syntactic trees can introduce considerable noise into the model. Meanwhile, it is difficult for a single graph convolutional network to aggregate both semantic and syntactic structural information of nodes, which affects the final sentence classification. To cope with the above problems, this paper proposes a bi-channel graph convolutional network model. The model introduces a phrase structure tree and transforms it into a hierarchical phrase matrix. The adjacency matrix of the dependent syntactic tree and the hierarchical phrase matrix are combined as the initial matrix of the graph convolutional network to enhance the syntactic information. The semantic information feature representations of the sentences are obtained by the graph convolutional network with a multi-head attention mechanism and fused to achieve complementary learning of dual-channel features. Experimental results show that the model performs well and improves the accuracy of sentiment classification on three public benchmark datasets, namely Rest14, Lap14 and Twitter.展开更多
To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features e...To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features extracted synchronously by the CCAE were stacked and fed to the multi-channel convolution layers for fusion. Then, the fused data was passed to all connection layers for compression and fed to the Softmax module for classification. Finally, the coupling loss function coefficients and the network parameters were optimized through an adaptive approach using the gray wolf optimization (GWO) algorithm. Experimental comparisons showed that the proposed ADCCAE fusion model was superior to existing models for multi-mode data fusion.展开更多
In the analysis of functionally graded materials (FGMs), the uncoupled approach is used broadly, which is based on homogenized material property and ignores the effect Of local micro-structural interaction. The high...In the analysis of functionally graded materials (FGMs), the uncoupled approach is used broadly, which is based on homogenized material property and ignores the effect Of local micro-structural interaction. The higher-order theory for FGMs (HOTFGM) is a coupled approach that explicitly takes the effect of micro-structural gradation and the local interaction of the spatially variable inclusion phase into account. Based on the HOTFGM, this article presents a quadrilateral element-based method for the calculation of multi-scale temperature field (QTF). In this method, the discrete cells are quadrilateral including rectangular while the surface-averaged quantities are the primary variables which replace the coefficients employed in the temperature function. In contrast with the HOTFGM, this method improves the efficiency, eliminates the restriction of being rectangular cells and expands the solution scale. The presented results illustrate the efficiency of the QTF and its advantages in analyzing FGMs.展开更多
Bearings are crucial components in rotating machines,which have direct effects on industrial productivity and safety.To fast and accurately identify the operating condition of bearings,a novel method based on multi⁃sc...Bearings are crucial components in rotating machines,which have direct effects on industrial productivity and safety.To fast and accurately identify the operating condition of bearings,a novel method based on multi⁃scale permutation entropy(MPE)and morphology similarity distance(MSD)is proposed in this paper.Firstly,the MPE values of the original signals were calculated to characterize the complexity in different scales and they constructed feature vectors after normalization.Then,the MSD was employed to measure the distance among test samples from different fault types and the reference samples,and achieved classification with the minimum MSD.Finally,the proposed method was verified with two experiments concerning artificially seeded damage bearings and run⁃to⁃failure bearings,respectively.Different categories were considered for the two experiments and high classification accuracies were obtained.The experimental results indicate that the proposed method is effective and feasible in bearing fault diagnosis.展开更多
A new radar echo tracking algorithm known as multi-scale tracking radar echoes by cross-correlation (MTREC) was developed in this study to analyze movements of radar echoes at different spatial scales. Movement of r...A new radar echo tracking algorithm known as multi-scale tracking radar echoes by cross-correlation (MTREC) was developed in this study to analyze movements of radar echoes at different spatial scales. Movement of radar echoes, particularly associated with convective storms, exhibits different characteristics at various spatial scales as a result of complex interactions among meteorological systems leading to the formation of convective storms. For the null echo region, the usual correlation technique produces zero or a very small magnitude of motion vectors. To mitigate these constraints, MTREC uses the tracking radar echoes by correlation (TREC) technique with a large "box" to determine the systematic movement driven by steering wind, and MTREC applies the TREC technique with a small "box" to estimate small-scale internal motion vectors. Eventually, the MTREC vectors are obtained by synthesizing the systematic motion and the small-scale internal motion. Performance of the MTREC technique was compared with TREC technique using case studies: the Khanun typhoon on 11 September 2005 observed by Wenzhou radar and a squall-line system on 23 June 2011 detected by Beijing radar. The results demonstrate that more spatially smoothed and continuous vector fields can be generated by the MTREC technique, which leads to improvements in tracking the entire radar reflectivity pattern. The new multi-scMe tracking scheme was applied to study its impact on the performance of quantitative precipitation nowcasting. The location and intensity of heavy precipitation at a 1-h lead time was more consistent with quantitative precipitation estimates using radar and rain gauges.展开更多
Based on the recognition framework of the outermost closed contours of cyclones, an automated identification algorithm capable of identifying the multi-scale cyclones that occur during spring in the Changjiang River-H...Based on the recognition framework of the outermost closed contours of cyclones, an automated identification algorithm capable of identifying the multi-scale cyclones that occur during spring in the Changjiang River-Huaihe River valleys (CHV) were developed. We studied the characteristics of the multi-scale cyclone activity that affects CHV and its relationship with rainfall during spring since 1979. The results indicated that the automated identification algorithm for cyclones proposed in this paper could intuitively identify multi-scale cyclones that affect CHV. The algorithm allows for effectively describing the shape and coverage area of the closed contours around the periphery of cyclones. We found that, compared to the meso- and sub-synoptic scale cyclone activities, the synoptic-scale cyclone activity showed more intimate correlation with the overall activity intensity of multi-scale CHV cyclones during spring. However, the frequency of occurrence of sub-synoptic scale cyclones was the highest, and their effect on changes in CHV cyclone activity could not be ignored. Based on the area of impact and the depth of the cyclones, the sub-synoptic scale, synoptic scale and comprehensive cyclone intensity indices were further defined, which showed a positive correlation with rainfall in CHV during spring. Additionally, the comprehensive cyclone intensity index was a good indicator of strong rainfall events.展开更多
基金supported in part by the National Natural Science Foundation of China(No.42401166)the Open Fund of Key Laboratory of Polar Environment Monitoring and Public Governance,Ministry of Education(No.202405)the Key Research and Development Program of Hebei Province(No.23375405D).
文摘Cracks represent a significant hazard to pavement integrity,making their efficient and automated extraction essential for effective road health monitoring and maintenance.In response to this challenge,we propose a crack automatic extraction network model that integrates multi⁃scale image features,thereby enhancing the model’s capability to capture crack characteristics and adaptation to complex scenarios.This model is based on the ResUNet architecture,makes modification to the convolutional layer of the model,proposes to construct multiple branches utilizing different convolution kernel sizes,and adds a atrous spatial pyramid pooling module within the intermediate layers.In this paper,comparative experiments on the performance of the basic model,ablation experiments,comparative experiments before and after data augmentation,and generalization verification experiments are conducted.Comparative experimental results indicate that the improved model exhibits superior detail processing capability at crack edges.The overall performance of the model,as measured by the F1⁃score,reaches 71.03%,reflecting a 2.1%improvement over the conventional ResUNet.
基金supported by State Grid Corporation Limited Science and Technology Project Funding(Contract No.SGCQSQ00YJJS2200380).
文摘There is instability in the distributed energy storage cloud group end region on the power grid side.In order to avoid large-scale fluctuating charging and discharging in the power grid environment and make the capacitor components showa continuous and stable charging and discharging state,a hierarchical time-sharing configuration algorithm of distributed energy storage cloud group end region on the power grid side based on multi-scale and multi feature convolution neural network is proposed.Firstly,a voltage stability analysis model based onmulti-scale and multi feature convolution neural network is constructed,and the multi-scale and multi feature convolution neural network is optimized based on Self-OrganizingMaps(SOM)algorithm to analyze the voltage stability of the cloud group end region of distributed energy storage on the grid side under the framework of credibility.According to the optimal scheduling objectives and network size,the distributed robust optimal configuration control model is solved under the framework of coordinated optimal scheduling at multiple time scales;Finally,the time series characteristics of regional power grid load and distributed generation are analyzed.According to the regional hierarchical time-sharing configuration model of“cloud”,“group”and“end”layer,the grid side distributed energy storage cloud group end regional hierarchical time-sharing configuration algorithm is realized.The experimental results show that after applying this algorithm,the best grid side distributed energy storage configuration scheme can be determined,and the stability of grid side distributed energy storage cloud group end region layered timesharing configuration can be improved.
基金Sponsored by the Project of Multi Modal Monitoring Information Learning Fusion and Health Warning Diagnosis of Wind Power Transmission System(Grant No.61803329)the Research on Product Quality Inspection Method Based on Time Series Analysis(Grant No.201703A020)the Research on the Theory and Reliability of Group Coordinated Control of Hydraulic System for Large Engineering Transportation Vehicles(Grant No.51675461).
文摘Aiming at the problem of image information loss,dilated convolution is introduced and a novel multi⁃scale dilated convolutional neural network(MDCNN)is proposed.Dilated convolution can polymerize image multi⁃scale information without reducing the resolution.The first layer of the network used spectral convolutional step to reduce dimensionality.Then the multi⁃scale aggregation extracted multi⁃scale features through applying dilated convolution and shortcut connection.The extracted features which represent properties of data were fed through Softmax to predict the samples.MDCNN achieved the overall accuracy of 99.58% and 99.92% on two public datasets,Indian Pines and Pavia University.Compared with four other existing models,the results illustrate that MDCNN can extract better discriminative features and achieve higher classification performance.
文摘Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline.
基金Supported by the National Natural Science Foundation of China(No.61602191,61672521,61375037,61473291,61572501,61572536,61502491,61372107,61401167)the Natural Science Foundation of Fujian Province(No.2016J01308)+3 种基金the Scientific and Technology Funds of Quanzhou(No.2015Z114)the Scientific and Technology Funds of Xiamen(No.3502Z20173045)the Promotion Program for Young and Middle aged Teacher in Science and Technology Research of Huaqiao University(No.ZQN-PY418,ZQN-YX403)the Scientific Research Funds of Huaqiao University(No.16BS108)
文摘Pedestrian attribute classification from a pedestrian image captured in surveillance scenarios is challenging due to diverse clothing appearances,varied poses and different camera views. A multiscale and multi-label convolutional neural network( MSMLCNN) is proposed to predict multiple pedestrian attributes simultaneously. The pedestrian attribute classification problem is firstly transformed into a multi-label problem including multiple binary attributes needed to be classified. Then,the multi-label problem is solved by fully connecting all binary attributes to multi-scale features with logistic regression functions. Moreover,the multi-scale features are obtained by concatenating those featured maps produced from multiple pooling layers of the MSMLCNN at different scales. Extensive experiment results show that the proposed MSMLCNN outperforms state-of-the-art pedestrian attribute classification methods with a large margin.
基金supported by the National Natural Science Foundation of China(No.62241109)the Tianjin Science and Technology Commissioner Project(No.20YDTPJC01110)。
文摘An improved model based on you only look once version 8(YOLOv8)is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images.Firstly,the feature pyramid network(FPN)structure of the original YOLOv8 mode is replaced by the generalized-FPN(GFPN)structure in GiraffeDet to realize the"cross-layer"and"cross-scale"adaptive feature fusion,to enrich the semantic information and spatial information on the feature map to improve the target detection ability of the model.Secondly,a pyramid-pool module of multi atrous spatial pyramid pooling(MASPP)is designed by using the idea of atrous convolution and feature pyramid structure to extract multi-scale features,so as to improve the processing ability of the model for multi-scale objects.The experimental results show that the detection accuracy of the improved YOLOv8 model on DIOR dataset is 92%and mean average precision(mAP)is 87.9%,respectively 3.5%and 1.7%higher than those of the original model.It is proved the detection and classification ability of the proposed model on multi-dimensional optical remote sensing target has been improved.
基金supported by the National Natural Science Foundation of China (Nos.61806107 and 61702135)。
文摘We propose a hierarchical multi-scale attention mechanism-based model in response to the low accuracy and inefficient manual classification of existing oceanic biological image classification methods. Firstly, the hierarchical efficient multi-scale attention(H-EMA) module is designed for lightweight feature extraction, achieving outstanding performance at a relatively low cost. Secondly, an improved EfficientNetV2 block is used to integrate information from different scales better and enhance inter-layer message passing. Furthermore, introducing the convolutional block attention module(CBAM) enhances the model's perception of critical features, optimizing its generalization ability. Lastly, Focal Loss is introduced to adjust the weights of complex samples to address the issue of imbalanced categories in the dataset, further improving the model's performance. The model achieved 96.11% accuracy on the intertidal marine organism dataset of Nanji Islands and 84.78% accuracy on the CIFAR-100 dataset, demonstrating its strong generalization ability to meet the demands of oceanic biological image classification.
基金supported by the National Natural Science Foundation of China(62302167,62477013)Natural Science Foundation of Shanghai(No.24ZR1456100)+1 种基金Science and Technology Commission of Shanghai Municipality(No.24DZ2305900)the Shanghai Municipal Special Fund for Promoting High-Quality Development of Industries(2211106).
文摘Multi-label image classification is a challenging task due to the diverse sizes and complex backgrounds of objects in images.Obtaining class-specific precise representations at different scales is a key aspect of feature representation.However,existing methods often rely on the single-scale deep feature,neglecting shallow and deeper layer features,which poses challenges when predicting objects of varying scales within the same image.Although some studies have explored multi-scale features,they rarely address the flow of information between scales or efficiently obtain class-specific precise representations for features at different scales.To address these issues,we propose a two-stage,three-branch Transformer-based framework.The first stage incorporates multi-scale image feature extraction and hierarchical scale attention.This design enables the model to consider objects at various scales while enhancing the flow of information across different feature scales,improving the model’s generalization to diverse object scales.The second stage includes a global feature enhancement module and a region selection module.The global feature enhancement module strengthens interconnections between different image regions,mitigating the issue of incomplete represen-tations,while the region selection module models the cross-modal relationships between image features and labels.Together,these components enable the efficient acquisition of class-specific precise feature representations.Extensive experiments on public datasets,including COCO2014,VOC2007,and VOC2012,demonstrate the effectiveness of our proposed method.Our approach achieves consistent performance gains of 0.3%,0.4%,and 0.2%over state-of-the-art methods on the three datasets,respectively.These results validate the reliability and superiority of our approach for multi-label image classification.
基金supported by the National Key R&D Program of China(No.2022YFB3205101)NSAF(No.U2230116)。
文摘To improve image quality under low illumination conditions,a novel low-light image enhancement method is proposed in this paper based on multi-illumination estimation and multi-scale fusion(MIMS).Firstly,the illumination is processed by contrast-limited adaptive histogram equalization(CLAHE),adaptive complementary gamma function(ACG),and adaptive detail preserving S-curve(ADPS),respectively,to obtain three components.Then,the fusion-relevant features,exposure,and color contrast are selected as the weight maps.Subsequently,these components and weight maps are fused through multi-scale to generate enhanced illumination.Finally,the enhanced images are obtained by multiplying the enhanced illumination and reflectance.Compared with existing approaches,this proposed method achieves an average increase of 0.81%and 2.89%in the structural similarity index measurement(SSIM)and peak signal-to-noise ratio(PSNR),and a decrease of 6.17%and 32.61%in the natural image quality evaluator(NIQE)and gradient magnitude similarity deviation(GMSD),respectively.
文摘Aiming at the problem of low detection accuracy due to the different scale sizes of apple leaf disease spots and their similarity to the background,this paper proposes a multi-scale lightweight network(MSL-Net).Firstly,a multiplexed aggregated feature extraction network is proposed using residual bottleneck block(RES-Bottleneck)and middle partial-convolution(MP-Conv)to capture multi-scale spatial features and enhance focus on disease features for better differentiation between disease targets and background information.Secondly,a lightweight feature fusion network is designed using scale-fuse concatenation(SF-Cat)and triple-scale sequence feature fusion(TSSF)module to merge multi-scale feature maps comprehensively.Depthwise convolution(DWConv)and GhostNet lighten the network,while the cross stage partial bottleneck with 3 convolutions ghost-normalization attention module(C3-GN)reduces missed detections by suppressing irrelevant background information.Finally,soft non-maximum suppression(Soft-NMS)is used in the post-processing stage to improve the problem of misdetection of dense disease sites.The results show that the MSL-Net improves mean average precision at intersection over union of 0.5(mAP@0.5)by 2.0%over the baseline you only look once version 5s(YOLOv5s)and reduces parameters by 44%,reducing computation by 27%,outperforming other state-of-the-art(SOTA)models overall.This method also shows excellent performance compared to the latest research.
文摘Aspect-oriented sentiment analysis is a meticulous sentiment analysis task that aims to analyse the sentiment polarity of specific aspects. Most of the current research builds graph convolutional networks based on dependent syntactic trees, which improves the classification performance of the models to some extent. However, the technical limitations of dependent syntactic trees can introduce considerable noise into the model. Meanwhile, it is difficult for a single graph convolutional network to aggregate both semantic and syntactic structural information of nodes, which affects the final sentence classification. To cope with the above problems, this paper proposes a bi-channel graph convolutional network model. The model introduces a phrase structure tree and transforms it into a hierarchical phrase matrix. The adjacency matrix of the dependent syntactic tree and the hierarchical phrase matrix are combined as the initial matrix of the graph convolutional network to enhance the syntactic information. The semantic information feature representations of the sentences are obtained by the graph convolutional network with a multi-head attention mechanism and fused to achieve complementary learning of dual-channel features. Experimental results show that the model performs well and improves the accuracy of sentiment classification on three public benchmark datasets, namely Rest14, Lap14 and Twitter.
文摘To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features extracted synchronously by the CCAE were stacked and fed to the multi-channel convolution layers for fusion. Then, the fused data was passed to all connection layers for compression and fed to the Softmax module for classification. Finally, the coupling loss function coefficients and the network parameters were optimized through an adaptive approach using the gray wolf optimization (GWO) algorithm. Experimental comparisons showed that the proposed ADCCAE fusion model was superior to existing models for multi-mode data fusion.
基金National Natural Science Foundation of China (2009ZB52028,05C52013)Ph.D. Programs Foundation of Ministry of Education of China (20070287039)
文摘In the analysis of functionally graded materials (FGMs), the uncoupled approach is used broadly, which is based on homogenized material property and ignores the effect Of local micro-structural interaction. The higher-order theory for FGMs (HOTFGM) is a coupled approach that explicitly takes the effect of micro-structural gradation and the local interaction of the spatially variable inclusion phase into account. Based on the HOTFGM, this article presents a quadrilateral element-based method for the calculation of multi-scale temperature field (QTF). In this method, the discrete cells are quadrilateral including rectangular while the surface-averaged quantities are the primary variables which replace the coefficients employed in the temperature function. In contrast with the HOTFGM, this method improves the efficiency, eliminates the restriction of being rectangular cells and expands the solution scale. The presented results illustrate the efficiency of the QTF and its advantages in analyzing FGMs.
基金Sponsored by the National Natural Science Foundation of China(Grant No.51505100)
文摘Bearings are crucial components in rotating machines,which have direct effects on industrial productivity and safety.To fast and accurately identify the operating condition of bearings,a novel method based on multi⁃scale permutation entropy(MPE)and morphology similarity distance(MSD)is proposed in this paper.Firstly,the MPE values of the original signals were calculated to characterize the complexity in different scales and they constructed feature vectors after normalization.Then,the MSD was employed to measure the distance among test samples from different fault types and the reference samples,and achieved classification with the minimum MSD.Finally,the proposed method was verified with two experiments concerning artificially seeded damage bearings and run⁃to⁃failure bearings,respectively.Different categories were considered for the two experiments and high classification accuracies were obtained.The experimental results indicate that the proposed method is effective and feasible in bearing fault diagnosis.
基金This study was supported by the Special Fund for Basic Research and Operation of Chinese Academy of Meteorological Science:Development on quantitative precipitation forecasts for 0-6 h lead times by blending radar-based extrapolation and GRAPES-meso,Observation and retrieval methods of micro-physics,the National Natural Science Foundation of China
文摘A new radar echo tracking algorithm known as multi-scale tracking radar echoes by cross-correlation (MTREC) was developed in this study to analyze movements of radar echoes at different spatial scales. Movement of radar echoes, particularly associated with convective storms, exhibits different characteristics at various spatial scales as a result of complex interactions among meteorological systems leading to the formation of convective storms. For the null echo region, the usual correlation technique produces zero or a very small magnitude of motion vectors. To mitigate these constraints, MTREC uses the tracking radar echoes by correlation (TREC) technique with a large "box" to determine the systematic movement driven by steering wind, and MTREC applies the TREC technique with a small "box" to estimate small-scale internal motion vectors. Eventually, the MTREC vectors are obtained by synthesizing the systematic motion and the small-scale internal motion. Performance of the MTREC technique was compared with TREC technique using case studies: the Khanun typhoon on 11 September 2005 observed by Wenzhou radar and a squall-line system on 23 June 2011 detected by Beijing radar. The results demonstrate that more spatially smoothed and continuous vector fields can be generated by the MTREC technique, which leads to improvements in tracking the entire radar reflectivity pattern. The new multi-scMe tracking scheme was applied to study its impact on the performance of quantitative precipitation nowcasting. The location and intensity of heavy precipitation at a 1-h lead time was more consistent with quantitative precipitation estimates using radar and rain gauges.
基金jointly sponsored by the National Natural Science Foundation of China(Grant No.41575081)the National Basic Research Program of China(Grant No.2015CB953904)+3 种基金the Public Sector(Meteorology)Special Research Foundation(Grant Nos.GYHY201406024 and GYHY201306022)the Special Fund for Core Operational Development of Forecast and Prediction of the China Meteorological Administration(Grant No.CMAHX20160405)the Natural Science Foundation of Jiangsu Province(Grant No.BK20161603,BK2012465)the Priority Academic Program Development of Jiangsu Higher Education Institutions
文摘Based on the recognition framework of the outermost closed contours of cyclones, an automated identification algorithm capable of identifying the multi-scale cyclones that occur during spring in the Changjiang River-Huaihe River valleys (CHV) were developed. We studied the characteristics of the multi-scale cyclone activity that affects CHV and its relationship with rainfall during spring since 1979. The results indicated that the automated identification algorithm for cyclones proposed in this paper could intuitively identify multi-scale cyclones that affect CHV. The algorithm allows for effectively describing the shape and coverage area of the closed contours around the periphery of cyclones. We found that, compared to the meso- and sub-synoptic scale cyclone activities, the synoptic-scale cyclone activity showed more intimate correlation with the overall activity intensity of multi-scale CHV cyclones during spring. However, the frequency of occurrence of sub-synoptic scale cyclones was the highest, and their effect on changes in CHV cyclone activity could not be ignored. Based on the area of impact and the depth of the cyclones, the sub-synoptic scale, synoptic scale and comprehensive cyclone intensity indices were further defined, which showed a positive correlation with rainfall in CHV during spring. Additionally, the comprehensive cyclone intensity index was a good indicator of strong rainfall events.