This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the traini...This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the training dataset,and one solution is applied to improve the distribution of the training data by augmenting minority class samples using a deep convolutional generative adversarial network.Experi.mental results demonstrate that retraining the deep learning model with the newly generated dataset leads to a new fast radio burst classifier,which effectively reduces false positives caused by periodic wide-band impulsive radio frequency interference,thereby enhancing the performance of the search pipeline.展开更多
The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the ...The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of sophistication.To resolve this problem,efficient and flexible malware detection tools are needed.This work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data representations.Moreover,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:a.Trojan,b.Adware,c.Ransomware,d.Spyware,e.Worm.These network traffic features are then converted to image formats for deep learning,which is applied in a CNN framework,including the VGG16 pre-trained model.In addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional techniques.The success of this approach also shows the applicability of deep learning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future.展开更多
The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-hei...The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-height remote sensing technique, which is flexible, efficient with low cost and with high resolution, is widely applied to investing various resources. Based on this, a novel extraction method for cultivated land information based on Deep Convolutional Neural Network and Transfer Learning (DTCLE) was proposed. First, linear features (roads and ridges etc.) were excluded based on Deep Convolutional Neural Network (DCNN). Next, feature extraction method learned from DCNN was used to cultivated land information extraction by introducing transfer learning mechanism. Last, cultivated land information extraction results were completed by the DTCLE and eCognifion for cultivated land information extraction (ECLE). The location of the Pengzhou County and Guanghan County, Sichuan Province were selected for the experimental purpose. The experimental results showed that the overall precision for the experimental image 1, 2 and 3 (of extracting cultivated land) with the DTCLE method was 91.7%, 88.1% and 88.2% respectively, and the overall precision of ECLE is 9o.7%, 90.5% and 87.0%, respectively. Accuracy of DTCLE was equivalent to that of ECLE, and also outperformed ECLE in terms of integrity and continuity.展开更多
Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However...Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However,previous methods have several challenges in costly,time-consuming,and unsafety.To address these drawbacks,this paper proposed a digital vision based concrete compressive strength evaluating model using deep convolutional neural network(DCNN).The proposed model presented an alternative approach to evaluating the concrete strength and contributed to improving efficiency and accuracy.The model was developed with 4,000 digital images and 61,996 images extracted from video recordings collected from concrete samples.The experimental results indicated a root mean square error(RMSE)value of 3.56(MPa),demonstrating a strong feasibility that the proposed model can be utilized to predict the concrete strength with digital images of their surfaces and advantages to overcome the previous limitations.This experiment contributed to provide the basis that could be extended to future research with image analysis technique and artificial neural network in the diagnosis of concrete building structures.展开更多
In underground mining,the belt is a critical component,as its state directly affects the safe and stable operation of the conveyor.Most of the existing non-contact detection methods based on machine vision can only de...In underground mining,the belt is a critical component,as its state directly affects the safe and stable operation of the conveyor.Most of the existing non-contact detection methods based on machine vision can only detect a single type of damage and they require pre-processing operations.This tends to cause a large amount of calculation and low detection precision.To solve these problems,in the work described in this paper a belt tear detection method based on a multi-class conditional deep convolutional generative adversarial network(CDCGAN)was designed.In the traditional DCGAN,the image generated by the generator has a certain degree of randomness.Here,a small number of labeled belt images are taken as conditions and added them to the generator and discriminator,so the generator can generate images with the characteristics of belt damage under the aforementioned conditions.Moreover,because the discriminator cannot identify multiple types of damage,the multi-class softmax function is used as the output function of the discriminator to output a vector of class probabilities,and it can accurately classify cracks,scratches,and tears.To avoid the features learned incompletely,skiplayer connection is adopted in the generator and discriminator.This not only can minimize the loss of features,but also improves the convergence speed.Compared with other algorithms,experimental results show that the loss value of the generator and discriminator is the least.Moreover,its convergence speed is faster,and the mean average precision of the proposed algorithm is up to 96.2%,which is at least 6%higher than that of other algorithms.展开更多
In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera an...In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera and microscope were simultaneously used to obtain concrete surface images used as the input data for the DCNN.Thereafter,training,validation,and testing of the DCNNs were performed based on the DSLR camera and microscope image data.Results of the analysis indicated that the DCNN employing DSLR image data achieved a relatively higher accuracy.The accuracy of the DSLR-derived image data was attributed to the relatively wider range of the DSLR camera,which was beneficial for extracting a larger number of features.Moreover,the DSLR camera procured more realistic images than the microscope.Thus,when the compressive strength of concrete was evaluated using the DCNN employing a DSLR camera,time and cost were reduced,whereas the usefulness increased.Furthermore,an indirect comparison of the accuracy of the DCNN with that of existing non-destructive methods for evaluating the strength of concrete proved the reliability of DCNN-derived concrete strength predictions.In addition,it was determined that the DCNN used for concrete strength evaluations in this study can be further expanded to detect and evaluate various deteriorative factors that affect the durability of structures,such as salt damage,carbonation,sulfation,corrosion,and freezing-thawing.展开更多
Assessing the age of an individual via bones serves as a fool proof method in true determination of individual skills.Several attempts are reported in the past for assessment of chronological age of an individual base...Assessing the age of an individual via bones serves as a fool proof method in true determination of individual skills.Several attempts are reported in the past for assessment of chronological age of an individual based on variety of discriminative features found in wrist radiograph images.The permutation and combination of these features realized satisfactory accuracies for a set of limited groups.In this paper,assessment of gender for individuals of chronological age between 1-17 years is performed using left hand wrist radiograph images.A fully automated approach is proposed for removal of noise persisted due to non-uniform illumination during the process of radiograph acquisition process.Subsequent to this a computational technique for extraction of wrist region is proposed using operations on specific bit planes of image.A framework called GeNet of deep convolutional neural network is applied for classification of extracted wrist regions into male and female.The experimentations are conducted on the datasets of Radiological Society of North America(RSNA)of about 12442 images.Efficiency of preprocessing and segmentation techniques resulted into a correlation of about 99.09%.Performance of GeNet is evaluated on the extracted wrist regions resulting into an accuracy of 82.18%.展开更多
In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a f...In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a focus on the area over the Yellow Sea and the Bohai Sea(32°-42°N,117°-127°E).The objective was to develop an algorithm for fusing and segmenting multi-channel images from geostationary meteorological satellites,specifically for monitoring sea fog in this region.Firstly,the extreme gradient boosting algorithm was adopted to evaluate the data from the 16 channels of the Himawari-8 satellite for sea fog detection,and we found that the top three channels in order of importance were channels 3,4,and 14,which were fused into false color daytime images,while channels 7,13,and 15 were fused into false color nighttime images.Secondly,the simple linear iterative super-pixel clustering algorithm was used for the pixel-level segmentation of false color images,and based on super-pixel blocks,manual sea-fog annotation was performed to obtain fine-grained annotation labels.The deep convolutional neural network D-LinkNet was built on the ResNet backbone and the dilated convolutional layers with direct connections were added in the central part to form a string-and-combine structure with five branches having different depths and receptive fields.Results show that the accuracy rate of fog area(proportion of detected real fog to detected fog)was 66.5%,the recognition rate of fog zone(proportion of detected real fog to real fog or cloud cover)was 51.9%,and the detection accuracy rate(proportion of samples detected correctly to total samples)was 93.2%.展开更多
This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image fe...This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image features is proposed for locating bolts on axlebox covers.And then,theA2-PFN is proposed according to the slender features of the marker lines for extracting more accurate marker lines regions of the bolts.Finally,a rectangular approximationmethod is proposed to regularize themarker line regions asaway tocalculate the angle of themarker line and plot all the angle values into an angle table,according to which the criteria of the angle table can determine whether the bolt with the marker line is in danger of loosening.Meanwhile,our improved algorithm is compared with the pre-improved algorithmin the object localization stage.The results show that our proposed method has a significant improvement in both detection accuracy and detection speed,where ourmAP(IoU=0.75)reaches 0.77 and fps reaches 16.6.And in the saliency detection stage,after qualitative comparison and quantitative comparison,our method significantly outperforms other state-of-the-art methods,where our MAE reaches 0.092,F-measure reaches 0.948 and AUC reaches 0.943.Ultimately,according to the angle table,out of 676 bolt samples,a total of 60 bolts are loose,69 bolts are at risk of loosening,and 547 bolts are tightened.展开更多
Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usag...Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usage in addition to the millimeter waves sensitivity might block the coverage along with the reliability of the mobile links.In this research work,the improvement in the quality of experience faced by the user for multimedia-related applications over the millimeter-wave band is investigated.The high attenuation loss in high frequencies is compensated with a massive array structure named Multiple Input and Multiple Output(MIMO)which is utilized in a hyperdense environment called heterogeneous networks(HetNet).The optimization problem which arises while maximizing the Mean Opinion Score(MOS)is analyzed along with the QoE(Quality of Experience)metric by considering the Base Station(BS)powers in addition to the needed Quality of Service(QoS).Most of the approaches related to wireless network communication are not suitable for the millimeter-wave band because of its problems due to high complexity and its dynamic nature.Hence a deep reinforcement learning framework is developed for tackling the same opti-mization problem.In this work,a Fuzzy-based Deep Convolutional Neural Net-work(FDCNN)is proposed in addition to a Deep Reinforcing Learning Framework(DRLF)for extracting the features of highly correlated data.The investigational results prove that the proposed method yields the highest satisfac-tion to the user by increasing the number of antennas in addition with the small-scale antennas at the base stations.The proposed work outperforms in terms of MOS with multiple antennas.展开更多
Graph embedding aims to map the high-dimensional nodes to a low-dimensional space and learns the graph relationship from its latent representations.Most existing graph embedding methods focus on the topological struct...Graph embedding aims to map the high-dimensional nodes to a low-dimensional space and learns the graph relationship from its latent representations.Most existing graph embedding methods focus on the topological structure of graph data,but ignore the semantic information of graph data,which results in the unsatisfied performance in practical applications.To overcome the problem,this paper proposes a novel deep convolutional adversarial graph autoencoder(GAE)model.To embed the semantic information between nodes in the graph data,the random walk strategy is first used to construct the positive pointwise mutual information(PPMI)matrix,then,graph convolutional net-work(GCN)is employed to encode the PPMI matrix and node content into the latent representation.Finally,the learned latent representation is used to reconstruct the topological structure of the graph data by decoder.Furthermore,the deep convolutional adversarial training algorithm is introduced to make the learned latent representation conform to the prior distribution better.The state-of-the-art experimental results on the graph data validate the effectiveness of the proposed model in the link prediction,node clustering and graph visualization tasks for three standard datasets,Cora,Citeseer and Pubmed.展开更多
Combining both visible and infrared object information, multispectral data is a promising source data for automatic maritime ship recognition. In this paper, in order to take advantage of deep convolutional neural net...Combining both visible and infrared object information, multispectral data is a promising source data for automatic maritime ship recognition. In this paper, in order to take advantage of deep convolutional neural network and multispectral data, we model multispectral ship recognition task into a convolutional feature fusion problem, and propose a feature fusion architecture called Hybrid Fusion. We fine-tune the VGG-16 model pre-trained on ImageNet through three channels single spectral image and four channels multispectral images, and use existing regularization techniques to avoid over-fitting problem. Hybrid Fusion as well as the other three feature fusion architectures is investigated. Each fusion architecture consists of visible image and infrared image feature extraction branches, in which the pre-trained and fine-tuned VGG-16 models are taken as feature extractor. In each fusion architecture, image features of two branches are firstly extracted from the same layer or different layers of VGG-16 model. Subsequently, the features extracted from the two branches are flattened and concatenated to produce a multispectral feature vector, which is finally fed into a classifier to achieve ship recognition task. Furthermore, based on these fusion architectures, we also evaluate recognition performance of a feature vector normalization method and three combinations of feature extractors. Experimental results on the visible and infrared ship (VAIS) dataset show that the best Hybrid Fusion achieves 89.6% mean per-class recognition accuracy on daytime paired images and 64.9% on nighttime infrared images, and outperforms the state-of-the-art method by 1.4% and 3.9%, respectively.展开更多
The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical s...The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical specialists.The deficiency of medical testing kits leading to huge complexity in detecting COVID-19 patients worldwide,resulting in the number of infected cases is expanding.Therefore,a significant study is necessary about detecting COVID-19 patients using an automated diagnosis method,which hinders the spreading of coronavirus.In this paper,the study suggests a Deep Convolutional Neural Network-based multi-classification framework(COV-MCNet)using eight different pre-trained architectures such as VGG16,VGG19,ResNet50V2,DenseNet201,InceptionV3,MobileNet,InceptionResNetV2,Xception which are trained and tested on the X-ray images of COVID-19,Normal,Viral Pneumonia,and Bacterial Pneumonia.The results from 4-class(Normal vs.COVID-19 vs.Viral Pneumonia vs.Bacterial Pneumonia)demonstrated that the pre-trained model DenseNet201 provides the highest classification performance(accuracy:92.54%,precision:93.05%,recall:92.81%,F1-score:92.83%,specificity:97.47%).Notably,the DenseNet201(4-class classification)pre-trained model in the proposed COV-MCNet framework showed higher accuracy compared to the rest seven models.Important to mention that the proposed COV-MCNet model showed comparatively higher classification accuracy based on the small number of pre-processed datasets that specifies the designed system can produce superior results when more data become available.The proposed multi-classification network(COV-MCNet)significantly speeds up the existing radiology based method which will be helpful for the medical community and clinical specialists to early diagnosis the COVID-19 cases during this pandemic.展开更多
Residual magnetic error remains after standard levelling process.The weak non-geological effect,manifesting itself as streaky noise along flight lines,creates a challenge for airborne geophysical data processing and i...Residual magnetic error remains after standard levelling process.The weak non-geological effect,manifesting itself as streaky noise along flight lines,creates a challenge for airborne geophysical data processing and interpretation.Microleveling is the process to eliminate this residual noise and is now a standard areogeophysical data processing step.In this paper,we propose a two-step procedure for single aerogeophysical data microleveling:a deep convolutional network is first adopted as approximator to map the original data into a low-level part with nature geological structures and a corrugated residual which still contains high-level detail geological structures;second,the mixture of Gaussian robust principal component analysis(MoG-RPCA)is then used to separate the weak energy fine structures from the residual.The final microleveling result is the addition of low-level structures from deep convolutional network and fine structures from MoG-RPCA.The deep convolutional network does not need dataset for training and the handcrafted network serves as prior(deep image prior)to capture the low-level nature geological structures in the areogeophysical data.Experiments on synthetic data and field data demonstrate that the combination of deep convolutional network and MoG-RPCA is an effective framework for single areogeophysical data microleveling.展开更多
Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attentio...Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.展开更多
The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual iden...The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual identifi-cation,which does not scale well to large datasets.In this paper,the deep convolution neural networks algorithm is constructed to implement the automatic individual identification for large numbers of Amur tiger images.The experimental data were obtained from 40 Amur tigers in Tieling Guaipo Tiger Park,China.The number of images collected from each tiger was approximately 200,and a total of 8277 images were obtained.The experiments were carried out on both the left and right side of body.Our results suggested that the recognition accuracy rate of left and right sides are 90.48%and 93.5%,respectively.The accuracy of our network has achieved the similar level compared to other state of the art networks like LeNet,ResNet34,and ZF_Net.The running time is much shorter than that of other networks.Consequently,this study can provide a new approach on automatic individual identification technology in the case of the Amur tiger.展开更多
Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Dee...Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Deep Convolutional Neural Networks(CNNs)are now the most prevalent computer algorithms for the purpose of disease classification.As with all algorithms,CNNs are sensitive to noise from imaging devices,which often contaminates the quality of the images that are fed into them.In this paper,a deep CNN(Inception-v3)is used to study the effect of image noise on the classification of skin lesions.Gaussian noise,impulse noise,and noise made up of a compound of the two are added to an image dataset,namely the Dermofit Image Library from the University of Edinburgh.Evaluations,based on t-distributed Stochastic Neighbor Embedding(t-SNE)visualization,Receiver Operating Characteristic(ROC)analysis,and saliency maps,demonstrate the reliability of the Inception-v3 deep CNN in classifying noisy skin lesion images.展开更多
The use of architectural morphological analysis and generative design is an important strategy to interpret current designs and to propose novel ones.Conventional morphological features are defined based on qualitativ...The use of architectural morphological analysis and generative design is an important strategy to interpret current designs and to propose novel ones.Conventional morphological features are defined based on qualitative descriptions or manually selected indicators,which include subjective bias,thus limiting generalizability.The lack of public architectural morphological datasets also leads to setbacks in data-driven morphological analysis.This study proposed a new method for generating topology-based synthetic data via a rule-based system and for encoding morphological information to promote morphological classification via deep learning.A deep convolution network,LeNet,which was modified in the output layer,was trained with synthetic data,including five spatial prototypes(central,linear,radial,cluster,and grid).The performance of the proposed method was validated on 40 practical architectural layouts.Compared to the ground truth,the proposed method provided an encouraging accuracy of 97.5%(39/40).Interestingly,the most possible mistakes of the LeNet were also understandable according to the architects intuitive perception.The proposed method considered the statistical and overall characteristics of the training samples.This work demonstrated the feasibility and effectiveness of the deep learning network trained with synthetic architectural patterns for morphological classification in practical architectural layouts.The findings of this work could serve as a basis for further morpho-topology studies and other social,building energy,and building structure studies related to spatial morphology.展开更多
Head pose estimation has been considered an important and challenging task in computer vision. In this paper we propose a novel method to estimate head pose based on a deep convolutional neural network (DCNN) for 2D...Head pose estimation has been considered an important and challenging task in computer vision. In this paper we propose a novel method to estimate head pose based on a deep convolutional neural network (DCNN) for 2D face images. We design an effective and simple method to roughly crop the face from the input image, maintaining the individual-relative facial features ratio. The method can be used in various poses. Then two convolutional neural networks are set up to train the head pose classifier and then compared with each other. The simpler one has six layers. It performs well on seven yaw poses but is somewhat unsatisfactory when mixed in two pitch poses. The other has eight layers and more pixels in input layers. It has better performance on more poses and more training samples. Before training the network, two reasonable strategies including shift and zoom are executed to prepare training samples. Finally, feature extraction filters are optimized together with the weight of the classification component through training, to minimize the classification error. Our method has been evaluated on the CAS-PEAL-R1, CMU PIE, and CUBIC FacePix databases. It has better performance than state-of-the-art methods for head pose estimation.展开更多
Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)wi...Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)with high-fidelity mechanistic capsule modelling,to identify the membrane constitutive law and estimate associated parameters of a microcapsule from its steady deformed profile in a capillary tube.Compared with conventional inverse methods,the present approach is more accurate and can increase the prediction throughput rate by a few orders of magnitude.It can process capsules with large deformation in inertial flows.Furthermore,the method can predict the capsule membrane shear elasticity,area dilatation modulus and initial inflation from a single steady capsule profile.We explore the mechanism that the DCNN makes decisions by considering its feature maps,and discuss their potential implication on the development of inverse methods.The present method provides a promising tool which may enable high-throughput mechanical characterisation of microcapsules and biological cells in microfluidic flows.展开更多
基金supported by the Chinese Academy of Science"Light of West China"Program(2022-XBQNXZ-015)the National Natural Science Foundation of China(11903071)the Operation,Maintenance and Upgrading Fund for Astronomical Telescopes and Facility Instruments,budgeted from the Ministry of Finance of China and administered by the Chinese Academy of Sciences。
文摘This paper addresses the performance degradation issue in a fast radio burst search pipeline based on deep learning.This issue is caused by the class imbalance of the radio frequency interference samples in the training dataset,and one solution is applied to improve the distribution of the training data by augmenting minority class samples using a deep convolutional generative adversarial network.Experi.mental results demonstrate that retraining the deep learning model with the newly generated dataset leads to a new fast radio burst classifier,which effectively reduces false positives caused by periodic wide-band impulsive radio frequency interference,thereby enhancing the performance of the search pipeline.
基金funded by the Deanship of Scientific Research at Princess Nourah bint Abdulrahman University,through the Research Funding Program,Grant No.(FRP-1443-15).
文摘The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of sophistication.To resolve this problem,efficient and flexible malware detection tools are needed.This work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data representations.Moreover,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:a.Trojan,b.Adware,c.Ransomware,d.Spyware,e.Worm.These network traffic features are then converted to image formats for deep learning,which is applied in a CNN framework,including the VGG16 pre-trained model.In addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional techniques.The success of this approach also shows the applicability of deep learning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future.
基金supported by the Fundamental Research Funds for the Central Universities of China(Grant No.2013SCU11006)the Key Laboratory of Digital Mapping and Land Information Application of National Administration of Surveying,Mapping and Geoinformation of China(Grant NO.DM2014SC02)the Key Laboratory of Geospecial Information Technology,Ministry of Land and Resources of China(Grant NO.KLGSIT201504)
文摘The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-height remote sensing technique, which is flexible, efficient with low cost and with high resolution, is widely applied to investing various resources. Based on this, a novel extraction method for cultivated land information based on Deep Convolutional Neural Network and Transfer Learning (DTCLE) was proposed. First, linear features (roads and ridges etc.) were excluded based on Deep Convolutional Neural Network (DCNN). Next, feature extraction method learned from DCNN was used to cultivated land information extraction by introducing transfer learning mechanism. Last, cultivated land information extraction results were completed by the DTCLE and eCognifion for cultivated land information extraction (ECLE). The location of the Pengzhou County and Guanghan County, Sichuan Province were selected for the experimental purpose. The experimental results showed that the overall precision for the experimental image 1, 2 and 3 (of extracting cultivated land) with the DTCLE method was 91.7%, 88.1% and 88.2% respectively, and the overall precision of ECLE is 9o.7%, 90.5% and 87.0%, respectively. Accuracy of DTCLE was equivalent to that of ECLE, and also outperformed ECLE in terms of integrity and continuity.
基金This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(NRF-2018R1A2B6007333).
文摘Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However,previous methods have several challenges in costly,time-consuming,and unsafety.To address these drawbacks,this paper proposed a digital vision based concrete compressive strength evaluating model using deep convolutional neural network(DCNN).The proposed model presented an alternative approach to evaluating the concrete strength and contributed to improving efficiency and accuracy.The model was developed with 4,000 digital images and 61,996 images extracted from video recordings collected from concrete samples.The experimental results indicated a root mean square error(RMSE)value of 3.56(MPa),demonstrating a strong feasibility that the proposed model can be utilized to predict the concrete strength with digital images of their surfaces and advantages to overcome the previous limitations.This experiment contributed to provide the basis that could be extended to future research with image analysis technique and artificial neural network in the diagnosis of concrete building structures.
基金This work was supported by the Shanxi Province Applied Basic Research Project,China(Grant No.201901D111100).Xiaoli Hao received the grant,and the URL of the sponsors’website is http://kjt.shanxi.gov.cn/.
文摘In underground mining,the belt is a critical component,as its state directly affects the safe and stable operation of the conveyor.Most of the existing non-contact detection methods based on machine vision can only detect a single type of damage and they require pre-processing operations.This tends to cause a large amount of calculation and low detection precision.To solve these problems,in the work described in this paper a belt tear detection method based on a multi-class conditional deep convolutional generative adversarial network(CDCGAN)was designed.In the traditional DCGAN,the image generated by the generator has a certain degree of randomness.Here,a small number of labeled belt images are taken as conditions and added them to the generator and discriminator,so the generator can generate images with the characteristics of belt damage under the aforementioned conditions.Moreover,because the discriminator cannot identify multiple types of damage,the multi-class softmax function is used as the output function of the discriminator to output a vector of class probabilities,and it can accurately classify cracks,scratches,and tears.To avoid the features learned incompletely,skiplayer connection is adopted in the generator and discriminator.This not only can minimize the loss of features,but also improves the convergence speed.Compared with other algorithms,experimental results show that the loss value of the generator and discriminator is the least.Moreover,its convergence speed is faster,and the mean average precision of the proposed algorithm is up to 96.2%,which is at least 6%higher than that of other algorithms.
基金This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(NRF-2018R1A2B6007333)This study was supported by 2018 Research Grant from Kangwon National University.
文摘In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera and microscope were simultaneously used to obtain concrete surface images used as the input data for the DCNN.Thereafter,training,validation,and testing of the DCNNs were performed based on the DSLR camera and microscope image data.Results of the analysis indicated that the DCNN employing DSLR image data achieved a relatively higher accuracy.The accuracy of the DSLR-derived image data was attributed to the relatively wider range of the DSLR camera,which was beneficial for extracting a larger number of features.Moreover,the DSLR camera procured more realistic images than the microscope.Thus,when the compressive strength of concrete was evaluated using the DCNN employing a DSLR camera,time and cost were reduced,whereas the usefulness increased.Furthermore,an indirect comparison of the accuracy of the DCNN with that of existing non-destructive methods for evaluating the strength of concrete proved the reliability of DCNN-derived concrete strength predictions.In addition,it was determined that the DCNN used for concrete strength evaluations in this study can be further expanded to detect and evaluate various deteriorative factors that affect the durability of structures,such as salt damage,carbonation,sulfation,corrosion,and freezing-thawing.
文摘Assessing the age of an individual via bones serves as a fool proof method in true determination of individual skills.Several attempts are reported in the past for assessment of chronological age of an individual based on variety of discriminative features found in wrist radiograph images.The permutation and combination of these features realized satisfactory accuracies for a set of limited groups.In this paper,assessment of gender for individuals of chronological age between 1-17 years is performed using left hand wrist radiograph images.A fully automated approach is proposed for removal of noise persisted due to non-uniform illumination during the process of radiograph acquisition process.Subsequent to this a computational technique for extraction of wrist region is proposed using operations on specific bit planes of image.A framework called GeNet of deep convolutional neural network is applied for classification of extracted wrist regions into male and female.The experimentations are conducted on the datasets of Radiological Society of North America(RSNA)of about 12442 images.Efficiency of preprocessing and segmentation techniques resulted into a correlation of about 99.09%.Performance of GeNet is evaluated on the extracted wrist regions resulting into an accuracy of 82.18%.
基金National Key R&D Program of China(2021YFC3000905)Open Research Program of the State Key Laboratory of Severe Weather(2022LASW-B09)National Natural Science Foundation of China(42375010)。
文摘In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a focus on the area over the Yellow Sea and the Bohai Sea(32°-42°N,117°-127°E).The objective was to develop an algorithm for fusing and segmenting multi-channel images from geostationary meteorological satellites,specifically for monitoring sea fog in this region.Firstly,the extreme gradient boosting algorithm was adopted to evaluate the data from the 16 channels of the Himawari-8 satellite for sea fog detection,and we found that the top three channels in order of importance were channels 3,4,and 14,which were fused into false color daytime images,while channels 7,13,and 15 were fused into false color nighttime images.Secondly,the simple linear iterative super-pixel clustering algorithm was used for the pixel-level segmentation of false color images,and based on super-pixel blocks,manual sea-fog annotation was performed to obtain fine-grained annotation labels.The deep convolutional neural network D-LinkNet was built on the ResNet backbone and the dilated convolutional layers with direct connections were added in the central part to form a string-and-combine structure with five branches having different depths and receptive fields.Results show that the accuracy rate of fog area(proportion of detected real fog to detected fog)was 66.5%,the recognition rate of fog zone(proportion of detected real fog to real fog or cloud cover)was 51.9%,and the detection accuracy rate(proportion of samples detected correctly to total samples)was 93.2%.
文摘This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image features is proposed for locating bolts on axlebox covers.And then,theA2-PFN is proposed according to the slender features of the marker lines for extracting more accurate marker lines regions of the bolts.Finally,a rectangular approximationmethod is proposed to regularize themarker line regions asaway tocalculate the angle of themarker line and plot all the angle values into an angle table,according to which the criteria of the angle table can determine whether the bolt with the marker line is in danger of loosening.Meanwhile,our improved algorithm is compared with the pre-improved algorithmin the object localization stage.The results show that our proposed method has a significant improvement in both detection accuracy and detection speed,where ourmAP(IoU=0.75)reaches 0.77 and fps reaches 16.6.And in the saliency detection stage,after qualitative comparison and quantitative comparison,our method significantly outperforms other state-of-the-art methods,where our MAE reaches 0.092,F-measure reaches 0.948 and AUC reaches 0.943.Ultimately,according to the angle table,out of 676 bolt samples,a total of 60 bolts are loose,69 bolts are at risk of loosening,and 547 bolts are tightened.
文摘Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usage in addition to the millimeter waves sensitivity might block the coverage along with the reliability of the mobile links.In this research work,the improvement in the quality of experience faced by the user for multimedia-related applications over the millimeter-wave band is investigated.The high attenuation loss in high frequencies is compensated with a massive array structure named Multiple Input and Multiple Output(MIMO)which is utilized in a hyperdense environment called heterogeneous networks(HetNet).The optimization problem which arises while maximizing the Mean Opinion Score(MOS)is analyzed along with the QoE(Quality of Experience)metric by considering the Base Station(BS)powers in addition to the needed Quality of Service(QoS).Most of the approaches related to wireless network communication are not suitable for the millimeter-wave band because of its problems due to high complexity and its dynamic nature.Hence a deep reinforcement learning framework is developed for tackling the same opti-mization problem.In this work,a Fuzzy-based Deep Convolutional Neural Net-work(FDCNN)is proposed in addition to a Deep Reinforcing Learning Framework(DRLF)for extracting the features of highly correlated data.The investigational results prove that the proposed method yields the highest satisfac-tion to the user by increasing the number of antennas in addition with the small-scale antennas at the base stations.The proposed work outperforms in terms of MOS with multiple antennas.
基金Supported by the Strategy Priority Research Program of Chinese Academy of Sciences(No.XDC02070600).
文摘Graph embedding aims to map the high-dimensional nodes to a low-dimensional space and learns the graph relationship from its latent representations.Most existing graph embedding methods focus on the topological structure of graph data,but ignore the semantic information of graph data,which results in the unsatisfied performance in practical applications.To overcome the problem,this paper proposes a novel deep convolutional adversarial graph autoencoder(GAE)model.To embed the semantic information between nodes in the graph data,the random walk strategy is first used to construct the positive pointwise mutual information(PPMI)matrix,then,graph convolutional net-work(GCN)is employed to encode the PPMI matrix and node content into the latent representation.Finally,the learned latent representation is used to reconstruct the topological structure of the graph data by decoder.Furthermore,the deep convolutional adversarial training algorithm is introduced to make the learned latent representation conform to the prior distribution better.The state-of-the-art experimental results on the graph data validate the effectiveness of the proposed model in the link prediction,node clustering and graph visualization tasks for three standard datasets,Cora,Citeseer and Pubmed.
文摘Combining both visible and infrared object information, multispectral data is a promising source data for automatic maritime ship recognition. In this paper, in order to take advantage of deep convolutional neural network and multispectral data, we model multispectral ship recognition task into a convolutional feature fusion problem, and propose a feature fusion architecture called Hybrid Fusion. We fine-tune the VGG-16 model pre-trained on ImageNet through three channels single spectral image and four channels multispectral images, and use existing regularization techniques to avoid over-fitting problem. Hybrid Fusion as well as the other three feature fusion architectures is investigated. Each fusion architecture consists of visible image and infrared image feature extraction branches, in which the pre-trained and fine-tuned VGG-16 models are taken as feature extractor. In each fusion architecture, image features of two branches are firstly extracted from the same layer or different layers of VGG-16 model. Subsequently, the features extracted from the two branches are flattened and concatenated to produce a multispectral feature vector, which is finally fed into a classifier to achieve ship recognition task. Furthermore, based on these fusion architectures, we also evaluate recognition performance of a feature vector normalization method and three combinations of feature extractors. Experimental results on the visible and infrared ship (VAIS) dataset show that the best Hybrid Fusion achieves 89.6% mean per-class recognition accuracy on daytime paired images and 64.9% on nighttime infrared images, and outperforms the state-of-the-art method by 1.4% and 3.9%, respectively.
文摘The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical specialists.The deficiency of medical testing kits leading to huge complexity in detecting COVID-19 patients worldwide,resulting in the number of infected cases is expanding.Therefore,a significant study is necessary about detecting COVID-19 patients using an automated diagnosis method,which hinders the spreading of coronavirus.In this paper,the study suggests a Deep Convolutional Neural Network-based multi-classification framework(COV-MCNet)using eight different pre-trained architectures such as VGG16,VGG19,ResNet50V2,DenseNet201,InceptionV3,MobileNet,InceptionResNetV2,Xception which are trained and tested on the X-ray images of COVID-19,Normal,Viral Pneumonia,and Bacterial Pneumonia.The results from 4-class(Normal vs.COVID-19 vs.Viral Pneumonia vs.Bacterial Pneumonia)demonstrated that the pre-trained model DenseNet201 provides the highest classification performance(accuracy:92.54%,precision:93.05%,recall:92.81%,F1-score:92.83%,specificity:97.47%).Notably,the DenseNet201(4-class classification)pre-trained model in the proposed COV-MCNet framework showed higher accuracy compared to the rest seven models.Important to mention that the proposed COV-MCNet model showed comparatively higher classification accuracy based on the small number of pre-processed datasets that specifies the designed system can produce superior results when more data become available.The proposed multi-classification network(COV-MCNet)significantly speeds up the existing radiology based method which will be helpful for the medical community and clinical specialists to early diagnosis the COVID-19 cases during this pandemic.
文摘Residual magnetic error remains after standard levelling process.The weak non-geological effect,manifesting itself as streaky noise along flight lines,creates a challenge for airborne geophysical data processing and interpretation.Microleveling is the process to eliminate this residual noise and is now a standard areogeophysical data processing step.In this paper,we propose a two-step procedure for single aerogeophysical data microleveling:a deep convolutional network is first adopted as approximator to map the original data into a low-level part with nature geological structures and a corrugated residual which still contains high-level detail geological structures;second,the mixture of Gaussian robust principal component analysis(MoG-RPCA)is then used to separate the weak energy fine structures from the residual.The final microleveling result is the addition of low-level structures from deep convolutional network and fine structures from MoG-RPCA.The deep convolutional network does not need dataset for training and the handcrafted network serves as prior(deep image prior)to capture the low-level nature geological structures in the areogeophysical data.Experiments on synthetic data and field data demonstrate that the combination of deep convolutional network and MoG-RPCA is an effective framework for single areogeophysical data microleveling.
基金supported by the Intelligent System Research Group(ISysRG)supported by Universitas Sriwijaya funded by the Competitive Research 2024.
文摘Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.
基金the Fundamental Research Funds for the Central Universities(2572018BC07,2572017PZ14)the Heilongjiang postdoctoral project fund project(LBH-Z18003)+2 种基金Biodiversity Survey,Monitoring and Assessment Project of Ministry of Ecology and Environment,China(2019HB2096001006)the National Natural Science Foundation of China(NSFC 31872241,31572285)the Individual Identification Technological Research on Camera-trapping images of Amur tigers(NFGA 2017).
文摘The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual identifi-cation,which does not scale well to large datasets.In this paper,the deep convolution neural networks algorithm is constructed to implement the automatic individual identification for large numbers of Amur tiger images.The experimental data were obtained from 40 Amur tigers in Tieling Guaipo Tiger Park,China.The number of images collected from each tiger was approximately 200,and a total of 8277 images were obtained.The experiments were carried out on both the left and right side of body.Our results suggested that the recognition accuracy rate of left and right sides are 90.48%and 93.5%,respectively.The accuracy of our network has achieved the similar level compared to other state of the art networks like LeNet,ResNet34,and ZF_Net.The running time is much shorter than that of other networks.Consequently,this study can provide a new approach on automatic individual identification technology in the case of the Amur tiger.
文摘Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Deep Convolutional Neural Networks(CNNs)are now the most prevalent computer algorithms for the purpose of disease classification.As with all algorithms,CNNs are sensitive to noise from imaging devices,which often contaminates the quality of the images that are fed into them.In this paper,a deep CNN(Inception-v3)is used to study the effect of image noise on the classification of skin lesions.Gaussian noise,impulse noise,and noise made up of a compound of the two are added to an image dataset,namely the Dermofit Image Library from the University of Edinburgh.Evaluations,based on t-distributed Stochastic Neighbor Embedding(t-SNE)visualization,Receiver Operating Characteristic(ROC)analysis,and saliency maps,demonstrate the reliability of the Inception-v3 deep CNN in classifying noisy skin lesion images.
基金This research was funded by National Key REtD Program of China(2017YFC0702302 in 2017YFC0702300)National Natural Science Foundation of China(Project No.51978139)Fundamental Research Funds for the Central Universities and Postgraduate Research Innovation Program of Jiangsu Province(Project No.KYCX20_0110).
文摘The use of architectural morphological analysis and generative design is an important strategy to interpret current designs and to propose novel ones.Conventional morphological features are defined based on qualitative descriptions or manually selected indicators,which include subjective bias,thus limiting generalizability.The lack of public architectural morphological datasets also leads to setbacks in data-driven morphological analysis.This study proposed a new method for generating topology-based synthetic data via a rule-based system and for encoding morphological information to promote morphological classification via deep learning.A deep convolution network,LeNet,which was modified in the output layer,was trained with synthetic data,including five spatial prototypes(central,linear,radial,cluster,and grid).The performance of the proposed method was validated on 40 practical architectural layouts.Compared to the ground truth,the proposed method provided an encouraging accuracy of 97.5%(39/40).Interestingly,the most possible mistakes of the LeNet were also understandable according to the architects intuitive perception.The proposed method considered the statistical and overall characteristics of the training samples.This work demonstrated the feasibility and effectiveness of the deep learning network trained with synthetic architectural patterns for morphological classification in practical architectural layouts.The findings of this work could serve as a basis for further morpho-topology studies and other social,building energy,and building structure studies related to spatial morphology.
基金Project supported by the National Key Scientific Instrument and Equipment Development Project of China(No.2013YQ49087903)the National Natural Science Foundation of China(No.61402307)the Educational Commission of Sichuan Province,China(No.15ZA0007)
文摘Head pose estimation has been considered an important and challenging task in computer vision. In this paper we propose a novel method to estimate head pose based on a deep convolutional neural network (DCNN) for 2D face images. We design an effective and simple method to roughly crop the face from the input image, maintaining the individual-relative facial features ratio. The method can be used in various poses. Then two convolutional neural networks are set up to train the head pose classifier and then compared with each other. The simpler one has six layers. It performs well on seven yaw poses but is somewhat unsatisfactory when mixed in two pitch poses. The other has eight layers and more pixels in input layers. It has better performance on more poses and more training samples. Before training the network, two reasonable strategies including shift and zoom are executed to prepare training samples. Finally, feature extraction filters are optimized together with the weight of the classification component through training, to minimize the classification error. Our method has been evaluated on the CAS-PEAL-R1, CMU PIE, and CUBIC FacePix databases. It has better performance than state-of-the-art methods for head pose estimation.
基金supported by the UK Engineering and Physical Science Research Council(EP/K000128/1)and the China Scholarship Council.
文摘Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)with high-fidelity mechanistic capsule modelling,to identify the membrane constitutive law and estimate associated parameters of a microcapsule from its steady deformed profile in a capillary tube.Compared with conventional inverse methods,the present approach is more accurate and can increase the prediction throughput rate by a few orders of magnitude.It can process capsules with large deformation in inertial flows.Furthermore,the method can predict the capsule membrane shear elasticity,area dilatation modulus and initial inflation from a single steady capsule profile.We explore the mechanism that the DCNN makes decisions by considering its feature maps,and discuss their potential implication on the development of inverse methods.The present method provides a promising tool which may enable high-throughput mechanical characterisation of microcapsules and biological cells in microfluidic flows.