In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a f...In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a focus on the area over the Yellow Sea and the Bohai Sea(32°-42°N,117°-127°E).The objective was to develop an algorithm for fusing and segmenting multi-channel images from geostationary meteorological satellites,specifically for monitoring sea fog in this region.Firstly,the extreme gradient boosting algorithm was adopted to evaluate the data from the 16 channels of the Himawari-8 satellite for sea fog detection,and we found that the top three channels in order of importance were channels 3,4,and 14,which were fused into false color daytime images,while channels 7,13,and 15 were fused into false color nighttime images.Secondly,the simple linear iterative super-pixel clustering algorithm was used for the pixel-level segmentation of false color images,and based on super-pixel blocks,manual sea-fog annotation was performed to obtain fine-grained annotation labels.The deep convolutional neural network D-LinkNet was built on the ResNet backbone and the dilated convolutional layers with direct connections were added in the central part to form a string-and-combine structure with five branches having different depths and receptive fields.Results show that the accuracy rate of fog area(proportion of detected real fog to detected fog)was 66.5%,the recognition rate of fog zone(proportion of detected real fog to real fog or cloud cover)was 51.9%,and the detection accuracy rate(proportion of samples detected correctly to total samples)was 93.2%.展开更多
The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-hei...The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-height remote sensing technique, which is flexible, efficient with low cost and with high resolution, is widely applied to investing various resources. Based on this, a novel extraction method for cultivated land information based on Deep Convolutional Neural Network and Transfer Learning (DTCLE) was proposed. First, linear features (roads and ridges etc.) were excluded based on Deep Convolutional Neural Network (DCNN). Next, feature extraction method learned from DCNN was used to cultivated land information extraction by introducing transfer learning mechanism. Last, cultivated land information extraction results were completed by the DTCLE and eCognifion for cultivated land information extraction (ECLE). The location of the Pengzhou County and Guanghan County, Sichuan Province were selected for the experimental purpose. The experimental results showed that the overall precision for the experimental image 1, 2 and 3 (of extracting cultivated land) with the DTCLE method was 91.7%, 88.1% and 88.2% respectively, and the overall precision of ECLE is 9o.7%, 90.5% and 87.0%, respectively. Accuracy of DTCLE was equivalent to that of ECLE, and also outperformed ECLE in terms of integrity and continuity.展开更多
Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However...Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However,previous methods have several challenges in costly,time-consuming,and unsafety.To address these drawbacks,this paper proposed a digital vision based concrete compressive strength evaluating model using deep convolutional neural network(DCNN).The proposed model presented an alternative approach to evaluating the concrete strength and contributed to improving efficiency and accuracy.The model was developed with 4,000 digital images and 61,996 images extracted from video recordings collected from concrete samples.The experimental results indicated a root mean square error(RMSE)value of 3.56(MPa),demonstrating a strong feasibility that the proposed model can be utilized to predict the concrete strength with digital images of their surfaces and advantages to overcome the previous limitations.This experiment contributed to provide the basis that could be extended to future research with image analysis technique and artificial neural network in the diagnosis of concrete building structures.展开更多
In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera an...In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera and microscope were simultaneously used to obtain concrete surface images used as the input data for the DCNN.Thereafter,training,validation,and testing of the DCNNs were performed based on the DSLR camera and microscope image data.Results of the analysis indicated that the DCNN employing DSLR image data achieved a relatively higher accuracy.The accuracy of the DSLR-derived image data was attributed to the relatively wider range of the DSLR camera,which was beneficial for extracting a larger number of features.Moreover,the DSLR camera procured more realistic images than the microscope.Thus,when the compressive strength of concrete was evaluated using the DCNN employing a DSLR camera,time and cost were reduced,whereas the usefulness increased.Furthermore,an indirect comparison of the accuracy of the DCNN with that of existing non-destructive methods for evaluating the strength of concrete proved the reliability of DCNN-derived concrete strength predictions.In addition,it was determined that the DCNN used for concrete strength evaluations in this study can be further expanded to detect and evaluate various deteriorative factors that affect the durability of structures,such as salt damage,carbonation,sulfation,corrosion,and freezing-thawing.展开更多
This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image fe...This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image features is proposed for locating bolts on axlebox covers.And then,theA2-PFN is proposed according to the slender features of the marker lines for extracting more accurate marker lines regions of the bolts.Finally,a rectangular approximationmethod is proposed to regularize themarker line regions asaway tocalculate the angle of themarker line and plot all the angle values into an angle table,according to which the criteria of the angle table can determine whether the bolt with the marker line is in danger of loosening.Meanwhile,our improved algorithm is compared with the pre-improved algorithmin the object localization stage.The results show that our proposed method has a significant improvement in both detection accuracy and detection speed,where ourmAP(IoU=0.75)reaches 0.77 and fps reaches 16.6.And in the saliency detection stage,after qualitative comparison and quantitative comparison,our method significantly outperforms other state-of-the-art methods,where our MAE reaches 0.092,F-measure reaches 0.948 and AUC reaches 0.943.Ultimately,according to the angle table,out of 676 bolt samples,a total of 60 bolts are loose,69 bolts are at risk of loosening,and 547 bolts are tightened.展开更多
Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usag...Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usage in addition to the millimeter waves sensitivity might block the coverage along with the reliability of the mobile links.In this research work,the improvement in the quality of experience faced by the user for multimedia-related applications over the millimeter-wave band is investigated.The high attenuation loss in high frequencies is compensated with a massive array structure named Multiple Input and Multiple Output(MIMO)which is utilized in a hyperdense environment called heterogeneous networks(HetNet).The optimization problem which arises while maximizing the Mean Opinion Score(MOS)is analyzed along with the QoE(Quality of Experience)metric by considering the Base Station(BS)powers in addition to the needed Quality of Service(QoS).Most of the approaches related to wireless network communication are not suitable for the millimeter-wave band because of its problems due to high complexity and its dynamic nature.Hence a deep reinforcement learning framework is developed for tackling the same opti-mization problem.In this work,a Fuzzy-based Deep Convolutional Neural Net-work(FDCNN)is proposed in addition to a Deep Reinforcing Learning Framework(DRLF)for extracting the features of highly correlated data.The investigational results prove that the proposed method yields the highest satisfac-tion to the user by increasing the number of antennas in addition with the small-scale antennas at the base stations.The proposed work outperforms in terms of MOS with multiple antennas.展开更多
The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical s...The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical specialists.The deficiency of medical testing kits leading to huge complexity in detecting COVID-19 patients worldwide,resulting in the number of infected cases is expanding.Therefore,a significant study is necessary about detecting COVID-19 patients using an automated diagnosis method,which hinders the spreading of coronavirus.In this paper,the study suggests a Deep Convolutional Neural Network-based multi-classification framework(COV-MCNet)using eight different pre-trained architectures such as VGG16,VGG19,ResNet50V2,DenseNet201,InceptionV3,MobileNet,InceptionResNetV2,Xception which are trained and tested on the X-ray images of COVID-19,Normal,Viral Pneumonia,and Bacterial Pneumonia.The results from 4-class(Normal vs.COVID-19 vs.Viral Pneumonia vs.Bacterial Pneumonia)demonstrated that the pre-trained model DenseNet201 provides the highest classification performance(accuracy:92.54%,precision:93.05%,recall:92.81%,F1-score:92.83%,specificity:97.47%).Notably,the DenseNet201(4-class classification)pre-trained model in the proposed COV-MCNet framework showed higher accuracy compared to the rest seven models.Important to mention that the proposed COV-MCNet model showed comparatively higher classification accuracy based on the small number of pre-processed datasets that specifies the designed system can produce superior results when more data become available.The proposed multi-classification network(COV-MCNet)significantly speeds up the existing radiology based method which will be helpful for the medical community and clinical specialists to early diagnosis the COVID-19 cases during this pandemic.展开更多
Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attentio...Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.展开更多
The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the ...The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of sophistication.To resolve this problem,efficient and flexible malware detection tools are needed.This work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data representations.Moreover,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:a.Trojan,b.Adware,c.Ransomware,d.Spyware,e.Worm.These network traffic features are then converted to image formats for deep learning,which is applied in a CNN framework,including the VGG16 pre-trained model.In addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional techniques.The success of this approach also shows the applicability of deep learning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future.展开更多
Background:Myopic maculopathy(MM)has become a major cause of visual impairment and blindness worldwide,especially in East Asian countries.Deep learning approaches such as deep convolutional neural networks(DCNN)have b...Background:Myopic maculopathy(MM)has become a major cause of visual impairment and blindness worldwide,especially in East Asian countries.Deep learning approaches such as deep convolutional neural networks(DCNN)have been successfully applied to identify some common retinal diseases and show great potential for the intelligent analysis of MM.This study aimed to build a reliable approach for automated detection of MM from retinal fundus images using DCNN models.Methods:A dual-stream DCNN(DCNN-DS)model that perceives features from both original images and corresponding processed images by color histogram distribution optimization method was designed for classification of no MM,tessellated fundus(TF),and pathologic myopia(PM).A total of 36,515 gradable images from four hospitals were used for DCNN model development,and 14,986 gradable images from the other two hospitals for external testing.We also compared the performance of the DCNN-DS model and four ophthalmologists on 3000 randomly sampledfundus images.Results:The DCNN-DS model achieved sensitivities of 93.3%and 91.0%,specificities of 99.6%and 98.7%,areas under the receiver operating characteristic curves(AUCs)of 0.998 and 0.994 for detecting PM,whereas sensitivities of 98.8%and 92.8%,specificities of 95.6%and 94.1%,AUCs of 0.986 and 0.970 for detecting TF in two external testing datasets.In the sampled testing dataset,the sensitivities of four ophthalmologists ranged from 88.3%to 95.8%and 81.1%to 89.1%,and the specificities ranged from 95.9%to 99.2%and 77.8%to 97.3%for detecting PM and TF,respectively.Meanwhile,the DCNN-DS model achieved sensitivities of 90.8%and 97.9%and specificities of 99.1%and 94.0%for detecting PMand T,respectively.Conclusions:The proposed DCNN-DS approach demonstrated reliable performance with high sensitivity,specificity,and AUC to classify different MM levels on fundus photographs sourced from clinics.It can help identify MM automatically among the large myopic groups and show great potential for real-life applications.展开更多
With the development of artificial intelligence,emotion recognition has become a hot topic in the field of humancomputer interaction.This paper focuses on the application and optimization of deep convolutional neural ...With the development of artificial intelligence,emotion recognition has become a hot topic in the field of humancomputer interaction.This paper focuses on the application and optimization of deep convolutional neural networks(CNNs)in multimodal emotion recognition.Multimodal emotion recognition involves analyzing data from different sources—such as voice,facial expressions,and text—to more accurately identify and interpret human emotional states.This paper first reviews the basic theories and methods of multimodal data processing,then details the structure and function of deep convolutional neural networks,particularly their advantages in handling various types of data.By innovating and optimizing network structures,loss functions,and training strategies,we have improved the model's accuracy in emotion recognition.Ultimately,experimental results show that the optimized CNN model demonstrates superior performance in multimodal emotion recognition tasks.展开更多
The combination of fingerprint positioning and 5G(the 5th Generation Mobile Communication Technology)offers broader application prospects for indoor positioning technology,but also brings challenges in real-time perfo...The combination of fingerprint positioning and 5G(the 5th Generation Mobile Communication Technology)offers broader application prospects for indoor positioning technology,but also brings challenges in real-time performance.In this paper,we propose a fingerprint positioning method based on a deep convolutional neural network(DCNN)using a classification approach in a single-base station scenario for massive multiple input multiple outputorthogonal frequency division multiplexing(MIMO-OFDM)systems.We introduce an angle-delay domain fingerprint matrix that simplifies the computation process and increases the location differentiation.The cosine distance is chosen as the fingerprint similarity criterion due to its sensitivity to angular differences.First,the DCNN model is used to determine the sub-area to which the mobile terminal belongs,and then the weighted K-nearest neighbor(WKNN)matching algorithm is used to estimate the position within the sub-area.The positioning performance is simulated in a DeepMIMO indoor environment,showing that the classification DCNN method reduces the positioning time by 77.05%compared to the non-classification method,with only a 1.08%increase in average positioning error.展开更多
Graph Convolutional Neural Networks(GCNs)have been widely used in various fields due to their powerful capabilities in processing graph-structured data.However,GCNs encounter significant challenges when applied to sca...Graph Convolutional Neural Networks(GCNs)have been widely used in various fields due to their powerful capabilities in processing graph-structured data.However,GCNs encounter significant challenges when applied to scale-free graphs with power-law distributions,resulting in substantial distortions.Moreover,most of the existing GCN models are shallow structures,which restricts their ability to capture dependencies among distant nodes and more refined high-order node features in scale-free graphs with hierarchical structures.To more broadly and precisely apply GCNs to real-world graphs exhibiting scale-free or hierarchical structures and utilize multi-level aggregation of GCNs for capturing high-level information in local representations,we propose the Hyperbolic Deep Graph Convolutional Neural Network(HDGCNN),an end-to-end deep graph representation learning framework that can map scale-free graphs from Euclidean space to hyperbolic space.In HDGCNN,we define the fundamental operations of deep graph convolutional neural networks in hyperbolic space.Additionally,we introduce a hyperbolic feature transformation method based on identity mapping and a dense connection scheme based on a novel non-local message passing framework.In addition,we present a neighborhood aggregation method that combines initial structural featureswith hyperbolic attention coefficients.Through the above methods,HDGCNN effectively leverages both the structural features and node features of graph data,enabling enhanced exploration of non-local structural features and more refined node features in scale-free or hierarchical graphs.Experimental results demonstrate that HDGCNN achieves remarkable performance improvements over state-ofthe-art GCNs in node classification and link prediction tasks,even when utilizing low-dimensional embedding representations.Furthermore,when compared to shallow hyperbolic graph convolutional neural network models,HDGCNN exhibits notable advantages and performance enhancements.展开更多
The fifth-generation (5G) communication requires a highly accurate estimation of the channel state information (CSI)to take advantage of the massive multiple-input multiple-output(MIMO) system. However, traditional ch...The fifth-generation (5G) communication requires a highly accurate estimation of the channel state information (CSI)to take advantage of the massive multiple-input multiple-output(MIMO) system. However, traditional channel estimation methods do not always yield reliable estimates. The methodology of this paper consists of deep residual shrinkage network (DRSN)neural network-based method that is used to solve this problem.Thus, the channel estimation approach, based on DRSN with its learning ability of noise-containing data, is first introduced. Then,the DRSN is used to train the noise reduction process based on the results of the least square (LS) channel estimation while applying the pilot frequency subcarriers, where the initially estimated subcarrier channel matrix is considered as a three-dimensional tensor of the DRSN input. Afterward, a mixed signal to noise ratio (SNR) training data strategy is proposed based on the learning ability of DRSN under different SNRs. Moreover, a joint mixed scenario training strategy is carried out to test the multi scenarios robustness of DRSN. As for the findings, the numerical results indicate that the DRSN method outperforms the spatial-frequency-temporal convolutional neural networks (SF-CNN)with similar computational complexity and achieves better advantages in the full SNR range than the minimum mean squared error (MMSE) estimator with a limited dataset. Moreover, the DRSN approach shows robustness in different propagation environments.展开更多
The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual iden...The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual identifi-cation,which does not scale well to large datasets.In this paper,the deep convolution neural networks algorithm is constructed to implement the automatic individual identification for large numbers of Amur tiger images.The experimental data were obtained from 40 Amur tigers in Tieling Guaipo Tiger Park,China.The number of images collected from each tiger was approximately 200,and a total of 8277 images were obtained.The experiments were carried out on both the left and right side of body.Our results suggested that the recognition accuracy rate of left and right sides are 90.48%and 93.5%,respectively.The accuracy of our network has achieved the similar level compared to other state of the art networks like LeNet,ResNet34,and ZF_Net.The running time is much shorter than that of other networks.Consequently,this study can provide a new approach on automatic individual identification technology in the case of the Amur tiger.展开更多
Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Dee...Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Deep Convolutional Neural Networks(CNNs)are now the most prevalent computer algorithms for the purpose of disease classification.As with all algorithms,CNNs are sensitive to noise from imaging devices,which often contaminates the quality of the images that are fed into them.In this paper,a deep CNN(Inception-v3)is used to study the effect of image noise on the classification of skin lesions.Gaussian noise,impulse noise,and noise made up of a compound of the two are added to an image dataset,namely the Dermofit Image Library from the University of Edinburgh.Evaluations,based on t-distributed Stochastic Neighbor Embedding(t-SNE)visualization,Receiver Operating Characteristic(ROC)analysis,and saliency maps,demonstrate the reliability of the Inception-v3 deep CNN in classifying noisy skin lesion images.展开更多
Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)wi...Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)with high-fidelity mechanistic capsule modelling,to identify the membrane constitutive law and estimate associated parameters of a microcapsule from its steady deformed profile in a capillary tube.Compared with conventional inverse methods,the present approach is more accurate and can increase the prediction throughput rate by a few orders of magnitude.It can process capsules with large deformation in inertial flows.Furthermore,the method can predict the capsule membrane shear elasticity,area dilatation modulus and initial inflation from a single steady capsule profile.We explore the mechanism that the DCNN makes decisions by considering its feature maps,and discuss their potential implication on the development of inverse methods.The present method provides a promising tool which may enable high-throughput mechanical characterisation of microcapsules and biological cells in microfluidic flows.展开更多
Existing learning-based super-resolution (SR) reconstruction algorithms are mainly designed for single image, which ignore the spatio-temporal relationship between video frames. Aiming at applying the advantages of ...Existing learning-based super-resolution (SR) reconstruction algorithms are mainly designed for single image, which ignore the spatio-temporal relationship between video frames. Aiming at applying the advantages of learning-based algorithms to video SR field, a novel video SR reconstruction algorithm based on deep convolutional neural network (CNN) and spatio-temporal similarity (STCNN-SR) was proposed in this paper. It is a deep learning method for video SR reconstruction, which considers not onlv the mapping relationship among associated low-resolution (LR) and high-resolution (HR) image blocks, but also the spatio-temporal non-local complementary and redundant information between adjacent low-resolution video frames. The reconstruction speed can be improved obviously with the pre-trained end-to-end reconstructed coefficients. Moreover, the performance of video SR will be further improved by the optimization process with spatio-temporal similarity. Experimental results demonstrated that the proposed algorithm achieves a competitive SR quality on both subjective and objective evaluations, when compared to other state-of-the-art algorithms.展开更多
Watermarking is the advanced technology utilized to secure digital data by integrating ownership or copyright protection.Most of the traditional extracting processes in audio watermarking have some restrictions due to...Watermarking is the advanced technology utilized to secure digital data by integrating ownership or copyright protection.Most of the traditional extracting processes in audio watermarking have some restrictions due to low reliability to various attacks.Hence,a deep learning-based audio watermarking system is proposed in this research to overcome the restriction in the traditional methods.The implication of the research relies on enhancing the performance of the watermarking system using the Discrete Wavelet Transform(DWT)and the optimized deep learning technique.The selection of optimal embedding location is the research contribution that is carried out by the deep convolutional neural network(DCNN).The hyperparameter tuning is performed by the so-called search location optimization,which minimizes the errors in the classifier.The experimental result reveals that the proposed digital audio watermarking system provides better robustness and performance in terms of Bit Error Rate(BER),Mean Square Error(MSE),and Signal-to-noise ratio.The BER,MSE,and SNR of the proposed audio watermarking model without the noise are 0.082,0.099,and 45.363 respectively,which is found to be better performance than the existing watermarking models.展开更多
Since of the scale and the various shapes of down in the image,it is difficult for traditional image recognition method to correctly recognize the type of down image and get the required recognition accuracy,even for ...Since of the scale and the various shapes of down in the image,it is difficult for traditional image recognition method to correctly recognize the type of down image and get the required recognition accuracy,even for the Traditional Convolutional Neural Network(TCNN).To deal with the above problems,a Deep Convolutional Neural Network(DCNN)for down image classification is constructed,and a new weight initialization method is proposed.Firstly,the salient regions of a down image were cut from the image using the visual saliency model.Then,these salient regions of the image were used to train a sparse autoencoder and get a collection of convolutional filters,which accord with the statistical characteristics of dataset.At last,a DCNN with Inception module and its variants was constructed.To improve the recognition accuracy,the depth of the network is deepened.The experiment results indicate that the constructed DCNN increases the recognition accuracy by 2.7% compared to TCNN,when recognizing the down in the images.The convergence rate of the proposed DCNN with the new weight initialization method is improved by 25.5% compared to TCNN.展开更多
基金National Key R&D Program of China(2021YFC3000905)Open Research Program of the State Key Laboratory of Severe Weather(2022LASW-B09)National Natural Science Foundation of China(42375010)。
文摘In this paper,we utilized the deep convolutional neural network D-LinkNet,a model for semantic segmentation,to analyze the Himawari-8 satellite data captured from 16 channels at a spatial resolution of 0.5 km,with a focus on the area over the Yellow Sea and the Bohai Sea(32°-42°N,117°-127°E).The objective was to develop an algorithm for fusing and segmenting multi-channel images from geostationary meteorological satellites,specifically for monitoring sea fog in this region.Firstly,the extreme gradient boosting algorithm was adopted to evaluate the data from the 16 channels of the Himawari-8 satellite for sea fog detection,and we found that the top three channels in order of importance were channels 3,4,and 14,which were fused into false color daytime images,while channels 7,13,and 15 were fused into false color nighttime images.Secondly,the simple linear iterative super-pixel clustering algorithm was used for the pixel-level segmentation of false color images,and based on super-pixel blocks,manual sea-fog annotation was performed to obtain fine-grained annotation labels.The deep convolutional neural network D-LinkNet was built on the ResNet backbone and the dilated convolutional layers with direct connections were added in the central part to form a string-and-combine structure with five branches having different depths and receptive fields.Results show that the accuracy rate of fog area(proportion of detected real fog to detected fog)was 66.5%,the recognition rate of fog zone(proportion of detected real fog to real fog or cloud cover)was 51.9%,and the detection accuracy rate(proportion of samples detected correctly to total samples)was 93.2%.
基金supported by the Fundamental Research Funds for the Central Universities of China(Grant No.2013SCU11006)the Key Laboratory of Digital Mapping and Land Information Application of National Administration of Surveying,Mapping and Geoinformation of China(Grant NO.DM2014SC02)the Key Laboratory of Geospecial Information Technology,Ministry of Land and Resources of China(Grant NO.KLGSIT201504)
文摘The development of precision agriculture demands high accuracy and efficiency of cultivated land information extraction. As a new means of monitoring the ground in recent years, unmanned aerial vehicle (UAV) low-height remote sensing technique, which is flexible, efficient with low cost and with high resolution, is widely applied to investing various resources. Based on this, a novel extraction method for cultivated land information based on Deep Convolutional Neural Network and Transfer Learning (DTCLE) was proposed. First, linear features (roads and ridges etc.) were excluded based on Deep Convolutional Neural Network (DCNN). Next, feature extraction method learned from DCNN was used to cultivated land information extraction by introducing transfer learning mechanism. Last, cultivated land information extraction results were completed by the DTCLE and eCognifion for cultivated land information extraction (ECLE). The location of the Pengzhou County and Guanghan County, Sichuan Province were selected for the experimental purpose. The experimental results showed that the overall precision for the experimental image 1, 2 and 3 (of extracting cultivated land) with the DTCLE method was 91.7%, 88.1% and 88.2% respectively, and the overall precision of ECLE is 9o.7%, 90.5% and 87.0%, respectively. Accuracy of DTCLE was equivalent to that of ECLE, and also outperformed ECLE in terms of integrity and continuity.
基金This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(NRF-2018R1A2B6007333).
文摘Compressive strength of concrete is a significant factor to assess building structure health and safety.Therefore,various methods have been developed to evaluate the compressive strength of concrete structures.However,previous methods have several challenges in costly,time-consuming,and unsafety.To address these drawbacks,this paper proposed a digital vision based concrete compressive strength evaluating model using deep convolutional neural network(DCNN).The proposed model presented an alternative approach to evaluating the concrete strength and contributed to improving efficiency and accuracy.The model was developed with 4,000 digital images and 61,996 images extracted from video recordings collected from concrete samples.The experimental results indicated a root mean square error(RMSE)value of 3.56(MPa),demonstrating a strong feasibility that the proposed model can be utilized to predict the concrete strength with digital images of their surfaces and advantages to overcome the previous limitations.This experiment contributed to provide the basis that could be extended to future research with image analysis technique and artificial neural network in the diagnosis of concrete building structures.
基金This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(NRF-2018R1A2B6007333)This study was supported by 2018 Research Grant from Kangwon National University.
文摘In this study,we examined the efficacy of a deep convolutional neural network(DCNN)in recognizing concrete surface images and predicting the compressive strength of concrete.A digital single-lens reflex(DSLR)camera and microscope were simultaneously used to obtain concrete surface images used as the input data for the DCNN.Thereafter,training,validation,and testing of the DCNNs were performed based on the DSLR camera and microscope image data.Results of the analysis indicated that the DCNN employing DSLR image data achieved a relatively higher accuracy.The accuracy of the DSLR-derived image data was attributed to the relatively wider range of the DSLR camera,which was beneficial for extracting a larger number of features.Moreover,the DSLR camera procured more realistic images than the microscope.Thus,when the compressive strength of concrete was evaluated using the DCNN employing a DSLR camera,time and cost were reduced,whereas the usefulness increased.Furthermore,an indirect comparison of the accuracy of the DCNN with that of existing non-destructive methods for evaluating the strength of concrete proved the reliability of DCNN-derived concrete strength predictions.In addition,it was determined that the DCNN used for concrete strength evaluations in this study can be further expanded to detect and evaluate various deteriorative factors that affect the durability of structures,such as salt damage,carbonation,sulfation,corrosion,and freezing-thawing.
文摘This paper proposes a cascade deep convolutional neural network to address the loosening detection problem of bolts on axlebox covers.Firstly,an SSD network based on ResNet50 and CBAM module by improving bolt image features is proposed for locating bolts on axlebox covers.And then,theA2-PFN is proposed according to the slender features of the marker lines for extracting more accurate marker lines regions of the bolts.Finally,a rectangular approximationmethod is proposed to regularize themarker line regions asaway tocalculate the angle of themarker line and plot all the angle values into an angle table,according to which the criteria of the angle table can determine whether the bolt with the marker line is in danger of loosening.Meanwhile,our improved algorithm is compared with the pre-improved algorithmin the object localization stage.The results show that our proposed method has a significant improvement in both detection accuracy and detection speed,where ourmAP(IoU=0.75)reaches 0.77 and fps reaches 16.6.And in the saliency detection stage,after qualitative comparison and quantitative comparison,our method significantly outperforms other state-of-the-art methods,where our MAE reaches 0.092,F-measure reaches 0.948 and AUC reaches 0.943.Ultimately,according to the angle table,out of 676 bolt samples,a total of 60 bolts are loose,69 bolts are at risk of loosening,and 547 bolts are tightened.
文摘Enabling high mobility applications in millimeter wave(mmWave)based systems opens up a slew of new possibilities,including vehicle communi-cations in addition to wireless virtual/augmented reality.The narrow beam usage in addition to the millimeter waves sensitivity might block the coverage along with the reliability of the mobile links.In this research work,the improvement in the quality of experience faced by the user for multimedia-related applications over the millimeter-wave band is investigated.The high attenuation loss in high frequencies is compensated with a massive array structure named Multiple Input and Multiple Output(MIMO)which is utilized in a hyperdense environment called heterogeneous networks(HetNet).The optimization problem which arises while maximizing the Mean Opinion Score(MOS)is analyzed along with the QoE(Quality of Experience)metric by considering the Base Station(BS)powers in addition to the needed Quality of Service(QoS).Most of the approaches related to wireless network communication are not suitable for the millimeter-wave band because of its problems due to high complexity and its dynamic nature.Hence a deep reinforcement learning framework is developed for tackling the same opti-mization problem.In this work,a Fuzzy-based Deep Convolutional Neural Net-work(FDCNN)is proposed in addition to a Deep Reinforcing Learning Framework(DRLF)for extracting the features of highly correlated data.The investigational results prove that the proposed method yields the highest satisfac-tion to the user by increasing the number of antennas in addition with the small-scale antennas at the base stations.The proposed work outperforms in terms of MOS with multiple antennas.
文摘The novel coronavirus 2019(COVID-19)rapidly spreading around the world and turns into a pandemic situation,consequently,detecting the coronavirus(COVID-19)affected patients are now the most critical task for medical specialists.The deficiency of medical testing kits leading to huge complexity in detecting COVID-19 patients worldwide,resulting in the number of infected cases is expanding.Therefore,a significant study is necessary about detecting COVID-19 patients using an automated diagnosis method,which hinders the spreading of coronavirus.In this paper,the study suggests a Deep Convolutional Neural Network-based multi-classification framework(COV-MCNet)using eight different pre-trained architectures such as VGG16,VGG19,ResNet50V2,DenseNet201,InceptionV3,MobileNet,InceptionResNetV2,Xception which are trained and tested on the X-ray images of COVID-19,Normal,Viral Pneumonia,and Bacterial Pneumonia.The results from 4-class(Normal vs.COVID-19 vs.Viral Pneumonia vs.Bacterial Pneumonia)demonstrated that the pre-trained model DenseNet201 provides the highest classification performance(accuracy:92.54%,precision:93.05%,recall:92.81%,F1-score:92.83%,specificity:97.47%).Notably,the DenseNet201(4-class classification)pre-trained model in the proposed COV-MCNet framework showed higher accuracy compared to the rest seven models.Important to mention that the proposed COV-MCNet model showed comparatively higher classification accuracy based on the small number of pre-processed datasets that specifies the designed system can produce superior results when more data become available.The proposed multi-classification network(COV-MCNet)significantly speeds up the existing radiology based method which will be helpful for the medical community and clinical specialists to early diagnosis the COVID-19 cases during this pandemic.
基金supported by the Intelligent System Research Group(ISysRG)supported by Universitas Sriwijaya funded by the Competitive Research 2024.
文摘Handling missing data accurately is critical in clinical research, where data quality directly impacts decision-making and patient outcomes. While deep learning (DL) techniques for data imputation have gained attention, challenges remain, especially when dealing with diverse data types. In this study, we introduce a novel data imputation method based on a modified convolutional neural network, specifically, a Deep Residual-Convolutional Neural Network (DRes-CNN) architecture designed to handle missing values across various datasets. Our approach demonstrates substantial improvements over existing imputation techniques by leveraging residual connections and optimized convolutional layers to capture complex data patterns. We evaluated the model on publicly available datasets, including Medical Information Mart for Intensive Care (MIMIC-III and MIMIC-IV), which contain critical care patient data, and the Beijing Multi-Site Air Quality dataset, which measures environmental air quality. The proposed DRes-CNN method achieved a root mean square error (RMSE) of 0.00006, highlighting its high accuracy and robustness. We also compared with Low Light-Convolutional Neural Network (LL-CNN) and U-Net methods, which had RMSE values of 0.00075 and 0.00073, respectively. This represented an improvement of approximately 92% over LL-CNN and 91% over U-Net. The results showed that this DRes-CNN-based imputation method outperforms current state-of-the-art models. These results established DRes-CNN as a reliable solution for addressing missing data.
基金funded by the Deanship of Scientific Research at Princess Nourah bint Abdulrahman University,through the Research Funding Program,Grant No.(FRP-1443-15).
文摘The analysis of Android malware shows that this threat is constantly increasing and is a real threat to mobile devices since traditional approaches,such as signature-based detection,are no longer effective due to the continuously advancing level of sophistication.To resolve this problem,efficient and flexible malware detection tools are needed.This work examines the possibility of employing deep CNNs to detect Android malware by transforming network traffic into image data representations.Moreover,the dataset used in this study is the CIC-AndMal2017,which contains 20,000 instances of network traffic across five distinct malware categories:a.Trojan,b.Adware,c.Ransomware,d.Spyware,e.Worm.These network traffic features are then converted to image formats for deep learning,which is applied in a CNN framework,including the VGG16 pre-trained model.In addition,our approach yielded high performance,yielding an accuracy of 0.92,accuracy of 99.1%,precision of 98.2%,recall of 99.5%,and F1 score of 98.7%.Subsequent improvements to the classification model through changes within the VGG19 framework improved the classification rate to 99.25%.Through the results obtained,it is clear that CNNs are a very effective way to classify Android malware,providing greater accuracy than conventional techniques.The success of this approach also shows the applicability of deep learning in mobile security along with the direction for the future advancement of the real-time detection system and other deeper learning techniques to counter the increasing number of threats emerging in the future.
基金The research has been supported by the Qingdao Science and Technology Demonstration and Guidance Project(Grant No.20-3-4-45-nsh)Academic Promotion Plan of Shandong First Medical University&Shandong Academy of Medical Sciences(Grant No.2019ZL001)National Science and Technology Major Project of China(Grant No.2017ZX09304010).
文摘Background:Myopic maculopathy(MM)has become a major cause of visual impairment and blindness worldwide,especially in East Asian countries.Deep learning approaches such as deep convolutional neural networks(DCNN)have been successfully applied to identify some common retinal diseases and show great potential for the intelligent analysis of MM.This study aimed to build a reliable approach for automated detection of MM from retinal fundus images using DCNN models.Methods:A dual-stream DCNN(DCNN-DS)model that perceives features from both original images and corresponding processed images by color histogram distribution optimization method was designed for classification of no MM,tessellated fundus(TF),and pathologic myopia(PM).A total of 36,515 gradable images from four hospitals were used for DCNN model development,and 14,986 gradable images from the other two hospitals for external testing.We also compared the performance of the DCNN-DS model and four ophthalmologists on 3000 randomly sampledfundus images.Results:The DCNN-DS model achieved sensitivities of 93.3%and 91.0%,specificities of 99.6%and 98.7%,areas under the receiver operating characteristic curves(AUCs)of 0.998 and 0.994 for detecting PM,whereas sensitivities of 98.8%and 92.8%,specificities of 95.6%and 94.1%,AUCs of 0.986 and 0.970 for detecting TF in two external testing datasets.In the sampled testing dataset,the sensitivities of four ophthalmologists ranged from 88.3%to 95.8%and 81.1%to 89.1%,and the specificities ranged from 95.9%to 99.2%and 77.8%to 97.3%for detecting PM and TF,respectively.Meanwhile,the DCNN-DS model achieved sensitivities of 90.8%and 97.9%and specificities of 99.1%and 94.0%for detecting PMand T,respectively.Conclusions:The proposed DCNN-DS approach demonstrated reliable performance with high sensitivity,specificity,and AUC to classify different MM levels on fundus photographs sourced from clinics.It can help identify MM automatically among the large myopic groups and show great potential for real-life applications.
文摘With the development of artificial intelligence,emotion recognition has become a hot topic in the field of humancomputer interaction.This paper focuses on the application and optimization of deep convolutional neural networks(CNNs)in multimodal emotion recognition.Multimodal emotion recognition involves analyzing data from different sources—such as voice,facial expressions,and text—to more accurately identify and interpret human emotional states.This paper first reviews the basic theories and methods of multimodal data processing,then details the structure and function of deep convolutional neural networks,particularly their advantages in handling various types of data.By innovating and optimizing network structures,loss functions,and training strategies,we have improved the model's accuracy in emotion recognition.Ultimately,experimental results show that the optimized CNN model demonstrates superior performance in multimodal emotion recognition tasks.
基金supported by the National Key Research and Development Program of China(No.2022YFC3801000)the Fundamental Research Funds for the Central Universities(No.2242022k60001,2242023K40015).
文摘The combination of fingerprint positioning and 5G(the 5th Generation Mobile Communication Technology)offers broader application prospects for indoor positioning technology,but also brings challenges in real-time performance.In this paper,we propose a fingerprint positioning method based on a deep convolutional neural network(DCNN)using a classification approach in a single-base station scenario for massive multiple input multiple outputorthogonal frequency division multiplexing(MIMO-OFDM)systems.We introduce an angle-delay domain fingerprint matrix that simplifies the computation process and increases the location differentiation.The cosine distance is chosen as the fingerprint similarity criterion due to its sensitivity to angular differences.First,the DCNN model is used to determine the sub-area to which the mobile terminal belongs,and then the weighted K-nearest neighbor(WKNN)matching algorithm is used to estimate the position within the sub-area.The positioning performance is simulated in a DeepMIMO indoor environment,showing that the classification DCNN method reduces the positioning time by 77.05%compared to the non-classification method,with only a 1.08%increase in average positioning error.
基金supported by the National Natural Science Foundation of China-China State Railway Group Co.,Ltd.Railway Basic Research Joint Fund (Grant No.U2268217)the Scientific Funding for China Academy of Railway Sciences Corporation Limited (No.2021YJ183).
文摘Graph Convolutional Neural Networks(GCNs)have been widely used in various fields due to their powerful capabilities in processing graph-structured data.However,GCNs encounter significant challenges when applied to scale-free graphs with power-law distributions,resulting in substantial distortions.Moreover,most of the existing GCN models are shallow structures,which restricts their ability to capture dependencies among distant nodes and more refined high-order node features in scale-free graphs with hierarchical structures.To more broadly and precisely apply GCNs to real-world graphs exhibiting scale-free or hierarchical structures and utilize multi-level aggregation of GCNs for capturing high-level information in local representations,we propose the Hyperbolic Deep Graph Convolutional Neural Network(HDGCNN),an end-to-end deep graph representation learning framework that can map scale-free graphs from Euclidean space to hyperbolic space.In HDGCNN,we define the fundamental operations of deep graph convolutional neural networks in hyperbolic space.Additionally,we introduce a hyperbolic feature transformation method based on identity mapping and a dense connection scheme based on a novel non-local message passing framework.In addition,we present a neighborhood aggregation method that combines initial structural featureswith hyperbolic attention coefficients.Through the above methods,HDGCNN effectively leverages both the structural features and node features of graph data,enabling enhanced exploration of non-local structural features and more refined node features in scale-free or hierarchical graphs.Experimental results demonstrate that HDGCNN achieves remarkable performance improvements over state-ofthe-art GCNs in node classification and link prediction tasks,even when utilizing low-dimensional embedding representations.Furthermore,when compared to shallow hyperbolic graph convolutional neural network models,HDGCNN exhibits notable advantages and performance enhancements.
基金supported by the National Key Scientific Instrument and Equipment Development Project(61827801).
文摘The fifth-generation (5G) communication requires a highly accurate estimation of the channel state information (CSI)to take advantage of the massive multiple-input multiple-output(MIMO) system. However, traditional channel estimation methods do not always yield reliable estimates. The methodology of this paper consists of deep residual shrinkage network (DRSN)neural network-based method that is used to solve this problem.Thus, the channel estimation approach, based on DRSN with its learning ability of noise-containing data, is first introduced. Then,the DRSN is used to train the noise reduction process based on the results of the least square (LS) channel estimation while applying the pilot frequency subcarriers, where the initially estimated subcarrier channel matrix is considered as a three-dimensional tensor of the DRSN input. Afterward, a mixed signal to noise ratio (SNR) training data strategy is proposed based on the learning ability of DRSN under different SNRs. Moreover, a joint mixed scenario training strategy is carried out to test the multi scenarios robustness of DRSN. As for the findings, the numerical results indicate that the DRSN method outperforms the spatial-frequency-temporal convolutional neural networks (SF-CNN)with similar computational complexity and achieves better advantages in the full SNR range than the minimum mean squared error (MMSE) estimator with a limited dataset. Moreover, the DRSN approach shows robustness in different propagation environments.
基金the Fundamental Research Funds for the Central Universities(2572018BC07,2572017PZ14)the Heilongjiang postdoctoral project fund project(LBH-Z18003)+2 种基金Biodiversity Survey,Monitoring and Assessment Project of Ministry of Ecology and Environment,China(2019HB2096001006)the National Natural Science Foundation of China(NSFC 31872241,31572285)the Individual Identification Technological Research on Camera-trapping images of Amur tigers(NFGA 2017).
文摘The automatic individual identification of Amur tigers(Panthera tigris altaica)is important for population monitoring and making effective conservation strategies.Most existing research primarily relies on manual identifi-cation,which does not scale well to large datasets.In this paper,the deep convolution neural networks algorithm is constructed to implement the automatic individual identification for large numbers of Amur tiger images.The experimental data were obtained from 40 Amur tigers in Tieling Guaipo Tiger Park,China.The number of images collected from each tiger was approximately 200,and a total of 8277 images were obtained.The experiments were carried out on both the left and right side of body.Our results suggested that the recognition accuracy rate of left and right sides are 90.48%and 93.5%,respectively.The accuracy of our network has achieved the similar level compared to other state of the art networks like LeNet,ResNet34,and ZF_Net.The running time is much shorter than that of other networks.Consequently,this study can provide a new approach on automatic individual identification technology in the case of the Amur tiger.
文摘Skin lesions are in a category of disease that is both common in humans and a major cause of death.The classification accuracy of skin lesions is a crucial determinant of the success rate of curing lethal diseases.Deep Convolutional Neural Networks(CNNs)are now the most prevalent computer algorithms for the purpose of disease classification.As with all algorithms,CNNs are sensitive to noise from imaging devices,which often contaminates the quality of the images that are fed into them.In this paper,a deep CNN(Inception-v3)is used to study the effect of image noise on the classification of skin lesions.Gaussian noise,impulse noise,and noise made up of a compound of the two are added to an image dataset,namely the Dermofit Image Library from the University of Edinburgh.Evaluations,based on t-distributed Stochastic Neighbor Embedding(t-SNE)visualization,Receiver Operating Characteristic(ROC)analysis,and saliency maps,demonstrate the reliability of the Inception-v3 deep CNN in classifying noisy skin lesion images.
基金supported by the UK Engineering and Physical Science Research Council(EP/K000128/1)and the China Scholarship Council.
文摘Deformable microcapsules are widely used in industries and also serve as a mechanical model of living biological cells.In this study,we develop a novel method,by integrating a deep convolutional neural network(DCNN)with high-fidelity mechanistic capsule modelling,to identify the membrane constitutive law and estimate associated parameters of a microcapsule from its steady deformed profile in a capillary tube.Compared with conventional inverse methods,the present approach is more accurate and can increase the prediction throughput rate by a few orders of magnitude.It can process capsules with large deformation in inertial flows.Furthermore,the method can predict the capsule membrane shear elasticity,area dilatation modulus and initial inflation from a single steady capsule profile.We explore the mechanism that the DCNN makes decisions by considering its feature maps,and discuss their potential implication on the development of inverse methods.The present method provides a promising tool which may enable high-throughput mechanical characterisation of microcapsules and biological cells in microfluidic flows.
基金supported by the National Natural Science Foundation of China (61320106006, 61532006, 61502042)
文摘Existing learning-based super-resolution (SR) reconstruction algorithms are mainly designed for single image, which ignore the spatio-temporal relationship between video frames. Aiming at applying the advantages of learning-based algorithms to video SR field, a novel video SR reconstruction algorithm based on deep convolutional neural network (CNN) and spatio-temporal similarity (STCNN-SR) was proposed in this paper. It is a deep learning method for video SR reconstruction, which considers not onlv the mapping relationship among associated low-resolution (LR) and high-resolution (HR) image blocks, but also the spatio-temporal non-local complementary and redundant information between adjacent low-resolution video frames. The reconstruction speed can be improved obviously with the pre-trained end-to-end reconstructed coefficients. Moreover, the performance of video SR will be further improved by the optimization process with spatio-temporal similarity. Experimental results demonstrated that the proposed algorithm achieves a competitive SR quality on both subjective and objective evaluations, when compared to other state-of-the-art algorithms.
文摘Watermarking is the advanced technology utilized to secure digital data by integrating ownership or copyright protection.Most of the traditional extracting processes in audio watermarking have some restrictions due to low reliability to various attacks.Hence,a deep learning-based audio watermarking system is proposed in this research to overcome the restriction in the traditional methods.The implication of the research relies on enhancing the performance of the watermarking system using the Discrete Wavelet Transform(DWT)and the optimized deep learning technique.The selection of optimal embedding location is the research contribution that is carried out by the deep convolutional neural network(DCNN).The hyperparameter tuning is performed by the so-called search location optimization,which minimizes the errors in the classifier.The experimental result reveals that the proposed digital audio watermarking system provides better robustness and performance in terms of Bit Error Rate(BER),Mean Square Error(MSE),and Signal-to-noise ratio.The BER,MSE,and SNR of the proposed audio watermarking model without the noise are 0.082,0.099,and 45.363 respectively,which is found to be better performance than the existing watermarking models.
基金supported by the Natural Science Foundation of Hebei Provence[grant numbers:F2015201033,F2017201069]the foundation of H3C[grant number:2017A20004]。
文摘Since of the scale and the various shapes of down in the image,it is difficult for traditional image recognition method to correctly recognize the type of down image and get the required recognition accuracy,even for the Traditional Convolutional Neural Network(TCNN).To deal with the above problems,a Deep Convolutional Neural Network(DCNN)for down image classification is constructed,and a new weight initialization method is proposed.Firstly,the salient regions of a down image were cut from the image using the visual saliency model.Then,these salient regions of the image were used to train a sparse autoencoder and get a collection of convolutional filters,which accord with the statistical characteristics of dataset.At last,a DCNN with Inception module and its variants was constructed.To improve the recognition accuracy,the depth of the network is deepened.The experiment results indicate that the constructed DCNN increases the recognition accuracy by 2.7% compared to TCNN,when recognizing the down in the images.The convergence rate of the proposed DCNN with the new weight initialization method is improved by 25.5% compared to TCNN.