Multi-target tracking is facing the difficulties of modeling uncertain motion and observation noise.Traditional tracking algorithms are limited by specific models and priors that may mismatch a real-world scenario.In ...Multi-target tracking is facing the difficulties of modeling uncertain motion and observation noise.Traditional tracking algorithms are limited by specific models and priors that may mismatch a real-world scenario.In this paper,considering the model-free purpose,we present an online Multi-Target Intelligent Tracking(MTIT)algorithm based on a Deep Long-Short Term Memory(DLSTM)network for complex tracking requirements,named the MTIT-DLSTM algorithm.Firstly,to distinguish trajectories and concatenate the tracking task in a time sequence,we define a target tuple set that is the labeled Random Finite Set(RFS).Then,prediction and update blocks based on the DLSTM network are constructed to predict and estimate the state of targets,respectively.Further,the prediction block can learn the movement trend from the historical state sequence,while the update block can capture the noise characteristic from the historical measurement sequence.Finally,a data association scheme based on Hungarian algorithm and the heuristic track management strategy are employed to assign measurements to targets and adapt births and deaths.Experimental results manifest that,compared with the existing tracking algorithms,our proposed MTIT-DLSTM algorithm can improve effectively the accuracy and robustness in estimating the state of targets appearing at random positions,and be applied to linear and nonlinear multi-target tracking scenarios.展开更多
The unloading relaxation caused by excavation for construction of high arch dams is an important factor influencing the foundation’s integrity and strength.To evaluate the degree of unloading relaxation,the long-shor...The unloading relaxation caused by excavation for construction of high arch dams is an important factor influencing the foundation’s integrity and strength.To evaluate the degree of unloading relaxation,the long-short term memory(LSTM)network was used to estimate the depth of unloading relaxation zones on the left bank foundation of the Baihetan Arch Dam.Principal component analysis indicates that rock charac-teristics,the structural plane,the protection layer,lithology,and time are the main factors.The LSTM network results demonstrate the unloading relaxation characteristics of the left bank,and the relationships with the factors were also analyzed.The structural plane has the most significant influence on the distribution of unloading relaxation zones.Compared with massive basalt,the columnar jointed basalt experiences a more significant unloading relaxation phenomenon with a clear time effect,with the average unloading relaxation period being 50 d.The protection layer can effectively reduce the unloading relaxation depth by approximately 20%.展开更多
The constitutive models of shape memory alloys(SMAs)play an important role in facilitating the widespread application of such types of alloys in various engineering fields.However,to accurately describe the deformatio...The constitutive models of shape memory alloys(SMAs)play an important role in facilitating the widespread application of such types of alloys in various engineering fields.However,to accurately describe the deformation behaviors of SMAs,the concepts in classical plasticity are employed in the existing constitutive models,and a series of complex mathematical equations are involved.Such complexity brings inconvenience for the construction,implementation,and application of the constitutive models.To overcome these shortcomings,a data-driven constitutive model of SMAs is developed in this work based on the artificial neural network(ANN).In the proposed model,the components of the strain tensor in principal space,ambient temperature,and the maximum equivalent strain in the deformation history from the initial state to the current loading state are chosen as the input features,and the components of the stress tensor in principal space are set as the output.The proposed ANN-based constitutive model is implemented into the finite element program ABAQUS by deriving its consistent tangent modulus and writing a user-defined material subroutine.The stress-strain responses of SMA material under various loading paths and at different ambient temperatures are used to train the ANN model,which is generated from the existing constitutive model(numerical experiments).To validate the capability of the proposed model,the predicted stress-strain responses of SMA material,and the global and local responses of two typical SMA structures are compared with the corresponding numerical experiments.This work demonstrates a good potential to obtain the constitutive model of SMAs by pure data and avoid the need for vast stores of knowledge for the construction of constitutive models.展开更多
The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster predic...The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster prediction.To address the issue of insufficient exploration of the spatio-temporal characteristic of microseismic data and the challenging selection of the optimal time window size in spatio-temporal prediction,this paper integrates deep learning methods and theory to propose a novel coal burst spatio-temporal prediction method based on Bidirectional Long Short-Term Memory(Bi-LSTM)network.The method involves three main modules,including microseismic spatio-temporal characteristic indicators construction,temporal prediction model,and spatial prediction model.To validate the effectiveness of the proposed method,engineering application tests are conducted at a high-risk working face in the Ordos mining area of Inner Mongolia,focusing on 13 high-energy microseismic events with energy levels greater than 105 J.In terms of temporal prediction,the analysis indicates that the temporal prediction results consist of 10 strong predictions and 3 medium predictions,and there is no false alarm detected throughout the entire testing period.Moreover,compared to the traditional threshold-based coal burst temporal prediction method,the accuracy of the proposed method is increased by 38.5%.In terms of spatial prediction,the distribution of spatial prediction results for high-energy events comprises 6 strong hazard predictions,3 medium hazard predictions,and 4 weak hazard predictions.展开更多
Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall&quo...Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall",limiting the data transfer between memory and processing units[1,2].Compute-in-memory(CIM)technologies,particularly analogue CIM with memristor crossbars,are promising because of their high energy efficiency,computational parallelism,and integration density for NN computations[3].In practical applications,analogue CIM excels in tasks like speech recognition and image classification,revealing its unique advantages.For instance,it efficiently processes vast amounts of audio data in speech recognition,achieving high accuracy with minimal power consumption.In image classification,the high parallelism of analogue CIM significantly speeds up feature extraction and reduces processing time.With the boosting development of AI applications,the demands for computational accuracy and task complexity are rising continually.However,analogue CIM systems are limited in handling complex regression tasks with needs of precise floating-point(FP)calculations.They are primarily suited for the classification tasks with low data precision and a limited dynamic range[4].展开更多
The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through acceler...The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through accelerated life testing.In the absence of lifetime data,the hidden long-term correlation between performance degradation data is challenging to mine effectively,which is the main factor that restricts the prediction precision and engineering application of the residual life prediction method.To address this problem,a novel method based on the multi-layer perception neural network and bidirectional long short-term memory network is proposed.Firstly,a nonlinear health indicator(HI)calculation method based on kernel principal component analysis(KPCA)and exponential weighted moving average(EWMA)is designed.Then,using the raw vibration data and HI,a multi-layer perceptron(MLP)neural network is trained to further calculate the HI of the online bearing in real time.Furthermore,The bidirectional long short-term memory model(BiLSTM)optimized by particle swarm optimization(PSO)is used to mine the time series features of HI and predict the remaining service life.Performance verification experiments and comparative experiments are carried out on the XJTU-SY bearing open dataset.The research results indicate that this method has an excellent ability to predict future HI and remaining life.展开更多
Complicated loads encountered by floating offshore wind turbines(FOWTs)in real sea conditions are crucial for future optimization of design,but obtaining data on them directly poses a challenge.To address this issue,w...Complicated loads encountered by floating offshore wind turbines(FOWTs)in real sea conditions are crucial for future optimization of design,but obtaining data on them directly poses a challenge.To address this issue,we applied machine learning techniques to obtain hydrodynamic and aerodynamic loads of FOWTs by measuring platform motion responses and wave-elevation sequences.First,a computational fluid dynamics(CFD)simulation model of the floating platform was established based on the dynamic fluid body interaction technique and overset grid technology.Then,a long short-term memory(LSTM)neural network model was constructed and trained to learn the nonlinear relationship between the waves,platform-motion inputs,and hydrodynamic-load outputs.The optimal model was determined after analyzing the sensitivity of parameters such as sample characteristics,network layers,and neuron numbers.Subsequently,the effectiveness of the hydrodynamic load model was validated under different simulation conditions,and the aerodynamic load calculation was completed based on the D'Alembert principle.Finally,we built a hybrid-scale FOWT model,based on the software in the loop strategy,in which the wind turbine was replaced by an actuation system.Model tests were carried out in a wave basin and the results demonstrated that the root mean square errors of the hydrodynamic and aerodynamic load measurements were 4.20%and 10.68%,respectively.展开更多
A Long Short-Term Memory(LSTM) Recurrent Neural Network(RNN) has driven tremendous improvements on an acoustic model based on Gaussian Mixture Model(GMM). However, these models based on a hybrid method require a force...A Long Short-Term Memory(LSTM) Recurrent Neural Network(RNN) has driven tremendous improvements on an acoustic model based on Gaussian Mixture Model(GMM). However, these models based on a hybrid method require a forced aligned Hidden Markov Model(HMM) state sequence obtained from the GMM-based acoustic model. Therefore, it requires a long computation time for training both the GMM-based acoustic model and a deep learning-based acoustic model. In order to solve this problem, an acoustic model using CTC algorithm is proposed. CTC algorithm does not require the GMM-based acoustic model because it does not use the forced aligned HMM state sequence. However, previous works on a LSTM RNN-based acoustic model using CTC used a small-scale training corpus. In this paper, the LSTM RNN-based acoustic model using CTC is trained on a large-scale training corpus and its performance is evaluated. The implemented acoustic model has a performance of 6.18% and 15.01% in terms of Word Error Rate(WER) for clean speech and noisy speech, respectively. This is similar to a performance of the acoustic model based on the hybrid method.展开更多
In this paper the globally asymptotic stability of more general two-layer nonlinear feedback associative memory neural networks with time delays is examined. The sufficient conditions of existence, uniqueness and glob...In this paper the globally asymptotic stability of more general two-layer nonlinear feedback associative memory neural networks with time delays is examined. The sufficient conditions of existence, uniqueness and globally asymptotic stability of the equilibrum position are given. Finally, two interesting examples to illustrate the theory are given.展开更多
Double network(DN)hydrogels as one kind of tough gels have attracted extensive at-tention for their potential applications in biomedical and load-bearing fields.Herein,we import more functions like shape memory into t...Double network(DN)hydrogels as one kind of tough gels have attracted extensive at-tention for their potential applications in biomedical and load-bearing fields.Herein,we import more functions like shape memory into the conventional tough DN hydro-gel system.We synthesize the PEG-PDAC/P(AAm-co-AAc)DN hydrogels,of which the first network is a well-defined PEG(polyethylene glycol)network loaded with PDAC(poly(acryloyloxyethyltrimethyl ammonium chloride))strands,while the second network is formed by copolymerizing AAm(acrylamide)with AAc(acrylic acid)and cross-linker MBAA(N;N′-methylenebisacrylamide).The PEG-PDAC/P(AAm-co-AAc)DN gels exhibits high mechanical strength.The fracture stress and toughness of the DN gels reach up to 0.9 MPa and 3.8 MJ/m^3,respectively.Compared with the conventional double network hydrogels with neutral polymers as the soft and ductile second network,the PEG-PDAC/P(AAm-co-AAc)DN hydrogels use P(AAm-co-AAc),a weak polyelectrolyte,as the second network.The AAc units serve as the coordination points with Fe^3+ions and physically crosslink the second network,which realizes the shape memory property activated by the reducing ability of ascorbic acid.Our results indicate that the high mechanical strength and shape memory properties,probably the two most important characters related to the potential application of the hydrogels,can be introduced simultaneously into the DN hydrogels if the functional monomer has been integrated into the network of DN hydrogels smartly.展开更多
In this paper, the global exponential stability of an equilibrium position for general bidirectional associative memory neural networks are studied. The sufficient conditions of existence and uniqueness of the equilib...In this paper, the global exponential stability of an equilibrium position for general bidirectional associative memory neural networks are studied. The sufficient conditions of existence and uniqueness of the equilibrium position are given. The method of energy function is examined. Two examples are given to illustrate the theory.展开更多
The resistive switching characteristics of TiO_2 nanowire networks directly grown on Ti foil by a single-step hydrothermal technique are discussed in this paper. The Ti foil serves as the supply of Ti atoms for growth...The resistive switching characteristics of TiO_2 nanowire networks directly grown on Ti foil by a single-step hydrothermal technique are discussed in this paper. The Ti foil serves as the supply of Ti atoms for growth of the TiO_2 nanowires, making the preparation straightforward. It also acts as a bottom electrode for the device. A top Al electrode was fabricated by e-beam evaporation process. The Al/TiO_2 nanowire networks/Ti device fabricated in this way displayed a highly repeatable and electroforming-free bipolar resistive behavior with retention for more than 10~4 s and an OFF/ON ratio of approximately 70. The switching mechanism of this Al/TiO_2 nanowire networks/Ti device is suggested to arise from the migration of oxygen vacancies under applied electric field. This provides a facile way to obtain metal oxide nanowire-based Re RAM device in the future.展开更多
To explore new operational forecasting methods of waves,a forecasting model for wave heights at three stations in the Bohai Sea has been developed.This model is based on long short-term memory(LSTM)neural network with...To explore new operational forecasting methods of waves,a forecasting model for wave heights at three stations in the Bohai Sea has been developed.This model is based on long short-term memory(LSTM)neural network with sea surface wind and wave heights as training samples.The prediction performance of the model is evaluated,and the error analysis shows that when using the same set of numerically predicted sea surface wind as input,the prediction error produced by the proposed LSTM model at Sta.N01 is 20%,18%and 23%lower than the conventional numerical wave models in terms of the total root mean square error(RMSE),scatter index(SI)and mean absolute error(MAE),respectively.Particularly,for significant wave height in the range of 3–5 m,the prediction accuracy of the LSTM model is improved the most remarkably,with RMSE,SI and MAE all decreasing by 24%.It is also evident that the numbers of hidden neurons,the numbers of buoys used and the time length of training samples all have impact on the prediction accuracy.However,the prediction does not necessary improve with the increase of number of hidden neurons or number of buoys used.The experiment trained by data with the longest time length is found to perform the best overall compared to other experiments with a shorter time length for training.Overall,long short-term memory neural network was proved to be a very promising method for future development and applications in wave forecasting.展开更多
A correct and timely fault diagnosis is important for improving the safety and reliability of chemical processes. With the advancement of big data technology, data-driven fault diagnosis methods are being extensively ...A correct and timely fault diagnosis is important for improving the safety and reliability of chemical processes. With the advancement of big data technology, data-driven fault diagnosis methods are being extensively used and still have considerable potential. In recent years, methods based on deep neural networks have made significant breakthroughs, and fault diagnosis methods for industrial processes based on deep learning have attracted considerable research attention. Therefore, we propose a fusion deeplearning algorithm based on a fully convolutional neural network(FCN) to extract features and build models to correctly diagnose all types of faults. We use long short-term memory(LSTM) units to expand our proposed FCN so that our proposed deep learning model can better extract the time-domain features of chemical process data. We also introduce the attention mechanism into the model, aimed at highlighting the importance of features, which is significant for the fault diagnosis of chemical processes with many features. When applied to the benchmark Tennessee Eastman process, our proposed model exhibits impressive performance, demonstrating the effectiveness of the attention-based LSTM FCN in chemical process fault diagnosis.展开更多
Holter usually monitors electrocardiogram(ECG)signals for more than 24 hours to capture short-lived cardiac abnormalities.In view of the large amount of Holter data and the fact that the normal part accounts for the m...Holter usually monitors electrocardiogram(ECG)signals for more than 24 hours to capture short-lived cardiac abnormalities.In view of the large amount of Holter data and the fact that the normal part accounts for the majority,it is reasonable to design an algorithm that can automatically eliminate normal data segments as much as possible without missing any abnormal data segments,and then take the left segments to the doctors or the computer programs for further diagnosis.In this paper,we propose a preliminary abnormal segment screening method for Holter data.Based on long short-term memory(LSTM)networks,the prediction model is established and trained with the normal data of a monitored object.Then,on the basis of kernel density estimation,we learn the distribution law of prediction errors after applying the trained LSTM model to the regular data.Based on these,the preliminary abnormal ECG segment screening analysis is carried out without R wave detection.Experiments on the MIT-BIH arrhythmia database show that,under the condition of ensuring that no abnormal point is missed,53.89% of normal segments can be effectively obviated.This work can greatly reduce the workload of subsequent further processing.展开更多
Traditional recurrent neural networks are composed of capacitors, inductors, resistors, and operational amplifiers.Memristive neural networks are constructed by replacing resistors with memristors. This paper focuses ...Traditional recurrent neural networks are composed of capacitors, inductors, resistors, and operational amplifiers.Memristive neural networks are constructed by replacing resistors with memristors. This paper focuses on the memory analysis,i.e. the initial value computation, of memristors. Firstly, we present the memory analysis for a single memristor based on memristors’ mathematical models with linear and nonlinear drift.Secondly, we present the memory analysis for two memristors in series and parallel. Thirdly, we point out the difference between traditional neural networks and those that are memristive. Based on the current and voltage relationship of memristors, we use mathematical analysis and SPICE simulations to demonstrate the validity of our methods.展开更多
There are two technical challenges in predicting slope deformation.The first one is the random displacement,which could not be decomposed and predicted by numerically resolving the observed accumulated displacement an...There are two technical challenges in predicting slope deformation.The first one is the random displacement,which could not be decomposed and predicted by numerically resolving the observed accumulated displacement and time series of a landslide.The second one is the dynamic evolution of a landslide,which could not be feasibly simulated simply by traditional prediction models.In this paper,a dynamic model of displacement prediction is introduced for composite landslides based on a combination of empirical mode decomposition with soft screening stop criteria(SSSC-EMD)and deep bidirectional long short-term memory(DBi-LSTM)neural network.In the proposed model,the time series analysis and SSSC-EMD are used to decompose the observed accumulated displacements of a slope into three components,viz.trend displacement,periodic displacement,and random displacement.Then,by analyzing the evolution pattern of a landslide and its key factors triggering landslides,appropriate influencing factors are selected for each displacement component,and DBi-LSTM neural network to carry out multi-datadriven dynamic prediction for each displacement component.An accumulated displacement prediction has been obtained by a summation of each component.For accuracy verification and engineering practicability of the model,field observations from two known landslides in China,the Xintan landslide and the Bazimen landslide were collected for comparison and evaluation.The case study verified that the model proposed in this paper can better characterize the"stepwise"deformation characteristics of a slope.As compared with long short-term memory(LSTM)neural network,support vector machine(SVM),and autoregressive integrated moving average(ARIMA)model,DBi-LSTM neural network has higher accuracy in predicting the periodic displacement of slope deformation,with the mean absolute percentage error reduced by 3.063%,14.913%,and 13.960%respectively,and the root mean square error reduced by 1.951 mm,8.954 mm and 7.790 mm respectively.Conclusively,this model not only has high prediction accuracy but also is more stable,which can provide new insight for practical landslide prevention and control engineering.展开更多
Based on current research on applications of chaotic neuron network for information processing, the stability and convergence of chaotic neuron network are proved from the viewpoint of energy function. Moreover, a new...Based on current research on applications of chaotic neuron network for information processing, the stability and convergence of chaotic neuron network are proved from the viewpoint of energy function. Moreover, a new auto-associative matrix is devised for artificial neural network composed of chaotic neurons, thus, an improved chaotic neuron network for associative memory is built up. Finally, the associative recalling process of the network is analyzed in detail and explanations of improvement are given.展开更多
A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could...A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could be chosen dynamically. Double-valued and multi-valued associative memory are all realized in our simulation experiment. The experimental results show that the method could enhance the associative success rate.展开更多
In this paper, a novel design procedure is proposed for synthesizing high-capacity auto-associative memories based on complex-valued neural networks with real-imaginary-type activation functions and constant delays. S...In this paper, a novel design procedure is proposed for synthesizing high-capacity auto-associative memories based on complex-valued neural networks with real-imaginary-type activation functions and constant delays. Stability criteria dependent on external inputs of neural networks are derived. The designed networks can retrieve the stored patterns by external inputs rather than initial conditions. The derivation can memorize the desired patterns with lower-dimensional neural networks than real-valued neural networks, and eliminate spurious equilibria of complex-valued neural networks. One numerical example is provided to show the effectiveness and superiority of the presented results.展开更多
基金supported by the National Natural Science Foundation of China(No.62276204)Open Foundation of Science and Technology on Electronic Information Control Laboratory,Natural Science Basic Research Program of Shanxi,China(Nos.2022JM-340 and 2023-JC-QN-0710)China Postdoctoral Science Foundation(Nos.2020T130494 and 2018M633470).
文摘Multi-target tracking is facing the difficulties of modeling uncertain motion and observation noise.Traditional tracking algorithms are limited by specific models and priors that may mismatch a real-world scenario.In this paper,considering the model-free purpose,we present an online Multi-Target Intelligent Tracking(MTIT)algorithm based on a Deep Long-Short Term Memory(DLSTM)network for complex tracking requirements,named the MTIT-DLSTM algorithm.Firstly,to distinguish trajectories and concatenate the tracking task in a time sequence,we define a target tuple set that is the labeled Random Finite Set(RFS).Then,prediction and update blocks based on the DLSTM network are constructed to predict and estimate the state of targets,respectively.Further,the prediction block can learn the movement trend from the historical state sequence,while the update block can capture the noise characteristic from the historical measurement sequence.Finally,a data association scheme based on Hungarian algorithm and the heuristic track management strategy are employed to assign measurements to targets and adapt births and deaths.Experimental results manifest that,compared with the existing tracking algorithms,our proposed MTIT-DLSTM algorithm can improve effectively the accuracy and robustness in estimating the state of targets appearing at random positions,and be applied to linear and nonlinear multi-target tracking scenarios.
基金This work was supported by the National Key Research and Development Program of China(Grant No.2018YFC0407004)the Natural Science Foundation of China(Grants No.51939004 and 11772116).
文摘The unloading relaxation caused by excavation for construction of high arch dams is an important factor influencing the foundation’s integrity and strength.To evaluate the degree of unloading relaxation,the long-short term memory(LSTM)network was used to estimate the depth of unloading relaxation zones on the left bank foundation of the Baihetan Arch Dam.Principal component analysis indicates that rock charac-teristics,the structural plane,the protection layer,lithology,and time are the main factors.The LSTM network results demonstrate the unloading relaxation characteristics of the left bank,and the relationships with the factors were also analyzed.The structural plane has the most significant influence on the distribution of unloading relaxation zones.Compared with massive basalt,the columnar jointed basalt experiences a more significant unloading relaxation phenomenon with a clear time effect,with the average unloading relaxation period being 50 d.The protection layer can effectively reduce the unloading relaxation depth by approximately 20%.
基金supported by the National Natural Science Foundation of China(NSFC)(Grant No.12322203).
文摘The constitutive models of shape memory alloys(SMAs)play an important role in facilitating the widespread application of such types of alloys in various engineering fields.However,to accurately describe the deformation behaviors of SMAs,the concepts in classical plasticity are employed in the existing constitutive models,and a series of complex mathematical equations are involved.Such complexity brings inconvenience for the construction,implementation,and application of the constitutive models.To overcome these shortcomings,a data-driven constitutive model of SMAs is developed in this work based on the artificial neural network(ANN).In the proposed model,the components of the strain tensor in principal space,ambient temperature,and the maximum equivalent strain in the deformation history from the initial state to the current loading state are chosen as the input features,and the components of the stress tensor in principal space are set as the output.The proposed ANN-based constitutive model is implemented into the finite element program ABAQUS by deriving its consistent tangent modulus and writing a user-defined material subroutine.The stress-strain responses of SMA material under various loading paths and at different ambient temperatures are used to train the ANN model,which is generated from the existing constitutive model(numerical experiments).To validate the capability of the proposed model,the predicted stress-strain responses of SMA material,and the global and local responses of two typical SMA structures are compared with the corresponding numerical experiments.This work demonstrates a good potential to obtain the constitutive model of SMAs by pure data and avoid the need for vast stores of knowledge for the construction of constitutive models.
基金supported by the National Research and Development Program(2022YFC3004603)the Jiangsu Province International Collaboration Program-Key National Industrial Technology Research and Development Cooperation Projects(BZ2023050)+1 种基金the Natural Science Foundation of Jiangsu Province(BK20221109)the National Natural Science Foundation of China(52274098).
文摘The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster prediction.To address the issue of insufficient exploration of the spatio-temporal characteristic of microseismic data and the challenging selection of the optimal time window size in spatio-temporal prediction,this paper integrates deep learning methods and theory to propose a novel coal burst spatio-temporal prediction method based on Bidirectional Long Short-Term Memory(Bi-LSTM)network.The method involves three main modules,including microseismic spatio-temporal characteristic indicators construction,temporal prediction model,and spatial prediction model.To validate the effectiveness of the proposed method,engineering application tests are conducted at a high-risk working face in the Ordos mining area of Inner Mongolia,focusing on 13 high-energy microseismic events with energy levels greater than 105 J.In terms of temporal prediction,the analysis indicates that the temporal prediction results consist of 10 strong predictions and 3 medium predictions,and there is no false alarm detected throughout the entire testing period.Moreover,compared to the traditional threshold-based coal burst temporal prediction method,the accuracy of the proposed method is increased by 38.5%.In terms of spatial prediction,the distribution of spatial prediction results for high-energy events comprises 6 strong hazard predictions,3 medium hazard predictions,and 4 weak hazard predictions.
文摘Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall",limiting the data transfer between memory and processing units[1,2].Compute-in-memory(CIM)technologies,particularly analogue CIM with memristor crossbars,are promising because of their high energy efficiency,computational parallelism,and integration density for NN computations[3].In practical applications,analogue CIM excels in tasks like speech recognition and image classification,revealing its unique advantages.For instance,it efficiently processes vast amounts of audio data in speech recognition,achieving high accuracy with minimal power consumption.In image classification,the high parallelism of analogue CIM significantly speeds up feature extraction and reduces processing time.With the boosting development of AI applications,the demands for computational accuracy and task complexity are rising continually.However,analogue CIM systems are limited in handling complex regression tasks with needs of precise floating-point(FP)calculations.They are primarily suited for the classification tasks with low data precision and a limited dynamic range[4].
基金supported by the National Key Research and Development Project(Grant Number 2023YFB3709601)the National Natural Science Foundation of China(Grant Numbers 62373215,62373219,62073193)+2 种基金the Key Research and Development Plan of Shandong Province(Grant Numbers 2021CXGC010204,2022CXGC020902)the Fundamental Research Funds of Shandong University(Grant Number 2021JCG008)the Natural Science Foundation of Shandong Province(Grant Number ZR2023MF100).
文摘The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through accelerated life testing.In the absence of lifetime data,the hidden long-term correlation between performance degradation data is challenging to mine effectively,which is the main factor that restricts the prediction precision and engineering application of the residual life prediction method.To address this problem,a novel method based on the multi-layer perception neural network and bidirectional long short-term memory network is proposed.Firstly,a nonlinear health indicator(HI)calculation method based on kernel principal component analysis(KPCA)and exponential weighted moving average(EWMA)is designed.Then,using the raw vibration data and HI,a multi-layer perceptron(MLP)neural network is trained to further calculate the HI of the online bearing in real time.Furthermore,The bidirectional long short-term memory model(BiLSTM)optimized by particle swarm optimization(PSO)is used to mine the time series features of HI and predict the remaining service life.Performance verification experiments and comparative experiments are carried out on the XJTU-SY bearing open dataset.The research results indicate that this method has an excellent ability to predict future HI and remaining life.
基金This work is supported by the National Key Research and Development Program of China(No.2023YFB4203000)the National Natural Science Foundation of China(No.U22A20178)
文摘Complicated loads encountered by floating offshore wind turbines(FOWTs)in real sea conditions are crucial for future optimization of design,but obtaining data on them directly poses a challenge.To address this issue,we applied machine learning techniques to obtain hydrodynamic and aerodynamic loads of FOWTs by measuring platform motion responses and wave-elevation sequences.First,a computational fluid dynamics(CFD)simulation model of the floating platform was established based on the dynamic fluid body interaction technique and overset grid technology.Then,a long short-term memory(LSTM)neural network model was constructed and trained to learn the nonlinear relationship between the waves,platform-motion inputs,and hydrodynamic-load outputs.The optimal model was determined after analyzing the sensitivity of parameters such as sample characteristics,network layers,and neuron numbers.Subsequently,the effectiveness of the hydrodynamic load model was validated under different simulation conditions,and the aerodynamic load calculation was completed based on the D'Alembert principle.Finally,we built a hybrid-scale FOWT model,based on the software in the loop strategy,in which the wind turbine was replaced by an actuation system.Model tests were carried out in a wave basin and the results demonstrated that the root mean square errors of the hydrodynamic and aerodynamic load measurements were 4.20%and 10.68%,respectively.
基金supported by the Ministry of Trade,Industry & Energy(MOTIE,Korea) under Industrial Technology Innovation Program (No.10063424,'development of distant speech recognition and multi-task dialog processing technologies for in-door conversational robots')
文摘A Long Short-Term Memory(LSTM) Recurrent Neural Network(RNN) has driven tremendous improvements on an acoustic model based on Gaussian Mixture Model(GMM). However, these models based on a hybrid method require a forced aligned Hidden Markov Model(HMM) state sequence obtained from the GMM-based acoustic model. Therefore, it requires a long computation time for training both the GMM-based acoustic model and a deep learning-based acoustic model. In order to solve this problem, an acoustic model using CTC algorithm is proposed. CTC algorithm does not require the GMM-based acoustic model because it does not use the forced aligned HMM state sequence. However, previous works on a LSTM RNN-based acoustic model using CTC used a small-scale training corpus. In this paper, the LSTM RNN-based acoustic model using CTC is trained on a large-scale training corpus and its performance is evaluated. The implemented acoustic model has a performance of 6.18% and 15.01% in terms of Word Error Rate(WER) for clean speech and noisy speech, respectively. This is similar to a performance of the acoustic model based on the hybrid method.
文摘In this paper the globally asymptotic stability of more general two-layer nonlinear feedback associative memory neural networks with time delays is examined. The sufficient conditions of existence, uniqueness and globally asymptotic stability of the equilibrum position are given. Finally, two interesting examples to illustrate the theory are given.
基金supported by the National Natural Science Foundation of China (No.51273189)the National Science and Technology Major Project of the Ministry of Science and Technology of China (No.2016ZX05016),the National Science and Technology Major Project of the Ministry of Science and Technology of China (No.2016ZX05046)
文摘Double network(DN)hydrogels as one kind of tough gels have attracted extensive at-tention for their potential applications in biomedical and load-bearing fields.Herein,we import more functions like shape memory into the conventional tough DN hydro-gel system.We synthesize the PEG-PDAC/P(AAm-co-AAc)DN hydrogels,of which the first network is a well-defined PEG(polyethylene glycol)network loaded with PDAC(poly(acryloyloxyethyltrimethyl ammonium chloride))strands,while the second network is formed by copolymerizing AAm(acrylamide)with AAc(acrylic acid)and cross-linker MBAA(N;N′-methylenebisacrylamide).The PEG-PDAC/P(AAm-co-AAc)DN gels exhibits high mechanical strength.The fracture stress and toughness of the DN gels reach up to 0.9 MPa and 3.8 MJ/m^3,respectively.Compared with the conventional double network hydrogels with neutral polymers as the soft and ductile second network,the PEG-PDAC/P(AAm-co-AAc)DN hydrogels use P(AAm-co-AAc),a weak polyelectrolyte,as the second network.The AAc units serve as the coordination points with Fe^3+ions and physically crosslink the second network,which realizes the shape memory property activated by the reducing ability of ascorbic acid.Our results indicate that the high mechanical strength and shape memory properties,probably the two most important characters related to the potential application of the hydrogels,can be introduced simultaneously into the DN hydrogels if the functional monomer has been integrated into the network of DN hydrogels smartly.
基金Supported by the National Natural Science Foundation of China
文摘In this paper, the global exponential stability of an equilibrium position for general bidirectional associative memory neural networks are studied. The sufficient conditions of existence and uniqueness of the equilibrium position are given. The method of energy function is examined. Two examples are given to illustrate the theory.
基金supported by the Natural Sciences and Engineering Research Council(NSERC)of CanadaThe financial support of the State Scholarship Fund of China(No.201506160061)
文摘The resistive switching characteristics of TiO_2 nanowire networks directly grown on Ti foil by a single-step hydrothermal technique are discussed in this paper. The Ti foil serves as the supply of Ti atoms for growth of the TiO_2 nanowires, making the preparation straightforward. It also acts as a bottom electrode for the device. A top Al electrode was fabricated by e-beam evaporation process. The Al/TiO_2 nanowire networks/Ti device fabricated in this way displayed a highly repeatable and electroforming-free bipolar resistive behavior with retention for more than 10~4 s and an OFF/ON ratio of approximately 70. The switching mechanism of this Al/TiO_2 nanowire networks/Ti device is suggested to arise from the migration of oxygen vacancies under applied electric field. This provides a facile way to obtain metal oxide nanowire-based Re RAM device in the future.
基金The National Key R&D Program of China under contract No.2016YFC1402103
文摘To explore new operational forecasting methods of waves,a forecasting model for wave heights at three stations in the Bohai Sea has been developed.This model is based on long short-term memory(LSTM)neural network with sea surface wind and wave heights as training samples.The prediction performance of the model is evaluated,and the error analysis shows that when using the same set of numerically predicted sea surface wind as input,the prediction error produced by the proposed LSTM model at Sta.N01 is 20%,18%and 23%lower than the conventional numerical wave models in terms of the total root mean square error(RMSE),scatter index(SI)and mean absolute error(MAE),respectively.Particularly,for significant wave height in the range of 3–5 m,the prediction accuracy of the LSTM model is improved the most remarkably,with RMSE,SI and MAE all decreasing by 24%.It is also evident that the numbers of hidden neurons,the numbers of buoys used and the time length of training samples all have impact on the prediction accuracy.However,the prediction does not necessary improve with the increase of number of hidden neurons or number of buoys used.The experiment trained by data with the longest time length is found to perform the best overall compared to other experiments with a shorter time length for training.Overall,long short-term memory neural network was proved to be a very promising method for future development and applications in wave forecasting.
文摘A correct and timely fault diagnosis is important for improving the safety and reliability of chemical processes. With the advancement of big data technology, data-driven fault diagnosis methods are being extensively used and still have considerable potential. In recent years, methods based on deep neural networks have made significant breakthroughs, and fault diagnosis methods for industrial processes based on deep learning have attracted considerable research attention. Therefore, we propose a fusion deeplearning algorithm based on a fully convolutional neural network(FCN) to extract features and build models to correctly diagnose all types of faults. We use long short-term memory(LSTM) units to expand our proposed FCN so that our proposed deep learning model can better extract the time-domain features of chemical process data. We also introduce the attention mechanism into the model, aimed at highlighting the importance of features, which is significant for the fault diagnosis of chemical processes with many features. When applied to the benchmark Tennessee Eastman process, our proposed model exhibits impressive performance, demonstrating the effectiveness of the attention-based LSTM FCN in chemical process fault diagnosis.
文摘Holter usually monitors electrocardiogram(ECG)signals for more than 24 hours to capture short-lived cardiac abnormalities.In view of the large amount of Holter data and the fact that the normal part accounts for the majority,it is reasonable to design an algorithm that can automatically eliminate normal data segments as much as possible without missing any abnormal data segments,and then take the left segments to the doctors or the computer programs for further diagnosis.In this paper,we propose a preliminary abnormal segment screening method for Holter data.Based on long short-term memory(LSTM)networks,the prediction model is established and trained with the normal data of a monitored object.Then,on the basis of kernel density estimation,we learn the distribution law of prediction errors after applying the trained LSTM model to the regular data.Based on these,the preliminary abnormal ECG segment screening analysis is carried out without R wave detection.Experiments on the MIT-BIH arrhythmia database show that,under the condition of ensuring that no abnormal point is missed,53.89% of normal segments can be effectively obviated.This work can greatly reduce the workload of subsequent further processing.
基金supported by the National Natural Science Foundation of China(61876097,61673188,61761130081)the National Key Research and Development Program of China(2016YFB0800402)+1 种基金the Foundation for Innovative Research Groups of Hubei Province of China(2017CFA005)the Fundamental Research Funds for the Central Universities(2017KFXKJC002)
文摘Traditional recurrent neural networks are composed of capacitors, inductors, resistors, and operational amplifiers.Memristive neural networks are constructed by replacing resistors with memristors. This paper focuses on the memory analysis,i.e. the initial value computation, of memristors. Firstly, we present the memory analysis for a single memristor based on memristors’ mathematical models with linear and nonlinear drift.Secondly, we present the memory analysis for two memristors in series and parallel. Thirdly, we point out the difference between traditional neural networks and those that are memristive. Based on the current and voltage relationship of memristors, we use mathematical analysis and SPICE simulations to demonstrate the validity of our methods.
文摘There are two technical challenges in predicting slope deformation.The first one is the random displacement,which could not be decomposed and predicted by numerically resolving the observed accumulated displacement and time series of a landslide.The second one is the dynamic evolution of a landslide,which could not be feasibly simulated simply by traditional prediction models.In this paper,a dynamic model of displacement prediction is introduced for composite landslides based on a combination of empirical mode decomposition with soft screening stop criteria(SSSC-EMD)and deep bidirectional long short-term memory(DBi-LSTM)neural network.In the proposed model,the time series analysis and SSSC-EMD are used to decompose the observed accumulated displacements of a slope into three components,viz.trend displacement,periodic displacement,and random displacement.Then,by analyzing the evolution pattern of a landslide and its key factors triggering landslides,appropriate influencing factors are selected for each displacement component,and DBi-LSTM neural network to carry out multi-datadriven dynamic prediction for each displacement component.An accumulated displacement prediction has been obtained by a summation of each component.For accuracy verification and engineering practicability of the model,field observations from two known landslides in China,the Xintan landslide and the Bazimen landslide were collected for comparison and evaluation.The case study verified that the model proposed in this paper can better characterize the"stepwise"deformation characteristics of a slope.As compared with long short-term memory(LSTM)neural network,support vector machine(SVM),and autoregressive integrated moving average(ARIMA)model,DBi-LSTM neural network has higher accuracy in predicting the periodic displacement of slope deformation,with the mean absolute percentage error reduced by 3.063%,14.913%,and 13.960%respectively,and the root mean square error reduced by 1.951 mm,8.954 mm and 7.790 mm respectively.Conclusively,this model not only has high prediction accuracy but also is more stable,which can provide new insight for practical landslide prevention and control engineering.
基金National Natural Science Foundation of P.R.China(No. 69735101)
文摘Based on current research on applications of chaotic neuron network for information processing, the stability and convergence of chaotic neuron network are proved from the viewpoint of energy function. Moreover, a new auto-associative matrix is devised for artificial neural network composed of chaotic neurons, thus, an improved chaotic neuron network for associative memory is built up. Finally, the associative recalling process of the network is analyzed in detail and explanations of improvement are given.
文摘A novel learning method for multi-valued associative memory network is introduced, which is based on Hebb rule, but utilizes more information. According to the current probe vector, the connection weights matrix could be chosen dynamically. Double-valued and multi-valued associative memory are all realized in our simulation experiment. The experimental results show that the method could enhance the associative success rate.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61503338,61573316,61374152,and 11302195)the Natural Science Foundation of Zhejiang Province,China(Grant No.LQ15F030005)
文摘In this paper, a novel design procedure is proposed for synthesizing high-capacity auto-associative memories based on complex-valued neural networks with real-imaginary-type activation functions and constant delays. Stability criteria dependent on external inputs of neural networks are derived. The designed networks can retrieve the stored patterns by external inputs rather than initial conditions. The derivation can memorize the desired patterns with lower-dimensional neural networks than real-valued neural networks, and eliminate spurious equilibria of complex-valued neural networks. One numerical example is provided to show the effectiveness and superiority of the presented results.