In dense pedestrian tracking,frequent object occlusions and close distances between objects cause difficulty when accurately estimating object trajectories.In this study,a conditional random field tracking model is es...In dense pedestrian tracking,frequent object occlusions and close distances between objects cause difficulty when accurately estimating object trajectories.In this study,a conditional random field tracking model is established by using a visual long short term memory network in the three-dimensional(3D)space and the motion estimations jointly performed on object trajectory segments.Object visual field information is added to the long short term memory network to improve the accuracy of the motion related object pair selection and motion estimation.To address the uncertainty of the length and interval of trajectory segments,a multimode long short term memory network is proposed for the object motion estimation.The tracking performance is evaluated using the PETS2009 dataset.The experimental results show that the proposed method achieves better performance than the tracking methods based on the independent motion estimation.展开更多
Speaker separation in complex acoustic environment is one of challenging tasks in speech separation.In practice,speakers are very often unmoving or moving slowly in normal communication.In this case,the spatial featur...Speaker separation in complex acoustic environment is one of challenging tasks in speech separation.In practice,speakers are very often unmoving or moving slowly in normal communication.In this case,the spatial features among the consecutive speech frames become highly correlated such that it is helpful for speaker separation by providing additional spatial information.To fully exploit this information,we design a separation system on Recurrent Neural Network(RNN)with long short-term memory(LSTM)which effectively learns the temporal dynamics of spatial features.In detail,a LSTM-based speaker separation algorithm is proposed to extract the spatial features in each time-frequency(TF)unit and form the corresponding feature vector.Then,we treat speaker separation as a supervised learning problem,where a modified ideal ratio mask(IRM)is defined as the training function during LSTM learning.Simulations show that the proposed system achieves attractive separation performance in noisy and reverberant environments.Specifically,during the untrained acoustic test with limited priors,e.g.,unmatched signal to noise ratio(SNR)and reverberation,the proposed LSTM based algorithm can still outperforms the existing DNN based method in the measures of PESQ and STOI.It indicates our method is more robust in untrained conditions.展开更多
Predicting wind speed accurately is essential to ensure the stability of the wind power system and improve the utilization rate of wind energy.However,owing to the stochastic and intermittent of wind speed,predicting ...Predicting wind speed accurately is essential to ensure the stability of the wind power system and improve the utilization rate of wind energy.However,owing to the stochastic and intermittent of wind speed,predicting wind speed accurately is difficult.A new hybrid deep learning model based on empirical wavelet transform,recurrent neural network and error correction for short-term wind speed prediction is proposed in this paper.The empirical wavelet transformation is applied to decompose the original wind speed series.The long short term memory network and the Elman neural network are adopted to predict low-frequency and high-frequency wind speed sub-layers respectively to balance the calculation efficiency and prediction accuracy.The error correction strategy based on deep long short term memory network is developed to modify the prediction errors.Four actual wind speed series are utilized to verify the effectiveness of the proposed model.The empirical results indicate that the method proposed in this paper has satisfactory performance in wind speed prediction.展开更多
In today’s world, there are many people suffering from mentalhealth problems such as depression and anxiety. If these conditions are notidentified and treated early, they can get worse quickly and have far-reachingne...In today’s world, there are many people suffering from mentalhealth problems such as depression and anxiety. If these conditions are notidentified and treated early, they can get worse quickly and have far-reachingnegative effects. Unfortunately, many people suffering from these conditions,especially depression and hypertension, are unaware of their existence until theconditions become chronic. Thus, this paper proposes a novel approach usingBi-directional Long Short-Term Memory (Bi-LSTM) algorithm and GlobalVector (GloVe) algorithm for the prediction and treatment of these conditions.Smartwatches and fitness bands can be equipped with these algorithms whichcan share data with a variety of IoT devices and smart systems to betterunderstand and analyze the user’s condition. We compared the accuracy andloss of the training dataset and the validation dataset of the two modelsnamely, Bi-LSTM without a global vector layer and with a global vector layer.It was observed that the model of Bi-LSTM without a global vector layer hadan accuracy of 83%,while Bi-LSTMwith a global vector layer had an accuracyof 86% with a precision of 86.4%, and an F1 score of 0.861. In addition toproviding basic therapies for the treatment of identified cases, our model alsohelps prevent the deterioration of associated conditions, making our methoda real-world solution.展开更多
Mental workload plays a vital role in cognitive impairment. The impairment refers to a person’s difficulty in remembering, receiving new information, learning new things, concentrating, or making decisions that serio...Mental workload plays a vital role in cognitive impairment. The impairment refers to a person’s difficulty in remembering, receiving new information, learning new things, concentrating, or making decisions that seriously affect everyday life. In this paper, the simultaneous capacity (SIMKAP) experiment-based EEG workload analysis was presented using 45 subjects for multitasking mental workload estimation with subject wise attention loss calculation as well as short term memory loss measurement. Using an open access preprocessed EEG dataset, Discrete wavelet transforms (DWT) was utilized for feature extraction and Minimum redundancy and maximum relevancy (MRMR) technique was used to select most relevance features. Wavelet decomposition technique was also used for decomposing EEG signals into five sub bands. Fourteen statistical features were calculated from each sub band signal to form a 5 × 14 window size. The Neural Network (Narrow) classification algorithm was used to classify dataset for low and high workload conditions and comparison was made using some other machine learning models. The results show the classifier’s accuracy of 86.7%, precision of 84.4%, F1 score of 86.33%, and recall of 88.37% that crosses the state-of-the art methodologies in the literature. This prediction is expected to greatly facilitate the improved way in memory and attention loss impairments assessment.展开更多
1 A study shows that music lessons obviously enhance children's cognitive abilities,including short⁃term memory and planning,which lead to improving academic performance.The research is the first large⁃scale and l...1 A study shows that music lessons obviously enhance children's cognitive abilities,including short⁃term memory and planning,which lead to improving academic performance.The research is the first large⁃scale and long⁃term study to be adapted into the regular school curriculum.Visual arts lessons were also found to significantly improve children's visual memory.展开更多
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing de...Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.展开更多
Aiming at the problem that some existing traffic flow prediction models are only for a single road segment and the model input data are not pre-processed,a heuristic threshold algorithm is used to de-noise the origina...Aiming at the problem that some existing traffic flow prediction models are only for a single road segment and the model input data are not pre-processed,a heuristic threshold algorithm is used to de-noise the original traffic flow data after wavelet decomposition.The correlation coefficients of road traffic flow data are calculated and the data compression matrix of road traffic flow is constructed.Data de-noising minimizes the interference of data to the model,while the correlation analysis of road network data realizes the prediction at the road network level.Utilizing the advantages of long short term memory(LSTM)network in time series data processing,the compression matrix is input into the constructed LSTM model for short-term traffic flow prediction.The LSTM-1 and LSTM-2 models were respectively trained by de-noising processed data and original data.Through simulation experiments,different prediction times were set,and the prediction results of the prediction model proposed in this paper were compared with those of other methods.It is found that the accuracy of the LSTM-2 model proposed in this paper increases by 10.278%on average compared with other prediction methods,and the prediction accuracy reaches 95.58%,which proves that the short-term traffic flow prediction method proposed in this paper is efficient.展开更多
Traffic flow prediction in urban areas is essential in the IntelligentTransportation System (ITS). Short Term Traffic Flow (STTF) predictionimpacts traffic flow series, where an estimation of the number of vehicleswil...Traffic flow prediction in urban areas is essential in the IntelligentTransportation System (ITS). Short Term Traffic Flow (STTF) predictionimpacts traffic flow series, where an estimation of the number of vehicleswill appear during the next instance of time per hour. Precise STTF iscritical in Intelligent Transportation System. Various extinct systems aim forshort-term traffic forecasts, ensuring a good precision outcome which was asignificant task over the past few years. The main objective of this paper is topropose a new model to predict STTF for every hour of a day. In this paper,we have proposed a novel hybrid algorithm utilizing Principal ComponentAnalysis (PCA), Stacked Auto-Encoder (SAE), Long Short Term Memory(LSTM), and K-Nearest Neighbors (KNN) named PALKNN. Firstly, PCAremoves unwanted information from the dataset and selects essential features.Secondly, SAE is used to reduce the dimension of input data using onehotencoding so the model can be trained with better speed. Thirdly, LSTMtakes the input from SAE, where the data is sorted in ascending orderbased on the important features and generates the derived value. Finally,KNN Regressor takes information from LSTM to predict traffic flow. Theforecasting performance of the PALKNN model is investigated with OpenRoad Traffic Statistics dataset, Great Britain, UK. This paper enhanced thetraffic flow prediction for every hour of a day with a minimal error value.An extensive experimental analysis was performed on the benchmark dataset.The evaluated results indicate the significant improvement of the proposedPALKNN model over the recent approaches such as KNN, SARIMA, LogisticRegression, RNN, and LSTM in terms of root mean square error (RMSE)of 2.07%, mean square error (MSE) of 4.1%, and mean absolute error (MAE)of 2.04%.展开更多
Wind power volatility not only limits the large-scale grid connection but also poses many challenges to safe grid operation.Accurate wind power prediction can mitigate the adverse effects of wind power volatility on w...Wind power volatility not only limits the large-scale grid connection but also poses many challenges to safe grid operation.Accurate wind power prediction can mitigate the adverse effects of wind power volatility on wind power grid connections.For the characteristics of wind power antecedent data and precedent data jointly to determine the prediction accuracy of the prediction model,the short-term prediction of wind power based on a combined neural network is proposed.First,the Bi-directional Long Short Term Memory(BiLSTM)network prediction model is constructed,and the bi-directional nature of the BiLSTM network is used to deeply mine the wind power data information and find the correlation information within the data.Secondly,to avoid the limitation of a single prediction model when the wind power changes abruptly,the Wavelet Transform-Improved Adaptive Genetic Algorithm-Back Propagation(WT-IAGA-BP)neural network based on the combination of the WT-IAGA-BP neural network and BiLSTM network is constructed for the short-term prediction of wind power.Finally,comparing with LSTM,BiLSTM,WT-LSTM,WT-BiLSTM,WT-IAGA-BP,and WT-IAGA-BP&LSTM prediction models,it is verified that the wind power short-term prediction model based on the combination of WT-IAGA-BP neural network and BiLSTM network has higher prediction accuracy.展开更多
In the electricity market,fluctuations in real-time prices are unstable,and changes in short-term load are determined by many factors.By studying the timing of charging and discharging,as well as the economic benefits...In the electricity market,fluctuations in real-time prices are unstable,and changes in short-term load are determined by many factors.By studying the timing of charging and discharging,as well as the economic benefits of energy storage in the process of participating in the power market,this paper takes energy storage scheduling as merely one factor affecting short-term power load,which affects short-term load time series along with time-of-use price,holidays,and temperature.A deep learning network is used to predict the short-term load,a convolutional neural network(CNN)is used to extract the features,and a long short-term memory(LSTM)network is used to learn the temporal characteristics of the load value,which can effectively improve prediction accuracy.Taking the load data of a certain region as an example,the CNN-LSTM prediction model is compared with the single LSTM prediction model.The experimental results show that the CNN-LSTM deep learning network with the participation of energy storage in dispatching can have high prediction accuracy for short-term power load forecasting.展开更多
This paper proposes a deformation evolution and perceptual prediction methodology for additive manufacturing of lightweight composite driven by hybrid digital twins(HDT).In order to improve manufacturing quality of ir...This paper proposes a deformation evolution and perceptual prediction methodology for additive manufacturing of lightweight composite driven by hybrid digital twins(HDT).In order to improve manufacturing quality of irregular lightweight composite through boosting conceptual design in aeronautic and aerospace engineering,the HDT meaning hybridization of physical and digital domains,including deformation and energy efficiency can be built,where the essential parameters can be perceptually predicted in advance,by virtue of the fusion of physical sensors and digital information.The long short term memory(LSTM)can be employed to void vanishing gradient problem and improve predicting precision via Recurrent Neural Networks,thereby laying a foundation for the HDT.The diverse manufacturing requirements of different regions are integrated into the parameters designing phase by attaching region weights confirmed via empiricism and in-service simulation.The effects of slicing strategy and external support structures on manufacturing quality are considered from the perspective of improving dimensional accuracy.The manufacturing efficiency and comprehensive costs are accounted as consideration factors,which are perceptually predicted via LSTM.The designed manufacturing parameters through HDT were virtually examined by evaluating the deformation and equivalent stress distributions of fabricated lightweight component with composite material through AM process simulation.The physical experiments were conducted to verify the HDT-based pre-designing and optimization method of manufacturing parameters via fused deposition modeling(FDM).The energy consumption of actual manufacturing process was measured via digital power meter and applied to evaluate accuracy of perceptual prediction outcomes.The dimensional accuracy and distortion distribution of the manufactured lightweight prototype made with composite material were measured through the coordinate measuring machine(CMM)and 3D optical scanner.The proposed method demonstrates effectiveness in improving manufacturing quality and accurately predicting energy consumption,which have been verified with a three-way solenoid valve element,in which the maximum deformation was reduced by 39.78%and the mean absolute percentage error for perceptual prediction was 3.76%.展开更多
Aiming at the problems of low accuracy and slow convergence speed of current intrusion detection models,SpiralConvolution is combined with Long Short-Term Memory Network to construct a new intrusion detection model.Th...Aiming at the problems of low accuracy and slow convergence speed of current intrusion detection models,SpiralConvolution is combined with Long Short-Term Memory Network to construct a new intrusion detection model.The dataset is first preprocessed using solo thermal encoding and normalization functions.Then the spiral convolution-Long Short-Term Memory Network model is constructed,which consists of spiral convolution,a two-layer long short-term memory network,and a classifier.It is shown through experiments that the model is characterized by high accuracy,small model computation,and fast convergence speed relative to previous deep learning models.The model uses a new neural network to achieve fast and accurate network traffic intrusion detection.The model in this paper achieves 0.9706 and 0.8432 accuracy rates on the NSL-KDD dataset and the UNSWNB-15 dataset under five classifications and ten classes,respectively.展开更多
Time series prediction has always been an important problem in the field of machine learning.Among them,power load forecasting plays a crucial role in identifying the behavior of photovoltaic power plants and regulati...Time series prediction has always been an important problem in the field of machine learning.Among them,power load forecasting plays a crucial role in identifying the behavior of photovoltaic power plants and regulating their control strategies.Traditional power load forecasting often has poor feature extraction performance for long time series.In this paper,a new deep learning framework Residual Stacked Temporal Long Short-Term Memory(RST-LSTM)is proposed,which combines wavelet decomposition and time convolutional memory network to solve the problem of feature extraction for long sequences.The network framework of RST-LSTM consists of two parts:one is a stacked time convolutional memory unit module for global and local feature extraction,and the other is a residual combination optimization module to reduce model redundancy.Finally,this paper demonstrates through various experimental indicators that RST-LSTM achieves significant performance improvements in both overall and local prediction accuracy compared to some state-of-the-art baseline methods.展开更多
Deep learning for time series sequence individual data instance classification can revolutionize computer assisted navigation by providing surgeons with accurate, real-time instrument locality through automatic instru...Deep learning for time series sequence individual data instance classification can revolutionize computer assisted navigation by providing surgeons with accurate, real-time instrument locality through automatic instrument localization. This paper presents an evaluation of Deep Learning models to perform individual data instance classification of time series data. The models explored include convolution and recurrent networks, as well as state-of-the-art residual and inception architectures. The time series data used to evaluate the models consists of depth and force measurements from a drill. Four recurrent neural network models using long short-term memory and gated recurrent units, known as baseline models, and four models using 1D convolution with ResNet and Inception architectures, known as advanced models, were evaluated by determining the data instance membership of the four classes. The four classes represent four distinct regions in a bone traversed by the drill bit during a surgical procedure. First, the time series data is preprocessed, identifying the four classes or regions of the bone. Next, the paper presents a discussion of the network architecture and modifications of both the basic and advanced deep learning models, followed by the training process and hyperparameters tuning. The performance of the models was evaluated using the precision and recall performance parameters. Out of the eight models evaluated, the recurrent neural network with gated recurrent units has the best performance. The paper also demonstrates the importance of the feature depth over the feature force in classifying the data instances, followed by the effects of the imbalanced dataset on the performance of the models.展开更多
Traditional algorithms do not fully utilize the timing information of non-cooperative targets,and setting too many motion parameters can lead to complex dynamic model calculations.This paper proposes a long short-term...Traditional algorithms do not fully utilize the timing information of non-cooperative targets,and setting too many motion parameters can lead to complex dynamic model calculations.This paper proposes a long short-term memory(LSTM)network-based method for estimating the parameters of the active segment of the non-cooperative target under single-satellite observation.Based on the simulation training set of the active segment of the non-cooperative target,the network parameters of the LSTM network are designed,the motion characteristics of the active segment of the non-cooperative target are fully excavated through data-driven methods,and the candidate cutting trajectories are screened and predicted to realize the estimation of the motion parameters of the active segment of the non-cooperative target under the condition of single-satellite observation.The experimental results show that the estimation method proposed in this paper can effectively deal with the inaccurate problem with the non-cooperative target’s active segment motion model established under the condition of single-satellite observation,obtain more accurate active segment motion parameters,and provide a feasible new idea and method for the parameter estimation of the active segment of the non-cooperative target under the single-satellite observation.展开更多
Text extraction from images using the traditional techniques of image collecting,and pattern recognition using machine learning consume time due to the amount of extracted features from the images.Deep Neural Networks...Text extraction from images using the traditional techniques of image collecting,and pattern recognition using machine learning consume time due to the amount of extracted features from the images.Deep Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant results.This study proposes using Dual Maxpooling and concatenating convolution Neural Networks(CNN)layers with the activation functions Relu and the Optimized Leaky Relu(OLRelu).The proposed method works by dividing the word image into slices that contain characters.Then pass them to deep learning layers to extract feature maps and reform the predicted words.Bidirectional Short Memory(BiLSTM)layers extractmore compelling features and link the time sequence fromforward and backward directions during the training phase.The Connectionist Temporal Classification(CTC)function calcifies the training and validation loss rates.In addition to decoding the extracted feature to reform characters again and linking them according to their time sequence.The proposed model performance is evaluated using training and validation loss errors on the Mjsynth and Integrated Argument Mining Tasks(IAM)datasets.The result of IAM was 2.09%for the average loss errors with the proposed dualMaxpooling and OLRelu.In the Mjsynth dataset,the best validation loss rate shrunk to 2.2%by applying concatenating CNN layers,and Relu.展开更多
目的 2D姿态估计的误差是导致3D人体姿态估计产生误差的主要原因,如何在2D误差或噪声干扰下从2D姿态映射到最优、最合理的3D姿态,是提高3D人体姿态估计的关键。本文提出了一种稀疏表示与深度模型联合的3D姿态估计方法,以将3D姿态空间几...目的 2D姿态估计的误差是导致3D人体姿态估计产生误差的主要原因,如何在2D误差或噪声干扰下从2D姿态映射到最优、最合理的3D姿态,是提高3D人体姿态估计的关键。本文提出了一种稀疏表示与深度模型联合的3D姿态估计方法,以将3D姿态空间几何先验与时间信息相结合,达到提高3D姿态估计精度的目的。方法利用融合稀疏表示的3D可变形状模型得到单帧图像可靠的3D初始值。构建多通道长短时记忆MLSTM(multichannel long short term memory)降噪编/解码器,将获得的单帧3D初始值以时间序列形式输入到其中,利用MLSTM降噪编/解码器学习相邻帧之间人物姿态的时间依赖关系,并施加时间平滑约束,得到最终优化的3D姿态。结果在Human3.6M数据集上进行了对比实验。对于两种输入数据:数据集给出的2D坐标和通过卷积神经网络获得的2D估计坐标,相比于单帧估计,通过MLSTM降噪编/解码器优化后的视频序列平均重构误差分别下降了12.6%,13%;相比于现有的基于视频的稀疏模型方法,本文方法对视频的平均重构误差下降了6.4%,9.1%。对于2D估计坐标数据,相比于现有的深度模型方法,本文方法对视频的平均重构误差下降了12.8%。结论本文提出的基于时间信息的MLSTM降噪编/解码器与稀疏模型相结合,有效利用了3D姿态先验知识,视频帧间人物姿态连续变化的时间和空间依赖性,一定程度上提高了单目视频3D姿态估计的精度。展开更多
Short-term traffic flow forecasting is a significant part of intelligent transportation system.In some traffic control scenarios,obtaining future traffic flow in advance is conducive to highway management department t...Short-term traffic flow forecasting is a significant part of intelligent transportation system.In some traffic control scenarios,obtaining future traffic flow in advance is conducive to highway management department to have sufficient time to formulate corresponding traffic flow control measures.In hence,it is meaningful to establish an accurate short-term traffic flow method and provide reference for peak traffic flow warning.This paper proposed a new hybrid model for traffic flow forecasting,which is composed of the variational mode decomposition(VMD)method,the group method of data handling(GMDH)neural network,bi-directional long and short term memory(BILSTM)network and ELMAN network,and is optimized by the imperialist competitive algorithm(ICA)method.To illustrate the performance of the proposed model,there are several comparative experiments between the proposed model and other models.The experiment results show that 1)BILSTM network,GMDH network and ELMAN network have better predictive performance than other single models;2)VMD can significantly improve the predictive performance of the ICA-GMDH-BILSTM-ELMAN model.The effect of VMD method is better than that of EEMD method and FEEMD method.To conclude,the proposed model which is made up of the VMD method,the ICA method,the BILSTM network,the GMDH network and the ELMAN network has excellent predictive ability for traffic flow series.展开更多
Rotating machinery is important to industrial production. Any failure of rotating machinery, especially the failure of rolling bearings, can lead to equipment shutdown and even more serious incidents. Therefore, accur...Rotating machinery is important to industrial production. Any failure of rotating machinery, especially the failure of rolling bearings, can lead to equipment shutdown and even more serious incidents. Therefore, accurate residual life prediction plays a crucial role in guaranteeing machine operation safety and reliability and reducing maintenance cost. In order to increase the forecasting precision of the remaining useful life(RUL) of the rolling bearing, an advanced approach combining elastic net with long short-time memory network(LSTM) is proposed, and the new approach is referred to as E-LSTM. The E-LSTM algorithm consists of an elastic mesh and LSTM, taking temporal-spatial correlation into consideration to forecast the RUL through the LSTM. To solve the over-fitting problem of the LSTM neural network during the training process, the elastic net based regularization term is introduced to the LSTM structure.In this way, the change of the output can be well characterized to express the bearing degradation mode. Experimental results from the real-world data demonstrate that the proposed E-LSTM method can obtain higher stability and relevant values that are useful for the RUL forecasting of bearing. Furthermore, these results also indicate that E-LSTM can achieve better performance.展开更多
文摘In dense pedestrian tracking,frequent object occlusions and close distances between objects cause difficulty when accurately estimating object trajectories.In this study,a conditional random field tracking model is established by using a visual long short term memory network in the three-dimensional(3D)space and the motion estimations jointly performed on object trajectory segments.Object visual field information is added to the long short term memory network to improve the accuracy of the motion related object pair selection and motion estimation.To address the uncertainty of the length and interval of trajectory segments,a multimode long short term memory network is proposed for the object motion estimation.The tracking performance is evaluated using the PETS2009 dataset.The experimental results show that the proposed method achieves better performance than the tracking methods based on the independent motion estimation.
基金This work is supported by the National Nature Science Foundation of China(NSFC)under Grant Nos.61571106,61501169,41706103the Fundamental Research Funds for the Central Universities under Grant No.2242013K30010.
文摘Speaker separation in complex acoustic environment is one of challenging tasks in speech separation.In practice,speakers are very often unmoving or moving slowly in normal communication.In this case,the spatial features among the consecutive speech frames become highly correlated such that it is helpful for speaker separation by providing additional spatial information.To fully exploit this information,we design a separation system on Recurrent Neural Network(RNN)with long short-term memory(LSTM)which effectively learns the temporal dynamics of spatial features.In detail,a LSTM-based speaker separation algorithm is proposed to extract the spatial features in each time-frequency(TF)unit and form the corresponding feature vector.Then,we treat speaker separation as a supervised learning problem,where a modified ideal ratio mask(IRM)is defined as the training function during LSTM learning.Simulations show that the proposed system achieves attractive separation performance in noisy and reverberant environments.Specifically,during the untrained acoustic test with limited priors,e.g.,unmatched signal to noise ratio(SNR)and reverberation,the proposed LSTM based algorithm can still outperforms the existing DNN based method in the measures of PESQ and STOI.It indicates our method is more robust in untrained conditions.
基金the Gansu Province Soft Scientific Research Projects(No.2015GS06516)the Funds for Distinguished Young Scientists of Lanzhou University of Technology,China(No.J201304)。
文摘Predicting wind speed accurately is essential to ensure the stability of the wind power system and improve the utilization rate of wind energy.However,owing to the stochastic and intermittent of wind speed,predicting wind speed accurately is difficult.A new hybrid deep learning model based on empirical wavelet transform,recurrent neural network and error correction for short-term wind speed prediction is proposed in this paper.The empirical wavelet transformation is applied to decompose the original wind speed series.The long short term memory network and the Elman neural network are adopted to predict low-frequency and high-frequency wind speed sub-layers respectively to balance the calculation efficiency and prediction accuracy.The error correction strategy based on deep long short term memory network is developed to modify the prediction errors.Four actual wind speed series are utilized to verify the effectiveness of the proposed model.The empirical results indicate that the method proposed in this paper has satisfactory performance in wind speed prediction.
基金This research is funded by Vellore Institute of Technology,Chennai,India.
文摘In today’s world, there are many people suffering from mentalhealth problems such as depression and anxiety. If these conditions are notidentified and treated early, they can get worse quickly and have far-reachingnegative effects. Unfortunately, many people suffering from these conditions,especially depression and hypertension, are unaware of their existence until theconditions become chronic. Thus, this paper proposes a novel approach usingBi-directional Long Short-Term Memory (Bi-LSTM) algorithm and GlobalVector (GloVe) algorithm for the prediction and treatment of these conditions.Smartwatches and fitness bands can be equipped with these algorithms whichcan share data with a variety of IoT devices and smart systems to betterunderstand and analyze the user’s condition. We compared the accuracy andloss of the training dataset and the validation dataset of the two modelsnamely, Bi-LSTM without a global vector layer and with a global vector layer.It was observed that the model of Bi-LSTM without a global vector layer hadan accuracy of 83%,while Bi-LSTMwith a global vector layer had an accuracyof 86% with a precision of 86.4%, and an F1 score of 0.861. In addition toproviding basic therapies for the treatment of identified cases, our model alsohelps prevent the deterioration of associated conditions, making our methoda real-world solution.
文摘Mental workload plays a vital role in cognitive impairment. The impairment refers to a person’s difficulty in remembering, receiving new information, learning new things, concentrating, or making decisions that seriously affect everyday life. In this paper, the simultaneous capacity (SIMKAP) experiment-based EEG workload analysis was presented using 45 subjects for multitasking mental workload estimation with subject wise attention loss calculation as well as short term memory loss measurement. Using an open access preprocessed EEG dataset, Discrete wavelet transforms (DWT) was utilized for feature extraction and Minimum redundancy and maximum relevancy (MRMR) technique was used to select most relevance features. Wavelet decomposition technique was also used for decomposing EEG signals into five sub bands. Fourteen statistical features were calculated from each sub band signal to form a 5 × 14 window size. The Neural Network (Narrow) classification algorithm was used to classify dataset for low and high workload conditions and comparison was made using some other machine learning models. The results show the classifier’s accuracy of 86.7%, precision of 84.4%, F1 score of 86.33%, and recall of 88.37% that crosses the state-of-the art methodologies in the literature. This prediction is expected to greatly facilitate the improved way in memory and attention loss impairments assessment.
文摘1 A study shows that music lessons obviously enhance children's cognitive abilities,including short⁃term memory and planning,which lead to improving academic performance.The research is the first large⁃scale and long⁃term study to be adapted into the regular school curriculum.Visual arts lessons were also found to significantly improve children's visual memory.
基金The author Dr.Arshiya S.Ansari extends the appreciation to the Deanship of Postgraduate Studies and Scientific Research at Majmaah University for funding this research work through the project number(R-2025-1538).
文摘Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.
基金National Natural Science Foundation of China(No.71961016)Planning Fund for the Humanities and Social Sciences of the Ministry of Education(Nos.15XJAZH002,18YJAZH148)Natural Science Foundation of Gansu Province(No.18JR3RA125)。
文摘Aiming at the problem that some existing traffic flow prediction models are only for a single road segment and the model input data are not pre-processed,a heuristic threshold algorithm is used to de-noise the original traffic flow data after wavelet decomposition.The correlation coefficients of road traffic flow data are calculated and the data compression matrix of road traffic flow is constructed.Data de-noising minimizes the interference of data to the model,while the correlation analysis of road network data realizes the prediction at the road network level.Utilizing the advantages of long short term memory(LSTM)network in time series data processing,the compression matrix is input into the constructed LSTM model for short-term traffic flow prediction.The LSTM-1 and LSTM-2 models were respectively trained by de-noising processed data and original data.Through simulation experiments,different prediction times were set,and the prediction results of the prediction model proposed in this paper were compared with those of other methods.It is found that the accuracy of the LSTM-2 model proposed in this paper increases by 10.278%on average compared with other prediction methods,and the prediction accuracy reaches 95.58%,which proves that the short-term traffic flow prediction method proposed in this paper is efficient.
文摘Traffic flow prediction in urban areas is essential in the IntelligentTransportation System (ITS). Short Term Traffic Flow (STTF) predictionimpacts traffic flow series, where an estimation of the number of vehicleswill appear during the next instance of time per hour. Precise STTF iscritical in Intelligent Transportation System. Various extinct systems aim forshort-term traffic forecasts, ensuring a good precision outcome which was asignificant task over the past few years. The main objective of this paper is topropose a new model to predict STTF for every hour of a day. In this paper,we have proposed a novel hybrid algorithm utilizing Principal ComponentAnalysis (PCA), Stacked Auto-Encoder (SAE), Long Short Term Memory(LSTM), and K-Nearest Neighbors (KNN) named PALKNN. Firstly, PCAremoves unwanted information from the dataset and selects essential features.Secondly, SAE is used to reduce the dimension of input data using onehotencoding so the model can be trained with better speed. Thirdly, LSTMtakes the input from SAE, where the data is sorted in ascending orderbased on the important features and generates the derived value. Finally,KNN Regressor takes information from LSTM to predict traffic flow. Theforecasting performance of the PALKNN model is investigated with OpenRoad Traffic Statistics dataset, Great Britain, UK. This paper enhanced thetraffic flow prediction for every hour of a day with a minimal error value.An extensive experimental analysis was performed on the benchmark dataset.The evaluated results indicate the significant improvement of the proposedPALKNN model over the recent approaches such as KNN, SARIMA, LogisticRegression, RNN, and LSTM in terms of root mean square error (RMSE)of 2.07%, mean square error (MSE) of 4.1%, and mean absolute error (MAE)of 2.04%.
基金support of national natural science foundation of China(No.52067021)natural science foundation of Xinjiang(2022D01C35)+1 种基金excellent youth scientific and technological talents plan of Xinjiang(No.2019Q012)major science&technology special project of Xinjiang Uygur Autonomous Region(2022A01002-2)。
文摘Wind power volatility not only limits the large-scale grid connection but also poses many challenges to safe grid operation.Accurate wind power prediction can mitigate the adverse effects of wind power volatility on wind power grid connections.For the characteristics of wind power antecedent data and precedent data jointly to determine the prediction accuracy of the prediction model,the short-term prediction of wind power based on a combined neural network is proposed.First,the Bi-directional Long Short Term Memory(BiLSTM)network prediction model is constructed,and the bi-directional nature of the BiLSTM network is used to deeply mine the wind power data information and find the correlation information within the data.Secondly,to avoid the limitation of a single prediction model when the wind power changes abruptly,the Wavelet Transform-Improved Adaptive Genetic Algorithm-Back Propagation(WT-IAGA-BP)neural network based on the combination of the WT-IAGA-BP neural network and BiLSTM network is constructed for the short-term prediction of wind power.Finally,comparing with LSTM,BiLSTM,WT-LSTM,WT-BiLSTM,WT-IAGA-BP,and WT-IAGA-BP&LSTM prediction models,it is verified that the wind power short-term prediction model based on the combination of WT-IAGA-BP neural network and BiLSTM network has higher prediction accuracy.
基金supported by a State Grid Zhejiang Electric Power Co.,Ltd.Economic and Technical Research Institute Project(Key Technologies and Empirical Research of Diversified Integrated Operation of User-Side Energy Storage in Power Market Environment,No.5211JY19000W)supported by the National Natural Science Foundation of China(Research on Power Market Management to Promote Large-Scale New Energy Consumption,No.71804045).
文摘In the electricity market,fluctuations in real-time prices are unstable,and changes in short-term load are determined by many factors.By studying the timing of charging and discharging,as well as the economic benefits of energy storage in the process of participating in the power market,this paper takes energy storage scheduling as merely one factor affecting short-term power load,which affects short-term load time series along with time-of-use price,holidays,and temperature.A deep learning network is used to predict the short-term load,a convolutional neural network(CNN)is used to extract the features,and a long short-term memory(LSTM)network is used to learn the temporal characteristics of the load value,which can effectively improve prediction accuracy.Taking the load data of a certain region as an example,the CNN-LSTM prediction model is compared with the single LSTM prediction model.The experimental results show that the CNN-LSTM deep learning network with the participation of energy storage in dispatching can have high prediction accuracy for short-term power load forecasting.
基金Supported by National Key Research and Development Project of China(Grant No.2022YFB3303303)Zhejiang Provincial Research and Development Project of China(Grant No.LGG22E050010)Key Open Fund of State Key Laboratory of Materials Processing and Die and Mould Technology of China(Grant No.P2024-001).
文摘This paper proposes a deformation evolution and perceptual prediction methodology for additive manufacturing of lightweight composite driven by hybrid digital twins(HDT).In order to improve manufacturing quality of irregular lightweight composite through boosting conceptual design in aeronautic and aerospace engineering,the HDT meaning hybridization of physical and digital domains,including deformation and energy efficiency can be built,where the essential parameters can be perceptually predicted in advance,by virtue of the fusion of physical sensors and digital information.The long short term memory(LSTM)can be employed to void vanishing gradient problem and improve predicting precision via Recurrent Neural Networks,thereby laying a foundation for the HDT.The diverse manufacturing requirements of different regions are integrated into the parameters designing phase by attaching region weights confirmed via empiricism and in-service simulation.The effects of slicing strategy and external support structures on manufacturing quality are considered from the perspective of improving dimensional accuracy.The manufacturing efficiency and comprehensive costs are accounted as consideration factors,which are perceptually predicted via LSTM.The designed manufacturing parameters through HDT were virtually examined by evaluating the deformation and equivalent stress distributions of fabricated lightweight component with composite material through AM process simulation.The physical experiments were conducted to verify the HDT-based pre-designing and optimization method of manufacturing parameters via fused deposition modeling(FDM).The energy consumption of actual manufacturing process was measured via digital power meter and applied to evaluate accuracy of perceptual prediction outcomes.The dimensional accuracy and distortion distribution of the manufactured lightweight prototype made with composite material were measured through the coordinate measuring machine(CMM)and 3D optical scanner.The proposed method demonstrates effectiveness in improving manufacturing quality and accurately predicting energy consumption,which have been verified with a three-way solenoid valve element,in which the maximum deformation was reduced by 39.78%and the mean absolute percentage error for perceptual prediction was 3.76%.
基金the Gansu University of Political Science and Law Key Research Funding Project in 2018(GZF2018XZDLW20)Gansu Provincial Science and Technology Plan Project(Technology Innovation Guidance Plan)(20CX9ZA072).
文摘Aiming at the problems of low accuracy and slow convergence speed of current intrusion detection models,SpiralConvolution is combined with Long Short-Term Memory Network to construct a new intrusion detection model.The dataset is first preprocessed using solo thermal encoding and normalization functions.Then the spiral convolution-Long Short-Term Memory Network model is constructed,which consists of spiral convolution,a two-layer long short-term memory network,and a classifier.It is shown through experiments that the model is characterized by high accuracy,small model computation,and fast convergence speed relative to previous deep learning models.The model uses a new neural network to achieve fast and accurate network traffic intrusion detection.The model in this paper achieves 0.9706 and 0.8432 accuracy rates on the NSL-KDD dataset and the UNSWNB-15 dataset under five classifications and ten classes,respectively.
基金funded by NARI Group’s Independent Project of China(Granted No.524609230125)the foundation of NARI-TECH Nanjing Control System Ltd.of China(Granted No.0914202403120020).
文摘Time series prediction has always been an important problem in the field of machine learning.Among them,power load forecasting plays a crucial role in identifying the behavior of photovoltaic power plants and regulating their control strategies.Traditional power load forecasting often has poor feature extraction performance for long time series.In this paper,a new deep learning framework Residual Stacked Temporal Long Short-Term Memory(RST-LSTM)is proposed,which combines wavelet decomposition and time convolutional memory network to solve the problem of feature extraction for long sequences.The network framework of RST-LSTM consists of two parts:one is a stacked time convolutional memory unit module for global and local feature extraction,and the other is a residual combination optimization module to reduce model redundancy.Finally,this paper demonstrates through various experimental indicators that RST-LSTM achieves significant performance improvements in both overall and local prediction accuracy compared to some state-of-the-art baseline methods.
文摘Deep learning for time series sequence individual data instance classification can revolutionize computer assisted navigation by providing surgeons with accurate, real-time instrument locality through automatic instrument localization. This paper presents an evaluation of Deep Learning models to perform individual data instance classification of time series data. The models explored include convolution and recurrent networks, as well as state-of-the-art residual and inception architectures. The time series data used to evaluate the models consists of depth and force measurements from a drill. Four recurrent neural network models using long short-term memory and gated recurrent units, known as baseline models, and four models using 1D convolution with ResNet and Inception architectures, known as advanced models, were evaluated by determining the data instance membership of the four classes. The four classes represent four distinct regions in a bone traversed by the drill bit during a surgical procedure. First, the time series data is preprocessed, identifying the four classes or regions of the bone. Next, the paper presents a discussion of the network architecture and modifications of both the basic and advanced deep learning models, followed by the training process and hyperparameters tuning. The performance of the models was evaluated using the precision and recall performance parameters. Out of the eight models evaluated, the recurrent neural network with gated recurrent units has the best performance. The paper also demonstrates the importance of the feature depth over the feature force in classifying the data instances, followed by the effects of the imbalanced dataset on the performance of the models.
基金supported by the National Natural Science Foundation of China under grant 42271448by the Key Laboratory of Land satellite Remote sensing Application,Ministry of Natural Resources of the People’s Republic of China(Grant No.KLSMNR-G202317).
文摘Traditional algorithms do not fully utilize the timing information of non-cooperative targets,and setting too many motion parameters can lead to complex dynamic model calculations.This paper proposes a long short-term memory(LSTM)network-based method for estimating the parameters of the active segment of the non-cooperative target under single-satellite observation.Based on the simulation training set of the active segment of the non-cooperative target,the network parameters of the LSTM network are designed,the motion characteristics of the active segment of the non-cooperative target are fully excavated through data-driven methods,and the candidate cutting trajectories are screened and predicted to realize the estimation of the motion parameters of the active segment of the non-cooperative target under the condition of single-satellite observation.The experimental results show that the estimation method proposed in this paper can effectively deal with the inaccurate problem with the non-cooperative target’s active segment motion model established under the condition of single-satellite observation,obtain more accurate active segment motion parameters,and provide a feasible new idea and method for the parameter estimation of the active segment of the non-cooperative target under the single-satellite observation.
基金supported this project under the Fundamental Research Grant Scheme(FRGS)FRGS/1/2019/ICT02/UKM/02/9 entitled“Convolution Neural Network Enhancement Based on Adaptive Convexity and Regularization Functions for Fake Video Analytics”.This grant was received by Prof.Assis.Dr.S.N.H.Sheikh Abdullah,https://www.ukm.my/spifper/research_news/instrumentfunds.
文摘Text extraction from images using the traditional techniques of image collecting,and pattern recognition using machine learning consume time due to the amount of extracted features from the images.Deep Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant results.This study proposes using Dual Maxpooling and concatenating convolution Neural Networks(CNN)layers with the activation functions Relu and the Optimized Leaky Relu(OLRelu).The proposed method works by dividing the word image into slices that contain characters.Then pass them to deep learning layers to extract feature maps and reform the predicted words.Bidirectional Short Memory(BiLSTM)layers extractmore compelling features and link the time sequence fromforward and backward directions during the training phase.The Connectionist Temporal Classification(CTC)function calcifies the training and validation loss rates.In addition to decoding the extracted feature to reform characters again and linking them according to their time sequence.The proposed model performance is evaluated using training and validation loss errors on the Mjsynth and Integrated Argument Mining Tasks(IAM)datasets.The result of IAM was 2.09%for the average loss errors with the proposed dualMaxpooling and OLRelu.In the Mjsynth dataset,the best validation loss rate shrunk to 2.2%by applying concatenating CNN layers,and Relu.
文摘目的 2D姿态估计的误差是导致3D人体姿态估计产生误差的主要原因,如何在2D误差或噪声干扰下从2D姿态映射到最优、最合理的3D姿态,是提高3D人体姿态估计的关键。本文提出了一种稀疏表示与深度模型联合的3D姿态估计方法,以将3D姿态空间几何先验与时间信息相结合,达到提高3D姿态估计精度的目的。方法利用融合稀疏表示的3D可变形状模型得到单帧图像可靠的3D初始值。构建多通道长短时记忆MLSTM(multichannel long short term memory)降噪编/解码器,将获得的单帧3D初始值以时间序列形式输入到其中,利用MLSTM降噪编/解码器学习相邻帧之间人物姿态的时间依赖关系,并施加时间平滑约束,得到最终优化的3D姿态。结果在Human3.6M数据集上进行了对比实验。对于两种输入数据:数据集给出的2D坐标和通过卷积神经网络获得的2D估计坐标,相比于单帧估计,通过MLSTM降噪编/解码器优化后的视频序列平均重构误差分别下降了12.6%,13%;相比于现有的基于视频的稀疏模型方法,本文方法对视频的平均重构误差下降了6.4%,9.1%。对于2D估计坐标数据,相比于现有的深度模型方法,本文方法对视频的平均重构误差下降了12.8%。结论本文提出的基于时间信息的MLSTM降噪编/解码器与稀疏模型相结合,有效利用了3D姿态先验知识,视频帧间人物姿态连续变化的时间和空间依赖性,一定程度上提高了单目视频3D姿态估计的精度。
基金Project(61873283)supported by the National Natural Science Foundation of ChinaProject(KQ1707017)supported by the Changsha Science&Technology Project,ChinaProject(2019CX005)supported by the Innovation Driven Project of the Central South University,China。
文摘Short-term traffic flow forecasting is a significant part of intelligent transportation system.In some traffic control scenarios,obtaining future traffic flow in advance is conducive to highway management department to have sufficient time to formulate corresponding traffic flow control measures.In hence,it is meaningful to establish an accurate short-term traffic flow method and provide reference for peak traffic flow warning.This paper proposed a new hybrid model for traffic flow forecasting,which is composed of the variational mode decomposition(VMD)method,the group method of data handling(GMDH)neural network,bi-directional long and short term memory(BILSTM)network and ELMAN network,and is optimized by the imperialist competitive algorithm(ICA)method.To illustrate the performance of the proposed model,there are several comparative experiments between the proposed model and other models.The experiment results show that 1)BILSTM network,GMDH network and ELMAN network have better predictive performance than other single models;2)VMD can significantly improve the predictive performance of the ICA-GMDH-BILSTM-ELMAN model.The effect of VMD method is better than that of EEMD method and FEEMD method.To conclude,the proposed model which is made up of the VMD method,the ICA method,the BILSTM network,the GMDH network and the ELMAN network has excellent predictive ability for traffic flow series.
基金by National Natural Science Foundation of China(No.61972443)National Key Research and Development Plan Program of China(No.2019YFE0105300)+1 种基金Hunan Provincial Hu-Xiang Young Talents Project of China(No.2018RS3095)Hunan Provincial Natural Science Foundation of China(No.2020JJ5199).
文摘Rotating machinery is important to industrial production. Any failure of rotating machinery, especially the failure of rolling bearings, can lead to equipment shutdown and even more serious incidents. Therefore, accurate residual life prediction plays a crucial role in guaranteeing machine operation safety and reliability and reducing maintenance cost. In order to increase the forecasting precision of the remaining useful life(RUL) of the rolling bearing, an advanced approach combining elastic net with long short-time memory network(LSTM) is proposed, and the new approach is referred to as E-LSTM. The E-LSTM algorithm consists of an elastic mesh and LSTM, taking temporal-spatial correlation into consideration to forecast the RUL through the LSTM. To solve the over-fitting problem of the LSTM neural network during the training process, the elastic net based regularization term is introduced to the LSTM structure.In this way, the change of the output can be well characterized to express the bearing degradation mode. Experimental results from the real-world data demonstrate that the proposed E-LSTM method can obtain higher stability and relevant values that are useful for the RUL forecasting of bearing. Furthermore, these results also indicate that E-LSTM can achieve better performance.