Periodicity is common in natural processes, however, extraction tools are typically difficult and cumbersome to use. Here we report a computational method developed in MATLAB through a function called Periods with the...Periodicity is common in natural processes, however, extraction tools are typically difficult and cumbersome to use. Here we report a computational method developed in MATLAB through a function called Periods with the aim to find the main harmonic components of time series data. This function is designed to obtain the period, amplitude and lag phase of the main harmonic components in a time series (Periods and lag phase components can be related to climate, social or economic events). It is based on methods of periodic regression with cyclic descent and includes statistical significance testing. The proposed method is very easy to use. Furthermore, it does not require full understanding of time series theory, nor require many inputs from the user. However, it is sufficiently flexible to undertake more complex tasks such as forecasting. Additionally, based on previous knowledge, specific periods can be included or excluded easily. The output results are organized into two groups. One contains the parameters of the adjusted model and their F statistics. The other consists of the harmonic parameters that best fit the original series according to their importance and the summarized statistics of the comparisons between successive models in the cyclic descent process. Periods is tested with both, simulated and actual sunspot and Multivariate ENSO Index data to show its performance and accuracy.展开更多
In today’s rapidly evolving internet landscape,prominent companies across various industries face increasingly complex business operations,leading to significant cluster-scale growth.However,this growth brings about ...In today’s rapidly evolving internet landscape,prominent companies across various industries face increasingly complex business operations,leading to significant cluster-scale growth.However,this growth brings about challenges in cluster management and the inefficient utilization of vast amounts of data due to its low value density.This paper,based on the large-scale cluster virtualization and monitoring system of the data center of the Bureau of Geophysical Prospecting(BGP),utilizes time series data of host resources from the monitoring system’s time series database to propose a multivariate multi-step time series forecasting model,MUL-CNN-BiGRU-Attention,for forecasting CPU load on virtual cluster hosts.The model undergoes extensive offline training using a large volume of time series data,followed by deployment using TensorFlow Serving.Recent small-batch data are employed for fine-tuning model parameters to better adapt to current data patterns.Comparative experiments are conducted between the proposed model and other baseline models,demonstrating notable improvements in Mean Absolute Error(MAE),Mean Squared Error(MSE),Root Mean Squared Error(RMSE),and R2 metrics by up to 35.2%,56.1%,32.5%,and 10.3%,respectively.Additionally,ablation experiments are designed to investigate the impact of different factors on the performance of the forecasting model,providing valuable insights for parameter optimization based on experimental results.展开更多
Long-term multivariate time series forecasting is an important task in engineering applications. It helps grasp the future development trend of data in real-time, which is of great significance for a wide variety of f...Long-term multivariate time series forecasting is an important task in engineering applications. It helps grasp the future development trend of data in real-time, which is of great significance for a wide variety of fields. Due to the non-linear and unstable characteristics of multivariate time series, the existing methods encounter difficulties in analyzing complex high-dimensional data and capturing latent relationships between multivariates in time series, thus affecting the performance of long-term prediction. In this paper, we propose a novel time series forecasting model based on multilayer perceptron that combines spatio-temporal decomposition and doubly residual stacking, namely Spatio-Temporal Decomposition Neural Network (STDNet). We decompose the originally complex and unstable time series into two parts, temporal term and spatial term. We design temporal module based on auto-correlation mechanism to discover temporal dependencies at the sub-series level, and spatial module based on convolutional neural network and self-attention mechanism to integrate multivariate information from two dimensions, global and local, respectively. Then we integrate the results obtained from the different modules to get the final forecast. Extensive experiments on four real-world datasets show that STDNet significantly outperforms other state-of-the-art methods, which provides an effective solution for long-term time series forecasting.展开更多
Medium-term forecasting is an important category of electric load forecasting that covers a time span of up to one year ahead. It suits outage and maintenance planning, as well as load switching operation. We propose ...Medium-term forecasting is an important category of electric load forecasting that covers a time span of up to one year ahead. It suits outage and maintenance planning, as well as load switching operation. We propose a new methodol-ogy that uses hourly daily loads to predict the next year hourly loads, and hence predict the peak loads expected to be reached in the next coming year. The technique is based on implementing multivariable regression on previous year's hourly loads. Three regression models are investigated in this research: the linear, the polynomial, and the exponential power. The proposed models are applied to real loads of the Jordanian power system. Results obtained using the pro-posed methods showed that their performance is close and they outperform results obtained using the widely used ex-ponential regression technique. Moreover, peak load prediction has about 90% accuracy using the proposed method-ology. The methods are generic and simple and can be implemented to hourly loads of any power system. No extra in-formation other than the hourly loads is required.展开更多
Multivariate Time Series(MTS)forecasting is an essential problem in many fields.Accurate forecasting results can effectively help in making decisions.To date,many MTS forecasting methods have been proposed and widely ...Multivariate Time Series(MTS)forecasting is an essential problem in many fields.Accurate forecasting results can effectively help in making decisions.To date,many MTS forecasting methods have been proposed and widely applied.However,these methods assume that the predicted value of a single variable is affected by all other variables,ignoring the causal relationship among variables.To address the above issue,we propose a novel end-to-end deep learning model,termed graph neural network with neural Granger causality,namely CauGNN,in this paper.To characterize the causal information among variables,we introduce the neural Granger causality graph in our model.Each variable is regarded as a graph node,and each edge represents the casual relationship between variables.In addition,convolutional neural network filters with different perception scales are used for time series feature extraction,to generate the feature of each node.Finally,the graph neural network is adopted to tackle the forecasting problem of the graph structure generated by the MTS.Three benchmark datasets from the real world are used to evaluate the proposed CauGNN,and comprehensive experiments show that the proposed method achieves state-of-the-art results in the MTS forecasting task.展开更多
Water is a vital resource.It supports a multitude of industries,civilizations,and agriculture.However,climatic conditions impact water availability,particularly in desert areas where the temperature is high,and rain i...Water is a vital resource.It supports a multitude of industries,civilizations,and agriculture.However,climatic conditions impact water availability,particularly in desert areas where the temperature is high,and rain is scarce.Therefore,it is crucial to forecast water demand to provide it to sectors either on regular or emergency days.The study aims to develop an accurate model to forecast daily water demand under the impact of climatic conditions.This forecasting is known as a multivariate time series because it uses both the historical data of water demand and climatic conditions to forecast the future.Focusing on the collected data of Jeddah city,Saudi Arabia in the period between 2004 and 2018,we develop a hybrid approach that uses Artificial Neural Networks(ANN)for forecasting and Particle Swarm Optimization algorithm(PSO)for tuning ANNs’hyperparameters.Based on the Root Mean Square Error(RMSE)metric,results show that the(PSO-ANN)is an accurate model for multivariate time series forecasting.Also,the first day is the most difficult day for prediction(highest error rate),while the second day is the easiest to predict(lowest error rate).Finally,correlation analysis shows that the dew point is the most climatic factor affecting water demand.展开更多
文摘Periodicity is common in natural processes, however, extraction tools are typically difficult and cumbersome to use. Here we report a computational method developed in MATLAB through a function called Periods with the aim to find the main harmonic components of time series data. This function is designed to obtain the period, amplitude and lag phase of the main harmonic components in a time series (Periods and lag phase components can be related to climate, social or economic events). It is based on methods of periodic regression with cyclic descent and includes statistical significance testing. The proposed method is very easy to use. Furthermore, it does not require full understanding of time series theory, nor require many inputs from the user. However, it is sufficiently flexible to undertake more complex tasks such as forecasting. Additionally, based on previous knowledge, specific periods can be included or excluded easily. The output results are organized into two groups. One contains the parameters of the adjusted model and their F statistics. The other consists of the harmonic parameters that best fit the original series according to their importance and the summarized statistics of the comparisons between successive models in the cyclic descent process. Periods is tested with both, simulated and actual sunspot and Multivariate ENSO Index data to show its performance and accuracy.
文摘In today’s rapidly evolving internet landscape,prominent companies across various industries face increasingly complex business operations,leading to significant cluster-scale growth.However,this growth brings about challenges in cluster management and the inefficient utilization of vast amounts of data due to its low value density.This paper,based on the large-scale cluster virtualization and monitoring system of the data center of the Bureau of Geophysical Prospecting(BGP),utilizes time series data of host resources from the monitoring system’s time series database to propose a multivariate multi-step time series forecasting model,MUL-CNN-BiGRU-Attention,for forecasting CPU load on virtual cluster hosts.The model undergoes extensive offline training using a large volume of time series data,followed by deployment using TensorFlow Serving.Recent small-batch data are employed for fine-tuning model parameters to better adapt to current data patterns.Comparative experiments are conducted between the proposed model and other baseline models,demonstrating notable improvements in Mean Absolute Error(MAE),Mean Squared Error(MSE),Root Mean Squared Error(RMSE),and R2 metrics by up to 35.2%,56.1%,32.5%,and 10.3%,respectively.Additionally,ablation experiments are designed to investigate the impact of different factors on the performance of the forecasting model,providing valuable insights for parameter optimization based on experimental results.
基金supported by the National Key Research and Development Program of China (No. 2021YFB3300503)Regional Innovation and Development Joint Fund of National Natural Science Foundation of China (No. U22A20167)National Natural Science Foundation of China (No. 61872260).
文摘Long-term multivariate time series forecasting is an important task in engineering applications. It helps grasp the future development trend of data in real-time, which is of great significance for a wide variety of fields. Due to the non-linear and unstable characteristics of multivariate time series, the existing methods encounter difficulties in analyzing complex high-dimensional data and capturing latent relationships between multivariates in time series, thus affecting the performance of long-term prediction. In this paper, we propose a novel time series forecasting model based on multilayer perceptron that combines spatio-temporal decomposition and doubly residual stacking, namely Spatio-Temporal Decomposition Neural Network (STDNet). We decompose the originally complex and unstable time series into two parts, temporal term and spatial term. We design temporal module based on auto-correlation mechanism to discover temporal dependencies at the sub-series level, and spatial module based on convolutional neural network and self-attention mechanism to integrate multivariate information from two dimensions, global and local, respectively. Then we integrate the results obtained from the different modules to get the final forecast. Extensive experiments on four real-world datasets show that STDNet significantly outperforms other state-of-the-art methods, which provides an effective solution for long-term time series forecasting.
文摘Medium-term forecasting is an important category of electric load forecasting that covers a time span of up to one year ahead. It suits outage and maintenance planning, as well as load switching operation. We propose a new methodol-ogy that uses hourly daily loads to predict the next year hourly loads, and hence predict the peak loads expected to be reached in the next coming year. The technique is based on implementing multivariable regression on previous year's hourly loads. Three regression models are investigated in this research: the linear, the polynomial, and the exponential power. The proposed models are applied to real loads of the Jordanian power system. Results obtained using the pro-posed methods showed that their performance is close and they outperform results obtained using the widely used ex-ponential regression technique. Moreover, peak load prediction has about 90% accuracy using the proposed method-ology. The methods are generic and simple and can be implemented to hourly loads of any power system. No extra in-formation other than the hourly loads is required.
基金supported in part by the National Natural Science Foundation of China (No.62002035)the Natural Science Foundation of Chongqing (No.cstc2020jcyj-bshX0034).
文摘Multivariate Time Series(MTS)forecasting is an essential problem in many fields.Accurate forecasting results can effectively help in making decisions.To date,many MTS forecasting methods have been proposed and widely applied.However,these methods assume that the predicted value of a single variable is affected by all other variables,ignoring the causal relationship among variables.To address the above issue,we propose a novel end-to-end deep learning model,termed graph neural network with neural Granger causality,namely CauGNN,in this paper.To characterize the causal information among variables,we introduce the neural Granger causality graph in our model.Each variable is regarded as a graph node,and each edge represents the casual relationship between variables.In addition,convolutional neural network filters with different perception scales are used for time series feature extraction,to generate the feature of each node.Finally,the graph neural network is adopted to tackle the forecasting problem of the graph structure generated by the MTS.Three benchmark datasets from the real world are used to evaluate the proposed CauGNN,and comprehensive experiments show that the proposed method achieves state-of-the-art results in the MTS forecasting task.
文摘Water is a vital resource.It supports a multitude of industries,civilizations,and agriculture.However,climatic conditions impact water availability,particularly in desert areas where the temperature is high,and rain is scarce.Therefore,it is crucial to forecast water demand to provide it to sectors either on regular or emergency days.The study aims to develop an accurate model to forecast daily water demand under the impact of climatic conditions.This forecasting is known as a multivariate time series because it uses both the historical data of water demand and climatic conditions to forecast the future.Focusing on the collected data of Jeddah city,Saudi Arabia in the period between 2004 and 2018,we develop a hybrid approach that uses Artificial Neural Networks(ANN)for forecasting and Particle Swarm Optimization algorithm(PSO)for tuning ANNs’hyperparameters.Based on the Root Mean Square Error(RMSE)metric,results show that the(PSO-ANN)is an accurate model for multivariate time series forecasting.Also,the first day is the most difficult day for prediction(highest error rate),while the second day is the easiest to predict(lowest error rate).Finally,correlation analysis shows that the dew point is the most climatic factor affecting water demand.