This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear...This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.展开更多
-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies ...-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies of the cold tongue water area in the eastern tropical Pacific Ocean is obtained. On the basis of the time series, an autoregression model, a self-exciting threshold autoregression model and an open loop autoregression model are developed respectively. The interannual variations are simulated by means of those models. The simulation results show that all the three models have made very good hindcasting for the nine El Nino events since 1951. In order to test the reliability of the open loop threshold model, extrapolated forecast was made for the period of Jan. 1986-Feb. 1987. It can be seen from the forecasting that the model could forecast well the beginning and strengthening stages of the recent El Nino event (1986-1987). Correlation coefficients of the estimations to observations are respectively 0. 84, 0. 88 and 0. 89. It is obvious that all the models work well and the open loop threshold one is the best. So the open loop threshold autoregression model is a useful tool for monitoring the SSTinterannual variation of the cold tongue water area in the Eastern Equatorial Pacific Ocean and for estimating the El Nino strength.展开更多
Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group stru...Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group structure to address nodal heterogeneity within the network.An iterative algorithm is employed to minimize a least-squares objective function,allowing for simultaneous estimation of both the parameters and the group structure.To determine the unknown number of groups and factors,a PIC criterion is introduced.Additionally,statistical inference of the estimated parameters is presented.To assess the validity of the proposed estimation and inference procedures,we conduct extensive numerical studies.We also demonstrate the utility of our model using a stock dataset obtained from the Chinese A-Share stock market.展开更多
Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector A...Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector Autoregression (VAR) model to forecast solar irradiance levels and weather characteristics in the San Francisco Bay Area. The results demonstrate a correlation between predicted and actual solar irradiance, indicating the effectiveness of the VAR model for this task. However, the model may not be sufficient for this region due to the requirement of additional weather features to reduce disparities between predictions and actual observations. Additionally, the current lag order in the model is relatively low, limiting its ability to capture all relevant information from past observations. As a result, the model’s forecasting capability is limited to short-term horizons, with a maximum horizon of four hours.展开更多
Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and ar...Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and are, thus, important on the supply side. In this paper, a large vector autoregression(VAR) model is built to forecast three important weather variables for 61 cities around the United States. The three variables at all locations are modeled as response variables. Lag terms are used to capture the relationship between observations in adjacent periods and daily and annual seasonality are modeled to consider the correlation between the same periods in adjacent days and years. We estimate the VAR model with16 years of hourly historical data and use two additional years of data for out-of-sample validation. Forecasts of up to six-hours-ahead are generated with good forecasting performance based on mean absolute error, root mean square error, relative root mean square error, and skill scores. Our VAR model gives forecasts with skill scoresthat are more than double the skill scores of other forecasting models in the literature. Our model also provides forecasts that outperform persistence forecasts by between6% and 80% in terms of mean absolute error. Our results show that the proposed time series approach is appropriate for very short-term forecasting of hourly solar radiation,temperature, and wind speed.展开更多
This paper studies the autoregression models of order one,in a general time series setting that allows for weakly dependent innovations.Let{Xt}be a linear process defined by Xt=∑k=0^∞ψkεt-k,where{ψk,k≥0}is a seq...This paper studies the autoregression models of order one,in a general time series setting that allows for weakly dependent innovations.Let{Xt}be a linear process defined by Xt=∑k=0^∞ψkεt-k,where{ψk,k≥0}is a sequence of real numbers and{εk,k=0,±1,±2,...}is a sequence of random variables.Two results are proved in this paper.In the first result,assuming that{εk,k≥1}is a sequence of asymptotically linear negative quadrant dependent(ALNQD)random variables,the authors find the limiting distributions of the least squares estimator and the associated regression t statistic.It is interesting that the limiting distributions are similar to the one found in earlier work under the assumption of i.i.d,innovations.In the second result the authors prove that the least squares estimator is not a strong consistency estimator of the autoregressive parameter a when{εk,k≥1}is a sequence of negatively associated(NA)random variables,andψ0=1,ψk=0,k≥1.展开更多
China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of mac...China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of macro-variables on inflation in low and high inflation regimes. The empirical evidence demonstrates, first, that the reactions of inflation to various shocks are inflation-regime-dependent and asymmetric. Second, monetary policy influences China "s high inflation and adjusting the domestic interest rate in China may be an effective way to control inflation in a high inflation regime, but not in a low inflation regime. In a high inflation regime, a high inflation rate may cause the macro-policy authorities to increase the domestic interest rate, in an attempt to stabilize high inflation. Third, contrary to expectations, the world oil price is not a strong cost-push factor in a low inflation regime. Oil price increases may increase inflation in a high inflation regime, but there is no such obvious effect in a low inflation regime. Finally, China "s nominal effective exchange rate influences inflation in both low and high inflation regimes. A nominal effeetive exchange rate appreciation might be effective in controlling domestic inflation in both regimes.展开更多
Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-ter...Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-term climatic prediction of MEFs with a formulated scheme. Experi- mental results suggest that the scheme is of encouraging usefulness to a weak persistence MEF, i.e., rainfall field and, in particular, to a strong persistance one like a SST field.展开更多
Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specific...Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specifically,this study estimates 13 three-variable factor-augmented vector autoregression (FAVAR) models to explore how two types of monetary instruments affect China's economy and price level.Overall,we find that monetary policy has positive effects on China's economy and price level.Second,this study clearly states that the effectiveness of China's monetary policy on the economy has depended on China's quantity-based direct monetary instruments since 2002.Third,the effectiveness of quantity-based direct monetary instruments on China's economy and price level is dependent on the significant and positive effects of quantity-based direct monetary instruments after the 2008 financial crisis.Fourth,the significant and positive effects of price-based indirect monetary instruments on China's economy and price level before 2008 cannot fundamentally change their current insignificant effects on China's economy and price level.展开更多
In the parameter tracking of time-varying systems, the ordinary method is weighted least squares with the rectangular window or the exponential window. In this paper we propose a new kind of sliding window called the ...In the parameter tracking of time-varying systems, the ordinary method is weighted least squares with the rectangular window or the exponential window. In this paper we propose a new kind of sliding window called the multiple exponential window, and then use it to fit time-varying Gaussian vector autoregressive models. The asymptotic bias and covariance of the estimator of the parameter for time-invariant models are also derived. Simulation results show that the multiple exponential windows have better parameter tracking effect than rectangular windows and exponential ones.展开更多
Although the popular multi-fidelity surrogate models,stochastic collocation and nonlinear autoregression have been applied successfully to multiple benchmark problems in different areas of science and engineering,they...Although the popular multi-fidelity surrogate models,stochastic collocation and nonlinear autoregression have been applied successfully to multiple benchmark problems in different areas of science and engineering,they have certain limitations.We propose a uniform Bayesian framework that connects these two methods allowing us to combine the strengths of both.To this end,we introduce Greedy-NAR,a nonlinear Bayesian autoregressive model that can handle complex between-fidelity correlations and involves a sequential construction that allows for significant improvements in performance given a limited computational budget.The proposed enhanced nonlinear autoregressive method is applied to three benchmark problems that are typical of energy applications,namely molecular dynamics and computational fluid dynamics.The results indicate an increase in both prediction stability and accuracy when compared to those of the standard multi-fidelity autoregression implementations.The results also reveal the advantages over the stochastic collocation approach in terms of accuracy and computational cost.Generally speaking,the proposed enhancement provides a straightforward and easily implemented approach for boosting the accuracy and efficiency of concatenated structure multi-fidelity simulation methods,e.g.,the nonlinear autoregressive model,with a negligible additional computational cost.展开更多
This study investigates the determinants that drive the volatility of the credit default swaps(CDS)of BRICIT(Brazil,Russia,India,China,Indonesia,and Turkey)nations as a proxy measure for sovereign risk.On the existenc...This study investigates the determinants that drive the volatility of the credit default swaps(CDS)of BRICIT(Brazil,Russia,India,China,Indonesia,and Turkey)nations as a proxy measure for sovereign risk.On the existence of cointegration,an unrestricted error correction model integrated with the autoregressive distributed lag(ARDL)model is applied to measure the short-run and long-run dynamics empirically.The study utilizes the Bayesian global vector autoregression methodology for cross-border spillover estimation.The study also suggests a strategy for policymakers for quadrant categorization to mitigate risk arising from cross-border spillover.The result of ARDL indicates that the global macroeconomic variables affect the BRICIT CDS more than domestic macroeconomic determinants,with Indian CDS being the most sensitive to Fed tapering.Notably,China’s CDS is the most sensitive to shocks,with the CDS volatility primarily driven by China’s geopolitical risk.Russian CDS is more sensitive to real effective exchange rates due to severe ruble depreciation than crude oil,despite Russia being a major oil exporter.The quadrant categorization indicates that the Indonesian stock market index is most interconnected with BRICIT CDS,while the Turkish long-term interest rates send the highest intensity spillover across BRICIT nations.展开更多
As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with h...As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with high accuracy is an important topic.The PM_(2.5) monitoring stations in Xinjiang Uygur Autonomous Region,China,are unevenly distributed,which makes it challenging to conduct comprehensive analyses and predictions.Therefore,this study primarily addresses the limitations mentioned above and the poor generalization ability of PM_(2.5) concentration prediction models across different monitoring stations.We chose the northern slope of the Tianshan Mountains as the study area and took the January−December in 2019 as the research period.On the basis of data from 21 PM_(2.5) monitoring stations as well as meteorological data(temperature,instantaneous wind speed,and pressure),we developed an improved model,namely GCN−TCN−AR(where GCN is the graph convolution network,TCN is the temporal convolutional network,and AR is the autoregression),for predicting PM_(2.5) concentrations on the northern slope of the Tianshan Mountains.The GCN−TCN−AR model is composed of an improved GCN model,a TCN model,and an AR model.The results revealed that the R2 values predicted by the GCN−TCN−AR model at the four monitoring stations(Urumqi,Wujiaqu,Shihezi,and Changji)were 0.93,0.91,0.93,and 0.92,respectively,and the RMSE(root mean square error)values were 6.85,7.52,7.01,and 7.28μg/m^(3),respectively.The performance of the GCN−TCN−AR model was also compared with the currently neural network models,including the GCN−TCN,GCN,TCN,Support Vector Regression(SVR),and AR.The GCN−TCN−AR outperformed the other current neural network models,with high prediction accuracy and good stability,making it especially suitable for the predictions of PM_(2.5)concentrations.This study revealed the significant spatiotemporal variations of PM_(2.5)concentrations.First,the PM_(2.5) concentrations exhibited clear seasonal fluctuations,with higher levels typically observed in winter and differences presented between months.Second,the spatial distribution analysis revealed that cities such as Urumqi and Wujiaqu have high PM_(2.5) concentrations,with a noticeable geographical clustering of pollutions.Understanding the variations in PM_(2.5) concentrations is highly important for the sustainable development of ecological environment in arid areas.展开更多
Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden ...Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden of NVL in China by sex and age groups from 1990 to 2021 and to project trends over the next 15 years.Methods:Using data from the Global Burden of Disease(GBD)2021 database,we conducted descriptive analyses of NVL prevalence in China,calculated age-standardized prevalence rates(ASPR)and age-standardized disability-adjusted life years rates(ASDR)to compare burden differences between sexes and age groups,and applied an autoregressive integrated moving average(ARIMA)model to predict NVL trends for the next 15 years.The model selection was based on best-fit criteria to ensure reliable projections.Results:From 1990 to 2021,China’s ASPR of NVL rose from 10096.24/100000 to 15624.54/100000,and ASDR increased from 101.75/100000 to 158.75/100000.In 2021,ASPR(16551.70/100000)and ASDR(167.69/100000)were higher among females than males(14686.21/100000 and 149.76/100000,respectively).China ranked highest globally in both NVL cases and disability-adjusted life years(DALYs),with female burden significantly exceeding male burden.Projections indicated this trend and sex gap will persist until 2036.Compared with 1990,the prevalence cases and DALYs increased by 239.20%and 238.82%,respectively in 2021,with the highest burden among females and the 55−59 age group.The ARIMA model predicted continued increases in prevalence and DALYs by 2036,with females maintaining a higher burden than males.Conclusion:This study reveals a marked increase in the NVL burden in China and predicts continued growth in the coming years.Public health policies should prioritize NVL prevention and control,with special attention to women and middle-aged populations to mitigate long-term societal and health impacts.展开更多
The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming ...The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming and higher plant species diversity,and to identify priority counties for conservation.We employed an integrated approach combining species distribution modeling,GIS overlay analysis,and empirical spatial regression to em pirically assess the impact of tobacco farming intensity on biodiversity risk.Our findings reveal a compelling negative spatial correlation between tobacco farming expansion and higher plant species diversity.Specifically,southern counties in Wenshan and Honghe prefectures are major priority areas of conservation that exhibit signif icant spatial correlations between biodiversity risks and high tobacco farming intensity.Quantitatively,at county level,a 1%increase in tobacco farming area corresponds to a 0.094%decrease in endemic higher plant species richness across the entire province.These results underscore the need for targeted and region-specific regulations to mitigate biodiversity loss and promote sustainable development in Yunnan Province.The integrated approach used in this study provides a comprehensive assessment of the tobacco-biodiversity trade-offs,offering actionable insights for policymaking.展开更多
Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the l...Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the literature.However,chaos theory has not been extensively investigated in AO.Moreover,it is still not applied in the parameter estimation of electro-hydraulic systems.In this work,ten well-defined chaotic maps were integrated into a narrowed exploitation of AO for the development of a robust chaotic optimization technique.An extensive investigation of twenty-three mathematical benchmarks and ten IEEE Congress on Evolutionary Computation(CEC)functions shows that chaotic Aquila optimization techniques perform better than the baseline technique.The investigation is further conducted on parameter estimation of an electro-hydraulic control system,which is performed on various noise levels and shows that the proposed chaotic AO with Piecewise map(CAO6)achieves the best fitness values of and at noise levels and respectively.Friedman test 2.873E-05,1.014E-04,8.728E-031.300E-03,1.300E-02,1.300E-01,for repeated measures,computational analysis,and Taguchi test reflect the superiority of CAO6 against the state of the arts,demonstrating its potential for addressing various engineering optimization problems.However,the sensitivity to parameter tuning may limit its direct application to complex optimization scenarios.展开更多
In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as tem...In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as template-based retrosynthesis,have high accuracy and interpretability in specific types of reactions but are limited by the scope of the template library,making it difficult to adapt to new or uncommon reaction types.Moreover,sequence-to-sequence retrosynthetic prediction methods,although they enhance the flexibility of prediction,often overlook the complexity of molecular graph structures and the actual interactions between atoms,which limits the accuracy and reliability of the predictions.To address these limitations,this paper proposes a Molecular Retrosynthesis Top-k Prediction based on the Latent Generation Process(MRLGP)that uses latent variables from graph neural networks to model the generation process and produce diverse set of reactants.Utilising an encoding method based on Graphormer,the authors have also introduced topology-aware positional encoding to better capture the interactions between atomic nodes in the molecular graph structure,thereby more accurately simulating the retrosynthetic process.The MRLGP model significantly enhances the accuracy and diversity of predictions by correlating discrete latent variables with the reactant generation process and progressively constructing molecular graphs using a variational autoregressive decoder.Experimental results on benchmark datasets such as USPTO-50k,USPTO-Full,and USPTO-DIVERSE demonstrate that MRLGP outperforms baseline models on multiple Top-k evaluation metrics.Additionally,ablation experiments conducted on the USPTO-50K dataset further validate the effectiveness of the methods used in the encoder and decoder parts of the model.展开更多
Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonline...Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.展开更多
基金the National Natural Science Foundation of China(No.61203337)the Natural Science Foundation of Shanghai(No.12ZR1440200)
文摘This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.
文摘-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies of the cold tongue water area in the eastern tropical Pacific Ocean is obtained. On the basis of the time series, an autoregression model, a self-exciting threshold autoregression model and an open loop autoregression model are developed respectively. The interannual variations are simulated by means of those models. The simulation results show that all the three models have made very good hindcasting for the nine El Nino events since 1951. In order to test the reliability of the open loop threshold model, extrapolated forecast was made for the period of Jan. 1986-Feb. 1987. It can be seen from the forecasting that the model could forecast well the beginning and strengthening stages of the recent El Nino event (1986-1987). Correlation coefficients of the estimations to observations are respectively 0. 84, 0. 88 and 0. 89. It is obvious that all the models work well and the open loop threshold one is the best. So the open loop threshold autoregression model is a useful tool for monitoring the SSTinterannual variation of the cold tongue water area in the Eastern Equatorial Pacific Ocean and for estimating the El Nino strength.
基金Supported by National Natural Science Foundation of China(72222009,71991472)。
文摘Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group structure to address nodal heterogeneity within the network.An iterative algorithm is employed to minimize a least-squares objective function,allowing for simultaneous estimation of both the parameters and the group structure.To determine the unknown number of groups and factors,a PIC criterion is introduced.Additionally,statistical inference of the estimated parameters is presented.To assess the validity of the proposed estimation and inference procedures,we conduct extensive numerical studies.We also demonstrate the utility of our model using a stock dataset obtained from the Chinese A-Share stock market.
文摘Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector Autoregression (VAR) model to forecast solar irradiance levels and weather characteristics in the San Francisco Bay Area. The results demonstrate a correlation between predicted and actual solar irradiance, indicating the effectiveness of the VAR model for this task. However, the model may not be sufficient for this region due to the requirement of additional weather features to reduce disparities between predictions and actual observations. Additionally, the current lag order in the model is relatively low, limiting its ability to capture all relevant information from past observations. As a result, the model’s forecasting capability is limited to short-term horizons, with a maximum horizon of four hours.
基金supported by the National Science Foundation (No: 1029337)supported by an allocation of computing time from the Ohio Supercomputer Center
文摘Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and are, thus, important on the supply side. In this paper, a large vector autoregression(VAR) model is built to forecast three important weather variables for 61 cities around the United States. The three variables at all locations are modeled as response variables. Lag terms are used to capture the relationship between observations in adjacent periods and daily and annual seasonality are modeled to consider the correlation between the same periods in adjacent days and years. We estimate the VAR model with16 years of hourly historical data and use two additional years of data for out-of-sample validation. Forecasts of up to six-hours-ahead are generated with good forecasting performance based on mean absolute error, root mean square error, relative root mean square error, and skill scores. Our VAR model gives forecasts with skill scoresthat are more than double the skill scores of other forecasting models in the literature. Our model also provides forecasts that outperform persistence forecasts by between6% and 80% in terms of mean absolute error. Our results show that the proposed time series approach is appropriate for very short-term forecasting of hourly solar radiation,temperature, and wind speed.
基金supported by the National Natural Science Foundation of China under Grant Nos.10971081 and 11001104985 Project of Jilin University
文摘This paper studies the autoregression models of order one,in a general time series setting that allows for weakly dependent innovations.Let{Xt}be a linear process defined by Xt=∑k=0^∞ψkεt-k,where{ψk,k≥0}is a sequence of real numbers and{εk,k=0,±1,±2,...}is a sequence of random variables.Two results are proved in this paper.In the first result,assuming that{εk,k≥1}is a sequence of asymptotically linear negative quadrant dependent(ALNQD)random variables,the authors find the limiting distributions of the least squares estimator and the associated regression t statistic.It is interesting that the limiting distributions are similar to the one found in earlier work under the assumption of i.i.d,innovations.In the second result the authors prove that the least squares estimator is not a strong consistency estimator of the autoregressive parameter a when{εk,k≥1}is a sequence of negatively associated(NA)random variables,andψ0=1,ψk=0,k≥1.
文摘China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of macro-variables on inflation in low and high inflation regimes. The empirical evidence demonstrates, first, that the reactions of inflation to various shocks are inflation-regime-dependent and asymmetric. Second, monetary policy influences China "s high inflation and adjusting the domestic interest rate in China may be an effective way to control inflation in a high inflation regime, but not in a low inflation regime. In a high inflation regime, a high inflation rate may cause the macro-policy authorities to increase the domestic interest rate, in an attempt to stabilize high inflation. Third, contrary to expectations, the world oil price is not a strong cost-push factor in a low inflation regime. Oil price increases may increase inflation in a high inflation regime, but there is no such obvious effect in a low inflation regime. Finally, China "s nominal effective exchange rate influences inflation in both low and high inflation regimes. A nominal effeetive exchange rate appreciation might be effective in controlling domestic inflation in both regimes.
文摘Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-term climatic prediction of MEFs with a formulated scheme. Experi- mental results suggest that the scheme is of encouraging usefulness to a weak persistence MEF, i.e., rainfall field and, in particular, to a strong persistance one like a SST field.
基金The anthors thank the support from Tianjin Philosophy and Social Science Planning Project(No.TJYYQN19-004)Project of National and Regional Research Center,Ministry of Education(No.ZX20170183)National Natural Science Foundation Youth Project(No.71803089).
文摘Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specifically,this study estimates 13 three-variable factor-augmented vector autoregression (FAVAR) models to explore how two types of monetary instruments affect China's economy and price level.Overall,we find that monetary policy has positive effects on China's economy and price level.Second,this study clearly states that the effectiveness of China's monetary policy on the economy has depended on China's quantity-based direct monetary instruments since 2002.Third,the effectiveness of quantity-based direct monetary instruments on China's economy and price level is dependent on the significant and positive effects of quantity-based direct monetary instruments after the 2008 financial crisis.Fourth,the significant and positive effects of price-based indirect monetary instruments on China's economy and price level before 2008 cannot fundamentally change their current insignificant effects on China's economy and price level.
文摘In the parameter tracking of time-varying systems, the ordinary method is weighted least squares with the rectangular window or the exponential window. In this paper we propose a new kind of sliding window called the multiple exponential window, and then use it to fit time-varying Gaussian vector autoregressive models. The asymptotic bias and covariance of the estimator of the parameter for time-invariant models are also derived. Simulation results show that the multiple exponential windows have better parameter tracking effect than rectangular windows and exponential ones.
基金This work has been supported by DARPA TRADES Award HR0011-17-2-0016.
文摘Although the popular multi-fidelity surrogate models,stochastic collocation and nonlinear autoregression have been applied successfully to multiple benchmark problems in different areas of science and engineering,they have certain limitations.We propose a uniform Bayesian framework that connects these two methods allowing us to combine the strengths of both.To this end,we introduce Greedy-NAR,a nonlinear Bayesian autoregressive model that can handle complex between-fidelity correlations and involves a sequential construction that allows for significant improvements in performance given a limited computational budget.The proposed enhanced nonlinear autoregressive method is applied to three benchmark problems that are typical of energy applications,namely molecular dynamics and computational fluid dynamics.The results indicate an increase in both prediction stability and accuracy when compared to those of the standard multi-fidelity autoregression implementations.The results also reveal the advantages over the stochastic collocation approach in terms of accuracy and computational cost.Generally speaking,the proposed enhancement provides a straightforward and easily implemented approach for boosting the accuracy and efficiency of concatenated structure multi-fidelity simulation methods,e.g.,the nonlinear autoregressive model,with a negligible additional computational cost.
文摘This study investigates the determinants that drive the volatility of the credit default swaps(CDS)of BRICIT(Brazil,Russia,India,China,Indonesia,and Turkey)nations as a proxy measure for sovereign risk.On the existence of cointegration,an unrestricted error correction model integrated with the autoregressive distributed lag(ARDL)model is applied to measure the short-run and long-run dynamics empirically.The study utilizes the Bayesian global vector autoregression methodology for cross-border spillover estimation.The study also suggests a strategy for policymakers for quadrant categorization to mitigate risk arising from cross-border spillover.The result of ARDL indicates that the global macroeconomic variables affect the BRICIT CDS more than domestic macroeconomic determinants,with Indian CDS being the most sensitive to Fed tapering.Notably,China’s CDS is the most sensitive to shocks,with the CDS volatility primarily driven by China’s geopolitical risk.Russian CDS is more sensitive to real effective exchange rates due to severe ruble depreciation than crude oil,despite Russia being a major oil exporter.The quadrant categorization indicates that the Indonesian stock market index is most interconnected with BRICIT CDS,while the Turkish long-term interest rates send the highest intensity spillover across BRICIT nations.
基金supported by the Program of Support Xinjiang by Technology(2024E02028,B2-2024-0359)Xinjiang Tianchi Talent Program of 2024,the Foundation of Chinese Academy of Sciences(B2-2023-0239)the Youth Foundation of Shandong Natural Science(ZR2023QD070).
文摘As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with high accuracy is an important topic.The PM_(2.5) monitoring stations in Xinjiang Uygur Autonomous Region,China,are unevenly distributed,which makes it challenging to conduct comprehensive analyses and predictions.Therefore,this study primarily addresses the limitations mentioned above and the poor generalization ability of PM_(2.5) concentration prediction models across different monitoring stations.We chose the northern slope of the Tianshan Mountains as the study area and took the January−December in 2019 as the research period.On the basis of data from 21 PM_(2.5) monitoring stations as well as meteorological data(temperature,instantaneous wind speed,and pressure),we developed an improved model,namely GCN−TCN−AR(where GCN is the graph convolution network,TCN is the temporal convolutional network,and AR is the autoregression),for predicting PM_(2.5) concentrations on the northern slope of the Tianshan Mountains.The GCN−TCN−AR model is composed of an improved GCN model,a TCN model,and an AR model.The results revealed that the R2 values predicted by the GCN−TCN−AR model at the four monitoring stations(Urumqi,Wujiaqu,Shihezi,and Changji)were 0.93,0.91,0.93,and 0.92,respectively,and the RMSE(root mean square error)values were 6.85,7.52,7.01,and 7.28μg/m^(3),respectively.The performance of the GCN−TCN−AR model was also compared with the currently neural network models,including the GCN−TCN,GCN,TCN,Support Vector Regression(SVR),and AR.The GCN−TCN−AR outperformed the other current neural network models,with high prediction accuracy and good stability,making it especially suitable for the predictions of PM_(2.5)concentrations.This study revealed the significant spatiotemporal variations of PM_(2.5)concentrations.First,the PM_(2.5) concentrations exhibited clear seasonal fluctuations,with higher levels typically observed in winter and differences presented between months.Second,the spatial distribution analysis revealed that cities such as Urumqi and Wujiaqu have high PM_(2.5) concentrations,with a noticeable geographical clustering of pollutions.Understanding the variations in PM_(2.5) concentrations is highly important for the sustainable development of ecological environment in arid areas.
基金supported by the Natural Science Foundation of Hunan Province(2023JJ30817)Hunan Provincial Natural Science Foundation-Hengyang City Joint Fund Project(2025JJ70129)+1 种基金Changsha Natural Science Foundation(kq2403057)China。
文摘Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden of NVL in China by sex and age groups from 1990 to 2021 and to project trends over the next 15 years.Methods:Using data from the Global Burden of Disease(GBD)2021 database,we conducted descriptive analyses of NVL prevalence in China,calculated age-standardized prevalence rates(ASPR)and age-standardized disability-adjusted life years rates(ASDR)to compare burden differences between sexes and age groups,and applied an autoregressive integrated moving average(ARIMA)model to predict NVL trends for the next 15 years.The model selection was based on best-fit criteria to ensure reliable projections.Results:From 1990 to 2021,China’s ASPR of NVL rose from 10096.24/100000 to 15624.54/100000,and ASDR increased from 101.75/100000 to 158.75/100000.In 2021,ASPR(16551.70/100000)and ASDR(167.69/100000)were higher among females than males(14686.21/100000 and 149.76/100000,respectively).China ranked highest globally in both NVL cases and disability-adjusted life years(DALYs),with female burden significantly exceeding male burden.Projections indicated this trend and sex gap will persist until 2036.Compared with 1990,the prevalence cases and DALYs increased by 239.20%and 238.82%,respectively in 2021,with the highest burden among females and the 55−59 age group.The ARIMA model predicted continued increases in prevalence and DALYs by 2036,with females maintaining a higher burden than males.Conclusion:This study reveals a marked increase in the NVL burden in China and predicts continued growth in the coming years.Public health policies should prioritize NVL prevention and control,with special attention to women and middle-aged populations to mitigate long-term societal and health impacts.
文摘The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming and higher plant species diversity,and to identify priority counties for conservation.We employed an integrated approach combining species distribution modeling,GIS overlay analysis,and empirical spatial regression to em pirically assess the impact of tobacco farming intensity on biodiversity risk.Our findings reveal a compelling negative spatial correlation between tobacco farming expansion and higher plant species diversity.Specifically,southern counties in Wenshan and Honghe prefectures are major priority areas of conservation that exhibit signif icant spatial correlations between biodiversity risks and high tobacco farming intensity.Quantitatively,at county level,a 1%increase in tobacco farming area corresponds to a 0.094%decrease in endemic higher plant species richness across the entire province.These results underscore the need for targeted and region-specific regulations to mitigate biodiversity loss and promote sustainable development in Yunnan Province.The integrated approach used in this study provides a comprehensive assessment of the tobacco-biodiversity trade-offs,offering actionable insights for policymaking.
基金funded by Taif University,Saudi Arabia,Project No.(TU-DSPP-2024-52).
文摘Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the literature.However,chaos theory has not been extensively investigated in AO.Moreover,it is still not applied in the parameter estimation of electro-hydraulic systems.In this work,ten well-defined chaotic maps were integrated into a narrowed exploitation of AO for the development of a robust chaotic optimization technique.An extensive investigation of twenty-three mathematical benchmarks and ten IEEE Congress on Evolutionary Computation(CEC)functions shows that chaotic Aquila optimization techniques perform better than the baseline technique.The investigation is further conducted on parameter estimation of an electro-hydraulic control system,which is performed on various noise levels and shows that the proposed chaotic AO with Piecewise map(CAO6)achieves the best fitness values of and at noise levels and respectively.Friedman test 2.873E-05,1.014E-04,8.728E-031.300E-03,1.300E-02,1.300E-01,for repeated measures,computational analysis,and Taguchi test reflect the superiority of CAO6 against the state of the arts,demonstrating its potential for addressing various engineering optimization problems.However,the sensitivity to parameter tuning may limit its direct application to complex optimization scenarios.
基金supported by the China Postdoctoral Science Foundation(No.2014m561331)Science and Technology Research Project of Heilongjiang Provincial Education Department(No.12521073)National Natural Science Youth Fund(No.61300115).
文摘In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as template-based retrosynthesis,have high accuracy and interpretability in specific types of reactions but are limited by the scope of the template library,making it difficult to adapt to new or uncommon reaction types.Moreover,sequence-to-sequence retrosynthetic prediction methods,although they enhance the flexibility of prediction,often overlook the complexity of molecular graph structures and the actual interactions between atoms,which limits the accuracy and reliability of the predictions.To address these limitations,this paper proposes a Molecular Retrosynthesis Top-k Prediction based on the Latent Generation Process(MRLGP)that uses latent variables from graph neural networks to model the generation process and produce diverse set of reactants.Utilising an encoding method based on Graphormer,the authors have also introduced topology-aware positional encoding to better capture the interactions between atomic nodes in the molecular graph structure,thereby more accurately simulating the retrosynthetic process.The MRLGP model significantly enhances the accuracy and diversity of predictions by correlating discrete latent variables with the reactant generation process and progressively constructing molecular graphs using a variational autoregressive decoder.Experimental results on benchmark datasets such as USPTO-50k,USPTO-Full,and USPTO-DIVERSE demonstrate that MRLGP outperforms baseline models on multiple Top-k evaluation metrics.Additionally,ablation experiments conducted on the USPTO-50K dataset further validate the effectiveness of the methods used in the encoder and decoder parts of the model.
基金supported by the Program of National Natural Science Foundation of China(U23A20329,62163036)Youth Academic and Technical Leaders Reserve Talent Training project(202105AC160094)Industrial Innovation Talent Special Project of Xingdian Talent Support Program(XDYC-CYCX-2022-0010).
文摘Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.