This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear...This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.展开更多
-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies ...-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies of the cold tongue water area in the eastern tropical Pacific Ocean is obtained. On the basis of the time series, an autoregression model, a self-exciting threshold autoregression model and an open loop autoregression model are developed respectively. The interannual variations are simulated by means of those models. The simulation results show that all the three models have made very good hindcasting for the nine El Nino events since 1951. In order to test the reliability of the open loop threshold model, extrapolated forecast was made for the period of Jan. 1986-Feb. 1987. It can be seen from the forecasting that the model could forecast well the beginning and strengthening stages of the recent El Nino event (1986-1987). Correlation coefficients of the estimations to observations are respectively 0. 84, 0. 88 and 0. 89. It is obvious that all the models work well and the open loop threshold one is the best. So the open loop threshold autoregression model is a useful tool for monitoring the SSTinterannual variation of the cold tongue water area in the Eastern Equatorial Pacific Ocean and for estimating the El Nino strength.展开更多
Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group stru...Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group structure to address nodal heterogeneity within the network.An iterative algorithm is employed to minimize a least-squares objective function,allowing for simultaneous estimation of both the parameters and the group structure.To determine the unknown number of groups and factors,a PIC criterion is introduced.Additionally,statistical inference of the estimated parameters is presented.To assess the validity of the proposed estimation and inference procedures,we conduct extensive numerical studies.We also demonstrate the utility of our model using a stock dataset obtained from the Chinese A-Share stock market.展开更多
Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector A...Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector Autoregression (VAR) model to forecast solar irradiance levels and weather characteristics in the San Francisco Bay Area. The results demonstrate a correlation between predicted and actual solar irradiance, indicating the effectiveness of the VAR model for this task. However, the model may not be sufficient for this region due to the requirement of additional weather features to reduce disparities between predictions and actual observations. Additionally, the current lag order in the model is relatively low, limiting its ability to capture all relevant information from past observations. As a result, the model’s forecasting capability is limited to short-term horizons, with a maximum horizon of four hours.展开更多
As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with h...As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with high accuracy is an important topic.The PM_(2.5) monitoring stations in Xinjiang Uygur Autonomous Region,China,are unevenly distributed,which makes it challenging to conduct comprehensive analyses and predictions.Therefore,this study primarily addresses the limitations mentioned above and the poor generalization ability of PM_(2.5) concentration prediction models across different monitoring stations.We chose the northern slope of the Tianshan Mountains as the study area and took the January−December in 2019 as the research period.On the basis of data from 21 PM_(2.5) monitoring stations as well as meteorological data(temperature,instantaneous wind speed,and pressure),we developed an improved model,namely GCN−TCN−AR(where GCN is the graph convolution network,TCN is the temporal convolutional network,and AR is the autoregression),for predicting PM_(2.5) concentrations on the northern slope of the Tianshan Mountains.The GCN−TCN−AR model is composed of an improved GCN model,a TCN model,and an AR model.The results revealed that the R2 values predicted by the GCN−TCN−AR model at the four monitoring stations(Urumqi,Wujiaqu,Shihezi,and Changji)were 0.93,0.91,0.93,and 0.92,respectively,and the RMSE(root mean square error)values were 6.85,7.52,7.01,and 7.28μg/m^(3),respectively.The performance of the GCN−TCN−AR model was also compared with the currently neural network models,including the GCN−TCN,GCN,TCN,Support Vector Regression(SVR),and AR.The GCN−TCN−AR outperformed the other current neural network models,with high prediction accuracy and good stability,making it especially suitable for the predictions of PM_(2.5)concentrations.This study revealed the significant spatiotemporal variations of PM_(2.5)concentrations.First,the PM_(2.5) concentrations exhibited clear seasonal fluctuations,with higher levels typically observed in winter and differences presented between months.Second,the spatial distribution analysis revealed that cities such as Urumqi and Wujiaqu have high PM_(2.5) concentrations,with a noticeable geographical clustering of pollutions.Understanding the variations in PM_(2.5) concentrations is highly important for the sustainable development of ecological environment in arid areas.展开更多
Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and ar...Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and are, thus, important on the supply side. In this paper, a large vector autoregression(VAR) model is built to forecast three important weather variables for 61 cities around the United States. The three variables at all locations are modeled as response variables. Lag terms are used to capture the relationship between observations in adjacent periods and daily and annual seasonality are modeled to consider the correlation between the same periods in adjacent days and years. We estimate the VAR model with16 years of hourly historical data and use two additional years of data for out-of-sample validation. Forecasts of up to six-hours-ahead are generated with good forecasting performance based on mean absolute error, root mean square error, relative root mean square error, and skill scores. Our VAR model gives forecasts with skill scoresthat are more than double the skill scores of other forecasting models in the literature. Our model also provides forecasts that outperform persistence forecasts by between6% and 80% in terms of mean absolute error. Our results show that the proposed time series approach is appropriate for very short-term forecasting of hourly solar radiation,temperature, and wind speed.展开更多
Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden ...Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden of NVL in China by sex and age groups from 1990 to 2021 and to project trends over the next 15 years.Methods:Using data from the Global Burden of Disease(GBD)2021 database,we conducted descriptive analyses of NVL prevalence in China,calculated age-standardized prevalence rates(ASPR)and age-standardized disability-adjusted life years rates(ASDR)to compare burden differences between sexes and age groups,and applied an autoregressive integrated moving average(ARIMA)model to predict NVL trends for the next 15 years.The model selection was based on best-fit criteria to ensure reliable projections.Results:From 1990 to 2021,China’s ASPR of NVL rose from 10096.24/100000 to 15624.54/100000,and ASDR increased from 101.75/100000 to 158.75/100000.In 2021,ASPR(16551.70/100000)and ASDR(167.69/100000)were higher among females than males(14686.21/100000 and 149.76/100000,respectively).China ranked highest globally in both NVL cases and disability-adjusted life years(DALYs),with female burden significantly exceeding male burden.Projections indicated this trend and sex gap will persist until 2036.Compared with 1990,the prevalence cases and DALYs increased by 239.20%and 238.82%,respectively in 2021,with the highest burden among females and the 55−59 age group.The ARIMA model predicted continued increases in prevalence and DALYs by 2036,with females maintaining a higher burden than males.Conclusion:This study reveals a marked increase in the NVL burden in China and predicts continued growth in the coming years.Public health policies should prioritize NVL prevention and control,with special attention to women and middle-aged populations to mitigate long-term societal and health impacts.展开更多
This paper studies the autoregression models of order one, in a general time series setting that allows for weakly dependent innovations. Let {Xt} be a linear process defined by Xt =∑k=0^∞ψ kεt-k, where {ψk, k ≥...This paper studies the autoregression models of order one, in a general time series setting that allows for weakly dependent innovations. Let {Xt} be a linear process defined by Xt =∑k=0^∞ψ kεt-k, where {ψk, k ≥ 0} is a sequence of real numbers and {εk, k = 0, ±1, ±2,...} is a sequence of random variables. Two results are proved in this paper. In the first result, assuming that {εk, k ≥ 1} is a sequence of asymptotically linear negative quadrant dependent (ALNQD) random variables, the authors find the limiting distributions of the least squares estimator and the associated regression t statistic. It is interesting that the limiting distributions are similar to the one found in earlier work under the assumption of i.i.d, innovations. In the second result the authors prove that the least squares estimator is not a strong consistency estimator of the autoregressive parameter a when {εk, k ≥ 1} is a sequence of negatively associated (NA) random variables, and ψ0 = 1, ψk = 0, k ≥ 1.展开更多
China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of mac...China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of macro-variables on inflation in low and high inflation regimes. The empirical evidence demonstrates, first, that the reactions of inflation to various shocks are inflation-regime-dependent and asymmetric. Second, monetary policy influences China "s high inflation and adjusting the domestic interest rate in China may be an effective way to control inflation in a high inflation regime, but not in a low inflation regime. In a high inflation regime, a high inflation rate may cause the macro-policy authorities to increase the domestic interest rate, in an attempt to stabilize high inflation. Third, contrary to expectations, the world oil price is not a strong cost-push factor in a low inflation regime. Oil price increases may increase inflation in a high inflation regime, but there is no such obvious effect in a low inflation regime. Finally, China "s nominal effective exchange rate influences inflation in both low and high inflation regimes. A nominal effeetive exchange rate appreciation might be effective in controlling domestic inflation in both regimes.展开更多
Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-ter...Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-term climatic prediction of MEFs with a formulated scheme. Experi- mental results suggest that the scheme is of encouraging usefulness to a weak persistence MEF, i.e., rainfall field and, in particular, to a strong persistance one like a SST field.展开更多
Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specific...Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specifically,this study estimates 13 three-variable factor-augmented vector autoregression (FAVAR) models to explore how two types of monetary instruments affect China's economy and price level.Overall,we find that monetary policy has positive effects on China's economy and price level.Second,this study clearly states that the effectiveness of China's monetary policy on the economy has depended on China's quantity-based direct monetary instruments since 2002.Third,the effectiveness of quantity-based direct monetary instruments on China's economy and price level is dependent on the significant and positive effects of quantity-based direct monetary instruments after the 2008 financial crisis.Fourth,the significant and positive effects of price-based indirect monetary instruments on China's economy and price level before 2008 cannot fundamentally change their current insignificant effects on China's economy and price level.展开更多
The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming ...The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming and higher plant species diversity,and to identify priority counties for conservation.We employed an integrated approach combining species distribution modeling,GIS overlay analysis,and empirical spatial regression to em pirically assess the impact of tobacco farming intensity on biodiversity risk.Our findings reveal a compelling negative spatial correlation between tobacco farming expansion and higher plant species diversity.Specifically,southern counties in Wenshan and Honghe prefectures are major priority areas of conservation that exhibit signif icant spatial correlations between biodiversity risks and high tobacco farming intensity.Quantitatively,at county level,a 1%increase in tobacco farming area corresponds to a 0.094%decrease in endemic higher plant species richness across the entire province.These results underscore the need for targeted and region-specific regulations to mitigate biodiversity loss and promote sustainable development in Yunnan Province.The integrated approach used in this study provides a comprehensive assessment of the tobacco-biodiversity trade-offs,offering actionable insights for policymaking.展开更多
Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the l...Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the literature.However,chaos theory has not been extensively investigated in AO.Moreover,it is still not applied in the parameter estimation of electro-hydraulic systems.In this work,ten well-defined chaotic maps were integrated into a narrowed exploitation of AO for the development of a robust chaotic optimization technique.An extensive investigation of twenty-three mathematical benchmarks and ten IEEE Congress on Evolutionary Computation(CEC)functions shows that chaotic Aquila optimization techniques perform better than the baseline technique.The investigation is further conducted on parameter estimation of an electro-hydraulic control system,which is performed on various noise levels and shows that the proposed chaotic AO with Piecewise map(CAO6)achieves the best fitness values of and at noise levels and respectively.Friedman test 2.873E-05,1.014E-04,8.728E-031.300E-03,1.300E-02,1.300E-01,for repeated measures,computational analysis,and Taguchi test reflect the superiority of CAO6 against the state of the arts,demonstrating its potential for addressing various engineering optimization problems.However,the sensitivity to parameter tuning may limit its direct application to complex optimization scenarios.展开更多
In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as tem...In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as template-based retrosynthesis,have high accuracy and interpretability in specific types of reactions but are limited by the scope of the template library,making it difficult to adapt to new or uncommon reaction types.Moreover,sequence-to-sequence retrosynthetic prediction methods,although they enhance the flexibility of prediction,often overlook the complexity of molecular graph structures and the actual interactions between atoms,which limits the accuracy and reliability of the predictions.To address these limitations,this paper proposes a Molecular Retrosynthesis Top-k Prediction based on the Latent Generation Process(MRLGP)that uses latent variables from graph neural networks to model the generation process and produce diverse set of reactants.Utilising an encoding method based on Graphormer,the authors have also introduced topology-aware positional encoding to better capture the interactions between atomic nodes in the molecular graph structure,thereby more accurately simulating the retrosynthetic process.The MRLGP model significantly enhances the accuracy and diversity of predictions by correlating discrete latent variables with the reactant generation process and progressively constructing molecular graphs using a variational autoregressive decoder.Experimental results on benchmark datasets such as USPTO-50k,USPTO-Full,and USPTO-DIVERSE demonstrate that MRLGP outperforms baseline models on multiple Top-k evaluation metrics.Additionally,ablation experiments conducted on the USPTO-50K dataset further validate the effectiveness of the methods used in the encoder and decoder parts of the model.展开更多
Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonline...Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.展开更多
High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiri...High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiring rapid responses or iterative processes,such as optimization,uncertainty quantification,or inverse modeling.To address this challenge,this work introduces the Dual-Stage Temporal Three-dimensional UNet Super-resolution(DST3D-UNet-SR)model,a highly efficient deep learning model for plume dispersion predictions.DST3D-UNet-SR is composed of two sequential modules:the temporal module(TM),which predicts the transient evolution of a plume in complex terrain from low-resolution temporal data,and the spatial refinement module(SRM),which subsequently enhances the spatial resolution of the TM predictions.We train DST3D-UNet-SR using a comprehensive dataset derived from high-resolution large eddy simulations(LES)of plume transport.We propose the DST3D-UNet-SR model to significantly accelerate LES of three-dimensional(3D)plume dispersion by three orders of magnitude.Additionally,the model demonstrates the ability to dynamically adapt to evolving conditions through the incorporation of new observational data,substantially improving prediction accuracy in high-concentration regions near the source.展开更多
Climate change may have detrimental effects on different sectoral growth in global economy and according to the Intergovernmental Panel on Climate Change,the impacts of climate change will be more vigorous in the comi...Climate change may have detrimental effects on different sectoral growth in global economy and according to the Intergovernmental Panel on Climate Change,the impacts of climate change will be more vigorous in the coming years.The climatic and non-climatic driving forces behind the economic sectoral performances involve short-and long-run interconnections among variables.This study attempts to investigate the effect of climatic factors(temperature and precipitation)along with non-climatic factors,including foreign direct investment(FDI),human capital index(HCI),natural capital(NC),and information and communication technology(ICT)on three major sectors of the economy(agricultural sector,industrial sector,and service sector)through non-linear model framework by employing cross-sectionally augmented autoregressive distributed lag(CS-ARDL)estimation technique.It considers a panel of 56 selected countries from different income groups,including high-income countries,upper-middle-income countries,lower-middle-income countries,and low-income countries,covering the period 1985-2022.The confirmation of slope heterogeneity,cross-sectional dependence,stationarity,and cointegration among variables lends support to the robustness of results.The augmented mean group(AMG)robustness test was applied to check robustness and the results were found mostly consistent with estimation method.The results revealed that upper-middle-income countries are more vulnerable to extreme temperatures compared to high-income countries.The results also confirmed an inverted U-shaped relationship between each sector’s output and precipitation in upper-middle-income countries.In contrast,for upper-middle-income,lower-middle-income,and low-income countries,this relation exists in industrial sector only in long run.This indicates that precipitation is initially beneficial for production activities.However,beyond a certain threshold of precipitation,this trend reverses,i.e.,the output of the economic sectors tends to decline.Furthermore,there is no supporting evidence that confirms a short-run non-linear relation between precipitation and agricultural yields.In upper-middle-income countries,the results confirmed that FDI is a driving factor behind both agricultural sector and service sector in long run while short-run results indicated a negative association but insignificant.This study also showed that in long run,an increase in HCI contributes to improving the output of the three sectors for high-income countries.The empirical findings provide valuable insights for policy-makers and governments to formulate coherent adaptation and mitigation strategies,thereby accelerating the transition of sectoral productivity from low to high levels in the sample countries.展开更多
The use ofrenewable energyisan important way toachieve sustainable agriculturalandeconomic development.However,there are differences in accessto renewable energy between the Global North and Global South.This study ut...The use ofrenewable energyisan important way toachieve sustainable agriculturalandeconomic development.However,there are differences in accessto renewable energy between the Global North and Global South.This study utilisedan autoregressive distributed lag-error correctionmodel and thedata spanning from 1991to 2021 to comparatively analyse the dynamic relationship amongrenewable energy consumption,the value of agricultural production,gross domestic product(GDP),economic diversificationindex,urban population,the total water extraction for agricultural withdrawal,and trade balancein the Netherlands and South Africa.In the shortrun,renewable energy consumption was increased by the value of agricultural productionbut decreased by GDPin South Africa.In the longrun,renewable energy consumption and GDP increased the value of agricultural production,while the value of agricultural production also increased GDP in South Africa.However,in the Netherlands,there was no short-and long-run relationship betweenrenewable energy consumption and agricultural and economic development.The results revealedthat there was a short-and long-run relationship in South Africa.Moreover,in the Netherlands,the adjustment speed was-1.46 forrenewable energy consumption with an error correction of 0.68 a(8.22 months).In South Africa,the adjustment speedwas-1.28 forrenewable energy consumption with an error correction of 0.78 a(9.38 months).Therefore,compared to South Africa,renewable energy consumptionin the Netherlands takes less time to return to balance after a shock.Thesefindings signify different trajectories on sectoral and economic transition initiatives spurred usingrenewable energy between the Netherlands and South Africa.Policy relating to initiatives such as“agro-energy communities”in Global South countries such as South Africa should be emphasised to promote the use of renewable energy in the agricultural sector.展开更多
Motivated by a significant impact of price volatility on food security and economic stability inCameroon,this study aims to understand the factors influencing agricultural product price volatility(APPV)and formulateef...Motivated by a significant impact of price volatility on food security and economic stability inCameroon,this study aims to understand the factors influencing agricultural product price volatility(APPV)and formulateeffective policies for mitigating its negative impactand promoting sustainable economic growth.Specifically,this research used theautoregressive distributed lag-error correction model(ARDL-ECM)to analyse the impact of agricultural productivity,agricultural product imports,population,temperature variation,gross domestic product(GDP)per capita,and government expenditure on APPV based on the annual data from 2000 to 2021.The ARDL-ECM estimation results revealed that agricultural productivity(β=4.901),agricultural product imports(β=1.012),population(β=13.635),and GDP per capita(β=2.794)were positively related toAPPV,while temperature variation(β=-0.990)and government expenditure(β=-8.585)were negatively related toAPPVin the long term.However,temperature variation had a positive relationship with APPV in the short term.Moreover,the Granger causality test showed that there werebidirectional causality of APPV with agricultural productivityandagricultural product imports,and unidirectional causality of APPVwith population,temperature variation,GDP per capita,and government expenditure.The findings highlight the importance of public policies in stabilizing agricultural product prices by investing in agricultural research,improving access to agricultural inputs,strengthening farmer capacities,implementing climate adaptation measures,and enhancing rural infrastructure.Thesepolicies can reduce APPV,improve food security,and promote inclusive economic growth in Cameroon.展开更多
In the field of railway traction drive systems,voltage sensor intermittent faults can significantly impact the reliability and safety of the entire system.This paper proposes an online diagnosis method for detecting s...In the field of railway traction drive systems,voltage sensor intermittent faults can significantly impact the reliability and safety of the entire system.This paper proposes an online diagnosis method for detecting such faults using an Artificial Intelligence(AI)predictor based on a Nonlinear Autoregressive with eXogenous inputs(NARX)data structure.The model is trained efficiently using the Extreme Learning Machine(ELM)algorithm.The NARX model captures the dynamic characteristics of the voltage sensor data,enabling the AI predictor to learn complex nonlinear relationships.The ELM training method ensures rapid convergence and high accuracy.Through extensive experimental validation,the proposed method demonstrates high sensitivity to voltage sensor intermittent faults and robust performance under varying operating conditions.This approach offers a promising solution for enhancing the diagnostic capabilities of railway traction systems,ensuring timely fault detection and improving overall system reliability.展开更多
基金the National Natural Science Foundation of China(No.61203337)the Natural Science Foundation of Shanghai(No.12ZR1440200)
文摘This study proposes two metrics using the nearest neighbors method to improve the accuracy of time-series forecasting. These two metrics can be treated as a hybrid forecasting approach to combine linear and non-linear forecasting techniques. One metric redefines the distance in k-nearest neighbors based on the coefficients of autoregression (AR) in time series. Meanwhile, an improvement to Kulesh's adaptive metrics in the nearest neighbors is also presented. To evaluate the performance of the two proposed metrics, three types of time-series data, namely deterministic synthetic data, chaotic time-series data and real time-series data, are predicted. Experimental results show the superiority of the proposed AR-enhanced k-nearest neighbors methods to the traditional k-nearest neighbors metric and Kulesh's adaptive metrics.
文摘-In this paper, monthly mean SST data in a large area are used. After the spacial average of the data is carried out and the secular monthly means are substracted, a time series (Jan. 1951-Dec. 1985) of SST anomalies of the cold tongue water area in the eastern tropical Pacific Ocean is obtained. On the basis of the time series, an autoregression model, a self-exciting threshold autoregression model and an open loop autoregression model are developed respectively. The interannual variations are simulated by means of those models. The simulation results show that all the three models have made very good hindcasting for the nine El Nino events since 1951. In order to test the reliability of the open loop threshold model, extrapolated forecast was made for the period of Jan. 1986-Feb. 1987. It can be seen from the forecasting that the model could forecast well the beginning and strengthening stages of the recent El Nino event (1986-1987). Correlation coefficients of the estimations to observations are respectively 0. 84, 0. 88 and 0. 89. It is obvious that all the models work well and the open loop threshold one is the best. So the open loop threshold autoregression model is a useful tool for monitoring the SSTinterannual variation of the cold tongue water area in the Eastern Equatorial Pacific Ocean and for estimating the El Nino strength.
基金Supported by National Natural Science Foundation of China(72222009,71991472)。
文摘Network autoregression and factor model are effective methods for modeling network time series data.In this study,we propose a network autoregression model with a factor structure that incorporates a latent group structure to address nodal heterogeneity within the network.An iterative algorithm is employed to minimize a least-squares objective function,allowing for simultaneous estimation of both the parameters and the group structure.To determine the unknown number of groups and factors,a PIC criterion is introduced.Additionally,statistical inference of the estimated parameters is presented.To assess the validity of the proposed estimation and inference procedures,we conduct extensive numerical studies.We also demonstrate the utility of our model using a stock dataset obtained from the Chinese A-Share stock market.
文摘Forecasting solar irradiance is a critical task in the renewable energy sector, as it provides essential information regarding the potential energy production from solar panels. This study aims to utilize the Vector Autoregression (VAR) model to forecast solar irradiance levels and weather characteristics in the San Francisco Bay Area. The results demonstrate a correlation between predicted and actual solar irradiance, indicating the effectiveness of the VAR model for this task. However, the model may not be sufficient for this region due to the requirement of additional weather features to reduce disparities between predictions and actual observations. Additionally, the current lag order in the model is relatively low, limiting its ability to capture all relevant information from past observations. As a result, the model’s forecasting capability is limited to short-term horizons, with a maximum horizon of four hours.
基金supported by the Program of Support Xinjiang by Technology(2024E02028,B2-2024-0359)Xinjiang Tianchi Talent Program of 2024,the Foundation of Chinese Academy of Sciences(B2-2023-0239)the Youth Foundation of Shandong Natural Science(ZR2023QD070).
文摘As one of the main characteristics of atmospheric pollutants,PM_(2.5) severely affects human health and has received widespread attention in recent years.How to predict the variations of PM_(2.5) concentrations with high accuracy is an important topic.The PM_(2.5) monitoring stations in Xinjiang Uygur Autonomous Region,China,are unevenly distributed,which makes it challenging to conduct comprehensive analyses and predictions.Therefore,this study primarily addresses the limitations mentioned above and the poor generalization ability of PM_(2.5) concentration prediction models across different monitoring stations.We chose the northern slope of the Tianshan Mountains as the study area and took the January−December in 2019 as the research period.On the basis of data from 21 PM_(2.5) monitoring stations as well as meteorological data(temperature,instantaneous wind speed,and pressure),we developed an improved model,namely GCN−TCN−AR(where GCN is the graph convolution network,TCN is the temporal convolutional network,and AR is the autoregression),for predicting PM_(2.5) concentrations on the northern slope of the Tianshan Mountains.The GCN−TCN−AR model is composed of an improved GCN model,a TCN model,and an AR model.The results revealed that the R2 values predicted by the GCN−TCN−AR model at the four monitoring stations(Urumqi,Wujiaqu,Shihezi,and Changji)were 0.93,0.91,0.93,and 0.92,respectively,and the RMSE(root mean square error)values were 6.85,7.52,7.01,and 7.28μg/m^(3),respectively.The performance of the GCN−TCN−AR model was also compared with the currently neural network models,including the GCN−TCN,GCN,TCN,Support Vector Regression(SVR),and AR.The GCN−TCN−AR outperformed the other current neural network models,with high prediction accuracy and good stability,making it especially suitable for the predictions of PM_(2.5)concentrations.This study revealed the significant spatiotemporal variations of PM_(2.5)concentrations.First,the PM_(2.5) concentrations exhibited clear seasonal fluctuations,with higher levels typically observed in winter and differences presented between months.Second,the spatial distribution analysis revealed that cities such as Urumqi and Wujiaqu have high PM_(2.5) concentrations,with a noticeable geographical clustering of pollutions.Understanding the variations in PM_(2.5) concentrations is highly important for the sustainable development of ecological environment in arid areas.
基金supported by the National Science Foundation (No: 1029337)supported by an allocation of computing time from the Ohio Supercomputer Center
文摘Weather forecasting is crucial to both the demand and supply sides of electricity systems. Temperature has a great effect on the demand side. Moreover, solar and wind are very promising renewable energy sources and are, thus, important on the supply side. In this paper, a large vector autoregression(VAR) model is built to forecast three important weather variables for 61 cities around the United States. The three variables at all locations are modeled as response variables. Lag terms are used to capture the relationship between observations in adjacent periods and daily and annual seasonality are modeled to consider the correlation between the same periods in adjacent days and years. We estimate the VAR model with16 years of hourly historical data and use two additional years of data for out-of-sample validation. Forecasts of up to six-hours-ahead are generated with good forecasting performance based on mean absolute error, root mean square error, relative root mean square error, and skill scores. Our VAR model gives forecasts with skill scoresthat are more than double the skill scores of other forecasting models in the literature. Our model also provides forecasts that outperform persistence forecasts by between6% and 80% in terms of mean absolute error. Our results show that the proposed time series approach is appropriate for very short-term forecasting of hourly solar radiation,temperature, and wind speed.
基金supported by the Natural Science Foundation of Hunan Province(2023JJ30817)Hunan Provincial Natural Science Foundation-Hengyang City Joint Fund Project(2025JJ70129)+1 种基金Changsha Natural Science Foundation(kq2403057)China。
文摘Objective:Near vision loss(NVL)is one of the leading causes of visual impairment worldwide,exerting a profound impact on individual quality of life and socio-economic development.This study aims to analyze the burden of NVL in China by sex and age groups from 1990 to 2021 and to project trends over the next 15 years.Methods:Using data from the Global Burden of Disease(GBD)2021 database,we conducted descriptive analyses of NVL prevalence in China,calculated age-standardized prevalence rates(ASPR)and age-standardized disability-adjusted life years rates(ASDR)to compare burden differences between sexes and age groups,and applied an autoregressive integrated moving average(ARIMA)model to predict NVL trends for the next 15 years.The model selection was based on best-fit criteria to ensure reliable projections.Results:From 1990 to 2021,China’s ASPR of NVL rose from 10096.24/100000 to 15624.54/100000,and ASDR increased from 101.75/100000 to 158.75/100000.In 2021,ASPR(16551.70/100000)and ASDR(167.69/100000)were higher among females than males(14686.21/100000 and 149.76/100000,respectively).China ranked highest globally in both NVL cases and disability-adjusted life years(DALYs),with female burden significantly exceeding male burden.Projections indicated this trend and sex gap will persist until 2036.Compared with 1990,the prevalence cases and DALYs increased by 239.20%and 238.82%,respectively in 2021,with the highest burden among females and the 55−59 age group.The ARIMA model predicted continued increases in prevalence and DALYs by 2036,with females maintaining a higher burden than males.Conclusion:This study reveals a marked increase in the NVL burden in China and predicts continued growth in the coming years.Public health policies should prioritize NVL prevention and control,with special attention to women and middle-aged populations to mitigate long-term societal and health impacts.
基金supported by the National Natural Science Foundation of China under Grant Nos.10971081 and 11001104985 Project of Jilin University
文摘This paper studies the autoregression models of order one, in a general time series setting that allows for weakly dependent innovations. Let {Xt} be a linear process defined by Xt =∑k=0^∞ψ kεt-k, where {ψk, k ≥ 0} is a sequence of real numbers and {εk, k = 0, ±1, ±2,...} is a sequence of random variables. Two results are proved in this paper. In the first result, assuming that {εk, k ≥ 1} is a sequence of asymptotically linear negative quadrant dependent (ALNQD) random variables, the authors find the limiting distributions of the least squares estimator and the associated regression t statistic. It is interesting that the limiting distributions are similar to the one found in earlier work under the assumption of i.i.d, innovations. In the second result the authors prove that the least squares estimator is not a strong consistency estimator of the autoregressive parameter a when {εk, k ≥ 1} is a sequence of negatively associated (NA) random variables, and ψ0 = 1, ψk = 0, k ≥ 1.
文摘China's astonishing economic growth implies a necessity to understand its inflation. The present paper employs threshold nonrecursive structural vector autoregression analysis to explore the asymmetric effects of macro-variables on inflation in low and high inflation regimes. The empirical evidence demonstrates, first, that the reactions of inflation to various shocks are inflation-regime-dependent and asymmetric. Second, monetary policy influences China "s high inflation and adjusting the domestic interest rate in China may be an effective way to control inflation in a high inflation regime, but not in a low inflation regime. In a high inflation regime, a high inflation rate may cause the macro-policy authorities to increase the domestic interest rate, in an attempt to stabilize high inflation. Third, contrary to expectations, the world oil price is not a strong cost-push factor in a low inflation regime. Oil price increases may increase inflation in a high inflation regime, but there is no such obvious effect in a low inflation regime. Finally, China "s nominal effective exchange rate influences inflation in both low and high inflation regimes. A nominal effeetive exchange rate appreciation might be effective in controlling domestic inflation in both regimes.
文摘Through extension of canonical correlation to the analysis of meteorological element fields (MEF), a concept from combination of canonical autocorrelation with canonical autoregression (CAR) is developed for short-term climatic prediction of MEFs with a formulated scheme. Experi- mental results suggest that the scheme is of encouraging usefulness to a weak persistence MEF, i.e., rainfall field and, in particular, to a strong persistance one like a SST field.
基金The anthors thank the support from Tianjin Philosophy and Social Science Planning Project(No.TJYYQN19-004)Project of National and Regional Research Center,Ministry of Education(No.ZX20170183)National Natural Science Foundation Youth Project(No.71803089).
文摘Since 2002,the People's Bank of China has frequently used both quantity-based direct monetary instruments and price-based indirect monetary instruments to promote economic growth and stabilize price level.Specifically,this study estimates 13 three-variable factor-augmented vector autoregression (FAVAR) models to explore how two types of monetary instruments affect China's economy and price level.Overall,we find that monetary policy has positive effects on China's economy and price level.Second,this study clearly states that the effectiveness of China's monetary policy on the economy has depended on China's quantity-based direct monetary instruments since 2002.Third,the effectiveness of quantity-based direct monetary instruments on China's economy and price level is dependent on the significant and positive effects of quantity-based direct monetary instruments after the 2008 financial crisis.Fourth,the significant and positive effects of price-based indirect monetary instruments on China's economy and price level before 2008 cannot fundamentally change their current insignificant effects on China's economy and price level.
文摘The rapid expansion of tobacco farming poses a significant threat to biodiversity in Yunnan Province,China,a region known for its rich biodiversity.This study aims to understand the trade-offs between tobacco farming and higher plant species diversity,and to identify priority counties for conservation.We employed an integrated approach combining species distribution modeling,GIS overlay analysis,and empirical spatial regression to em pirically assess the impact of tobacco farming intensity on biodiversity risk.Our findings reveal a compelling negative spatial correlation between tobacco farming expansion and higher plant species diversity.Specifically,southern counties in Wenshan and Honghe prefectures are major priority areas of conservation that exhibit signif icant spatial correlations between biodiversity risks and high tobacco farming intensity.Quantitatively,at county level,a 1%increase in tobacco farming area corresponds to a 0.094%decrease in endemic higher plant species richness across the entire province.These results underscore the need for targeted and region-specific regulations to mitigate biodiversity loss and promote sustainable development in Yunnan Province.The integrated approach used in this study provides a comprehensive assessment of the tobacco-biodiversity trade-offs,offering actionable insights for policymaking.
基金funded by Taif University,Saudi Arabia,Project No.(TU-DSPP-2024-52).
文摘Aquila Optimizer(AO)is a recently proposed population-based optimization technique inspired by Aquila’s behavior in catching prey.AO is applied in various applications and its numerous variants were proposed in the literature.However,chaos theory has not been extensively investigated in AO.Moreover,it is still not applied in the parameter estimation of electro-hydraulic systems.In this work,ten well-defined chaotic maps were integrated into a narrowed exploitation of AO for the development of a robust chaotic optimization technique.An extensive investigation of twenty-three mathematical benchmarks and ten IEEE Congress on Evolutionary Computation(CEC)functions shows that chaotic Aquila optimization techniques perform better than the baseline technique.The investigation is further conducted on parameter estimation of an electro-hydraulic control system,which is performed on various noise levels and shows that the proposed chaotic AO with Piecewise map(CAO6)achieves the best fitness values of and at noise levels and respectively.Friedman test 2.873E-05,1.014E-04,8.728E-031.300E-03,1.300E-02,1.300E-01,for repeated measures,computational analysis,and Taguchi test reflect the superiority of CAO6 against the state of the arts,demonstrating its potential for addressing various engineering optimization problems.However,the sensitivity to parameter tuning may limit its direct application to complex optimization scenarios.
基金supported by the China Postdoctoral Science Foundation(No.2014m561331)Science and Technology Research Project of Heilongjiang Provincial Education Department(No.12521073)National Natural Science Youth Fund(No.61300115).
文摘In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as template-based retrosynthesis,have high accuracy and interpretability in specific types of reactions but are limited by the scope of the template library,making it difficult to adapt to new or uncommon reaction types.Moreover,sequence-to-sequence retrosynthetic prediction methods,although they enhance the flexibility of prediction,often overlook the complexity of molecular graph structures and the actual interactions between atoms,which limits the accuracy and reliability of the predictions.To address these limitations,this paper proposes a Molecular Retrosynthesis Top-k Prediction based on the Latent Generation Process(MRLGP)that uses latent variables from graph neural networks to model the generation process and produce diverse set of reactants.Utilising an encoding method based on Graphormer,the authors have also introduced topology-aware positional encoding to better capture the interactions between atomic nodes in the molecular graph structure,thereby more accurately simulating the retrosynthetic process.The MRLGP model significantly enhances the accuracy and diversity of predictions by correlating discrete latent variables with the reactant generation process and progressively constructing molecular graphs using a variational autoregressive decoder.Experimental results on benchmark datasets such as USPTO-50k,USPTO-Full,and USPTO-DIVERSE demonstrate that MRLGP outperforms baseline models on multiple Top-k evaluation metrics.Additionally,ablation experiments conducted on the USPTO-50K dataset further validate the effectiveness of the methods used in the encoder and decoder parts of the model.
基金supported by the Program of National Natural Science Foundation of China(U23A20329,62163036)Youth Academic and Technical Leaders Reserve Talent Training project(202105AC160094)Industrial Innovation Talent Special Project of Xingdian Talent Support Program(XDYC-CYCX-2022-0010).
文摘Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.
文摘High-resolution spatiotemporal simulations effectively capture the complexities of atmospheric plume sion disper-in complex terrain.However,their high computational cost makes them impractical for applications requiring rapid responses or iterative processes,such as optimization,uncertainty quantification,or inverse modeling.To address this challenge,this work introduces the Dual-Stage Temporal Three-dimensional UNet Super-resolution(DST3D-UNet-SR)model,a highly efficient deep learning model for plume dispersion predictions.DST3D-UNet-SR is composed of two sequential modules:the temporal module(TM),which predicts the transient evolution of a plume in complex terrain from low-resolution temporal data,and the spatial refinement module(SRM),which subsequently enhances the spatial resolution of the TM predictions.We train DST3D-UNet-SR using a comprehensive dataset derived from high-resolution large eddy simulations(LES)of plume transport.We propose the DST3D-UNet-SR model to significantly accelerate LES of three-dimensional(3D)plume dispersion by three orders of magnitude.Additionally,the model demonstrates the ability to dynamically adapt to evolving conditions through the incorporation of new observational data,substantially improving prediction accuracy in high-concentration regions near the source.
文摘Climate change may have detrimental effects on different sectoral growth in global economy and according to the Intergovernmental Panel on Climate Change,the impacts of climate change will be more vigorous in the coming years.The climatic and non-climatic driving forces behind the economic sectoral performances involve short-and long-run interconnections among variables.This study attempts to investigate the effect of climatic factors(temperature and precipitation)along with non-climatic factors,including foreign direct investment(FDI),human capital index(HCI),natural capital(NC),and information and communication technology(ICT)on three major sectors of the economy(agricultural sector,industrial sector,and service sector)through non-linear model framework by employing cross-sectionally augmented autoregressive distributed lag(CS-ARDL)estimation technique.It considers a panel of 56 selected countries from different income groups,including high-income countries,upper-middle-income countries,lower-middle-income countries,and low-income countries,covering the period 1985-2022.The confirmation of slope heterogeneity,cross-sectional dependence,stationarity,and cointegration among variables lends support to the robustness of results.The augmented mean group(AMG)robustness test was applied to check robustness and the results were found mostly consistent with estimation method.The results revealed that upper-middle-income countries are more vulnerable to extreme temperatures compared to high-income countries.The results also confirmed an inverted U-shaped relationship between each sector’s output and precipitation in upper-middle-income countries.In contrast,for upper-middle-income,lower-middle-income,and low-income countries,this relation exists in industrial sector only in long run.This indicates that precipitation is initially beneficial for production activities.However,beyond a certain threshold of precipitation,this trend reverses,i.e.,the output of the economic sectors tends to decline.Furthermore,there is no supporting evidence that confirms a short-run non-linear relation between precipitation and agricultural yields.In upper-middle-income countries,the results confirmed that FDI is a driving factor behind both agricultural sector and service sector in long run while short-run results indicated a negative association but insignificant.This study also showed that in long run,an increase in HCI contributes to improving the output of the three sectors for high-income countries.The empirical findings provide valuable insights for policy-makers and governments to formulate coherent adaptation and mitigation strategies,thereby accelerating the transition of sectoral productivity from low to high levels in the sample countries.
基金research supported wholly by the National Research Foundation (NRF) of South Africathe Dutch Research Council (NWO) Project (UID 129352)
文摘The use ofrenewable energyisan important way toachieve sustainable agriculturalandeconomic development.However,there are differences in accessto renewable energy between the Global North and Global South.This study utilisedan autoregressive distributed lag-error correctionmodel and thedata spanning from 1991to 2021 to comparatively analyse the dynamic relationship amongrenewable energy consumption,the value of agricultural production,gross domestic product(GDP),economic diversificationindex,urban population,the total water extraction for agricultural withdrawal,and trade balancein the Netherlands and South Africa.In the shortrun,renewable energy consumption was increased by the value of agricultural productionbut decreased by GDPin South Africa.In the longrun,renewable energy consumption and GDP increased the value of agricultural production,while the value of agricultural production also increased GDP in South Africa.However,in the Netherlands,there was no short-and long-run relationship betweenrenewable energy consumption and agricultural and economic development.The results revealedthat there was a short-and long-run relationship in South Africa.Moreover,in the Netherlands,the adjustment speed was-1.46 forrenewable energy consumption with an error correction of 0.68 a(8.22 months).In South Africa,the adjustment speedwas-1.28 forrenewable energy consumption with an error correction of 0.78 a(9.38 months).Therefore,compared to South Africa,renewable energy consumptionin the Netherlands takes less time to return to balance after a shock.Thesefindings signify different trajectories on sectoral and economic transition initiatives spurred usingrenewable energy between the Netherlands and South Africa.Policy relating to initiatives such as“agro-energy communities”in Global South countries such as South Africa should be emphasised to promote the use of renewable energy in the agricultural sector.
文摘Motivated by a significant impact of price volatility on food security and economic stability inCameroon,this study aims to understand the factors influencing agricultural product price volatility(APPV)and formulateeffective policies for mitigating its negative impactand promoting sustainable economic growth.Specifically,this research used theautoregressive distributed lag-error correction model(ARDL-ECM)to analyse the impact of agricultural productivity,agricultural product imports,population,temperature variation,gross domestic product(GDP)per capita,and government expenditure on APPV based on the annual data from 2000 to 2021.The ARDL-ECM estimation results revealed that agricultural productivity(β=4.901),agricultural product imports(β=1.012),population(β=13.635),and GDP per capita(β=2.794)were positively related toAPPV,while temperature variation(β=-0.990)and government expenditure(β=-8.585)were negatively related toAPPVin the long term.However,temperature variation had a positive relationship with APPV in the short term.Moreover,the Granger causality test showed that there werebidirectional causality of APPV with agricultural productivityandagricultural product imports,and unidirectional causality of APPVwith population,temperature variation,GDP per capita,and government expenditure.The findings highlight the importance of public policies in stabilizing agricultural product prices by investing in agricultural research,improving access to agricultural inputs,strengthening farmer capacities,implementing climate adaptation measures,and enhancing rural infrastructure.Thesepolicies can reduce APPV,improve food security,and promote inclusive economic growth in Cameroon.
文摘In the field of railway traction drive systems,voltage sensor intermittent faults can significantly impact the reliability and safety of the entire system.This paper proposes an online diagnosis method for detecting such faults using an Artificial Intelligence(AI)predictor based on a Nonlinear Autoregressive with eXogenous inputs(NARX)data structure.The model is trained efficiently using the Extreme Learning Machine(ELM)algorithm.The NARX model captures the dynamic characteristics of the voltage sensor data,enabling the AI predictor to learn complex nonlinear relationships.The ELM training method ensures rapid convergence and high accuracy.Through extensive experimental validation,the proposed method demonstrates high sensitivity to voltage sensor intermittent faults and robust performance under varying operating conditions.This approach offers a promising solution for enhancing the diagnostic capabilities of railway traction systems,ensuring timely fault detection and improving overall system reliability.