Accurate forecasting of time series is crucial across various domains.Many prediction tasks rely on effectively segmenting,matching,and time series data alignment.For instance,regardless of time series with the same g...Accurate forecasting of time series is crucial across various domains.Many prediction tasks rely on effectively segmenting,matching,and time series data alignment.For instance,regardless of time series with the same granularity,segmenting them into different granularity events can effectively mitigate the impact of varying time scales on prediction accuracy.However,these events of varying granularity frequently intersect with each other,which may possess unequal durations.Even minor differences can result in significant errors when matching time series with future trends.Besides,directly using matched events but unaligned events as state vectors in machine learning-based prediction models can lead to insufficient prediction accuracy.Therefore,this paper proposes a short-term forecasting method for time series based on a multi-granularity event,MGE-SP(multi-granularity event-based short-termprediction).First,amethodological framework for MGE-SP established guides the implementation steps.The framework consists of three key steps,including multi-granularity event matching based on the LTF(latest time first)strategy,multi-granularity event alignment using a piecewise aggregate approximation based on the compression ratio,and a short-term prediction model based on XGBoost.The data from a nationwide online car-hailing service in China ensures the method’s reliability.The average RMSE(root mean square error)and MAE(mean absolute error)of the proposed method are 3.204 and 2.360,lower than the respective values of 4.056 and 3.101 obtained using theARIMA(autoregressive integratedmoving average)method,as well as the values of 4.278 and 2.994 obtained using k-means-SVR(support vector regression)method.The other experiment is conducted on stock data froma public data set.The proposed method achieved an average RMSE and MAE of 0.836 and 0.696,lower than the respective values of 1.019 and 0.844 obtained using the ARIMA method,as well as the values of 1.350 and 1.172 obtained using the k-means-SVR method.展开更多
Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and ...Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and the environment.Estimating ammonia emissions is crucial for national inventories and policy-making.Various models exist for predicting emissions,including mechanistic,empirical,and semi-empirical approaches.While machine learning(ML)is widely used in environmental science,its application to ammonia emissions remains limited.In this study,we used 5939 ammonia emission data from 538 trials,extracted from the ALFAM2 database,to train three machine learning methods-random forest,gradient boosting,and lasso-for predicting cumulative ammonia emissions 72 h after manure application.These methods were compared to the semi-empirical ALFAM2 model using an independent test dataset.Random forest(RMSE=4.51,r=0.94,MAE=3.28,Bias=0.92)and gradient boosting(RMSE=6.19,r=0.89,MAE=4.10,Bias=0.51)showed the best performance,while the lasso log-linear model(RMSE=7.30,r=0.84,MAE=5.57,Bias=-1.38)performed worst.Both random forest and gradient boosting outperformed the semi-empirical ALFAM2 model,which showed performance comparable to the lasso model.We then used these models and the ALFAM2 model to compare five slurry management techniques,varying in application method(trailing hoses,trailing shoes,and open slot)and post-application incorporation,across 128 scenarios with different manure types and weather conditions.Compared to broadcast application,alternative techniques reduced emissions by a median of-13.6%to-61.7%.This study highlights the promise of ML models in assessing ammonia emission reduction methods,while emphasizing the importance of evaluating model sensitivity to algorithm choice.展开更多
Accurate prediction of remaining useful life serves as a reliable basis for maintenance strategies,effectively reducing both the frequency of failures and associated costs.As a core component of PHM,RUL prediction pla...Accurate prediction of remaining useful life serves as a reliable basis for maintenance strategies,effectively reducing both the frequency of failures and associated costs.As a core component of PHM,RUL prediction plays a crucial role in preventing equipment failures and optimizing maintenance decision-making.However,deep learning models often falter when processing raw,noisy temporal signals,fail to quantify prediction uncertainty,and face challenges in effectively capturing the nonlinear dynamics of equipment degradation.To address these issues,this study proposes a novel deep learning framework.First,a newbidirectional long short-termmemory network integrated with an attention mechanism is designed to enhance temporal feature extraction with improved noise robustness.Second,a probabilistic prediction framework based on kernel density estimation is constructed,incorporating residual connections and stochastic regularization to achieve precise RUL estimation.Finally,extensive experiments on the C-MAPSS dataset demonstrate that our method achieves competitive performance in terms of RMSE and Score metrics compared to state-of-the-artmodels.More importantly,the probabilistic output provides a quantifiablemeasure of prediction confidence,which is crucial for risk-informed maintenance planning,enabling managers to optimize maintenance strategies based on a quantifiable understanding of failure risk.展开更多
With the increasing depth and intensity of coal mining operations,high-energy mine tremors have become a major trigger for rockburst disasters,posing severe threats to mine safety.Conventional rockburst risk assessmen...With the increasing depth and intensity of coal mining operations,high-energy mine tremors have become a major trigger for rockburst disasters,posing severe threats to mine safety.Conventional rockburst risk assessment methods either lack real-time adaptability or rely heavily on qualitative microseismic data analysis,limiting their effectiveness in dynamic early warning.To address these limitations,this study proposed a predictive framework for rockburst risk assessment by integrating ensemble learning algorithms with Bayesian optimization.A dataset was constructed using a sliding time window approach,linking the highest MS energy in the subsequent days with predefined risk levels.Both undersampling and oversampling strategies were employed to mitigate class imbalance,and their performance was evaluated.Three ensemble models,i.e.CatBoost,Random Forest,and LightGBM,were developed,and their hyperparameters were optimized using Bayesian techniques to enhance predictive performance.The models were validated using MS data from the 6303 and 6306 working faces at the Dongtan Coal Mine.All three ensemble models outperformed conventional classification methods,particularly in accurately predicting high-risk categories.Among them,the CatBoost model exhibited the best performance,with an accuracy of 89.47%and an F1¯-score of 90.62%.Furthermore,SHapley Additive exPlanations analysis was used to enhance model interpretability,identifying key MS indicators influencing rockburst risk predictions.This study provides a systematic approach for leveraging MS data and machine learning to improve an early warning system for rockburst hazards,offering valuable insights for underground mining safety management.展开更多
BACKGROUND Colorectal polyps(CPs)are important precursor lesions of colorectal cancer,and endoscopic surgery remains the primary treatment option.However,the shortterm recurrence rate post-surgery is high,and the risk...BACKGROUND Colorectal polyps(CPs)are important precursor lesions of colorectal cancer,and endoscopic surgery remains the primary treatment option.However,the shortterm recurrence rate post-surgery is high,and the risk factors for recurrence remain unknown.AIM To comprehensively explore risk factors for short-term recurrence of CPs after endoscopic surgery and develop a nomogram prediction model.METHODS Overall,362 patients who underwent endoscopic polypectomy between January 2022 and January 2024 at Nanjing Jiangbei Hospital were included.We screened basic demographic data,clinical and polyp characteristics,surgery-related information,and independent risk factors for CPs recurrence using univariate and multivariate logistic regression analyses.The multivariate analysis results were used to construct a nomogram prediction model,internally validated using Bootstrapping,with performance evaluated using area under the curve(AUC),calibration curve,and decision curve analysis.RESULTS CP re-occurred in 166(45.86%)of the 362 patients within 1 year post-surgery.Multivariate logistic regression analysis showed that age(OR=1.04,P=0.002),alcohol consumption(OR=2.07,P=0.012),Helicobacter pylori infection(OR=2.34,P<0.001),polyp number>2(OR=1.98,P=0.005),sessile polyps(OR=2.10,P=0.006),and adenomatous pathological type(OR=3.02,P<0.001)were independent risk factors for post-surgery recurrence.The nomogram prediction model showed good discriminatory(AUC=0.73)and calibrating power,and decision curve analysis showed that the model had good clinical benefit at risk probabilities>20%.CONCLUSION We identified multiple independent risk factors for short-term recurrence after endoscopic surgery.The nomogram prediction model showed a certain degree of differentiation,calibration,and potential clinical applicability.展开更多
RNAs have important biological functions and the functions of RNAs are generally coupled to their structures, especiallytheir secondary structures. In this work, we have made a comprehensive evaluation of the performa...RNAs have important biological functions and the functions of RNAs are generally coupled to their structures, especiallytheir secondary structures. In this work, we have made a comprehensive evaluation of the performances of existingtop RNA secondary structure prediction methods, including five deep-learning (DL) based methods and five minimum freeenergy (MFE) based methods. First, we made a brief overview of these RNA secondary structure prediction methods.Afterwards, we built two rigorous test datasets consisting of RNAs with non-redundant sequences and comprehensivelyexamined the performances of the RNA secondary structure prediction methods through classifying the RNAs into differentlength ranges and different types. Our examination shows that the DL-based methods generally perform better thanthe MFE-based methods for RNAs with long lengths and complex structures, while the MFE-based methods can achievegood performance for small RNAs and some specialized MFE-based methods can achieve good prediction accuracy forpseudoknots. Finally, we provided some insights and perspectives in modeling RNA secondary structures.展开更多
Solar flares are violent solar outbursts which have a great influence on the space environment surrounding Earth,potentially causing disruption of the ionosphere and interference with the geomagnetic field,thus causin...Solar flares are violent solar outbursts which have a great influence on the space environment surrounding Earth,potentially causing disruption of the ionosphere and interference with the geomagnetic field,thus causing magnetic storms.Consequently,it is very important to accurately predict the time period of solar flares.This paper proposes a flare prediction model,based on physical images of active solar regions.We employ X-ray flux curves recorded directly by the Geostationary Operational Environmental Satellite,used as input data for the model,allowing us to largely avoid the influence of accidental errors,effectively improving the model prediction efficiency.A model based on the X-ray flux curve can predict whether there will be a flare event within 24 hours.The reverse can also be verified by the peak of the X-ray flux curve to see if a flare has occurred within the past 24 hours.The True Positive Rate and False Positive Rate of the prediction model,based on physical images of active regions are 0.6070 and 0.2410 respectively,and the accuracy and True Skill Statistics are 0.7590 and 0.5556.Our model can effectively improve prediction efficiency compared with models based on the physical parameters of active regions or magnetic field records,providing a simple method for solar flare prediction.展开更多
Deep learning plays a vital role in real-life applications, for example object identification, human face recognition, speech recognition, biometrics identification, and short and long-term forecasting of data. The ma...Deep learning plays a vital role in real-life applications, for example object identification, human face recognition, speech recognition, biometrics identification, and short and long-term forecasting of data. The main objective of our work is to predict the market performance of the Dhaka Stock Exchange (DSE) on day closing price using different Deep Learning techniques. In this study, we have used the LSTM (Long Short-Term Memory) network to forecast the data of DSE for the convenience of shareholders. We have enforced LSTM networks to train data as well as forecast the future time series that has differentiated with test data. We have computed the Root Mean Square Error (RMSE) value to scrutinize the error between the forecasted value and test data that diminished the error by updating the LSTM networks. As a consequence of the renovation of the network, the LSTM network provides tremendous performance which outperformed the existing works to predict stock market prices.展开更多
To tackle the difficulties of the point prediction in quantifying the reliability of landslide displacement prediction,a data-driven combination-interval prediction method(CIPM)based on copula and variational-mode-dec...To tackle the difficulties of the point prediction in quantifying the reliability of landslide displacement prediction,a data-driven combination-interval prediction method(CIPM)based on copula and variational-mode-decomposition associated with kernel-based-extreme-learningmachine optimized by the whale optimization algorithm(VMD-WOA-KELM)is proposed in this paper.Firstly,the displacement is decomposed by VMD to three IMF components and a residual component of different fluctuation characteristics.The key impact factors of each IMF component are selected according to Copula model,and the corresponding WOA-KELM is established to conduct point prediction.Subsequently,the parametric method(PM)and non-parametric method(NPM)are used to estimate the prediction error probability density distribution(PDF)of each component,whose prediction interval(PI)under the 95%confidence level is also obtained.By means of the differential evolution algorithm(DE),a weighted combination model based on the PIs is built to construct the combination-interval(CI).Finally,the CIs of each component are added to generate the total PI.A comparative case study shows that the CIPM performs better in constructing landslide displacement PI with high performance.展开更多
Ribonucleic Acid(RNA)contact prediction holds great significance for modeling RNA 3D structures and further understanding RNA biological functions.The rapid growth of RNA sequencing data has driven the development of ...Ribonucleic Acid(RNA)contact prediction holds great significance for modeling RNA 3D structures and further understanding RNA biological functions.The rapid growth of RNA sequencing data has driven the development of diverse computational methods for RNA contact prediction,and a benchmark evaluation of these methods remains essential.In this work,we first classified RNA contact prediction methods into statistical inference-based and neural networkbased ones.We then evaluated eight state-of-the-art methods on three test sets:a sequencediverse set,a structurally non-redundant set and a CASP RNA targets set.Our evaluation shows that for identifying non-local and long-range contacts,neural network-based methods outperform statistical inference-based ones,with SPOT-RNA-2D achieving the best performance,followed by CoCoNet and RNAcontact.However,for identifying the long-range tertiary contacts,which are vital for stabilizing RNA tertiary structure,statistical inference-based methods exhibit superior performance with GREMLIN emerging as the top performer.This work provides a comprehensive benchmarking of RNA contact prediction methods,highlighting their strengths and limitations to guide further methodological improvements and applications in RNA structure modeling.展开更多
The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster predic...The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster prediction.To address the issue of insufficient exploration of the spatio-temporal characteristic of microseismic data and the challenging selection of the optimal time window size in spatio-temporal prediction,this paper integrates deep learning methods and theory to propose a novel coal burst spatio-temporal prediction method based on Bidirectional Long Short-Term Memory(Bi-LSTM)network.The method involves three main modules,including microseismic spatio-temporal characteristic indicators construction,temporal prediction model,and spatial prediction model.To validate the effectiveness of the proposed method,engineering application tests are conducted at a high-risk working face in the Ordos mining area of Inner Mongolia,focusing on 13 high-energy microseismic events with energy levels greater than 105 J.In terms of temporal prediction,the analysis indicates that the temporal prediction results consist of 10 strong predictions and 3 medium predictions,and there is no false alarm detected throughout the entire testing period.Moreover,compared to the traditional threshold-based coal burst temporal prediction method,the accuracy of the proposed method is increased by 38.5%.In terms of spatial prediction,the distribution of spatial prediction results for high-energy events comprises 6 strong hazard predictions,3 medium hazard predictions,and 4 weak hazard predictions.展开更多
Wellbore instability is one of the significant challenges in the drilling engineering and during the development of carbonate reservoirs,especially with open-hole completion.The problems of wellbore instability such a...Wellbore instability is one of the significant challenges in the drilling engineering and during the development of carbonate reservoirs,especially with open-hole completion.The problems of wellbore instability such as downhole collapse and silt deposit in the fractured carbonate reservoir of Tarim Basin(Ordovician)are severe.Solid destabilization and production(SDP)was proposed to describe this engineering problem of carbonate reservoirs.To clarify the mechanism and mitigate potential borehole instability problems,we conducted particle size distribution(PSD)analysis,X-ray diffraction(XRD)analysis,triaxial compression tests,and micro-scale sand production tests based on data analysis.We found that the rock fragments and silt in the wellbore came from two sources:one from the wellbore collapse in the upper unplugged layers and the other from the production of sand particles carried by the fluid in the productive layers.Based on the experimental study,a novel method combining a geomechanical model and microscopic sand production model was proposed to predict wellbore instability and analyze its influencing factors.The critical condition and failure zone predicted by the prediction model fit well with the field observations.According to the prediction results,the management and prevention measures of wellbore instability in carbonate reservoirs were proposed.It is suggested to optimize the well track in new drilling wells while upgrading the production system in old wells.This study is of great guiding significance for the optimization of carbonate solid control and it improves the understanding of the sand production problems in carbonate reservoirs.展开更多
Accelerating urbanization and the rapid development of intelligent transportation systems have rendered shortterm traffic flow prediction an important research field.Accurate prediction of traffic flow is beneficial f...Accelerating urbanization and the rapid development of intelligent transportation systems have rendered shortterm traffic flow prediction an important research field.Accurate prediction of traffic flow is beneficial for the optimization of traffic planning,improvement of road utilization,reduction of traffic congestion,and reduction in the incidence of traffic accidents.However,data pertaining to traffic flow are typically influenced by a multitude of factors,resulting in data that exhibit a considerable degree of nonlinearity and complexity.To address the issue of noise in raw traffic flow data,this study proposes a hybrid model that combines variational mode decomposition(VMD),a bidirectional long short-term memory network(BiLSTM),and a gated recurrent unit(GRU)for short-term traffic flow prediction.To validate the effectiveness of the model,an experimental validation was conducted based on traffic flow data from UK highways,and the performance of the model was compared with common benchmark models.The experimental results demonstrate that the proposed method yields superior prediction results in terms of mean absolute error,coefficient of determination,and root-mean-square error compared to existing prediction techniques,thereby substantiating its efficacy in short-term traffic flow prediction.展开更多
The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through acceler...The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through accelerated life testing.In the absence of lifetime data,the hidden long-term correlation between performance degradation data is challenging to mine effectively,which is the main factor that restricts the prediction precision and engineering application of the residual life prediction method.To address this problem,a novel method based on the multi-layer perception neural network and bidirectional long short-term memory network is proposed.Firstly,a nonlinear health indicator(HI)calculation method based on kernel principal component analysis(KPCA)and exponential weighted moving average(EWMA)is designed.Then,using the raw vibration data and HI,a multi-layer perceptron(MLP)neural network is trained to further calculate the HI of the online bearing in real time.Furthermore,The bidirectional long short-term memory model(BiLSTM)optimized by particle swarm optimization(PSO)is used to mine the time series features of HI and predict the remaining service life.Performance verification experiments and comparative experiments are carried out on the XJTU-SY bearing open dataset.The research results indicate that this method has an excellent ability to predict future HI and remaining life.展开更多
Accurate prediction of coal and gas outburst(CGO)hazards is paramount in gas disaster prevention and control.This paper endeavors to overcome the constraints posed by traditional prediction indexes when dealing with C...Accurate prediction of coal and gas outburst(CGO)hazards is paramount in gas disaster prevention and control.This paper endeavors to overcome the constraints posed by traditional prediction indexes when dealing with CGO incidents under low gas pressure conditions.In pursuit of this objective,we have studied and established a mechanical model of the working face under abnormal stress and the excitation energy conditions of CGO,and proposed a method for predicting the risk of CGO under abnormal stress.On site application verification shows that when a strong outburst hazard level prediction is issued,there is a high possibility of outburst disasters occurring.In one of the three locations where we predicted strong outburst hazards,a small outburst occurred,and the accuracy of the prediction was higher than the traditional drilling cuttings index S and drilling cuttings gas desorption index q.Finally,we discuss the mechanism of CGO under the action of stress anomalies.Based on the analysis of stress distribution changes and energy accumulation characteristics of coal under abnormal stress,this article believes that the increase in outburst risk caused by high stress abnormal gradient is mainly due to two reasons:(1)The high stress abnormal gradient leads to an increase in the plastic zone of the coal seam.After the working face advances,it indirectly leads to an increase in the gas expansion energy that can be released from the coal seam before reaching a new stress equilibrium.(2)Abnormal stress leads to increased peak stress of coal body in front of working face.When coal body in elastic area transforms to plastic area,its failure speed is accelerated,which induces accelerated gas desorption and aggravates the risk of outburst.展开更多
[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-base...[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-based models that utilize either images data or environmental data.These methods fail to fully leverage multi-modal data to capture the diverse aspects of plant growth comprehensively.[Methods]To address this limitation,a two-stage phenotypic feature extraction(PFE)model based on deep learning algorithm of recurrent neural network(RNN)and long short-term memory(LSTM)was developed.The model integrated environment and plant information to provide a holistic understanding of the growth process,emploied phenotypic and temporal feature extractors to comprehensively capture both types of features,enabled a deeper understanding of the interaction between tomato plants and their environment,ultimately leading to highly accurate predictions of growth height.[Results and Discussions]The experimental results showed the model's ef‐fectiveness:When predicting the next two days based on the past five days,the PFE-based RNN and LSTM models achieved mean absolute percentage error(MAPE)of 0.81%and 0.40%,respectively,which were significantly lower than the 8.00%MAPE of the large language model(LLM)and 6.72%MAPE of the Transformer-based model.In longer-term predictions,the 10-day prediction for 4 days ahead and the 30-day prediction for 12 days ahead,the PFE-RNN model continued to outperform the other two baseline models,with MAPE of 2.66%and 14.05%,respectively.[Conclusions]The proposed method,which leverages phenotypic-temporal collaboration,shows great potential for intelligent,data-driven management of tomato cultivation,making it a promising approach for enhancing the efficiency and precision of smart tomato planting management.展开更多
The mathematical method of ZTD(zenith tropospheric delay)spatial prediction is important for precise ZTD derivation and real-time precise point positioning(PPP)augmentation.This paper analyses the performance of the p...The mathematical method of ZTD(zenith tropospheric delay)spatial prediction is important for precise ZTD derivation and real-time precise point positioning(PPP)augmentation.This paper analyses the performance of the popular optimal function coefficient(OFC),sphere cap harmonic analysis(SCHA),kriging and inverse distance weighting(IDW)interpolation in ZTD spatial prediction and Beidou satellite navigation system(BDS)-PPP augmentation over China.For ZTD spatial prediction,the average time consumption of the OFC,kriging,and IDW methods is less than 0.1 s,which is significantly better than that of the SCHA method(63.157 s).The overall ZTD precision of the OFC is 3.44 cm,which outperforms those of the SCHA(9.65 cm),Kriging(10.6 cm),and IDW(11.8 cm)methods.We confirmed that the low performance of kriging and IDW is caused by their weakness in modelling ZTD variation in the vertical direction.To mitigate such deficiencies,an elevation normalization factor(ENF)is introduced into the kriging and IDW models(kriging-ENF and IDW-ENF).The overall ZTD spatial prediction accuracies of IDW-ENF and kriging-ENF are 2.80 cm and 2.01 cm,respectively,which are both superior to those of the OFC and the widely used empirical model GPT3(4.92 cm).For BDS-PPP enhancement,the ZTD provided by the kriging-ENF,IDW-ENF and OFC as prior constraints can effectively reduce the convergence time.Compared with unconstrained BDS-PPP,our proposed kriging-ENF outperforms IDW-ENF and OFC by reducing the horizontal and vertical convergence times by approximately 13.2%and 5.8%in Ningxia and 30.4%and 7.84%in Guangdong,respectively.These results indicate that kriging-ENF is a promising method for ZTD spatial prediction and BDS-PPP enhancement over China.展开更多
Based on data from the Jilin Water Diversion Tunnels from the Songhua River(China),an improved and real-time prediction method optimized by multi-algorithm for tunnel boring machine(TBM)cutter-head torque is presented...Based on data from the Jilin Water Diversion Tunnels from the Songhua River(China),an improved and real-time prediction method optimized by multi-algorithm for tunnel boring machine(TBM)cutter-head torque is presented.Firstly,a function excluding invalid and abnormal data is established to distinguish TBM operating state,and a feature selection method based on the SelectKBest algorithm is proposed.Accordingly,ten features that are most closely related to the cutter-head torque are selected as input variables,which,in descending order of influence,include the sum of motor torque,cutter-head power,sum of motor power,sum of motor current,advance rate,cutter-head pressure,total thrust force,penetration rate,cutter-head rotational velocity,and field penetration index.Secondly,a real-time cutterhead torque prediction model’s structure is developed,based on the bidirectional long short-term memory(BLSTM)network integrating the dropout algorithm to prevent overfitting.Then,an algorithm to optimize hyperparameters of model based on Bayesian and cross-validation is proposed.Early stopping and checkpoint algorithms are integrated to optimize the training process.Finally,a BLSTMbased real-time cutter-head torque prediction model is developed,which fully utilizes the previous time-series tunneling information.The mean absolute percentage error(MAPE)of the model in the verification section is 7.3%,implying that the presented model is suitable for real-time cutter-head torque prediction.Furthermore,an incremental learning method based on the above base model is introduced to improve the adaptability of the model during the TBM tunneling.Comparison of the prediction performance between the base and incremental learning models in the same tunneling section shows that:(1)the MAPE of the predicted results of the BLSTM-based real-time cutter-head torque prediction model remains below 10%,and both the coefficient of determination(R^(2))and correlation coefficient(r)between measured and predicted values exceed 0.95;and(2)the incremental learning method is suitable for realtime cutter-head torque prediction and can effectively improve the prediction accuracy and generalization capacity of the model during the excavation process.展开更多
Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currentl...Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currently,least squares(LS)+auto-regressive(AR)hybrid method is one of the main techniques of PM prediction.Besides,the weighted LS+AR hybrid method performs well for PM short-term prediction.However,the corresponding covariance information of LS fitting residuals deserves further exploration in the AR model.In this study,we have derived a modified stochastic model for the LS+AR hybrid method,namely the weighted LS+weighted AR hybrid method.By using the PM data products of IERS EOP 14 C04,the numerical results indicate that for PM short-term forecasting,the proposed weighted LS+weighted AR hybrid method shows an advantage over both the LS+AR hybrid method and the weighted LS+AR hybrid method.Compared to the mean absolute errors(MAEs)of PMX/PMY sho rt-term prediction of the LS+AR hybrid method and the weighted LS+AR hybrid method,the weighted LS+weighted AR hybrid method shows average improvements of 6.61%/12.08%and 0.24%/11.65%,respectively.Besides,for the slopes of the linear regression lines fitted to the errors of each method,the growth of the prediction error of the proposed method is slower than that of the other two methods.展开更多
Short-term traffic flow prediction is one of the essential issues in intelligent transportation systems(ITS). A new two-stage traffic flow prediction method named AKNN-AVL method is presented, which combines an advanc...Short-term traffic flow prediction is one of the essential issues in intelligent transportation systems(ITS). A new two-stage traffic flow prediction method named AKNN-AVL method is presented, which combines an advanced k-nearest neighbor(AKNN)method and balanced binary tree(AVL) data structure to improve the prediction accuracy. The AKNN method uses pattern recognition two times in the searching process, which considers the previous sequences of traffic flow to forecast the future traffic state. Clustering method and balanced binary tree technique are introduced to build case database to reduce the searching time. To illustrate the effects of these developments, the accuracies performance of AKNN-AVL method, k-nearest neighbor(KNN) method and the auto-regressive and moving average(ARMA) method are compared. These methods are calibrated and evaluated by the real-time data from a freeway traffic detector near North 3rd Ring Road in Beijing under both normal and incident traffic conditions.The comparisons show that the AKNN-AVL method with the optimal neighbor and pattern size outperforms both KNN method and ARMA method under both normal and incident traffic conditions. In addition, the combinations of clustering method and balanced binary tree technique to the prediction method can increase the searching speed and respond rapidly to case database fluctuations.展开更多
基金funded by the Fujian Province Science and Technology Plan,China(Grant Number 2019H0017).
文摘Accurate forecasting of time series is crucial across various domains.Many prediction tasks rely on effectively segmenting,matching,and time series data alignment.For instance,regardless of time series with the same granularity,segmenting them into different granularity events can effectively mitigate the impact of varying time scales on prediction accuracy.However,these events of varying granularity frequently intersect with each other,which may possess unequal durations.Even minor differences can result in significant errors when matching time series with future trends.Besides,directly using matched events but unaligned events as state vectors in machine learning-based prediction models can lead to insufficient prediction accuracy.Therefore,this paper proposes a short-term forecasting method for time series based on a multi-granularity event,MGE-SP(multi-granularity event-based short-termprediction).First,amethodological framework for MGE-SP established guides the implementation steps.The framework consists of three key steps,including multi-granularity event matching based on the LTF(latest time first)strategy,multi-granularity event alignment using a piecewise aggregate approximation based on the compression ratio,and a short-term prediction model based on XGBoost.The data from a nationwide online car-hailing service in China ensures the method’s reliability.The average RMSE(root mean square error)and MAE(mean absolute error)of the proposed method are 3.204 and 2.360,lower than the respective values of 4.056 and 3.101 obtained using theARIMA(autoregressive integratedmoving average)method,as well as the values of 4.278 and 2.994 obtained using k-means-SVR(support vector regression)method.The other experiment is conducted on stock data froma public data set.The proposed method achieved an average RMSE and MAE of 0.836 and 0.696,lower than the respective values of 1.019 and 0.844 obtained using the ARIMA method,as well as the values of 1.350 and 1.172 obtained using the k-means-SVR method.
基金the French state aid managed by the ANR under the“Investissements d’avenir”programme with the reference ANR-16-CONV-0003from the AgroEcoSystem department of INRAE.We are grateful to the INRAE MIGALE bioinformatics facility(MIGALE,INRAE,2020.Migale bioinformatics Facility,doi:10.15454/1.5572390655343293E12)for providing help and/or computing and/or storage resources.We are also grateful to Sasha Hafner for his help in reproducing some of the results of Hafner et al.(2019).
文摘Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and the environment.Estimating ammonia emissions is crucial for national inventories and policy-making.Various models exist for predicting emissions,including mechanistic,empirical,and semi-empirical approaches.While machine learning(ML)is widely used in environmental science,its application to ammonia emissions remains limited.In this study,we used 5939 ammonia emission data from 538 trials,extracted from the ALFAM2 database,to train three machine learning methods-random forest,gradient boosting,and lasso-for predicting cumulative ammonia emissions 72 h after manure application.These methods were compared to the semi-empirical ALFAM2 model using an independent test dataset.Random forest(RMSE=4.51,r=0.94,MAE=3.28,Bias=0.92)and gradient boosting(RMSE=6.19,r=0.89,MAE=4.10,Bias=0.51)showed the best performance,while the lasso log-linear model(RMSE=7.30,r=0.84,MAE=5.57,Bias=-1.38)performed worst.Both random forest and gradient boosting outperformed the semi-empirical ALFAM2 model,which showed performance comparable to the lasso model.We then used these models and the ALFAM2 model to compare five slurry management techniques,varying in application method(trailing hoses,trailing shoes,and open slot)and post-application incorporation,across 128 scenarios with different manure types and weather conditions.Compared to broadcast application,alternative techniques reduced emissions by a median of-13.6%to-61.7%.This study highlights the promise of ML models in assessing ammonia emission reduction methods,while emphasizing the importance of evaluating model sensitivity to algorithm choice.
基金funded by scientific research projects under Grant JY2024B011.
文摘Accurate prediction of remaining useful life serves as a reliable basis for maintenance strategies,effectively reducing both the frequency of failures and associated costs.As a core component of PHM,RUL prediction plays a crucial role in preventing equipment failures and optimizing maintenance decision-making.However,deep learning models often falter when processing raw,noisy temporal signals,fail to quantify prediction uncertainty,and face challenges in effectively capturing the nonlinear dynamics of equipment degradation.To address these issues,this study proposes a novel deep learning framework.First,a newbidirectional long short-termmemory network integrated with an attention mechanism is designed to enhance temporal feature extraction with improved noise robustness.Second,a probabilistic prediction framework based on kernel density estimation is constructed,incorporating residual connections and stochastic regularization to achieve precise RUL estimation.Finally,extensive experiments on the C-MAPSS dataset demonstrate that our method achieves competitive performance in terms of RMSE and Score metrics compared to state-of-the-artmodels.More importantly,the probabilistic output provides a quantifiablemeasure of prediction confidence,which is crucial for risk-informed maintenance planning,enabling managers to optimize maintenance strategies based on a quantifiable understanding of failure risk.
基金funded by the National Natural Science Foundation of China(Grant No.42477208)Natural Science Foundation of Hubei Province,China(Grant No.2024AFA072)Open Research Fund of State Key Laboratory of Geomechanics and Geotechnical Engineering Safety(Grant No.SKLGME-JBGS2402).
文摘With the increasing depth and intensity of coal mining operations,high-energy mine tremors have become a major trigger for rockburst disasters,posing severe threats to mine safety.Conventional rockburst risk assessment methods either lack real-time adaptability or rely heavily on qualitative microseismic data analysis,limiting their effectiveness in dynamic early warning.To address these limitations,this study proposed a predictive framework for rockburst risk assessment by integrating ensemble learning algorithms with Bayesian optimization.A dataset was constructed using a sliding time window approach,linking the highest MS energy in the subsequent days with predefined risk levels.Both undersampling and oversampling strategies were employed to mitigate class imbalance,and their performance was evaluated.Three ensemble models,i.e.CatBoost,Random Forest,and LightGBM,were developed,and their hyperparameters were optimized using Bayesian techniques to enhance predictive performance.The models were validated using MS data from the 6303 and 6306 working faces at the Dongtan Coal Mine.All three ensemble models outperformed conventional classification methods,particularly in accurately predicting high-risk categories.Among them,the CatBoost model exhibited the best performance,with an accuracy of 89.47%and an F1¯-score of 90.62%.Furthermore,SHapley Additive exPlanations analysis was used to enhance model interpretability,identifying key MS indicators influencing rockburst risk predictions.This study provides a systematic approach for leveraging MS data and machine learning to improve an early warning system for rockburst hazards,offering valuable insights for underground mining safety management.
文摘BACKGROUND Colorectal polyps(CPs)are important precursor lesions of colorectal cancer,and endoscopic surgery remains the primary treatment option.However,the shortterm recurrence rate post-surgery is high,and the risk factors for recurrence remain unknown.AIM To comprehensively explore risk factors for short-term recurrence of CPs after endoscopic surgery and develop a nomogram prediction model.METHODS Overall,362 patients who underwent endoscopic polypectomy between January 2022 and January 2024 at Nanjing Jiangbei Hospital were included.We screened basic demographic data,clinical and polyp characteristics,surgery-related information,and independent risk factors for CPs recurrence using univariate and multivariate logistic regression analyses.The multivariate analysis results were used to construct a nomogram prediction model,internally validated using Bootstrapping,with performance evaluated using area under the curve(AUC),calibration curve,and decision curve analysis.RESULTS CP re-occurred in 166(45.86%)of the 362 patients within 1 year post-surgery.Multivariate logistic regression analysis showed that age(OR=1.04,P=0.002),alcohol consumption(OR=2.07,P=0.012),Helicobacter pylori infection(OR=2.34,P<0.001),polyp number>2(OR=1.98,P=0.005),sessile polyps(OR=2.10,P=0.006),and adenomatous pathological type(OR=3.02,P<0.001)were independent risk factors for post-surgery recurrence.The nomogram prediction model showed good discriminatory(AUC=0.73)and calibrating power,and decision curve analysis showed that the model had good clinical benefit at risk probabilities>20%.CONCLUSION We identified multiple independent risk factors for short-term recurrence after endoscopic surgery.The nomogram prediction model showed a certain degree of differentiation,calibration,and potential clinical applicability.
基金supported by grants from the National Science Foundation of China(Grant Nos.12375038 and 12075171 to ZJT,and 12205223 to YLT).
文摘RNAs have important biological functions and the functions of RNAs are generally coupled to their structures, especiallytheir secondary structures. In this work, we have made a comprehensive evaluation of the performances of existingtop RNA secondary structure prediction methods, including five deep-learning (DL) based methods and five minimum freeenergy (MFE) based methods. First, we made a brief overview of these RNA secondary structure prediction methods.Afterwards, we built two rigorous test datasets consisting of RNAs with non-redundant sequences and comprehensivelyexamined the performances of the RNA secondary structure prediction methods through classifying the RNAs into differentlength ranges and different types. Our examination shows that the DL-based methods generally perform better thanthe MFE-based methods for RNAs with long lengths and complex structures, while the MFE-based methods can achievegood performance for small RNAs and some specialized MFE-based methods can achieve good prediction accuracy forpseudoknots. Finally, we provided some insights and perspectives in modeling RNA secondary structures.
基金partially supported by the National Key R&D Program of China (2022YFE0133700)the National Natural Science Foundation of China(12273007)+4 种基金the Guizhou Provincial Excellent Young Science and Technology Talent Program (YQK[2023]006)the National SKA Program of China (2020SKA0110300)the National Natural Science Foundation of China(11963003)the Guizhou Provincial Basic Research Program (Natural Science)(ZK[2022]143)the Cultivation project of Guizhou University ([2020]76).
文摘Solar flares are violent solar outbursts which have a great influence on the space environment surrounding Earth,potentially causing disruption of the ionosphere and interference with the geomagnetic field,thus causing magnetic storms.Consequently,it is very important to accurately predict the time period of solar flares.This paper proposes a flare prediction model,based on physical images of active solar regions.We employ X-ray flux curves recorded directly by the Geostationary Operational Environmental Satellite,used as input data for the model,allowing us to largely avoid the influence of accidental errors,effectively improving the model prediction efficiency.A model based on the X-ray flux curve can predict whether there will be a flare event within 24 hours.The reverse can also be verified by the peak of the X-ray flux curve to see if a flare has occurred within the past 24 hours.The True Positive Rate and False Positive Rate of the prediction model,based on physical images of active regions are 0.6070 and 0.2410 respectively,and the accuracy and True Skill Statistics are 0.7590 and 0.5556.Our model can effectively improve prediction efficiency compared with models based on the physical parameters of active regions or magnetic field records,providing a simple method for solar flare prediction.
文摘Deep learning plays a vital role in real-life applications, for example object identification, human face recognition, speech recognition, biometrics identification, and short and long-term forecasting of data. The main objective of our work is to predict the market performance of the Dhaka Stock Exchange (DSE) on day closing price using different Deep Learning techniques. In this study, we have used the LSTM (Long Short-Term Memory) network to forecast the data of DSE for the convenience of shareholders. We have enforced LSTM networks to train data as well as forecast the future time series that has differentiated with test data. We have computed the Root Mean Square Error (RMSE) value to scrutinize the error between the forecasted value and test data that diminished the error by updating the LSTM networks. As a consequence of the renovation of the network, the LSTM network provides tremendous performance which outperformed the existing works to predict stock market prices.
基金financially supported by the National Natural Science Foundation of China(Nos.42277149,41502299,41372306)the Research Planning of Sichuan Education Department,China(No.16ZB0105)+3 种基金the State Key Laboratory of Geohazard Prevention and Geoenvironment Protection Independent Research Project(Nos.SKLGP2016Z007,SKLGP2018Z017,SKLGP2020Z009)Chengdu University of Technology Young and Middle Aged Backbone Program(No.KYGG201720)Sichuan Provincial Science and Technology Department Program(No.19YYJC2087)China Scholarship Council。
文摘To tackle the difficulties of the point prediction in quantifying the reliability of landslide displacement prediction,a data-driven combination-interval prediction method(CIPM)based on copula and variational-mode-decomposition associated with kernel-based-extreme-learningmachine optimized by the whale optimization algorithm(VMD-WOA-KELM)is proposed in this paper.Firstly,the displacement is decomposed by VMD to three IMF components and a residual component of different fluctuation characteristics.The key impact factors of each IMF component are selected according to Copula model,and the corresponding WOA-KELM is established to conduct point prediction.Subsequently,the parametric method(PM)and non-parametric method(NPM)are used to estimate the prediction error probability density distribution(PDF)of each component,whose prediction interval(PI)under the 95%confidence level is also obtained.By means of the differential evolution algorithm(DE),a weighted combination model based on the PIs is built to construct the combination-interval(CI).Finally,the CIs of each component are added to generate the total PI.A comparative case study shows that the CIPM performs better in constructing landslide displacement PI with high performance.
基金supported by grants from the National Natural Science Foundation of China(Grant No.12205223 to YLT,Grant No.12375038 to ZJT and Grant No.11605125 to YZS)the Department of Education of Hubei Province(Grant No.Q20221705 to YLT)。
文摘Ribonucleic Acid(RNA)contact prediction holds great significance for modeling RNA 3D structures and further understanding RNA biological functions.The rapid growth of RNA sequencing data has driven the development of diverse computational methods for RNA contact prediction,and a benchmark evaluation of these methods remains essential.In this work,we first classified RNA contact prediction methods into statistical inference-based and neural networkbased ones.We then evaluated eight state-of-the-art methods on three test sets:a sequencediverse set,a structurally non-redundant set and a CASP RNA targets set.Our evaluation shows that for identifying non-local and long-range contacts,neural network-based methods outperform statistical inference-based ones,with SPOT-RNA-2D achieving the best performance,followed by CoCoNet and RNAcontact.However,for identifying the long-range tertiary contacts,which are vital for stabilizing RNA tertiary structure,statistical inference-based methods exhibit superior performance with GREMLIN emerging as the top performer.This work provides a comprehensive benchmarking of RNA contact prediction methods,highlighting their strengths and limitations to guide further methodological improvements and applications in RNA structure modeling.
基金supported by the National Research and Development Program(2022YFC3004603)the Jiangsu Province International Collaboration Program-Key National Industrial Technology Research and Development Cooperation Projects(BZ2023050)+1 种基金the Natural Science Foundation of Jiangsu Province(BK20221109)the National Natural Science Foundation of China(52274098).
文摘The increasingly severe state of coal burst disaster has emerged as a critical factor constraining coal mine safety production,and it has become a challenging task to enhance the accuracy of coal burst disaster prediction.To address the issue of insufficient exploration of the spatio-temporal characteristic of microseismic data and the challenging selection of the optimal time window size in spatio-temporal prediction,this paper integrates deep learning methods and theory to propose a novel coal burst spatio-temporal prediction method based on Bidirectional Long Short-Term Memory(Bi-LSTM)network.The method involves three main modules,including microseismic spatio-temporal characteristic indicators construction,temporal prediction model,and spatial prediction model.To validate the effectiveness of the proposed method,engineering application tests are conducted at a high-risk working face in the Ordos mining area of Inner Mongolia,focusing on 13 high-energy microseismic events with energy levels greater than 105 J.In terms of temporal prediction,the analysis indicates that the temporal prediction results consist of 10 strong predictions and 3 medium predictions,and there is no false alarm detected throughout the entire testing period.Moreover,compared to the traditional threshold-based coal burst temporal prediction method,the accuracy of the proposed method is increased by 38.5%.In terms of spatial prediction,the distribution of spatial prediction results for high-energy events comprises 6 strong hazard predictions,3 medium hazard predictions,and 4 weak hazard predictions.
基金financially supported by the National Natural Science Foundation of China(Grant No.52074331).
文摘Wellbore instability is one of the significant challenges in the drilling engineering and during the development of carbonate reservoirs,especially with open-hole completion.The problems of wellbore instability such as downhole collapse and silt deposit in the fractured carbonate reservoir of Tarim Basin(Ordovician)are severe.Solid destabilization and production(SDP)was proposed to describe this engineering problem of carbonate reservoirs.To clarify the mechanism and mitigate potential borehole instability problems,we conducted particle size distribution(PSD)analysis,X-ray diffraction(XRD)analysis,triaxial compression tests,and micro-scale sand production tests based on data analysis.We found that the rock fragments and silt in the wellbore came from two sources:one from the wellbore collapse in the upper unplugged layers and the other from the production of sand particles carried by the fluid in the productive layers.Based on the experimental study,a novel method combining a geomechanical model and microscopic sand production model was proposed to predict wellbore instability and analyze its influencing factors.The critical condition and failure zone predicted by the prediction model fit well with the field observations.According to the prediction results,the management and prevention measures of wellbore instability in carbonate reservoirs were proposed.It is suggested to optimize the well track in new drilling wells while upgrading the production system in old wells.This study is of great guiding significance for the optimization of carbonate solid control and it improves the understanding of the sand production problems in carbonate reservoirs.
基金supported by the Enterprise Innovation Consortium Project under the Major Special Science and Technology Project of Gansu Province(Grant No.22ZD6GA010).
文摘Accelerating urbanization and the rapid development of intelligent transportation systems have rendered shortterm traffic flow prediction an important research field.Accurate prediction of traffic flow is beneficial for the optimization of traffic planning,improvement of road utilization,reduction of traffic congestion,and reduction in the incidence of traffic accidents.However,data pertaining to traffic flow are typically influenced by a multitude of factors,resulting in data that exhibit a considerable degree of nonlinearity and complexity.To address the issue of noise in raw traffic flow data,this study proposes a hybrid model that combines variational mode decomposition(VMD),a bidirectional long short-term memory network(BiLSTM),and a gated recurrent unit(GRU)for short-term traffic flow prediction.To validate the effectiveness of the model,an experimental validation was conducted based on traffic flow data from UK highways,and the performance of the model was compared with common benchmark models.The experimental results demonstrate that the proposed method yields superior prediction results in terms of mean absolute error,coefficient of determination,and root-mean-square error compared to existing prediction techniques,thereby substantiating its efficacy in short-term traffic flow prediction.
基金supported by the National Key Research and Development Project(Grant Number 2023YFB3709601)the National Natural Science Foundation of China(Grant Numbers 62373215,62373219,62073193)+2 种基金the Key Research and Development Plan of Shandong Province(Grant Numbers 2021CXGC010204,2022CXGC020902)the Fundamental Research Funds of Shandong University(Grant Number 2021JCG008)the Natural Science Foundation of Shandong Province(Grant Number ZR2023MF100).
文摘The remaining useful life prediction of rolling bearing is vital in safety and reliability guarantee.In engineering scenarios,only a small amount of bearing performance degradation data can be obtained through accelerated life testing.In the absence of lifetime data,the hidden long-term correlation between performance degradation data is challenging to mine effectively,which is the main factor that restricts the prediction precision and engineering application of the residual life prediction method.To address this problem,a novel method based on the multi-layer perception neural network and bidirectional long short-term memory network is proposed.Firstly,a nonlinear health indicator(HI)calculation method based on kernel principal component analysis(KPCA)and exponential weighted moving average(EWMA)is designed.Then,using the raw vibration data and HI,a multi-layer perceptron(MLP)neural network is trained to further calculate the HI of the online bearing in real time.Furthermore,The bidirectional long short-term memory model(BiLSTM)optimized by particle swarm optimization(PSO)is used to mine the time series features of HI and predict the remaining service life.Performance verification experiments and comparative experiments are carried out on the XJTU-SY bearing open dataset.The research results indicate that this method has an excellent ability to predict future HI and remaining life.
基金supported by the National Natural Science Foundation of China(52174162)the Fundamental Research Funds for the Central Universities(FRF-TP-20-002A3).
文摘Accurate prediction of coal and gas outburst(CGO)hazards is paramount in gas disaster prevention and control.This paper endeavors to overcome the constraints posed by traditional prediction indexes when dealing with CGO incidents under low gas pressure conditions.In pursuit of this objective,we have studied and established a mechanical model of the working face under abnormal stress and the excitation energy conditions of CGO,and proposed a method for predicting the risk of CGO under abnormal stress.On site application verification shows that when a strong outburst hazard level prediction is issued,there is a high possibility of outburst disasters occurring.In one of the three locations where we predicted strong outburst hazards,a small outburst occurred,and the accuracy of the prediction was higher than the traditional drilling cuttings index S and drilling cuttings gas desorption index q.Finally,we discuss the mechanism of CGO under the action of stress anomalies.Based on the analysis of stress distribution changes and energy accumulation characteristics of coal under abnormal stress,this article believes that the increase in outburst risk caused by high stress abnormal gradient is mainly due to two reasons:(1)The high stress abnormal gradient leads to an increase in the plastic zone of the coal seam.After the working face advances,it indirectly leads to an increase in the gas expansion energy that can be released from the coal seam before reaching a new stress equilibrium.(2)Abnormal stress leads to increased peak stress of coal body in front of working face.When coal body in elastic area transforms to plastic area,its failure speed is accelerated,which induces accelerated gas desorption and aggravates the risk of outburst.
文摘[Objective]Accurate prediction of tomato growth height is crucial for optimizing production environments in smart farming.However,current prediction methods predominantly rely on empirical,mechanistic,or learning-based models that utilize either images data or environmental data.These methods fail to fully leverage multi-modal data to capture the diverse aspects of plant growth comprehensively.[Methods]To address this limitation,a two-stage phenotypic feature extraction(PFE)model based on deep learning algorithm of recurrent neural network(RNN)and long short-term memory(LSTM)was developed.The model integrated environment and plant information to provide a holistic understanding of the growth process,emploied phenotypic and temporal feature extractors to comprehensively capture both types of features,enabled a deeper understanding of the interaction between tomato plants and their environment,ultimately leading to highly accurate predictions of growth height.[Results and Discussions]The experimental results showed the model's ef‐fectiveness:When predicting the next two days based on the past five days,the PFE-based RNN and LSTM models achieved mean absolute percentage error(MAPE)of 0.81%and 0.40%,respectively,which were significantly lower than the 8.00%MAPE of the large language model(LLM)and 6.72%MAPE of the Transformer-based model.In longer-term predictions,the 10-day prediction for 4 days ahead and the 30-day prediction for 12 days ahead,the PFE-RNN model continued to outperform the other two baseline models,with MAPE of 2.66%and 14.05%,respectively.[Conclusions]The proposed method,which leverages phenotypic-temporal collaboration,shows great potential for intelligent,data-driven management of tomato cultivation,making it a promising approach for enhancing the efficiency and precision of smart tomato planting management.
基金co-supported by the National Nature Science Foundation of China(No.12303071)the Shanghai Science and Technology Plan Project,China(No.23YF1455500)+1 种基金the China Postdoctoral Science Foundation(No.2023M743653)Ministry of Industry and Information Technology of China through the High Precision Timing Service Project(No.TC220A04A-80)。
文摘The mathematical method of ZTD(zenith tropospheric delay)spatial prediction is important for precise ZTD derivation and real-time precise point positioning(PPP)augmentation.This paper analyses the performance of the popular optimal function coefficient(OFC),sphere cap harmonic analysis(SCHA),kriging and inverse distance weighting(IDW)interpolation in ZTD spatial prediction and Beidou satellite navigation system(BDS)-PPP augmentation over China.For ZTD spatial prediction,the average time consumption of the OFC,kriging,and IDW methods is less than 0.1 s,which is significantly better than that of the SCHA method(63.157 s).The overall ZTD precision of the OFC is 3.44 cm,which outperforms those of the SCHA(9.65 cm),Kriging(10.6 cm),and IDW(11.8 cm)methods.We confirmed that the low performance of kriging and IDW is caused by their weakness in modelling ZTD variation in the vertical direction.To mitigate such deficiencies,an elevation normalization factor(ENF)is introduced into the kriging and IDW models(kriging-ENF and IDW-ENF).The overall ZTD spatial prediction accuracies of IDW-ENF and kriging-ENF are 2.80 cm and 2.01 cm,respectively,which are both superior to those of the OFC and the widely used empirical model GPT3(4.92 cm).For BDS-PPP enhancement,the ZTD provided by the kriging-ENF,IDW-ENF and OFC as prior constraints can effectively reduce the convergence time.Compared with unconstrained BDS-PPP,our proposed kriging-ENF outperforms IDW-ENF and OFC by reducing the horizontal and vertical convergence times by approximately 13.2%and 5.8%in Ningxia and 30.4%and 7.84%in Guangdong,respectively.These results indicate that kriging-ENF is a promising method for ZTD spatial prediction and BDS-PPP enhancement over China.
基金financially supported by the National Natural Science Foundation of China (Grant Nos. 52074258, 41941018, and U21A20153)
文摘Based on data from the Jilin Water Diversion Tunnels from the Songhua River(China),an improved and real-time prediction method optimized by multi-algorithm for tunnel boring machine(TBM)cutter-head torque is presented.Firstly,a function excluding invalid and abnormal data is established to distinguish TBM operating state,and a feature selection method based on the SelectKBest algorithm is proposed.Accordingly,ten features that are most closely related to the cutter-head torque are selected as input variables,which,in descending order of influence,include the sum of motor torque,cutter-head power,sum of motor power,sum of motor current,advance rate,cutter-head pressure,total thrust force,penetration rate,cutter-head rotational velocity,and field penetration index.Secondly,a real-time cutterhead torque prediction model’s structure is developed,based on the bidirectional long short-term memory(BLSTM)network integrating the dropout algorithm to prevent overfitting.Then,an algorithm to optimize hyperparameters of model based on Bayesian and cross-validation is proposed.Early stopping and checkpoint algorithms are integrated to optimize the training process.Finally,a BLSTMbased real-time cutter-head torque prediction model is developed,which fully utilizes the previous time-series tunneling information.The mean absolute percentage error(MAPE)of the model in the verification section is 7.3%,implying that the presented model is suitable for real-time cutter-head torque prediction.Furthermore,an incremental learning method based on the above base model is introduced to improve the adaptability of the model during the TBM tunneling.Comparison of the prediction performance between the base and incremental learning models in the same tunneling section shows that:(1)the MAPE of the predicted results of the BLSTM-based real-time cutter-head torque prediction model remains below 10%,and both the coefficient of determination(R^(2))and correlation coefficient(r)between measured and predicted values exceed 0.95;and(2)the incremental learning method is suitable for realtime cutter-head torque prediction and can effectively improve the prediction accuracy and generalization capacity of the model during the excavation process.
基金supported by National Natural Science Foundation of China,China(No.42004016)HuBei Natural Science Fund,China(No.2020CFB329)+1 种基金HuNan Natural Science Fund,China(No.2023JJ60559,2023JJ60560)the State Key Laboratory of Geodesy and Earth’s Dynamics self-deployment project,China(No.S21L6101)。
文摘Short-term(up to 30 days)predictions of Earth Rotation Parameters(ERPs)such as Polar Motion(PM:PMX and PMY)play an essential role in real-time applications related to high-precision reference frame conversion.Currently,least squares(LS)+auto-regressive(AR)hybrid method is one of the main techniques of PM prediction.Besides,the weighted LS+AR hybrid method performs well for PM short-term prediction.However,the corresponding covariance information of LS fitting residuals deserves further exploration in the AR model.In this study,we have derived a modified stochastic model for the LS+AR hybrid method,namely the weighted LS+weighted AR hybrid method.By using the PM data products of IERS EOP 14 C04,the numerical results indicate that for PM short-term forecasting,the proposed weighted LS+weighted AR hybrid method shows an advantage over both the LS+AR hybrid method and the weighted LS+AR hybrid method.Compared to the mean absolute errors(MAEs)of PMX/PMY sho rt-term prediction of the LS+AR hybrid method and the weighted LS+AR hybrid method,the weighted LS+weighted AR hybrid method shows average improvements of 6.61%/12.08%and 0.24%/11.65%,respectively.Besides,for the slopes of the linear regression lines fitted to the errors of each method,the growth of the prediction error of the proposed method is slower than that of the other two methods.
基金Project(2012CB725403)supported by the National Basic Research Program of ChinaProjects(71210001,51338008)supported by the National Natural Science Foundation of ChinaProject supported by World Capital Cities Smooth Traffic Collaborative Innovation Center and Singapore National Research Foundation Under Its Campus for Research Excellence and Technology Enterprise(CREATE)Programme
文摘Short-term traffic flow prediction is one of the essential issues in intelligent transportation systems(ITS). A new two-stage traffic flow prediction method named AKNN-AVL method is presented, which combines an advanced k-nearest neighbor(AKNN)method and balanced binary tree(AVL) data structure to improve the prediction accuracy. The AKNN method uses pattern recognition two times in the searching process, which considers the previous sequences of traffic flow to forecast the future traffic state. Clustering method and balanced binary tree technique are introduced to build case database to reduce the searching time. To illustrate the effects of these developments, the accuracies performance of AKNN-AVL method, k-nearest neighbor(KNN) method and the auto-regressive and moving average(ARMA) method are compared. These methods are calibrated and evaluated by the real-time data from a freeway traffic detector near North 3rd Ring Road in Beijing under both normal and incident traffic conditions.The comparisons show that the AKNN-AVL method with the optimal neighbor and pattern size outperforms both KNN method and ARMA method under both normal and incident traffic conditions. In addition, the combinations of clustering method and balanced binary tree technique to the prediction method can increase the searching speed and respond rapidly to case database fluctuations.