Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a sign...Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision.This study introduces REPTF-TMDI,a novel method that combines a Reduced Error Pruning Tree Forest(REPTree Forest)with a newly proposed Time-based Missing Data Imputation(TMDI)approach.The REP Tree Forest,an ensemble learning approach,is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urbanmobility solutions.Meanwhile,the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered.The proposed method was evaluated using hourly traffic flow data from a major U.S.roadway spanning 2012-2018,incorporating temporal features(e.g.,hour,day,month,year,weekday),holiday indicator,and weather conditions(temperature,rain,snow,and cloud coverage).Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76%improvement in terms of correlation coefficient(R).Furthermore,REPTree Forest achieved improvements of 68.62%in RMSE and 70.52%in MAE compared to existing state-of-the-art models.These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy,even in the presence of missing data,thereby contributing to the broader objectives of sustainable urban transportation systems.展开更多
Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation a...Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation and meeting the high precision and rapidity requirements in slope engineering.The data set of this study includes five parameters,namely slope height,slope angle,cohesion,internal friction angle,and peak ground acceleration.The available data is split into two categories:training(75%)and test(25%)sets.The output of the RT and REP tree models is evaluated using performance measures including accuracy(Acc),Matthews correlation coefficient(Mcc),precision(Prec),recall(Rec),and F-score.The applications of the aforementionedmethods for predicting slope stability are compared to one another and recently established soft computing models in the literature.The analysis of the Acc together with Mcc,and F-score for the slope stability in the test set demonstrates that the RT achieved a better prediction performance with(Acc=97.1429%,Mcc=0.935,F-score for stable class=0.979 and for unstable case F-score=0.935)succeeded by the REP tree model with(Acc=95.4286%,Mcc=0.896,F-score stable class=0.967 and for unstable class F-score=0.923)for the slope stability dataset The analysis of performance measures for the slope stability dataset reveals that the RT model attains comparatively better and reliable results and thus should be encouraged in further research.展开更多
Every mobile operator of today's world switches their technology over from 2G(second generation)to 3G(third generation)network.Operators are keen analyzing their CDR(call detail record)obtained over the past usage...Every mobile operator of today's world switches their technology over from 2G(second generation)to 3G(third generation)network.Operators are keen analyzing their CDR(call detail record)obtained over the past usage for predicting the behavior of their customers and their usage.The operators are willing to mine knowledge from real-world dataset which implies the pattern of user mentality on this changing world.To identify the usage of 2G and 3G services the classification models were trained using the data collected from PAKDD 2006 dataset.In order to obtain the prediction accuracy,the classifiers were evaluated using 10 folds cross validation.On comparing the results of the experiment,J48 performed more accurately and random tree consumed less time.展开更多
文摘Accurate traffic flow prediction(TFP)is vital for efficient and sustainable transportation management and the development of intelligent traffic systems.However,missing data in real-world traffic datasets poses a significant challenge to maintaining prediction precision.This study introduces REPTF-TMDI,a novel method that combines a Reduced Error Pruning Tree Forest(REPTree Forest)with a newly proposed Time-based Missing Data Imputation(TMDI)approach.The REP Tree Forest,an ensemble learning approach,is tailored for time-related traffic data to enhance predictive accuracy and support the evolution of sustainable urbanmobility solutions.Meanwhile,the TMDI approach exploits temporal patterns to estimate missing values reliably whenever empty fields are encountered.The proposed method was evaluated using hourly traffic flow data from a major U.S.roadway spanning 2012-2018,incorporating temporal features(e.g.,hour,day,month,year,weekday),holiday indicator,and weather conditions(temperature,rain,snow,and cloud coverage).Experimental results demonstrated that the REPTF-TMDI method outperformed conventional imputation techniques across various missing data ratios by achieving an average 11.76%improvement in terms of correlation coefficient(R).Furthermore,REPTree Forest achieved improvements of 68.62%in RMSE and 70.52%in MAE compared to existing state-of-the-art models.These findings highlight the method’s ability to significantly boost traffic flow prediction accuracy,even in the presence of missing data,thereby contributing to the broader objectives of sustainable urban transportation systems.
基金supported by the National Key Research and Development Plan of China under Grant No.2021YFB2600703.
文摘Slope stability prediction plays a significant role in landslide disaster prevention and mitigation.This paper’s reduced error pruning(REP)tree and random tree(RT)models are developed for slope stability evaluation and meeting the high precision and rapidity requirements in slope engineering.The data set of this study includes five parameters,namely slope height,slope angle,cohesion,internal friction angle,and peak ground acceleration.The available data is split into two categories:training(75%)and test(25%)sets.The output of the RT and REP tree models is evaluated using performance measures including accuracy(Acc),Matthews correlation coefficient(Mcc),precision(Prec),recall(Rec),and F-score.The applications of the aforementionedmethods for predicting slope stability are compared to one another and recently established soft computing models in the literature.The analysis of the Acc together with Mcc,and F-score for the slope stability in the test set demonstrates that the RT achieved a better prediction performance with(Acc=97.1429%,Mcc=0.935,F-score for stable class=0.979 and for unstable case F-score=0.935)succeeded by the REP tree model with(Acc=95.4286%,Mcc=0.896,F-score stable class=0.967 and for unstable class F-score=0.923)for the slope stability dataset The analysis of performance measures for the slope stability dataset reveals that the RT model attains comparatively better and reliable results and thus should be encouraged in further research.
文摘Every mobile operator of today's world switches their technology over from 2G(second generation)to 3G(third generation)network.Operators are keen analyzing their CDR(call detail record)obtained over the past usage for predicting the behavior of their customers and their usage.The operators are willing to mine knowledge from real-world dataset which implies the pattern of user mentality on this changing world.To identify the usage of 2G and 3G services the classification models were trained using the data collected from PAKDD 2006 dataset.In order to obtain the prediction accuracy,the classifiers were evaluated using 10 folds cross validation.On comparing the results of the experiment,J48 performed more accurately and random tree consumed less time.