Monitoring sensors in complex engineering environments often record abnormal data,leading to significant positioning errors.To reduce the influence of abnormal arrival times,we introduce an innovative,outlier-robust l...Monitoring sensors in complex engineering environments often record abnormal data,leading to significant positioning errors.To reduce the influence of abnormal arrival times,we introduce an innovative,outlier-robust localization method that integrates kernel density estimation(KDE)with damping linear correction to enhance the precision of microseismic/acoustic emission(MS/AE)source positioning.Our approach systematically addresses abnormal arrival times through a three-step process:initial location by 4-arrival combinations,elimination of outliers based on three-dimensional KDE,and refinement using a linear correction with an adaptive damping factor.We validate our method through lead-breaking experiments,demonstrating over a 23%improvement in positioning accuracy with a maximum error of 9.12 mm(relative error of 15.80%)—outperforming 4 existing methods.Simulations under various system errors,outlier scales,and ratios substantiate our method’s superior performance.Field blasting experiments also confirm the practical applicability,with an average positioning error of 11.71 m(relative error of 7.59%),compared to 23.56,66.09,16.95,and 28.52 m for other methods.This research is significant as it enhances the robustness of MS/AE source localization when confronted with data anomalies.It also provides a practical solution for real-world engineering and safety monitoring applications.展开更多
Previous research has identified specific areas of frequent tropical cyclone activity in the North Atlantic basin. This study examines long-term and decadal spatio-temporal patterns of Atlantic tropical cyclone freque...Previous research has identified specific areas of frequent tropical cyclone activity in the North Atlantic basin. This study examines long-term and decadal spatio-temporal patterns of Atlantic tropical cyclone frequencies from 1944 to 2009, and analyzes categorical and decadal centroid patterns using kernel density estimation (KDE) and centrographic statistics. Results corroborate previous research which has suggested that the Bermuda-Azores anticyclone plays an integral role in the direction of tropical cyclone tracks. Other teleconnections such as the North Atlantic Oscillation (NAO) may also have an impact on tropical cyclone tracks, but at a different temporal resolution. Results expand on existing knowledge of the spatial trends of tropical cyclones based on storm category and time through the use of spatial statistics. Overall, location of peak frequency varies by tropical cyclone category, with stronger storms being more concentrated in narrow regions of the southern Caribbean Sea and Gulf of Mexico, while weaker storms occur in a much larger area that encompasses much of the Caribbean Sea, Gulf of Mexico, and Atlantic Ocean off of the east coast of the United States. Additionally, the decadal centroids of tropical cyclone tracks have oscillated over a large area of the Atlantic Ocean for much of recorded history. Data collected since 1944 can be analyzed confidently to reveal these patterns.展开更多
In this paper, we propose a new method that combines collage error in fractal domain and Hu moment invariants for image retrieval with a statistical method - variable bandwidth Kernel Density Estimation (KDE). The pro...In this paper, we propose a new method that combines collage error in fractal domain and Hu moment invariants for image retrieval with a statistical method - variable bandwidth Kernel Density Estimation (KDE). The proposed method is called CHK (KDE of Collage error and Hu moment) and it is tested on the Vistex texture database with 640 natural images. Experimental results show that the Average Retrieval Rate (ARR) can reach into 78.18%, which demonstrates that the proposed method performs better than the one with parameters respectively as well as the commonly used histogram method both on retrieval rate and retrieval time.展开更多
The sixth-generation fighter has superior stealth performance,but for the traditional kernel density estimation(KDE),precision requirements are difficult to satisfy when dealing with the fluctuation characteristics of...The sixth-generation fighter has superior stealth performance,but for the traditional kernel density estimation(KDE),precision requirements are difficult to satisfy when dealing with the fluctuation characteristics of complex radar cross section(RCS).To solve this problem,this paper studies the KDE algorithm for F/AXX stealth fighter.By considering the accuracy lack of existing fixed bandwidth algorithms,a novel adaptive kernel density estimation(AKDE)algorithm equipped with least square cross validation and integrated squared error criterion is proposed to optimize the bandwidth.Meanwhile,an adaptive RCS density estimation can be obtained according to the optimized bandwidth.Finally,simulations verify that the estimation accuracy of the adaptive bandwidth RCS density estimation algorithm is more than 50%higher than that of the traditional algorithm.Based on the proposed algorithm(i.e.,AKDE),statistical characteristics of the considered fighter are more accurately acquired,and then the significant advantages of the AKDE algorithm in solving cumulative distribution function estimation of RCS less than 1 m2 are analyzed.展开更多
Traffic incidents are a major cause of non-recurrent congestion and delays,making accu-rate incident duration(ID)prediction essential for effective traffic management.While machine learning(ML)and deep learning(DL)mod...Traffic incidents are a major cause of non-recurrent congestion and delays,making accu-rate incident duration(ID)prediction essential for effective traffic management.While machine learning(ML)and deep learning(DL)models have been developed to predict ID and its sub-periods(verification,response,and clearance times),their reliability has not been systematically assessed.This study introduces a novel framework to evaluate the reli-ability of these predictions using 4000 traffic incident records.Non-parametric kernel den-sity estimation(KDE)effectively captured variations in the data,outperforming traditional parametric methods.Monte Carlo simulation(MCS)was then used to assess prediction reliability.Among the models tested,bagged ensemble trees provided the best balance between accuracy and complexity,showing strong reliability for predicting total ID and sub-periods.Adding 5–25%buffer adjustments further improved reliability by accounting for prediction uncertainties.This framework offers a robust tool for assessing prediction reliability,is adaptable to various ML and DL models,and represents a significant step for-ward in traffic incident management.展开更多
As a production quality index of hematite grinding process,particle size(PS)is hard to be measured in real time.To achieve the PS estimation,this paper proposes a novel data driven model of PS using stochastic configu...As a production quality index of hematite grinding process,particle size(PS)is hard to be measured in real time.To achieve the PS estimation,this paper proposes a novel data driven model of PS using stochastic configuration network(SCN)with robust technique,namely,robust SCN(RSCN).Firstly,this paper proves the universal approximation property of RSCN with weighted least squares technique.Secondly,three robust algorithms are presented by employing M-estimation with Huber loss function,M-estimation with interquartile range(IQR)and nonparametric kernel density estimation(NKDE)function respectively to set the penalty weight.Comparison experiments are first carried out based on the UCI standard data sets to verify the effectiveness of these methods,and then the data-driven PS model based on the robust algorithms are established and verified.Experimental results show that the RSCN has an excellent performance for the PS estimation.展开更多
Road network is a critical component of public infrastructure,and the supporting system of social and economic development.Based on a modified kernel density estimate(KDE)algorithm,this study evaluated the road servic...Road network is a critical component of public infrastructure,and the supporting system of social and economic development.Based on a modified kernel density estimate(KDE)algorithm,this study evaluated the road service capacity provided by a road network composed of multi-level roads(i.e.national,provincial,county and rural roads),by taking account of the differences of effect extent and intensity for roads of different levels.Summarized at town scale,the population burden and the annual rural economic income of unit road service capacity were used as the surrogates of social and economic demands for road service.This method was applied to the road network of the Three Parallel River Region,the northwestern Yunnan Province,China to evaluate the development of road network in this region.In results,the total road length of this region in 2005 was 3.70×104km,and the length ratio between national,provincial,county and rural roads was 1∶2∶8∶47.From 1989 to 2005,the regional road service capacity increased by 13.1%,of which the contributions from the national,provincial,county and rural roads were 11.1%,19.4%,22.6%,and 67.8%,respectively,revealing the effect of′All Village Accessible′policy of road development in the mountainous regions in the last decade.The spatial patterns of population burden and economic requirement of unit road service suggested that the areas farther away from the national and provincial roads have higher road development priority(RDP).Based on the modified KDE model and the framework of RDP evaluation,this study provided a useful approach for developing an optimal plan of road development at regional scale.展开更多
由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,...由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,采用数据预处理和特征选择操作筛选出最适合建模的特征变量.其次,使用基于Stacking的RF-ET算法对指标进行点预测,该算法使得模型有较高的准确性和泛化性.然后,采用KDE算法计算指标的预测误差,得到了一定置信水平下的分布区间和区间预测结果.最后,用所建模型与其余组合模型进行对比.结果表明,RF-ET算法有较高的点预测效果,KDE算法可以很好地量化指标的误差,可以得到较高可靠度的区间预测结果.展开更多
区别于经典的基于Parzen窗口法的概率密度函数估计器构建策略,提出了基于近邻误差度量函数的启发式核密度估计器(Heuristic kernel density estimator,HKDE),用以提升对模相近数据概率密度函数拟合的准确性。首次从数据不确定性和模型...区别于经典的基于Parzen窗口法的概率密度函数估计器构建策略,提出了基于近邻误差度量函数的启发式核密度估计器(Heuristic kernel density estimator,HKDE),用以提升对模相近数据概率密度函数拟合的准确性。首次从数据不确定性和模型不确定性的角度分析了传统核密度估计器解决模相近数据概率密度函数估计问题时的缺陷:利用概率密度值对于直方图箱宽参数的收敛性确定观测数据的启发式概率密度值,降低数据概率密度值计算的不确定性;基于启发式概率密度值构建用于确定核密度估计器最优带宽的目标函数,降低最优带宽优化过程中的不确定性。在18个模相近数据集上对新估计器HKDE的可行性、合理性和有效性进行了系统性的验证。实验结果表明,与7种具有代表性的概率密度函数估计器相比,HKDE能够获得更加优异的概率分布近似表现,具有比其他估计器更低的估计误差,能够确定出更接近真实值的概率密度函数估计值。展开更多
Traffic accident frequency has been decreasing in Japan in recent years.Nevertheless,many accidents still occur on residential roads.Area-wide traffic calming measures including Zone 30,which discourages traffic by se...Traffic accident frequency has been decreasing in Japan in recent years.Nevertheless,many accidents still occur on residential roads.Area-wide traffic calming measures including Zone 30,which discourages traffic by setting a speed limit of 30 km/h in residential areas,have been implemented.However,no objective implementation method has been established.Development of a model for traffic accident density estimation explained by GIS data can enable the determination of dangerous areas objectively and easily,indicating where area-wide traffic calming can be implemented preferentially.This study examined the relations between traffic accidents and city characteristics,such as population,road factors,and spatial factors.A model was developed to estimate traffic accident density.Kernel density estimation(KDE)techniques were used to assess the relations efficiently.Besides,16 models were developed by combining accident locations,accident types,and data types.By using them,the applicability of traffic accident density estimation models was examined.Results obtained using Spearman rank correlation show high coefficients between the predicted number and the actual number.The model can indicate the relative accident risk in cities.Results of this study can be used for objective determination of areas where area-wide traffic calming can be implemented preferentially,even if sufficient traffic accident data are not available.展开更多
基金the financial support provided by the National Key Research and Development Program for Young Scientists(No.2021YFC2900400)Postdoctoral Fellowship Program of China Postdoctoral Science Foundation(CPSF)(No.GZB20230914)+2 种基金National Natural Science Foundation of China(No.52304123)China Postdoctoral Science Foundation(No.2023M730412)Chongqing Outstanding Youth Science Foundation Program(No.CSTB2023NSCQ-JQX0027).
文摘Monitoring sensors in complex engineering environments often record abnormal data,leading to significant positioning errors.To reduce the influence of abnormal arrival times,we introduce an innovative,outlier-robust localization method that integrates kernel density estimation(KDE)with damping linear correction to enhance the precision of microseismic/acoustic emission(MS/AE)source positioning.Our approach systematically addresses abnormal arrival times through a three-step process:initial location by 4-arrival combinations,elimination of outliers based on three-dimensional KDE,and refinement using a linear correction with an adaptive damping factor.We validate our method through lead-breaking experiments,demonstrating over a 23%improvement in positioning accuracy with a maximum error of 9.12 mm(relative error of 15.80%)—outperforming 4 existing methods.Simulations under various system errors,outlier scales,and ratios substantiate our method’s superior performance.Field blasting experiments also confirm the practical applicability,with an average positioning error of 11.71 m(relative error of 7.59%),compared to 23.56,66.09,16.95,and 28.52 m for other methods.This research is significant as it enhances the robustness of MS/AE source localization when confronted with data anomalies.It also provides a practical solution for real-world engineering and safety monitoring applications.
文摘Previous research has identified specific areas of frequent tropical cyclone activity in the North Atlantic basin. This study examines long-term and decadal spatio-temporal patterns of Atlantic tropical cyclone frequencies from 1944 to 2009, and analyzes categorical and decadal centroid patterns using kernel density estimation (KDE) and centrographic statistics. Results corroborate previous research which has suggested that the Bermuda-Azores anticyclone plays an integral role in the direction of tropical cyclone tracks. Other teleconnections such as the North Atlantic Oscillation (NAO) may also have an impact on tropical cyclone tracks, but at a different temporal resolution. Results expand on existing knowledge of the spatial trends of tropical cyclones based on storm category and time through the use of spatial statistics. Overall, location of peak frequency varies by tropical cyclone category, with stronger storms being more concentrated in narrow regions of the southern Caribbean Sea and Gulf of Mexico, while weaker storms occur in a much larger area that encompasses much of the Caribbean Sea, Gulf of Mexico, and Atlantic Ocean off of the east coast of the United States. Additionally, the decadal centroids of tropical cyclone tracks have oscillated over a large area of the Atlantic Ocean for much of recorded history. Data collected since 1944 can be analyzed confidently to reveal these patterns.
基金Supported by the Fundamental Research Funds for the Central Universities (No. NS2012093)
文摘In this paper, we propose a new method that combines collage error in fractal domain and Hu moment invariants for image retrieval with a statistical method - variable bandwidth Kernel Density Estimation (KDE). The proposed method is called CHK (KDE of Collage error and Hu moment) and it is tested on the Vistex texture database with 640 natural images. Experimental results show that the Average Retrieval Rate (ARR) can reach into 78.18%, which demonstrates that the proposed method performs better than the one with parameters respectively as well as the commonly used histogram method both on retrieval rate and retrieval time.
基金the National Natural Science Foundation of China(Nos.61074090 and 60804025)。
文摘The sixth-generation fighter has superior stealth performance,but for the traditional kernel density estimation(KDE),precision requirements are difficult to satisfy when dealing with the fluctuation characteristics of complex radar cross section(RCS).To solve this problem,this paper studies the KDE algorithm for F/AXX stealth fighter.By considering the accuracy lack of existing fixed bandwidth algorithms,a novel adaptive kernel density estimation(AKDE)algorithm equipped with least square cross validation and integrated squared error criterion is proposed to optimize the bandwidth.Meanwhile,an adaptive RCS density estimation can be obtained according to the optimized bandwidth.Finally,simulations verify that the estimation accuracy of the adaptive bandwidth RCS density estimation algorithm is more than 50%higher than that of the traditional algorithm.Based on the proposed algorithm(i.e.,AKDE),statistical characteristics of the considered fighter are more accurately acquired,and then the significant advantages of the AKDE algorithm in solving cumulative distribution function estimation of RCS less than 1 m2 are analyzed.
文摘Traffic incidents are a major cause of non-recurrent congestion and delays,making accu-rate incident duration(ID)prediction essential for effective traffic management.While machine learning(ML)and deep learning(DL)models have been developed to predict ID and its sub-periods(verification,response,and clearance times),their reliability has not been systematically assessed.This study introduces a novel framework to evaluate the reli-ability of these predictions using 4000 traffic incident records.Non-parametric kernel den-sity estimation(KDE)effectively captured variations in the data,outperforming traditional parametric methods.Monte Carlo simulation(MCS)was then used to assess prediction reliability.Among the models tested,bagged ensemble trees provided the best balance between accuracy and complexity,showing strong reliability for predicting total ID and sub-periods.Adding 5–25%buffer adjustments further improved reliability by accounting for prediction uncertainties.This framework offers a robust tool for assessing prediction reliability,is adaptable to various ML and DL models,and represents a significant step for-ward in traffic incident management.
基金Projects(61603393,61741318)supported in part by the National Natural Science Foundation of ChinaProject(BK20160275)supported by the Natural Science Foundation of Jiangsu Province,China+1 种基金Project(2015M581885)supported by the Postdoctoral Science Foundation of ChinaProject(PAL-N201706)supported by the Open Project Foundation of State Key Laboratory of Synthetical Automation for Process Industries of Northeastern University,China
文摘As a production quality index of hematite grinding process,particle size(PS)is hard to be measured in real time.To achieve the PS estimation,this paper proposes a novel data driven model of PS using stochastic configuration network(SCN)with robust technique,namely,robust SCN(RSCN).Firstly,this paper proves the universal approximation property of RSCN with weighted least squares technique.Secondly,three robust algorithms are presented by employing M-estimation with Huber loss function,M-estimation with interquartile range(IQR)and nonparametric kernel density estimation(NKDE)function respectively to set the penalty weight.Comparison experiments are first carried out based on the UCI standard data sets to verify the effectiveness of these methods,and then the data-driven PS model based on the robust algorithms are established and verified.Experimental results show that the RSCN has an excellent performance for the PS estimation.
基金Under the auspices of National Natural Science Foundation of China(No.41371190,31021001)Scientific and Tech-nical Projects of Western China Transportation Construction,Ministry of Transport of China(No.2008-318-799-17)
文摘Road network is a critical component of public infrastructure,and the supporting system of social and economic development.Based on a modified kernel density estimate(KDE)algorithm,this study evaluated the road service capacity provided by a road network composed of multi-level roads(i.e.national,provincial,county and rural roads),by taking account of the differences of effect extent and intensity for roads of different levels.Summarized at town scale,the population burden and the annual rural economic income of unit road service capacity were used as the surrogates of social and economic demands for road service.This method was applied to the road network of the Three Parallel River Region,the northwestern Yunnan Province,China to evaluate the development of road network in this region.In results,the total road length of this region in 2005 was 3.70×104km,and the length ratio between national,provincial,county and rural roads was 1∶2∶8∶47.From 1989 to 2005,the regional road service capacity increased by 13.1%,of which the contributions from the national,provincial,county and rural roads were 11.1%,19.4%,22.6%,and 67.8%,respectively,revealing the effect of′All Village Accessible′policy of road development in the mountainous regions in the last decade.The spatial patterns of population burden and economic requirement of unit road service suggested that the areas farther away from the national and provincial roads have higher road development priority(RDP).Based on the modified KDE model and the framework of RDP evaluation,this study provided a useful approach for developing an optimal plan of road development at regional scale.
文摘由于烧结过程中存在众多不确定性因素,使得机理分析和点预测结果的可靠性不足.基于此提出随机森林-极限树-核密度估计(random forest-extreme tree-kernel density estimation,RF-ET-KDE)算法对物理指标(粒度、水分)进行区间预测.首先,采用数据预处理和特征选择操作筛选出最适合建模的特征变量.其次,使用基于Stacking的RF-ET算法对指标进行点预测,该算法使得模型有较高的准确性和泛化性.然后,采用KDE算法计算指标的预测误差,得到了一定置信水平下的分布区间和区间预测结果.最后,用所建模型与其余组合模型进行对比.结果表明,RF-ET算法有较高的点预测效果,KDE算法可以很好地量化指标的误差,可以得到较高可靠度的区间预测结果.
文摘区别于经典的基于Parzen窗口法的概率密度函数估计器构建策略,提出了基于近邻误差度量函数的启发式核密度估计器(Heuristic kernel density estimator,HKDE),用以提升对模相近数据概率密度函数拟合的准确性。首次从数据不确定性和模型不确定性的角度分析了传统核密度估计器解决模相近数据概率密度函数估计问题时的缺陷:利用概率密度值对于直方图箱宽参数的收敛性确定观测数据的启发式概率密度值,降低数据概率密度值计算的不确定性;基于启发式概率密度值构建用于确定核密度估计器最优带宽的目标函数,降低最优带宽优化过程中的不确定性。在18个模相近数据集上对新估计器HKDE的可行性、合理性和有效性进行了系统性的验证。实验结果表明,与7种具有代表性的概率密度函数估计器相比,HKDE能够获得更加优异的概率分布近似表现,具有比其他估计器更低的估计误差,能够确定出更接近真实值的概率密度函数估计值。
文摘Traffic accident frequency has been decreasing in Japan in recent years.Nevertheless,many accidents still occur on residential roads.Area-wide traffic calming measures including Zone 30,which discourages traffic by setting a speed limit of 30 km/h in residential areas,have been implemented.However,no objective implementation method has been established.Development of a model for traffic accident density estimation explained by GIS data can enable the determination of dangerous areas objectively and easily,indicating where area-wide traffic calming can be implemented preferentially.This study examined the relations between traffic accidents and city characteristics,such as population,road factors,and spatial factors.A model was developed to estimate traffic accident density.Kernel density estimation(KDE)techniques were used to assess the relations efficiently.Besides,16 models were developed by combining accident locations,accident types,and data types.By using them,the applicability of traffic accident density estimation models was examined.Results obtained using Spearman rank correlation show high coefficients between the predicted number and the actual number.The model can indicate the relative accident risk in cities.Results of this study can be used for objective determination of areas where area-wide traffic calming can be implemented preferentially,even if sufficient traffic accident data are not available.