期刊文献+
共找到1,305篇文章
< 1 2 66 >
每页显示 20 50 100
Comparison of the performance of gradient boost,linear regression,decision tree,and voting algorithms to separate geochemical anomalies areas in the fractal environment
1
作者 Mirmahdi Seyedrahimi-Niaraq Hossein Mahdiyanfar Mohammad hossein Olyaee 《Artificial Intelligence in Geosciences》 2025年第2期290-305,共16页
In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,inclu... In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,including Mo,Cu,Pb,Zn,Ag,Ni,Co,Mn,Fe,and As,were used with these machine learning algorithms(MLAs)to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area.The performance of the models was evaluated using the Mean Absolute Percentage Error(MAPE)and Root Mean Square Error(RMSE)metrics.The proposed ensemble Voting algorithm outperformed the other models,yielding more ac-curate predictions according to both metrics.The predicted data from the GB,LR,DT,and Voting MLAs were modeled using the Concentration-Area fractal method,and Au geochemical anomalies were mapped.To compare and validate the results,factors such as the location of the mineral deposits,their surface extent,and mineralization trend were considered.The results indicate that integrating hybrid MLAs with fractal modeling signifi-cantly improves geochemical prospectivity mapping.Among the four models,three(DT,GB,Voting)accurately identified both mineral deposits.The LR model,however,only identified Deposit I(central),and its mineralization trend diverged from the field data.The GB and Voting models produced similar results,with their final maps derived from fractal modeling showing the same anomalous areas.The anomaly boundaries identified by these two models are consistent with the two known reserves in the region.The results and plots related to prediction indicators and error rates for these two models also show high similarity,with lower error rates than the other models.Notably,the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies. 展开更多
关键词 Gradient boost Linear regression Decision tree Voting algorithm C-A fractal modeling Geochemical mapping
在线阅读 下载PDF
Groundwater level prediction of landslide based on classification and regression tree 被引量:2
2
作者 Yannan Zhao Yuan Li +1 位作者 Lifen Zhang Qiuliang Wang 《Geodesy and Geodynamics》 2016年第5期348-355,共8页
According to groundwater level monitoring data of Shuping landslide in the Three Gorges Reservoir area, based on the response relationship between influential factors such as rainfall and reservoir level and the chang... According to groundwater level monitoring data of Shuping landslide in the Three Gorges Reservoir area, based on the response relationship between influential factors such as rainfall and reservoir level and the change of groundwater level, the influential factors of groundwater level were selected. Then the classification and regression tree(CART) model was constructed by the subset and used to predict the groundwater level. Through the verification, the predictive results of the test sample were consistent with the actually measured values, and the mean absolute error and relative error is 0.28 m and 1.15%respectively. To compare the support vector machine(SVM) model constructed using the same set of factors, the mean absolute error and relative error of predicted results is 1.53 m and 6.11% respectively. It is indicated that CART model has not only better fitting and generalization ability, but also strong advantages in the analysis of landslide groundwater dynamic characteristics and the screening of important variables. It is an effective method for prediction of ground water level in landslides. 展开更多
关键词 LANDSLIDE Groundwater level PREDICTION Classification and regression tree Three Gorges Reservoir area
原文传递
Prediction method of restoring force based on online AdaBoost regression tree algorithm in hybrid test 被引量:1
3
作者 Wang Yanhua Lü Jing +1 位作者 Wu Jing Wang Cheng 《Journal of Southeast University(English Edition)》 EI CAS 2020年第2期181-187,共7页
In order to solve the poor generalization ability of the back-propagation(BP)neural network in the model updating hybrid test,a novel method called the AdaBoost regression tree algorithm is introduced into the model u... In order to solve the poor generalization ability of the back-propagation(BP)neural network in the model updating hybrid test,a novel method called the AdaBoost regression tree algorithm is introduced into the model updating procedure in hybrid tests.During the learning phase,the regression tree is selected as a weak regression model to be trained,and then multiple trained weak regression models are integrated into a strong regression model.Finally,the training results are generated through voting by all the selected regression models.A 2-DOF nonlinear structure was numerically simulated by utilizing the online AdaBoost regression tree algorithm and the BP neural network algorithm as a contrast.The results show that the prediction accuracy of the online AdaBoost regression algorithm is 48.3%higher than that of the BP neural network algorithm,which verifies that the online AdaBoost regression tree algorithm has better generalization ability compared to the BP neural network algorithm.Furthermore,it can effectively eliminate the influence of weight initialization and improve the prediction accuracy of the restoring force in hybrid tests. 展开更多
关键词 hybrid test restoring force prediction generalization ability AdaBoost regression tree
在线阅读 下载PDF
A New Approach to Predict Financial Failure: Classification and Regression Trees (CART) 被引量:1
4
作者 Ayse Guel Yllgoer UEmit Dogrul Guelhan Orekici Temel 《Journal of Modern Accounting and Auditing》 2011年第4期329-339,共11页
The increase of competition, economic recession and financial crises has increased business failure and depending on this the researchers have attempted to develop new approaches which can yield more correct and more ... The increase of competition, economic recession and financial crises has increased business failure and depending on this the researchers have attempted to develop new approaches which can yield more correct and more reliable results. The classification and regression tree (CART) is one of the new modeling techniques which is developed for this purpose. In this study, the classification and regression trees method is explained and tested the power of the financial failure prediction. CART is applied for the data of industry companies which is trade in Istanbul Stock Exchange (ISE) between 1997-2007. As a result of this study, it has been observed that, CART has a high predicting power of financial failure one, two and three years prior to failure, and profitability ratios being the most important ratios in the prediction of failure. 展开更多
关键词 business failure financial distress PREDICTION classification and regression trees (CART)
在线阅读 下载PDF
Monthly Electricity Consumption Forecast Based on Multi-Target Regression
5
作者 Haiming Li Ping Chen 《Journal of Computer and Communications》 2019年第7期231-242,共12页
Urban grid power forecasting is one of the important tasks of power system operators, which helps to analyze the development trend of the city. As the demand for electricity in various industries is affected by many f... Urban grid power forecasting is one of the important tasks of power system operators, which helps to analyze the development trend of the city. As the demand for electricity in various industries is affected by many factors, the data of relevant influencing factors are scarce, resulting in great deviations in the accuracy of prediction results. In order to improve the prediction results, this paper proposes a model based on Multi-Target Tree Regression to predict the monthly electricity consumption of different industrial structures. Due to few data characteristics of actual electricity consumption in Shanghai from 2013 to the first half of 2017. Thus, we collect data on GDP growth, weather conditions, and tourism season distribution in various industries in Shanghai, model and train the electricity consumption data of different industries in different months. The multi-target tree regression model was tested with actual values to verify the reliability of the model and predict the monthly electricity consumption of each industry in the second half of 2017. The experimental results show that the model can accurately predict the monthly electricity consumption of various industries. 展开更多
关键词 Forecasting multi-target tree regression ELECTRICITY MONTHLY ELECTRICITY CONSUMPTION PREDICT
暂未订购
An Empirical Comparison on Multi-Target Regression Learning
6
作者 Xuefeng Xi Victor S.Sheng +2 位作者 Binqi Sun Lei Wang Fuyuan Hu 《Computers, Materials & Continua》 SCIE EI 2018年第8期185-198,共14页
Multi-target regression is concerned with the simultaneous prediction of multiple continuous target variables based on the same set of input variables.It has received relatively small attention from the Machine Learni... Multi-target regression is concerned with the simultaneous prediction of multiple continuous target variables based on the same set of input variables.It has received relatively small attention from the Machine Learning community.However,multi-target regression exists in many real-world applications.In this paper we conduct extensive experiments to investigate the performance of three representative multi-target regression learning algorithms(i.e.Multi-Target Stacking(MTS),Random Linear Target Combination(RLTC),and Multi-Objective Random Forest(MORF)),comparing the baseline single-target learning.Our experimental results show that all three multi-target regression learning algorithms do improve the performance of the single-target learning.Among them,MTS performs the best,followed by RLTC,followed by MORF.However,the single-target learning sometimes still performs very well,even the best.This analysis sheds the light on multi-target regression learning and indicates that the single-target learning is a competitive baseline for multi-target regression learning on multi-target domains. 展开更多
关键词 multi-target regression multi-label classification multi-target stacking
在线阅读 下载PDF
Predicting the Underlying Structure for Phylogenetic Trees Using Neural Networks and Logistic Regression
7
作者 Hassan W. Kayondo Samuel Mwalili 《Open Journal of Statistics》 2020年第2期239-251,共13页
Understanding an underlying structure for phylogenetic trees is very important as it informs on the methods that should be employed during phylogenetic inference. The methods used under a structured population differ ... Understanding an underlying structure for phylogenetic trees is very important as it informs on the methods that should be employed during phylogenetic inference. The methods used under a structured population differ from those needed when a population is not structured. In this paper, we compared two supervised machine learning techniques, that is artificial neural network (ANN) and logistic regression models for prediction of an underlying structure for phylogenetic trees. We carried out parameter tuning for the models to identify optimal models. We then performed 10-fold cross-validation on the optimal models for both logistic regression?and ANN. We also performed a non-supervised technique called clustering to identify the number of clusters that could be identified from simulated phylogenetic trees. The trees were from?both structured?and non-structured populations. Clustering and prediction using classification techniques were?done using tree statistics such as Colless, Sackin and cophenetic indices, among others. Results from 10-fold cross-validation revealed that both logistic regression and ANN models had comparable results, with both models having average accuracy rates of over 0.75. Most of the clustering indices used resulted in 2 or 3 as the optimal number of clusters. 展开更多
关键词 Artificial NEURAL Networks LOGISTIC regression PHYLOGENETIC tree tree STATISTICS Classification Clustering
在线阅读 下载PDF
Mapping species assemblages of tropical forests at different hierarchical levels based on multivariate regression trees
8
作者 Qi Yang Maaike Y.Bader +3 位作者 Guang Feng Jialing Li Dexu Zhang Wenxing Long 《Forest Ecosystems》 SCIE CSCD 2023年第3期387-397,共11页
Background: Vegetation distribution maps are of great significance for nature protection and management. In diverse tropical forests, accurate spatial mapping of vegetation types is challenging;the high species divers... Background: Vegetation distribution maps are of great significance for nature protection and management. In diverse tropical forests, accurate spatial mapping of vegetation types is challenging;the high species diversity and abundance of rare species challenge classification concepts, while remote sensing signals may not vary systematically with species composition, complicating the technical capability for delineating vegetation types in the landscape.Methods: We used a combination of field-based compositional data and their relations to environmental variables to predict the distribution of forest types in the Wuzhishan National Natural Reserve(WNNR), Hainan Island,China, using multivariate regression trees(MRT). The MRT was based on arboreal vegetation composition in 132plots of 20 m×20 m with a regular spacing of 1 km. Apart from the MRT, non-metric multidimensional scaling(NMDS) was used to evaluate vegetation-environment relationships.Results: The MRT model worked best when using 14 key environmental variables including topography, climate,latitude and soil, although the difference with the simpler model including only topographical variables was small. The full model classified the 132 plots into 3 vegetation types, 6 formation groups, 20 formations and 65associations at different hierarchical syntaxonomic levels. This model was the basis for forest vegetation maps for the WNNR. MRT and NMDS showed that elevation was the main driving force for the distribution of vegetation types and formation groups. Climate, latitude, and soil(especially available P), together with topographic variables, all influenced the distribution of formations and associations.Conclusions: While elevation determines forest-type distributions, lower-level syntaxonomic forest classes respond to the topographic diversity typical for mountains. Apart from providing the first detailed forest vegetation map for any part of WNNR, we show how, in spite of limitations, MRT with existing environmental data can be a useful method for mapping diverse and remote tropical forests. 展开更多
关键词 Species assemblages Tropical forest MAPPING Multivariate regression trees Non-metric multidimensional scaling
在线阅读 下载PDF
Using Boosted Regression Trees and Remotely Sensed Data to Drive Decision-Making
9
作者 Brigitte Colin Samuel Clifford +2 位作者 Paul Wu Samuel Rathmanner Kerrie Mengersen 《Open Journal of Statistics》 2017年第5期859-875,共17页
Challenges in Big Data analysis arise due to the way the data are recorded, maintained, processed and stored. We demonstrate that a hierarchical, multivariate, statistical machine learning algorithm, namely Boosted Re... Challenges in Big Data analysis arise due to the way the data are recorded, maintained, processed and stored. We demonstrate that a hierarchical, multivariate, statistical machine learning algorithm, namely Boosted Regression Tree (BRT) can address Big Data challenges to drive decision making. The challenge of this study is lack of interoperability since the data, a collection of GIS shapefiles, remotely sensed imagery, and aggregated and interpolated spatio-temporal information, are stored in monolithic hardware components. For the modelling process, it was necessary to create one common input file. By merging the data sources together, a structured but noisy input file, showing inconsistencies and redundancies, was created. Here, it is shown that BRT can process different data granularities, heterogeneous data and missingness. In particular, BRT has the advantage of dealing with missing data by default by allowing a split on whether or not a value is missing as well as what the value is. Most importantly, the BRT offers a wide range of possibilities regarding the interpretation of results and variable selection is automatically performed by considering how frequently a variable is used to define a split in the tree. A comparison with two similar regression models (Random Forests and Least Absolute Shrinkage and Selection Operator, LASSO) shows that BRT outperforms these in this instance. BRT can also be a starting point for sophisticated hierarchical modelling in real world scenarios. For example, a single or ensemble approach of BRT could be tested with existing models in order to improve results for a wide range of data-driven decisions and applications. 展开更多
关键词 Boosted regression trees Remotely Sensed DATA BIG DATA MODELLING Approach MISSING DATA
暂未订购
Point-Tree Structure Genetic Programming Method for Discontinuous Function's Regression
10
作者 Xiong Sheng-wu, Wang Wei-wuSchool of Computer Science and Technology, Wuhan University of Technology, Wuhan 430070, Hubei. China 《Wuhan University Journal of Natural Sciences》 CAS 2003年第S1期323-326,共4页
A new point-tree data structure genetic programming (PTGP) method is proposed. For the discontinuous function regression problem, the proposed method is able to identify both the function structure and discontinuities... A new point-tree data structure genetic programming (PTGP) method is proposed. For the discontinuous function regression problem, the proposed method is able to identify both the function structure and discontinuities points simultaneously. It is also easy to be used to solve the continuous function's regression problems. The numerical experiment results demonstrate that the point-tree GP is an efficient alternative way to the complex function identification problems. 展开更多
关键词 genetic programming symbolic regression point-tree structure
在线阅读 下载PDF
Iceberg Draft Prediction Using Several Tree-Based Machine Learning Models
11
作者 AZIMI Hamed SHIRI Hodjat 《Journal of Ocean University of China》 2025年第5期1269-1288,共20页
The Arctic region is experiencing accelerated sea ice melt and increased iceberg detachment from glaciers due to climate change.These drifting icebergs present a risk and engineering challenge for subsea installations... The Arctic region is experiencing accelerated sea ice melt and increased iceberg detachment from glaciers due to climate change.These drifting icebergs present a risk and engineering challenge for subsea installations traversing shallow waters,where ice-berg keels may reach the seabed,potentially damaging subsea structures.Consequently,costly and time-intensive iceberg manage-ment operations,such as towing and rerouting,are undertaken to safeguard subsea and offshore infrastructure.This study,therefore,explores the application of extra tree regression(ETR)as a robust solution for estimating iceberg draft,particularly in the preliminary phases of decision-making for iceberg management projects.Nine ETR models were developed using parameters influencing iceberg draft.Subsequent analyses identified the most effective models and significant input variables.Uncertainty analysis revealed that the superior ETR model tended to overestimate iceberg drafts;however,it achieved the highest precision,correlation,and simplicity in estimation.Comparison with decision tree regression,random forest regression,and empirical methods confirmed the superior perfor-mance of ETR in predicting iceberg drafts. 展开更多
关键词 sea-bottom founded structures iceberg draft extra tree regression decision tree regression random forest regression
在线阅读 下载PDF
A classification tree for seismic evaluation of strip foundations on liquefiable soils
12
作者 Rohollah Taslimian Parisa Delalat 《Earthquake Engineering and Engineering Vibration》 2025年第3期675-695,共21页
The feasibility of constructing shallow foundations on saturated sands remains uncertain.Seismic design standards simply stipulate that geotechnical investigations for a shallow foundation on such soils shall be condu... The feasibility of constructing shallow foundations on saturated sands remains uncertain.Seismic design standards simply stipulate that geotechnical investigations for a shallow foundation on such soils shall be conducted to mitigate the effects of the liquefaction hazard.This study investigates the seismic behavior of strip foundations on typical two-layered soil profiles-a natural loose sand layer supported by a dense sand layer.Coupled nonlinear dynamic analyses have been conducted to calculate response parameters,including seismic settlement,the acceleration response on the ground surface,and excess pore pressure beneath strip foundations.A novel liquefaction potential index(LPI_(footing)),based on excess pore pressure ratios across a given region of soil mass beneath footings is introduced to classify liquefaction severity into three distinct levels:minor,moderate,and severe.To validate the proposed LPI_(footing),the foundation settlement is evaluated for the different liquefaction potential classes.A classification tree model has been grown to predict liquefaction susceptibility,utilizing various input variables,including earthquake intensity on the ground surface,foundation pressure,sand permeability,and top layer thickness.Moreover,a nonlinear regression function has been established to map LPI_(footing) in relation to these input predictors.The models have been constructed using a substantial dataset comprising 13,824 excess pore pressure ratio time histories.The performance of the developed models has been examined using various methods,including the 10-fold cross-validation method.The predictive capability of the tree also has been validated through existing experimental studies.The results indicate that the classification tree is not only interpretable but also highly predictive,with a testing accuracy level of 78.1%.The decision tree provides valuable insights for engineers assessing liquefaction potential beneath strip foundations. 展开更多
关键词 computational geomechanics liquefaction potential index shallow foundation finite element method machine learning decision tree CLASSIFICATION regression
在线阅读 下载PDF
基于决策树回归的单通道热流法体温预测
13
作者 熊凌浩 宋晋忠 +3 位作者 张煜 余新明 张欣苗 于闯 《传感器与微系统》 北大核心 2026年第1期41-45,共5页
人体核心体温(CBT)的连续监测,在很多户外工作场景上具有较高的研究价值。使用基于单通道热流(HF)法的传感器,分别从传感器测量原理、数据采集方式、特征工程选择、模型构建、评估指标等方面进行介绍。通过网格搜索,在训练集上进行算法... 人体核心体温(CBT)的连续监测,在很多户外工作场景上具有较高的研究价值。使用基于单通道热流(HF)法的传感器,分别从传感器测量原理、数据采集方式、特征工程选择、模型构建、评估指标等方面进行介绍。通过网格搜索,在训练集上进行算法模型训练,得到最终的决策树回归算法。在10人的测试集上,平均绝对误差(MAE)为0.257℃,且对于核心体温的变化具有一定的跟随性,可为单通道热流法测量人体核心体温的推广应用提供一定的支撑。 展开更多
关键词 单通道热流法 核心体温 决策树回归法
在线阅读 下载PDF
一种基于ExtraTrees的差分隐私保护算法 被引量:6
14
作者 李杨 陈子彬 谢光强 《计算机工程》 CAS CSCD 北大核心 2020年第2期134-140,共7页
为在同等隐私保护级别下提高模型的预测准确率并降低误差,提出一种基于ExtraTrees的差分隐私保护算法DiffPETs。在决策树生成过程中,根据不同的准则计算出各特征的结果值,利用指数机制选择得分最高的特征,通过拉普拉斯机制在叶子节点上... 为在同等隐私保护级别下提高模型的预测准确率并降低误差,提出一种基于ExtraTrees的差分隐私保护算法DiffPETs。在决策树生成过程中,根据不同的准则计算出各特征的结果值,利用指数机制选择得分最高的特征,通过拉普拉斯机制在叶子节点上进行加噪,使算法能够提供ε-差分隐私保护。将DiffPETs算法应用于决策树分类和回归分析中,对于分类树,选择基尼指数作为指数机制的可用性函数并给出基尼指数的敏感度,在回归树上,将方差作为指数机制的可用性函数并给出方差的敏感度。实验结果表明,与决策树差分隐私分类和回归算法相比,DiffPETs算法能有效降低预测误差。 展开更多
关键词 差分隐私 Extratrees算法 分类 回归分析 决策树
在线阅读 下载PDF
基于数值模拟和决策树回归的金属切削力学性能预测
15
作者 程一晋 冯志强 李燕 《应用数学和力学》 北大核心 2026年第1期32-45,共14页
快速预测金属切削的各种力学性能对工业制造的优化设计和产能提高十分关键.当前相关预测模型通常需要昂贵且耗时的实验和分析过程.构建了一种基于金属切削模拟和决策树回归(decision tree regression,DTR)的预测模型,用于获取不同切削... 快速预测金属切削的各种力学性能对工业制造的优化设计和产能提高十分关键.当前相关预测模型通常需要昂贵且耗时的实验和分析过程.构建了一种基于金属切削模拟和决策树回归(decision tree regression,DTR)的预测模型,用于获取不同切削工况下的力学性能.首先,采用自适应光滑粒子流体动力学(adaptive smoothed particle hydrodynamics,ASPH)模拟金属切削过程,捕获了不同模拟参数下的多种力学性能,组成2000种切削工况的模拟数据集;其次,利用DTR算法学习模拟数据集,训练和构建金属切削预测模型,并通过交叉验证和网格搜索评估了不同剪枝策略下预测模型的效果.结果表明,建立的预测模型可以快速地预测不同模拟参数下的多种力学性能,适宜的剪枝策略可以提升预测模型的准确度、泛化能力和稳定性. 展开更多
关键词 金属切削 力学性能预测 数值模拟 自适应光滑粒子流体动力学 决策树回归
在线阅读 下载PDF
基于机器学习的岩溶裂隙空间分布预测研究:以北京房山为例
16
作者 乔小娟 罗承可 +1 位作者 柴新宇 于文瑾 《地学前缘》 北大核心 2026年第1期405-418,共14页
岩溶裂隙发育具有高维、非线性及空间异质性特征,如何刻画裂隙的空间展布是岩溶发育规律研究的难点。以多源数据驱动的机器学习建模方法可以有效捕捉裂隙系统中隐含的非线性、非连续的特征,从而显著地提高裂隙识别与刻画的效率与精度。... 岩溶裂隙发育具有高维、非线性及空间异质性特征,如何刻画裂隙的空间展布是岩溶发育规律研究的难点。以多源数据驱动的机器学习建模方法可以有效捕捉裂隙系统中隐含的非线性、非连续的特征,从而显著地提高裂隙识别与刻画的效率与精度。本研究以北京市房山张坊地区为研究对象,基于翔实的野外裂隙实测数据,系统融合了地表地形信息、区域构造背景、地层岩性分布以及地下水位等多源数据集。利用机器学习框架构建了一套综合性的定量化特征体系,该体系涵盖了断层空间影响、地层岩性组合特征、地下水埋深变化以及高精度地形衍生属性(如坡度、曲率等)等多个维度的指标。重点研究对比了支持向量回归、极致梯度提升树及随机森林这三种机器学习方法,旨在预测研究区内岩溶裂隙的发育与空间分布情况。结果表明,基于随机森林构建的预测模型表现最为优异。该模型的裂隙密度、节理走向与倾角的模拟结果与实测统计数据最符合,模型表现最为稳健,具有良好的泛化能力和方法适用性,在表达多期次裂隙发育等复杂地质过程方面具有独特优势。本研究的结果揭示,将数据驱动模型与深入的地质机理分析相融合,是突破复杂岩溶系统定量化表征与预测难题的一条有效途径。 展开更多
关键词 岩溶裂隙 机器学习 支持向量回归 梯度提升树 随机森林 北京房山
在线阅读 下载PDF
基于全流程仿真机理数据的城市固废焚烧过程尾气排放建模与分析
17
作者 汤健 王天峥 +4 位作者 夏恒 陈佳昆 梁永琪 庄家宾 乔俊飞 《北京工业大学学报》 北大核心 2026年第1期41-54,共14页
针对城市固废焚烧(municipal solid waste incineration,MSWI)过程中“布风布料”操作变量与尾气排放气体间的精确机理模型难以构建的问题,提出了基于全流程仿真机理数据的MSWI过程尾气排放建模与分析方法。首先,在进行面向操作变量与... 针对城市固废焚烧(municipal solid waste incineration,MSWI)过程中“布风布料”操作变量与尾气排放气体间的精确机理模型难以构建的问题,提出了基于全流程仿真机理数据的MSWI过程尾气排放建模与分析方法。首先,在进行面向操作变量与尾气排放的工艺流程描述的基础上,耦合流体动力焚烧代码(fluid dynamic incinerator code,FLIC)、Fluent和Aspen Plus这3种数值仿真软件对MSWI过程所包含的炉排固相燃烧、炉内气相燃烧、余热交换与烟气处理等阶段进行全流程模拟,进而获得基准运行工况下的数值仿真模型;接着,面向操作变量进行正交实验设计和实验实施,获得多种运行工况下仿真机理数据;最后,构建以操作变量为输入、以主要尾气排放为输出的基于多入多出线性回归决策树(multiple-input multiple-output least regression decision tree,MIMO-LRDT)的尾气排放模型,并分别采用单因素和双因素法可视化分析两者间的映射关系。采用面向北京某MSWI厂构建的数值仿真和数据驱动模型验证了所提方法的有效性。 展开更多
关键词 城市固废焚烧过程 尾气排放建模 数值仿真模型 仿真机理数据 数据驱动模型 多入多出线性回归决策树
在线阅读 下载PDF
Segmented Linear Regression Trees
18
作者 Xiangyu Zheng Songxi Chen 《Acta Mathematica Sinica,English Series》 2025年第2期498-521,共24页
Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-... Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-tree method and propose the segmented linear regression trees(SLRT) model that replaces the traditional constant leaf model with linear ones. From the parametric view, SLRT can be employed as a recursive change point detect procedure for segmented linear regression(SLR) models,which is much more efficient and flexible than the traditional grid search method. Along this way,we propose to use the conditional Kendall's τ correlation coefficient to select the underlying change points. From the non-parametric view, we propose an efficient greedy splitting method that selects the splits by analyzing the association between residuals and each candidate split variable. Further, with the SLRT as a single-tree predictor, we propose a linear random forest approach that aggregates the SLRTs by a weighted average. Both simulation and empirical studies showed significant improvements than the CART trees and even the random forest. 展开更多
关键词 regression trees segmented linear regression split selection tree pruning
原文传递
Services on Academic Achievement: A Robust Estimation Using Bayesian Additive Regression Trees
19
作者 Yuntian Zuo 《Open Journal of Statistics》 2025年第6期445-472,共28页
Determining the causal effect of special education is a critical topic when mak-ing educational policy that focuses on student achievement.However,current special education research is facing challenges from persisten... Determining the causal effect of special education is a critical topic when mak-ing educational policy that focuses on student achievement.However,current special education research is facing challenges from persistent selection bias and complex confounding.Bayesian Additive Regression Trees(BART)is em-ployed in this study to provide a flexible estimation of the academic perfor-mance.Targeted Maximum Likelihood Estimation(TMLE)is also integrated into the BART model,supporting doubly robust estimation of the special ed-ucation effect.This study extracted survey data from the Early Childhood Lon-gitudinal Study,Kindergarten Class(ECLS-K),to estimate the causal impact of special education status on students’combined mathematics and reading achievement scores.The analysis results of the BART-TMLE model show that children receiving special education services demonstrated approximately 9 points lower scores on average for combined math and reading scores,even adjusting for a considerable number of covariates,compared to their peers who did not receive these services.The estimated negative treatment effect persists after controlling for observed covariates that are closely correlated to the combined test score.The negative effect likely reflects unobserved factors,such as the underlying severity of learning disabilities,parent involvement and other potential traits,which are actual factors that determine the placement of special education status,rather than indicating the ineffectiveness of special education service.The achievement gap in academic performance reflects the current observable status of special education.The estimated effect could be improved by future research incorporating educational domain knowledge,allowing the model to be constructed more accurately. 展开更多
关键词 Bayesian Additive regression trees Targeted Maximum Likelihood
在线阅读 下载PDF
Estimating canopy closure density and above-ground tree biomass using partial least square methods in Chinese boreal forests 被引量:5
20
作者 LEI Cheng-liang JU Cun-yong +3 位作者 CAI Ti-jiu J1NG Xia WEI Xiao-hua DI Xue-ying 《Journal of Forestry Research》 CAS CSCD 2012年第2期191-196,共6页
Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used parti... Boreal forests play an important role in global environment systems. Understanding boreal forest ecosystem structure and function requires accurate monitoring and estimating of forest canopy and biomass. We used partial least square regression (PLSR) models to relate forest parameters, i.e. canopy closure density and above ground tree biomass, to Landsat ETM+ data. The established models were optimized according to the variable importance for projection (VIP) criterion and the bootstrap method, and their performance was compared using several statistical indices. All variables selected by the VIP criterion passed the bootstrap test (p〈0.05). The simplified models without insignificant variables (VIP 〈1) performed as well as the full model but with less computation time. The relative root mean square error (RMSE%) was 29% for canopy closure density, and 58% for above ground tree biomass. We conclude that PLSR can be an effective method for estimating canopy closure density and above ground biomass. 展开更多
关键词 above-ground tree biomass bootstrap method canopy clo- sure density partial least square regression (PLSR) VIP criterion
在线阅读 下载PDF
上一页 1 2 66 下一页 到第
使用帮助 返回顶部