期刊文献+
共找到81篇文章
< 1 2 5 >
每页显示 20 50 100
Services on Academic Achievement: A Robust Estimation Using Bayesian Additive Regression Trees
1
作者 Yuntian Zuo 《Open Journal of Statistics》 2025年第6期445-472,共28页
Determining the causal effect of special education is a critical topic when mak-ing educational policy that focuses on student achievement.However,current special education research is facing challenges from persisten... Determining the causal effect of special education is a critical topic when mak-ing educational policy that focuses on student achievement.However,current special education research is facing challenges from persistent selection bias and complex confounding.Bayesian Additive Regression Trees(BART)is em-ployed in this study to provide a flexible estimation of the academic perfor-mance.Targeted Maximum Likelihood Estimation(TMLE)is also integrated into the BART model,supporting doubly robust estimation of the special ed-ucation effect.This study extracted survey data from the Early Childhood Lon-gitudinal Study,Kindergarten Class(ECLS-K),to estimate the causal impact of special education status on students’combined mathematics and reading achievement scores.The analysis results of the BART-TMLE model show that children receiving special education services demonstrated approximately 9 points lower scores on average for combined math and reading scores,even adjusting for a considerable number of covariates,compared to their peers who did not receive these services.The estimated negative treatment effect persists after controlling for observed covariates that are closely correlated to the combined test score.The negative effect likely reflects unobserved factors,such as the underlying severity of learning disabilities,parent involvement and other potential traits,which are actual factors that determine the placement of special education status,rather than indicating the ineffectiveness of special education service.The achievement gap in academic performance reflects the current observable status of special education.The estimated effect could be improved by future research incorporating educational domain knowledge,allowing the model to be constructed more accurately. 展开更多
关键词 Bayesian Additive regression trees Targeted Maximum Likelihood
在线阅读 下载PDF
A New Approach to Predict Financial Failure: Classification and Regression Trees (CART) 被引量:1
2
作者 Ayse Guel Yllgoer UEmit Dogrul Guelhan Orekici Temel 《Journal of Modern Accounting and Auditing》 2011年第4期329-339,共11页
The increase of competition, economic recession and financial crises has increased business failure and depending on this the researchers have attempted to develop new approaches which can yield more correct and more ... The increase of competition, economic recession and financial crises has increased business failure and depending on this the researchers have attempted to develop new approaches which can yield more correct and more reliable results. The classification and regression tree (CART) is one of the new modeling techniques which is developed for this purpose. In this study, the classification and regression trees method is explained and tested the power of the financial failure prediction. CART is applied for the data of industry companies which is trade in Istanbul Stock Exchange (ISE) between 1997-2007. As a result of this study, it has been observed that, CART has a high predicting power of financial failure one, two and three years prior to failure, and profitability ratios being the most important ratios in the prediction of failure. 展开更多
关键词 business failure financial distress PREDICTION classification and regression trees (CART)
在线阅读 下载PDF
Mapping species assemblages of tropical forests at different hierarchical levels based on multivariate regression trees
3
作者 Qi Yang Maaike Y.Bader +3 位作者 Guang Feng Jialing Li Dexu Zhang Wenxing Long 《Forest Ecosystems》 SCIE CSCD 2023年第3期387-397,共11页
Background: Vegetation distribution maps are of great significance for nature protection and management. In diverse tropical forests, accurate spatial mapping of vegetation types is challenging;the high species divers... Background: Vegetation distribution maps are of great significance for nature protection and management. In diverse tropical forests, accurate spatial mapping of vegetation types is challenging;the high species diversity and abundance of rare species challenge classification concepts, while remote sensing signals may not vary systematically with species composition, complicating the technical capability for delineating vegetation types in the landscape.Methods: We used a combination of field-based compositional data and their relations to environmental variables to predict the distribution of forest types in the Wuzhishan National Natural Reserve(WNNR), Hainan Island,China, using multivariate regression trees(MRT). The MRT was based on arboreal vegetation composition in 132plots of 20 m×20 m with a regular spacing of 1 km. Apart from the MRT, non-metric multidimensional scaling(NMDS) was used to evaluate vegetation-environment relationships.Results: The MRT model worked best when using 14 key environmental variables including topography, climate,latitude and soil, although the difference with the simpler model including only topographical variables was small. The full model classified the 132 plots into 3 vegetation types, 6 formation groups, 20 formations and 65associations at different hierarchical syntaxonomic levels. This model was the basis for forest vegetation maps for the WNNR. MRT and NMDS showed that elevation was the main driving force for the distribution of vegetation types and formation groups. Climate, latitude, and soil(especially available P), together with topographic variables, all influenced the distribution of formations and associations.Conclusions: While elevation determines forest-type distributions, lower-level syntaxonomic forest classes respond to the topographic diversity typical for mountains. Apart from providing the first detailed forest vegetation map for any part of WNNR, we show how, in spite of limitations, MRT with existing environmental data can be a useful method for mapping diverse and remote tropical forests. 展开更多
关键词 Species assemblages Tropical forest MAPPING Multivariate regression trees Non-metric multidimensional scaling
在线阅读 下载PDF
Segmented Linear Regression Trees
4
作者 Xiangyu Zheng Songxi Chen 《Acta Mathematica Sinica,English Series》 2025年第2期498-521,共24页
Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-... Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-tree method and propose the segmented linear regression trees(SLRT) model that replaces the traditional constant leaf model with linear ones. From the parametric view, SLRT can be employed as a recursive change point detect procedure for segmented linear regression(SLR) models,which is much more efficient and flexible than the traditional grid search method. Along this way,we propose to use the conditional Kendall's τ correlation coefficient to select the underlying change points. From the non-parametric view, we propose an efficient greedy splitting method that selects the splits by analyzing the association between residuals and each candidate split variable. Further, with the SLRT as a single-tree predictor, we propose a linear random forest approach that aggregates the SLRTs by a weighted average. Both simulation and empirical studies showed significant improvements than the CART trees and even the random forest. 展开更多
关键词 regression trees segmented linear regression split selection tree pruning
原文传递
Predicting plant disease epidemics using boosted regression trees
5
作者 Chun Peng Xingyue Zhang Weiming Wang 《Infectious Disease Modelling》 CSCD 2024年第4期1138-1146,共9页
Plant epidemics are often associated with weather-related variables.It is difficult to identify weather-related predictors for models predicting plant epidemics.In the article by Shah et al.,to predict Fusarium head b... Plant epidemics are often associated with weather-related variables.It is difficult to identify weather-related predictors for models predicting plant epidemics.In the article by Shah et al.,to predict Fusarium head blight(FHB)epidemics of wheat,they explored a functional approach using scalar-on-function regression to model a binary outcome(FHB epidemic or non-epidemic)with respect to weather time series spanning 140 days relative to anthesis.The scalar-on-function models fit the data better than previously described logistic regression models.In this work,given the same dataset and models,we attempt to reproduce the article by Shah et al.using a different approach,boosted regression trees.After fitting,the classification accuracy and model statistics are surprisingly good. 展开更多
关键词 Plant disease epidemics Scalar-on-function model Boosted regression trees
原文传递
Partially fixed bayesian additive regression trees
6
作者 Hao Ran Yang Bai 《Statistical Theory and Related Fields》 CSCD 2024年第3期232-242,共11页
Bayesian Additive Regression Trees(BART)is a widely popular nonparametric regression model known for its accurate prediction capabilities.In certain situations,there is knowledge suggesting the existence of certain do... Bayesian Additive Regression Trees(BART)is a widely popular nonparametric regression model known for its accurate prediction capabilities.In certain situations,there is knowledge suggesting the existence of certain dominant variables.However,the BART model fails to fully utilize the knowledge.To tackle this problem,the paper introduces a modification to BART known as the Partially Fixed BART model.By fixing a portion of the trees’structure,this model enables more efficient utilization of prior knowledge,resulting in enhanced estimation accuracy.Moreover,the Partially Fixed BART model can offer more precise estimates and valuable insights for future analysis even when such prior knowledge is absent.Empirical results substantiate the enhancement of the proposed model in comparison to the original BART. 展开更多
关键词 Bayesian additive regression trees nonparametric model machine learning variable importance
原文传递
Groundwater level prediction of landslide based on classification and regression tree 被引量:2
7
作者 Yannan Zhao Yuan Li +1 位作者 Lifen Zhang Qiuliang Wang 《Geodesy and Geodynamics》 2016年第5期348-355,共8页
According to groundwater level monitoring data of Shuping landslide in the Three Gorges Reservoir area, based on the response relationship between influential factors such as rainfall and reservoir level and the chang... According to groundwater level monitoring data of Shuping landslide in the Three Gorges Reservoir area, based on the response relationship between influential factors such as rainfall and reservoir level and the change of groundwater level, the influential factors of groundwater level were selected. Then the classification and regression tree(CART) model was constructed by the subset and used to predict the groundwater level. Through the verification, the predictive results of the test sample were consistent with the actually measured values, and the mean absolute error and relative error is 0.28 m and 1.15%respectively. To compare the support vector machine(SVM) model constructed using the same set of factors, the mean absolute error and relative error of predicted results is 1.53 m and 6.11% respectively. It is indicated that CART model has not only better fitting and generalization ability, but also strong advantages in the analysis of landslide groundwater dynamic characteristics and the screening of important variables. It is an effective method for prediction of ground water level in landslides. 展开更多
关键词 LANDSLIDE Groundwater level PREDICTION Classification and regression tree Three Gorges Reservoir area
原文传递
Prediction method of restoring force based on online AdaBoost regression tree algorithm in hybrid test 被引量:1
8
作者 Wang Yanhua Lü Jing +1 位作者 Wu Jing Wang Cheng 《Journal of Southeast University(English Edition)》 EI CAS 2020年第2期181-187,共7页
In order to solve the poor generalization ability of the back-propagation(BP)neural network in the model updating hybrid test,a novel method called the AdaBoost regression tree algorithm is introduced into the model u... In order to solve the poor generalization ability of the back-propagation(BP)neural network in the model updating hybrid test,a novel method called the AdaBoost regression tree algorithm is introduced into the model updating procedure in hybrid tests.During the learning phase,the regression tree is selected as a weak regression model to be trained,and then multiple trained weak regression models are integrated into a strong regression model.Finally,the training results are generated through voting by all the selected regression models.A 2-DOF nonlinear structure was numerically simulated by utilizing the online AdaBoost regression tree algorithm and the BP neural network algorithm as a contrast.The results show that the prediction accuracy of the online AdaBoost regression algorithm is 48.3%higher than that of the BP neural network algorithm,which verifies that the online AdaBoost regression tree algorithm has better generalization ability compared to the BP neural network algorithm.Furthermore,it can effectively eliminate the influence of weight initialization and improve the prediction accuracy of the restoring force in hybrid tests. 展开更多
关键词 hybrid test restoring force prediction generalization ability AdaBoost regression tree
在线阅读 下载PDF
基于区间Ⅱ型FNN的MSWI过程炉膛温度控制 被引量:4
9
作者 汤健 田昊 +1 位作者 夏恒 乔俊飞 《北京工业大学学报》 北大核心 2025年第2期157-172,共16页
针对城市固废焚烧(municipal solid waste incineration,MSWI)过程的炉膛温度难以实现有效控制的问题,提出基于区间Ⅱ型模糊神经网络(interval type-Ⅱfuzzy neural network,IT2FNN)的炉膛温度控制方法。首先,进行炉膛温度控制特性分析... 针对城市固废焚烧(municipal solid waste incineration,MSWI)过程的炉膛温度难以实现有效控制的问题,提出基于区间Ⅱ型模糊神经网络(interval type-Ⅱfuzzy neural network,IT2FNN)的炉膛温度控制方法。首先,进行炉膛温度控制特性分析以确定对其产生影响的关键操作变量;然后,根据上述操作变量基于线性回归决策树(linear regression decision tree,LRDT)建立多入单出(multiple-input single-output,MISO)炉膛温度模型;最后,构建具有自适应参数学习的IT2FNN控制器,并证明其稳定性。在MSWI过程数据集上构建模型并进行控制,实验结果验证了所提方法的有效性。 展开更多
关键词 城市固废焚烧(municipal solid waste incineration MSWI) 炉膛温度控制 线性回归决策树(linear regression decision tree LRDT) 区间Ⅱ型模糊神经网络(interval type-Ⅱfuzzy neural network IT2FNN) 梯度下降法 李雅普诺夫稳定性分析
在线阅读 下载PDF
Quantitative analysis of factors driving the variations in snow cover fraction in the Qilian Mountains,China
10
作者 JIN Zizhen QIN Xiang +6 位作者 LI Xiaoying ZHAO Qiudong ZHANG Jingtian MA Xinxin WANG Chunlin HE Rui WANG Renjun 《Journal of Arid Land》 2025年第7期888-911,共24页
Understanding the impact of meteorological and topographical factors on snow cover fraction(SCF)is crucial for water resource management in the Qilian Mountains(QLM),China.However,there is still a lack of adequate qua... Understanding the impact of meteorological and topographical factors on snow cover fraction(SCF)is crucial for water resource management in the Qilian Mountains(QLM),China.However,there is still a lack of adequate quantitative analysis of the impact of these factors.This study investigated the spatiotemporal characteristics and trends of SCF in the QLM based on the cloud-removed Moderate Resolution Imaging Spectroradiometer(MODIS)SCF dataset during 2000-2021 and conducted a quantitative analysis of the drivers using a histogram-based gradient boosting regression tree(HGBRT)model.The results indicated that the monthly distribution of SCF exhibited a bimodal pattern.The SCF showed a pattern of higher values in the western regions and lower values in the eastern regions.Overall,the SCF showed a decreasing trend during 2000-2021.The decrease in SCF occurred at higher elevations,while an increase was observed at lower elevations.At the annual scale,the SCF showed a downward trend in the western regions affected by westerly(52.84%of the QLM).However,the opposite trend was observed in the eastern regions affected by monsoon(45.73%of the QLM).The SCF displayed broadly similar spatial patterns in autumn and winter,with a significant decrease in the western regions and a slight increase in the central and eastern regions.The effect of spring SCF on spring surface runoff was more pronounced than that of winter SCF.Furthermore,compared with meteorological factors,a variation of 46.53%in spring surface runoff can be attributed to changes in spring SCF.At the annual scale,temperature and relative humidity were the most important drivers of SCF change.An increase in temperature exceeding 0.04°C/a was observed to result in a decline in SCF,with a maximum decrease of 0.22%/a.An increase in relative humidity of more than 0.02%/a stabilized the rise in SCF(about 0.06%/a).The impacts of slope and aspect were found to be minimal.At the seasonal scale,the primary factors impacting SCF change varied.In spring,precipitation and wind speed emerged as the primary drivers.In autumn,precipitation and temperature were identified as the primary drivers.In winter,relative humidity and precipitation were the most important drivers.In contrast to the other seasons,slope exerted the strongest influence on SCF change in summer.This study facilitates a detailed quantitative description of SCF change in the QLM,enhancing the effectiveness of watershed water resource management and ecological conservation efforts in this region. 展开更多
关键词 snow cover fraction surface runoff machine learning histogram-based gradient boosting regression tree(HGBRT)model hydrological effects Qinghai-Xizang Plateau
在线阅读 下载PDF
Multidimensional factors influencing ecosystem services and their relationships in alpine ecosystems:A case study of the Daxing'anling forest area,Inner Mongolia
11
作者 Laixian Xu Jiang He +3 位作者 Youjun He Liang Zhang Hui Xu Chunwei Tang 《Forest Ecosystems》 2025年第6期1296-1318,共23页
Understanding the influencing factors of ecosystem services(ESs)and their relationships is essential for sustainable ecosystem management in degraded alpine ecosystems.There is a lack of integrated multi-model approac... Understanding the influencing factors of ecosystem services(ESs)and their relationships is essential for sustainable ecosystem management in degraded alpine ecosystems.There is a lack of integrated multi-model approaches to explore the multidimensional influences on ESs and their relationships in alpine ecosystems.Taking the Daxing'anling forest area,Inner Mongolia(DFAIM)as a case study,this study used the integrated valuation of ecosystem services and trade-offs(InVEST)model to quantify four ESs—soil conservation(SC),water yield(WY),carbon storage(CS),and habitat quality(HQ)—from 2013 to 2018.We adopted root mean square deviation(RMSD)and coupling coordination degree models(CCDM)to analyze their relationships,and integrated three complementary approaches—optimal parameter-based geographical detector model(OPGDM),gradient boosting regression tree model(GBRTM),and quantile regression model(QRM)—to reveal multidimensional influencing factors.Key findings include the following:(1)From 2013 to 2018,WY,SC,and HQ declined while CS increased.WY was primarily influenced by mean annual precipitation(MAP),forest ratio(RF),and soil bulk density(SBD);CS and HQ by RF and population density(PD);and SC by slope(S),RF,and MAP.Mean annual temperature(MAT),gross domestic product(GDP),and road network density(RND)showed increasing negative impacts.(2)Low trade-off intensity(TI<0.15)dominated all ES pairs,with RF,MAP,PD,and normalized difference vegetation index(NDVI)being the dominant factors.The factor interactions primarily showed two-factor enhancement patterns.(3)The average coupling coordination degree(CCD)of the four ESs was low and declined over time,with low-CCD areas becoming increasingly prevalent.RF,S,SBD,and NDVI positively influenced CCD,while PD,MAT,GDP,and RND had increasing negative impacts,with over 62%of the factor interactions exceeding the individual factor effects.In summary,ES supply generally decreased.Local relationships showed moderate coordination,while overall relationships indicated primary dysfunction.Land use and natural factors primarily shaped these ES and their relationships,while climate and socioeconomic changes diminished ES supply and intensified competition.We recommend enhancing the resilience of natural systems rather than replacing them,establishing climate adaptation monitoring systems,and promoting conservation tillage and cross-departmental coordination mechanisms for collaborative ES optimization.These results provide valuable insights into the sustainable management of alpine ecosystems. 展开更多
关键词 Trade-off intensity(TI) Coupling coordination degree(CCD) Influencing factor Optimal parameter-based geographical detector model(OPGDM) Gradient boosting regression tree model(GBRTM) Quantile regression model(QRM) Trade-offs and synergies
在线阅读 下载PDF
Enhanced extrapolative machine learning for designing high-performance multi-principal-element superalloys
12
作者 Qiu-Ling Tao Long-Ke Bao +6 位作者 Jia-Wen Cao Rong-Pei Shi Yi-Lu Zhao Tao Yang Xue Jia Zhi-Fu Yao Xing-Jun Liu 《Rare Metals》 2025年第10期7859-7875,共17页
Machine learning(ML)has become a powerful tool for accelerating the design and development of new materials.Among various traditional ML algorithms,decision tree-based ensemble learning methods are frequently chosen f... Machine learning(ML)has become a powerful tool for accelerating the design and development of new materials.Among various traditional ML algorithms,decision tree-based ensemble learning methods are frequently chosen for their strong predictive capabilities.However,decision trees are limited in regression tasks to interpolating within the data range of the training set,which restricts their usefulness for designing materials with enhanced properties.Herein,we focused on predicting and optimizing the L1_(2)-phase solvus temperature(T_(L12))and density,two critical properties for multi-principal-element superalloys(MPESAs).To achieve this,we employed the piecewise symbolic regression tree(PS-Tree),which demonstrates excellent extrapolation capability.Our model successfully predicted high T_(L12)values exceeding the training data range(1242℃),with four candidate alloys achieving TL12values of 1246,1249,1254,and 1274℃.Experimental validation confirmed the accuracy of these predictions,verifying the robust extrapolative capability of the PS-Tree method.Notably,one alloy exhibited a T_(L12)of 1267℃and a density of 7.94 g cm^(-3),outperforming most MPESAs.Additionally,another alloy exhibited a compressive yield strength of 897 MPa at 750℃,with a specific yield strength at this temperature higher than that of most L1_(2)-strengthened alloys and Co/Ni-based superalloys.Moreover,the model provided generalized insights,indicating that alloys with δ_(r)>5.3 and ΔH_(mix)<-12.8 J mol^(-1)K^(-1)tend to favor higher T_(L12). 展开更多
关键词 Machine learning for material design Piecewise symbolic regression tree Extrapolation capability Multi-principal-element superalloys
原文传递
Iceberg Draft Prediction Using Several Tree-Based Machine Learning Models
13
作者 AZIMI Hamed SHIRI Hodjat 《Journal of Ocean University of China》 2025年第5期1269-1288,共20页
The Arctic region is experiencing accelerated sea ice melt and increased iceberg detachment from glaciers due to climate change.These drifting icebergs present a risk and engineering challenge for subsea installations... The Arctic region is experiencing accelerated sea ice melt and increased iceberg detachment from glaciers due to climate change.These drifting icebergs present a risk and engineering challenge for subsea installations traversing shallow waters,where ice-berg keels may reach the seabed,potentially damaging subsea structures.Consequently,costly and time-intensive iceberg manage-ment operations,such as towing and rerouting,are undertaken to safeguard subsea and offshore infrastructure.This study,therefore,explores the application of extra tree regression(ETR)as a robust solution for estimating iceberg draft,particularly in the preliminary phases of decision-making for iceberg management projects.Nine ETR models were developed using parameters influencing iceberg draft.Subsequent analyses identified the most effective models and significant input variables.Uncertainty analysis revealed that the superior ETR model tended to overestimate iceberg drafts;however,it achieved the highest precision,correlation,and simplicity in estimation.Comparison with decision tree regression,random forest regression,and empirical methods confirmed the superior perfor-mance of ETR in predicting iceberg drafts. 展开更多
关键词 sea-bottom founded structures iceberg draft extra tree regression decision tree regression random forest regression
在线阅读 下载PDF
Variations in the biomass of Eucalyptus plantations at a regional scale in Southern China 被引量:2
14
作者 Quanyi Qiu Guoliang Yun +6 位作者 Shudi Zuo Jing Yan Lizhong Hua Yin Ren Jianfeng Tang Yaying Li Qi Chen 《Journal of Forestry Research》 SCIE CAS CSCD 2018年第5期1263-1276,共14页
We quantified deviations in regional forest biomass from simple extrapolation of plot data by the biomass expansion factor method(BEF) versus estimates obtained from a local biomass model,based on large-scale empiri... We quantified deviations in regional forest biomass from simple extrapolation of plot data by the biomass expansion factor method(BEF) versus estimates obtained from a local biomass model,based on large-scale empirical field inventory sampling data.The sources and relative contributions of deviations between the two models were analyzed by the boosted regression trees method.Relative to the local model,BEF overestimated accumulative biomass by 22.12%.The predominant sources of the total deviation (70.94%) were stand-structure variables.Stand age and diameter at breast height are the major factors.Compared with biotic variables,abiotic variables had a smaller overall contribution (29.06%),with elevation and soil depth being the most important among the examined abiotic factors.Large deviations in regional forest biomass and carbon stock estimates are likely to be obtained with BEF relative to estimates based on local data.To minimize deviations,stand age and elevation should be included in regional forest-biomass estimation. 展开更多
关键词 BEF Boosted regression trees Eucalyptus plantations Local biomass model Regional biomass estimation Biotic versus abiotic factors Uncertainty analysis
在线阅读 下载PDF
Hybrid XGBoost model with hyperparameter tuning for prediction of liver disease with better accuracy 被引量:2
15
作者 Surjeet Dalal Edeh Michael Onyema Amit Malik 《World Journal of Gastroenterology》 SCIE CAS 2022年第46期6551-6563,共13页
BACKGROUND Liver disease indicates any pathology that can harm or destroy the liver or prevent it from normal functioning.The global community has recently witnessed an increase in the mortality rate due to liver dise... BACKGROUND Liver disease indicates any pathology that can harm or destroy the liver or prevent it from normal functioning.The global community has recently witnessed an increase in the mortality rate due to liver disease.This could be attributed to many factors,among which are human habits,awareness issues,poor healthcare,and late detection.To curb the growing threats from liver disease,early detection is critical to help reduce the risks and improve treatment outcome.Emerging technologies such as machine learning,as shown in this study,could be deployed to assist in enhancing its prediction and treatment.AIM To present a more efficient system for timely prediction of liver disease using a hybrid eXtreme Gradient Boosting model with hyperparameter tuning with a view to assist in early detection,diagnosis,and reduction of risks and mortality associated with the disease.METHODS The dataset used in this study consisted of 416 people with liver problems and 167 with no such history.The data were collected from the state of Andhra Pradesh,India,through https://www.kaggle.com/datasets/uciml/indian-liver-patientrecords.The population was divided into two sets depending on the disease state of the patient.This binary information was recorded in the attribute"is_patient".RESULTS The results indicated that the chi-square automated interaction detection and classification and regression trees models achieved an accuracy level of 71.36%and 73.24%,respectively,which was much better than the conventional method.The proposed solution would assist patients and physicians in tackling the problem of liver disease and ensuring that cases are detected early to prevent it from developing into cirrhosis(scarring)and to enhance the survival of patients.The study showed the potential of machine learning in health care,especially as it concerns disease prediction and monitoring.CONCLUSION This study contributed to the knowledge of machine learning application to health and to the efforts toward combating the problem of liver disease.However,relevant authorities have to invest more into machine learning research and other health technologies to maximize their potential. 展开更多
关键词 Liver infection Machine learning Chi-square automated interaction detection Classification and regression trees Decision tree XGBoost Hyperparameter tuning
在线阅读 下载PDF
An Insight into Machine Learning Algorithms to Map the Occurrence of the Soil Mattic Horizon in the Northeastern Qinghai-Tibetan Plateau 被引量:1
16
作者 ZHI Junjun ZHANG Ganlin +6 位作者 YANG Renmin YANG Fei JIN Chengwei LIU Feng SONG Xiaodong ZHAO Yuguo LI Decheng 《Pedosphere》 SCIE CAS CSCD 2018年第5期739-750,共12页
Soil diagnostic horizons, which each have a set of quantified properties, play a key role in soil classification. However, they are difficult to predict, and few attempts have been made to map their spatial occurrence... Soil diagnostic horizons, which each have a set of quantified properties, play a key role in soil classification. However, they are difficult to predict, and few attempts have been made to map their spatial occurrence. We evaluated and compared four machine learning algorithms, namely, the classification and regression tree(CART), random forest(RF), boosted regression trees(BRT), and support vector machine(SVM), to map the occurrence of the soil mattic horizon in the northeastern Qinghai-Tibetan Plateau using readily available ancillary data. The mechanisms of resampling and ensemble techniques significantly improved prediction accuracies(measured based on area under the receiver operator characteristic curve score(AUC)) and produced more stable results for the BRT(AUC of 0.921 ± 0.012, mean ± standard deviation) and RF(0.908 ± 0.013) algorithms compared to the CART algorithm(0.784 ± 0.012), which is the most commonly used machine learning method. Although the SVM algorithm yielded a comparable AUC value(0.906 ± 0.006) to the RF and BRT algorithms, it is sensitive to parameter settings, which are extremely time-consuming.Therefore, we consider it inadequate for occurrence-distribution modeling. Considering the obvious advantages of high prediction accuracy, robustness to parameter settings, the ability to estimate uncertainty in prediction, and easy interpretation of predictor variables, BRT seems to be the most desirable method. These results provide an insight into the use of machine learning algorithms to map the mattic horizon and potentially other soil diagnostic horizons. 展开更多
关键词 boosted regression trees classification and regression tree digital soil mapping random forest soil diagnostic horizons support vector machine
原文传递
Patterns of forest composition and their long term environmental drivers in the tropical dry forest transition zone of southern Africa 被引量:1
17
作者 Vera De Cauwer Coert J.Geldenhuys +2 位作者 Raf Aerts Miya Kabajani Bart Muys 《Forest Ecosystems》 SCIE CSCD 2017年第1期33-44,共12页
Background: Tropical dry forests cover less than 13 % of the world's tropical forests and their area and biodiversity are declining. In southern Africa, the major threat is increasing population pressure, while drou... Background: Tropical dry forests cover less than 13 % of the world's tropical forests and their area and biodiversity are declining. In southern Africa, the major threat is increasing population pressure, while drought caused by climate change is a potential threat in the drier transition zones to shrub land. Monitoring climate change impacts in these transition zones is difficult as there is inadequate information on forest composition to allow disentanglement from other environmental drivers. Methods: This study combined historical and modern forest inventories covering an area of 21,000 km2 in a transition zone in Namibia and Angola to distinguish late succession tree communities, to understand their dependence on site factors, and to detect trends in the forest composition over the last 40 years. Results: The woodlands were dominated by six tree species that represented 84 % of the total basal area and can be referred to as Bdikioea - Pterocarpus woodlands. A boosted regression tree analysis revealed that late succession tree communities are primarily determined by climate and topography. The Schinziophyton rautanenfi and Baikiaea plurijuga communities are common on slightly inclined dune or valley slopes and had the highest basal area (5.5 - 6.2 m^2 ha&-1). The Burkea africana - Guibourtia coleosperma and Pterocarpus angolensis - Diafium englerianum communities are typical for the sandy plateaux and have a higher proportion of smaller stems caused by a higher fire frequency. A decrease in overall basal area or a trend of increasing domination by the more drought and cold resilient B. africana community was not confirmed by the historical data, but there were significant decreases in basal area for Ochna pulchra and the valuable fruit tree D. englerianum. Conclusions: The slope communities are more sheltered from fire, frost and drought but are more susceptible to human expansion. The community with the important timber tree P. angolensis can best withstand high fire frequency but shows signs of a higher vulnerability to climate change. Conservation and climate adaptation strategies should include protection of the slope communities through refuges. Follow-up studies are needed on short term dynamics, especially near the edges of the transition zone towards shrub land. 展开更多
关键词 Baikiaea woodland Tree community Namibia boosted regression trees Pterocarpus ango/ensis Disturbance Miombo Ecoregion Climate change
在线阅读 下载PDF
Forecasting Flowering and Maturity Times of Barley Using Six Machine Learning Algorithms 被引量:1
18
作者 Mingyuan Cheng Mingchu Zhang 《Journal of Agricultural Science and Technology(B)》 2019年第6期373-391,共19页
Interior Alaska has a short growing season of 110 d.The knowledge of timings of crop flowering and maturity will provide the information for the agricultural decision making.In this study,six machine learning algorith... Interior Alaska has a short growing season of 110 d.The knowledge of timings of crop flowering and maturity will provide the information for the agricultural decision making.In this study,six machine learning algorithms,namely Linear Discriminant Analysis(LDA),Support Vector Machines(SVMs),k-nearest neighbor(kNN),Naïve Bayes(NB),Recursive Partitioning and Regression Trees(RPART),and Random Forest(RF),were selected to forecast the timings of barley flowering and maturity based on the Alaska Crop Datasets and climate data from 1991 to 2016 in Fairbanks,Alaska.Among 32 models fit to forecast flowering time,two from LDA,12 from SVMs,four from NB,three from RF outperformed models from other algorithms with the highest accuracy.Models from kNN performed worst to forecast flowering time.Among 32 models fit to forecast maturity time,two models from LDA outperformed the models from other algorithms.Models from kNN and RPART performed worst to forecast maturity time.Models from machine learning methods also provided a variable importance explanation.In this study,four out of six algorithms gave the same variable importance order.Sowing date was the most important variable to forecast flowering but less important variable to forecast maturity.The daily maximum temperature may be more important than daily minimum temperature to fit flowering models while daily minimum temperature may be more important than daily maximum temperature to fit maturity models.The results indicate that models from machine learning provide a promising technique in forecasting the timings of flowering and maturity of barley. 展开更多
关键词 Machine learning flowering and maturity Linear Discriminant Analysis Support Vector Machines k-nearest neighbor Naïve Bayes recursive partitioning regression trees Random Forest
在线阅读 下载PDF
Data-driven methods for predicting the representative temperature of bridge cable based on limited measured data
19
作者 WANG Fen DAI Gong-lian +2 位作者 HE Chang-lin GE Hao RAO Hui-ming 《Journal of Central South University》 SCIE EI CAS CSCD 2024年第9期3168-3186,共19页
Cable-stayed bridges have been widely used in high-speed railway infrastructure.The accurate determination of cable’s representative temperatures is vital during the intricate processes of design,construction,and mai... Cable-stayed bridges have been widely used in high-speed railway infrastructure.The accurate determination of cable’s representative temperatures is vital during the intricate processes of design,construction,and maintenance of cable-stayed bridges.However,the representative temperatures of stayed cables are not specified in the existing design codes.To address this issue,this study investigates the distribution of the cable temperature and determinates its representative temperature.First,an experimental investigation,spanning over a period of one year,was carried out near the bridge site to obtain the temperature data.According to the statistical analysis of the measured data,it reveals that the temperature distribution is generally uniform along the cable cross-section without significant temperature gradient.Then,based on the limited data,the Monte Carlo,the gradient boosted regression trees(GBRT),and univariate linear regression(ULR)methods are employed to predict the cable’s representative temperature throughout the service life.These methods effectively overcome the limitations of insufficient monitoring data and accurately predict the representative temperature of the cables.However,each method has its own advantages and limitations in terms of applicability and accuracy.A comprehensive evaluation of the performance of these methods is conducted,and practical recommendations are provided for their application.The proposed methods and representative temperatures provide a good basis for the operation and maintenance of in-service long-span cable-stayed bridges. 展开更多
关键词 cable-stayed bridges representative temperature gradient boosted regression trees(GBRT)method field test limited measured data
在线阅读 下载PDF
Modelling the Psychological Impact of COVID-19 in Saudi Arabia Using Machine Learning
20
作者 Mohammed A.Aleid Khaled A.Z.Alyamani +3 位作者 Mohieddine Rahmouni Theyazn H.H.Aldhyani Nizar Alsharif Mohammed Y.Alzahrani 《Computers, Materials & Continua》 SCIE EI 2021年第5期2029-2047,共19页
This article aims to assess health habits,safety behaviors,and anxiety factors in the community during the novel coronavirus disease(COVID-19)pandemic in Saudi Arabia based on primary data collected through a question... This article aims to assess health habits,safety behaviors,and anxiety factors in the community during the novel coronavirus disease(COVID-19)pandemic in Saudi Arabia based on primary data collected through a questionnaire with 320 respondents.In other words,this paper aims to provide empirical insights into the correlation and the correspondence between sociodemographic factors(gender,nationality,age,citizenship factors,income,and education),and psycho-behavioral effects on individuals in response to the emergence of this new pandemic.To focus on the interaction between these variables and their effects,we suggest different methods of analysis,comprising regression trees and support vector machine regression(SVMR)algorithms.According to the regression tree results,the age variable plays a predominant role in health habits,safety behaviors,and anxiety.The health habit index,which focuses on the extent of behavioral change toward the commitment to use the health and protection methods,is highly affected by gender and age factors.The average monthly income is also a relevant factor but has contrasting effects during the COVID-19 pandemic period.The results of the SVMR model reveal a strong positive effect of income,with R^(2) values of 99.59%,99.93%and 99.88%corresponding to health habits,safety behaviors,and anxiety. 展开更多
关键词 COVID-19 health habits safety behaviors ANXIETY support vector machine regression regression trees
在线阅读 下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部