期刊文献+
共找到1,701篇文章
< 1 2 86 >
每页显示 20 50 100
Evaluation of the susceptibility to landslide geological disasters based on different slope units and an information content random forest model:a case study of the Longhua District,Shenzhen
1
作者 XIONG Haoyu RAN Xiangjin XUE Linfu 《Global Geology》 2026年第1期86-100,共15页
Slope units are divided according to the real topography and have clear geological characteristics,making them ideal units for evaluating the susceptibility to geological disasters.Based on the results of automaticall... Slope units are divided according to the real topography and have clear geological characteristics,making them ideal units for evaluating the susceptibility to geological disasters.Based on the results of automatically and manually corrected hydrological slope unit division,the Longhua District,Shenzhen City,Guangdong Province,was selected as the study area.A total of 15 influencing factors,namely Fluctuation,slope,slope aspect,curvature,topographic witness index(TWI),stream power index(SPI),topographic roughness index(TRI),annual average rainfall,distance to water system,engineering rock group,distance to fault,land use,normalized difference vegetation index(NDVI),nighttime light,and distance to road,were selected as evaluation indicators.The information volume model(IV)and random points were used to select non-geological disaster units,and then the random forest model(RF)was used to evaluate the susceptibility to geological disasters.The automatic slope unit and the hydrological slope unit were compared and analyzed in the random forest and information volume random forest models.The results show that the area under the curve(AUC)values of the automatic slope unit evaluation results are 0.931 for the IV-RF model and 0.716 for the RF model,which are 0.6%(IV-RF model)and 1.9%(RF model)higher than those for the hydrological slope unit.Based on a comparison of the evaluation methods based on the two types of slope units,the hydrological slope unit evaluation method based on manual correction is highly subjective,is complicated to operate,and has a low evaluation accuracy,whereas the evaluation method based on automatic slope unit division is efficient and accurate,is suitable for large-scale efficient geological disaster evaluation,and can better deal with the problem of geological disaster susceptibility evaluation. 展开更多
关键词 geological hazards slope unit information content random forest model susceptibility assessment SHENZHEN
在线阅读 下载PDF
AI-Driven Malware Detection with VGG Feature Extraction and Artificial Rabbits Optimized Random Forest Model
2
作者 Brij B.Gupta Akshat Gaurav +3 位作者 Wadee Alhalabi Varsha Arya Shavi Bansal Ching-Hsien Hsu 《Computers, Materials & Continua》 2025年第9期4755-4772,共18页
Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support v... Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support vector machine(SVM),as well as ensemble methods,such as Gradient Boosting and eXtreme gradient boosting(XGBoost),are often plagued by high computational costs,which makes it challenging for them to perform real-time detection.In this regard,we suggested an attack detection approach that integrates Visual Geometry Group 16(VGG16),Artificial Rabbits Optimizer(ARO),and Random Forest Model to increase detection accuracy and operational efficiency in Internet of Things(IoT)networks.In the suggested model,the extraction of features from malware pictures was accomplished with the help of VGG16.The prediction process is carried out by the random forest model using the extracted features from the VGG16.Additionally,ARO is used to improve the hyper-parameters of the random forest model of the random forest.With an accuracy of 96.36%,the suggested model outperforms the standard models in terms of accuracy,F1-score,precision,and recall.The comparative research highlights our strategy’s success,which improves performance while maintaining a lower computational cost.This method is ideal for real-time applications,but it is effective. 展开更多
关键词 Malware detection VGG feature extraction artificial rabbits OPTIMIZATION random forest model
在线阅读 下载PDF
Desertification status mapping in MuttumaWatershed by using Random Forest Model 被引量:1
3
作者 S.Dharumarajan Thomas F.A.Bishop 《Research in Cold and Arid Regions》 CSCD 2022年第1期32-42,共11页
Potential of the Random Forest Model on mapping of different desertification processes was studied in Muttuma watershed of mid-Murrumbidgee river region of New South Wales,Australia.Desertification vulnerability index... Potential of the Random Forest Model on mapping of different desertification processes was studied in Muttuma watershed of mid-Murrumbidgee river region of New South Wales,Australia.Desertification vulnerability index was developed using climate,terrain,vegetation,soil and land quality indices to identify environmentally sensitive areas for desertification.Random Forest Model(RFM)was used to predict the different desertification processes such as soil erosion,salinization and waterlogging in the watershed and the information needed to train classification algorithms was obtained from satellite imagery interpretation and ground truth data.Climatic factors(evaporation,rainfall,temperature),terrain factors(aspect,slope,slope length,steepness,and wetness index),soil properties(pH,organic carbon,clay and sand content)and vulnerability indices were used as an explanatory variable.Classification accuracy and kappa index were calculated for training and testing datasets.We recorded an overall accuracy rate of 87.7%and 72.1%for training and testing sites,respectively.We found larger discrepancies between overall accuracy rate and kappa index for testing datasets(72.2%and 27.5%,respectively)suggesting that all the classes are not predicted well.The prediction of soil erosion and no desertification process was good and poor for salinization and water-logging process.Overall,the results observed give a new idea of using the knowledge of desertification process in training areas that can be used to predict the desertification processes at unvisited areas. 展开更多
关键词 desertification processes vulnerability indices random forest model EXTRAPOLATION
在线阅读 下载PDF
Experimental study of population density using an optimized random forest model
4
作者 LI Lingling LIU Jinsong +3 位作者 LI Zhi WEN Peizhang LI Yancheng LIU Yi 《Journal of Geographical Sciences》 SCIE CSCD 2024年第8期1636-1656,共21页
Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning ba... Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning based on endowments as the modeling unit,conducted stratified sampling on a hectare grid cell,and systematically carried out incremental selection experiments of population density impact factors,optimizing the population density random forest model throughout the process(zonal modeling,stratified sampling,factor selection,weighted output).The results are as follows:(1)Zonal modeling addresses the issue of confusion in population distribution laws caused by a single model.Sampling on a grid cell not only ensures the quality of training data by avoiding the modifiable areal unit problem(MAUP)but also attempts to mitigate the adverse effects of the ecological fallacy.Stratified sampling ensures the stability of population density label values(target variable)in the training sample.(2)Zonal selection experiments on population density impact factors help identify suitable combinations of factors,leading to a significant improvement in the goodness of fit(R^(2))of the zonal models.(3)Weighted combination output of the population density prediction dataset substantially enhances the model's robustness.(4)The population density dataset exhibits multi-scale superposition characteristics.On a large scale,the population density in plains is higher than that in mountainous areas,while on a small scale,urban areas have higher density compared to rural areas.The optimization scheme for the population density random forest model that we propose offers a unified technical framework for uncovering local population distribution law and the impact mechanisms. 展开更多
关键词 population density random forest model endowment zones stratified sampling factor selection weighted output
原文传递
Identification of Mixtures of Two Types of Body Fluids Using the Multiplex Methylation System and Random Forest Models
5
作者 Han-xiao WANG Xiao-zhao LIU +3 位作者 Xi-miao HE Chao XIAO Dai-xin HUANG Shao-hua YI 《Current Medical Science》 SCIE CAS 2023年第5期908-918,共11页
Objective Body fluid mixtures are complex biological samples that frequently occur in crime scenes,and can provide important clues for criminal case analysis.DNA methylation assay has been applied in the identificatio... Objective Body fluid mixtures are complex biological samples that frequently occur in crime scenes,and can provide important clues for criminal case analysis.DNA methylation assay has been applied in the identification of human body fluids,and has exhibited excellent performance in predicting single-source body fluids.The present study aims to develop a methylation SNaPshot multiplex system for body fluid identification,and accurately predict the mixture samples.In addition,the value of DNA methylation in the prediction of body fluid mixtures was further explored.Methods In the present study,420 samples of body fluid mixtures and 250 samples of single body fluids were tested using an optimized multiplex methylation system.Each kind of body fluid sample presented the specific methylation profiles of the 10 markers.Results Significant differences in methylation levels were observed between the mixtures and single body fluids.For all kinds of mixtures,the Spearman’s correlation analysis revealed a significantly strong correlation between the methylation levels and component proportions(1:20,1:10,1:5,1:1,5:1,10:1 and 20:1).Two random forest classification models were trained for the prediction of mixture types and the prediction of the mixture proportion of 2 components,based on the methylation levels of 10 markers.For the mixture prediction,Model-1 presented outstanding prediction accuracy,which reached up to 99.3%in 427 training samples,and had a remarkable accuracy of 100%in 243 independent test samples.For the mixture proportion prediction,Model-2 demonstrated an excellent accuracy of 98.8%in 252 training samples,and 98.2%in 168 independent test samples.The total prediction accuracy reached 99.3%for body fluid mixtures and 98.6%for the mixture proportions.Conclusion These results indicate the excellent capability and powerful value of the multiplex methylation system in the identification of forensic body fluid mixtures. 展开更多
关键词 body fluid identification MIXTURE mixing ratio DNA methylation multiplex assay random forest model
暂未订购
Modeling the Spatial Distribution of Soil Heavy Metals Using Random Forest Model—A Case Study of Nairobi and Thirirka Rivers’ Confluence 被引量:2
6
作者 Evans Omondi Mark Boitt 《Journal of Geographic Information System》 2020年第6期597-619,共23页
Modeling the spatial distribution of soil heavy metals is important in determining the safety of contaminated soils for agricultural use. This study utilized 60 topsoil samples (0 - 30 cm), multispectral images (Senti... Modeling the spatial distribution of soil heavy metals is important in determining the safety of contaminated soils for agricultural use. This study utilized 60 topsoil samples (0 - 30 cm), multispectral images (Sentinel-2), spectral indices, and ancillary data to model the spatial distribution of heavy metals in the soils along the Nairobi River. The model was generated using the Random Forest package in R. Using R2 to assess the prediction accuracy, the Random Forest model generated satisfactory results for all the elements. It also ranked the variables in order of their importance in the overall prediction. Spectral indices were the most important variables within the rankings. From the predicted topsoil maps, there were high concentrations of Cadmium on the easterly end of the river. Cadmium is an impurity in detergents, and this section is in close proximity to the Nairobi water sewerage plant, which could be a direct source of Cadmium. Some farms had Zinc levels which were above the World Health Organization recommended limit. The Random Forest model performed satisfactorily. However, the predictions can be improved further if the spatial resolutions of the various variables are increased and through the addition of more predictor variables. 展开更多
关键词 random forest Sentinel 2 Heavy Metals Spectral Indices Spatial modeling
在线阅读 下载PDF
Modelling the Densities of Soil Organic Carbon,Total Nitrogen and Phosphorus Using Random Forest Model,and Their Spatial Distributions of Cultivated Lands in the YLN Region of Xizang
7
作者 SUN Wei LI Tianyu +8 位作者 LI Shaowei ZHA Xinjie HAN Fusong HUANG Shaolin Dorblha CHEN Chuhong Dawaqiongda Luobu FU Gang 《Journal of Resources and Ecology》 2025年第6期1842-1850,共9页
The“Yarlung Zangbo River,Lhasa River and Nyangqu River”(YLN)region is the main grain producing area on which the Tibetan people depend for survival.The densities of soil organic carbon(SOC),total nitrogen(TN)and tot... The“Yarlung Zangbo River,Lhasa River and Nyangqu River”(YLN)region is the main grain producing area on which the Tibetan people depend for survival.The densities of soil organic carbon(SOC),total nitrogen(TN)and total phosphorus(TP)in farmlands are closely related to grain production.Scientific management and regulation of these nutrient densities are of great significance for ensuring food security.However,accurate simulations of spatial variations in the densities of SOC(SOCD),TN(TND)and TP(TPD)and the spatial distributions of SOCD,TND and TPD are still unclear.In this study,388 samples of cultivated soils at 0–10 and 10–20 cm in the YLN region were collected to determine the SOC,TN,and TP contents,as well as pH and bulk density(BD).Random forest models of SOCD,TND and TPD were constructed using longitude,latitude,elevation,mean annual temperature,mean annual precipitation,mean annual radiation and vegetation index,which were then used to obtain the spatial distribution maps of SOCD,TND and TPD,and the storages of SOC(SOCS),TN(TNS)and TP(TPS).Mean annual radiation can partially explain the spatial variations of SOCD and TND,in addition to temperature and precipitation.The relative biases between modelled and observed SOCD,TND,TPD,SOCS,TNS and TPS ranged from–9.43%to 7.57%.The SOCD and TND increased from west to east,but they were both low in the middle and high in the north and south.The SOCD and TND decreased with increasing pH and BD.SOCD,TND and TPD were low at mid-elevations but high at low and high elevations.The SOCD,TND,TPD,SOCS,TNS and TPS were 2.72 kg m^(-2),0.30 kg m^(-2),0.18 kg m^(-2),4.88 Tg,0.54 Tg and 0.32 Tg,respectively,at 0–20 cm over the cultivated lands of the YLN region.Based on these results,the random forest models constructed in this study can be used for subsequent related studies.Besides warming and precipitation changes,radiation changes can also affect SOCD and TND.In terms of the production of food crops such as highland barley,the farmland soils in the YLN region currently can have relative deficiencies of nitrogen and phosphorus nutrients.In the future,measures such as increasing the application of organic fertilizers should be taken to improve the carbon sequestration capacity and nitrogen and phosphorus nutrition of the soil.These findings have important guiding significance for the fertilization management of cultivated lands in the YLN region and other alpine regions similar to the YLN region. 展开更多
关键词 carbon storage random forest model soil organic carbon density YLN
原文传递
A Hybrid Air Quality Prediction Method Based on VAR and Random Forest
8
作者 Minghao Yi Fuming Lin 《Journal of Computer and Communications》 2025年第2期142-154,共13页
To improve the efficiency of air quality analysis and the accuracy of predictions, this paper proposes a composite method based on Vector Autoregressive (VAR) and Random Forest (RF) models. In the theoretical section,... To improve the efficiency of air quality analysis and the accuracy of predictions, this paper proposes a composite method based on Vector Autoregressive (VAR) and Random Forest (RF) models. In the theoretical section, the model introduction and estimation algorithms are provided. In the empirical analysis section, global air quality data from 2022 to 2024 are used, and the proposed method is applied. Specifically, principal component analysis (PCA) is first conducted, and then VAR and Random Forest methods are used for prediction on the reduced-dimensional data. The results show that the RMSE of the hybrid model is 45.27, significantly lower than the 49.11 of the VAR model alone, verifying its superiority. The stability and predictive performance of the model are effectively enhanced. 展开更多
关键词 Var model Principal Component Analysis random forest model
在线阅读 下载PDF
Mapping landslide susceptibility at the Three Gorges Reservoir, China, using gradient boosting decision tree,random forest and information value models 被引量:14
9
作者 CHEN Tao ZHU Li +3 位作者 NIU Rui-qing TRINDER C John PENG Ling LEI Tao 《Journal of Mountain Science》 SCIE CSCD 2020年第3期670-685,共16页
This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting de... This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR. 展开更多
关键词 MAPPING LANDSLIDE SUSCEPTIBILITY Gradient BOOSTING DECISION tree random forest Information value model Three Gorges Reservoir
原文传递
Random forest algorithm and regional applications of spectral inversion model for estimating canopy nitrogen concentration in rice 被引量:1
10
作者 LI Xuqing LIU Xiangnan LIU Meiling WU Ling 《遥感学报》 CSCD 北大核心 2014年第4期923-945,共23页
原文传递
A Data-Driven Car-Following Model Based on the Random Forest
11
作者 Huili Shi Tingli Wang +3 位作者 Fusheng Zhong Hanqing Wang Junyan Han Xiaoyuan Wang 《World Journal of Engineering and Technology》 2021年第3期503-515,共13页
The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare... The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare. In recent years, the related technologies of Intelligent Transportation System (ITS) re</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">presented by the Vehicles to Everything (V2X) technology have been developing rapidly. Utilizing the related technologies of ITS, the large-scale vehicle microscopic trajectory data with high quality can be acquired, which provides the research foundation for modeling the car-following behavior based on the data-driven methods. According to this point, a data-driven car-following model based on the Random Forest (RF) method was constructed in this work, and the Next Generation Simulation (NGSIM) dataset was used to calibrate and train the constructed model. The Artificial Neural Network (ANN) model, GM model, and Full Velocity Difference (FVD) model are em</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">ployed to comparatively verify the proposed model. The research results suggest that the model proposed in this work can accurately describe the car-</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">following behavior with better performance under multiple performance indicators. 展开更多
关键词 Traffic Flow Car-Following model Data-Driven Method random forest Intelligent Transportation System
在线阅读 下载PDF
Linear and Nonlinear Trading Models with Gradient Boosted Random Forests and Application to Singapore Stock Market
12
作者 Qin Qin Qing-Guo Wang +1 位作者 Jin Li Shuzhi Sam Ge 《Journal of Intelligent Learning Systems and Applications》 2013年第1期1-10,共10页
This paper presents new trading models for the stock market and test whether they are able to consistently generate excess returns from the Singapore Exchange (SGX). Instead of conventional ways of modeling stock pric... This paper presents new trading models for the stock market and test whether they are able to consistently generate excess returns from the Singapore Exchange (SGX). Instead of conventional ways of modeling stock prices, we construct models which relate the market indicators to a trading decision directly. Furthermore, unlike a reversal trading system or a binary system of buy and sell, we allow three modes of trades, namely, buy, sell or stand by, and the stand-by case is important as it caters to the market conditions where a model does not produce a strong signal of buy or sell. Linear trading models are firstly developed with the scoring technique which weights higher on successful indicators, as well as with the Least Squares technique which tries to match the past perfect trades with its weights. The linear models are then made adaptive by using the forgetting factor to address market changes. Because stock markets could be highly nonlinear sometimes, the Random Forest is adopted as a nonlinear trading model, and improved with Gradient Boosting to form a new technique—Gradient Boosted Random Forest. All the models are trained and evaluated on nine stocks and one index, and statistical tests such as randomness, linear and nonlinear correlations are conducted on the data to check the statistical significance of the inputs and their relation with the output before a model is trained. Our empirical results show that the proposed trading methods are able to generate excess returns compared with the buy-and-hold strategy. 展开更多
关键词 Stock modeling SCORING TECHNIQUE Least Square TECHNIQUE random forest GRADIENT Boosted random forest
暂未订购
Understory terrain estimation using multi-source remote sensing data under different forest-type conditions
13
作者 HUANG Jia-Peng FAN Qing-Nan ZHANG Yue 《红外与毫米波学报》 北大核心 2025年第6期919-932,共14页
Accurate estimation of understory terrain has significant scientific importance for maintaining ecosystem balance and biodiversity conservation.Addressing the issue of inadequate representation of spatial heterogeneit... Accurate estimation of understory terrain has significant scientific importance for maintaining ecosystem balance and biodiversity conservation.Addressing the issue of inadequate representation of spatial heterogeneity when traditional forest topographic inversion methods consider the entire forest as the inversion unit,this study pro⁃poses a differentiated modeling approach to forest types based on refined land cover classification.Taking Puerto Ri⁃co and Maryland as study areas,a multi-dimensional feature system is constructed by integrating multi-source re⁃mote sensing data:ICESat-2 spaceborne LiDAR is used to obtain benchmark values for understory terrain,topo⁃graphic factors such as slope and aspect are extracted based on SRTM data,and vegetation cover characteristics are analyzed using Landsat-8 multispectral imagery.This study incorporates forest type as a classification modeling con⁃dition and applies the random forest algorithm to build differentiated topographic inversion models.Experimental re⁃sults indicate that,compared to traditional whole-area modeling methods(RMSE=5.06 m),forest type-based classi⁃fication modeling significantly improves the accuracy of understory terrain estimation(RMSE=2.94 m),validating the effectiveness of spatial heterogeneity modeling.Further sensitivity analysis reveals that canopy structure parame⁃ters(with RMSE variation reaching 4.11 m)exert a stronger regulatory effect on estimation accuracy compared to forest cover,providing important theoretical support for optimizing remote sensing models of forest topography. 展开更多
关键词 understory terrain forest type multi-source remote sensing data random forest model
在线阅读 下载PDF
Mixed-effects modeling for tree height prediction models of Oriental beech in the Hyrcanian forests 被引量:9
14
作者 Siavash Kalbi Asghar Fallah +2 位作者 Pete Bettinger Shaban Shataee Rassoul Yousefpour 《Journal of Forestry Research》 SCIE CAS CSCD 2018年第5期1195-1204,共10页
Height–diameter relationships are essential elements of forest assessment and modeling efforts.In this work,two linear and eighteen nonlinear height–diameter equations were evaluated to find a local model for Orient... Height–diameter relationships are essential elements of forest assessment and modeling efforts.In this work,two linear and eighteen nonlinear height–diameter equations were evaluated to find a local model for Oriental beech(Fagus orientalis Lipsky) in the Hyrcanian Forest in Iran.The predictive performance of these models was first assessed by different evaluation criteria: adjusted R^2(R^2_(adj)),root mean square error(RMSE),relative RMSE(%RMSE),bias,and relative bias(%bias) criteria.The best model was selected for use as the base mixed-effects model.Random parameters for test plots were estimated with different tree selection options.Results show that the Chapman–Richards model had better predictive ability in terms of adj R^2(0.81),RMSE(3.7 m),%RMSE(12.9),bias(0.8),%Bias(2.79) than the other models.Furthermore,the calibration response,based on a selection of four trees from the sample plots,resulted in a reduction percentage for bias and RMSE of about 1.6–2.7%.Our results indicate that the calibrated model produced the most accurate results. 展开更多
关键词 random effects Tree height CALIBRATION Sangdeh forest Chapman–Richards model Oriental beech
在线阅读 下载PDF
多中心机器学习构建预测潜在器官捐献者的模型与决策曲线验证研究
15
作者 王旭 李文秀 +5 位作者 王凤华 吴淑莉 贾栋 葛鑫 单志华 李峒作 《器官移植》 北大核心 2026年第1期106-115,共10页
目的评估在多中心环境下构建的不同机器学习模型对潜在器官捐献者的预测价值并验证其临床应用可行性。方法研究纳入国内5家三级甲等医院在2020年1月至2023年12月收治的2000例符合潜在器官捐献评估标准的住院患者,随机分为训练集和内部... 目的评估在多中心环境下构建的不同机器学习模型对潜在器官捐献者的预测价值并验证其临床应用可行性。方法研究纳入国内5家三级甲等医院在2020年1月至2023年12月收治的2000例符合潜在器官捐献评估标准的住院患者,随机分为训练集和内部验证集(7∶3),另纳入2024年1月至2025年4月在哈尔滨医科大学附属第一医院收治的300例同类患者作为外部验证集。比较3种模型的曲线下面积(AUC)、灵敏度、特异度、准确率、F1-score,并对潜在器官捐献者判定流程一致性进行检验。采用多因素logistic回归分析潜在器官捐献者的预测因素,利用决策曲线分析(DCA)验证各模型的资源效益,评估阈值区间与干预平衡点。结果各中心除年龄外其他基本特征差异均无统计学意义(均为P>0.05),各中心研究者潜在器官捐献者判定流程间一致性良好[均为95%可信区间(CI)下限>0]。内部验证集中,XGBoost模型的预测性能最佳(AUC=0.92,95%CI0.89~0.94)且校准最佳(P=0.441,Brier分数0.099);外部验证集中,XGBoost模型的预测性能最佳(AUC=0.91,95%CI 0.88~0.94),均优于logistic回归与随机森林。多因素logistic回归显示使用机械通气影响最大(比值比=2.06,95%CI 1.54~2.76,P<0.001)。DCA显示XGBoost模型在0.2~0.6阈值区间净获益最高,“全部干预”策略仅在极低阈值略占优势,推荐阈值区间兼顾≥50%PPV与≤50例/100例高危患者转介量,可平衡干预成本与临床受益。结论多中心环境下建立的XGBoost模型在预测潜在器官捐献者方面准确率与校准度均较理想,结合DCA可有效指导临床干预时机与资源分配,为脑死亡后器官捐献评估与管理提供新思路。 展开更多
关键词 多中心机器学习 潜在器官捐献者 预测模型 决策曲线分析 极端梯度提升树 器官捐献评估 LOGISTIC回归模型 随机森林模型
暂未订购
基于Landsat影像的大型水体水下地形分区反演
16
作者 窦明 史玉仙 +2 位作者 屈凌波 王继华 邢澳琪 《郑州大学学报(工学版)》 北大核心 2026年第2期128-135,共8页
针对缺资料大型水体水下地形资料获取困难的问题,以丹江口水库为研究对象,提出了一种基于Landsat遥感影像和水深分区的大型水体水下地形反演方法,分别采用水位线克里金插值法和4种水深反演模型(单波段、双波段比值、BP神经网络、多波段... 针对缺资料大型水体水下地形资料获取困难的问题,以丹江口水库为研究对象,提出了一种基于Landsat遥感影像和水深分区的大型水体水下地形反演方法,分别采用水位线克里金插值法和4种水深反演模型(单波段、双波段比值、BP神经网络、多波段随机森林)对丹江口水库浅水区和深水区水下地形进行反演,并评价其反演精度。结果显示,浅水区水下地形反演效果良好(均方根误差RMSE=2.553 m);深水区反演中,汉库水域采用多波段随机森林模型表现最佳(RMSE=2.428 m),丹库水域采用BP神经网络模型表现最佳(RMSE=1.599 m);不同反演模型精度针对不同水深和不同区域具有差异性,多波段随机森林模型在深水域水下地形反演上存在优势。研究结果可为缺资料大型水体提供一种快捷的地形资料收集方法。 展开更多
关键词 水下地形反演 Landsat遥感影像 BP神经网络模型 多波段随机森林模型 丹江口水库
在线阅读 下载PDF
基于机器学习算法的雷州半岛桉树复层混交林土壤呼吸模拟
17
作者 竹万宽 王志超 +4 位作者 许宇星 黄润霞 陶怡 钟源源 杜阿朋 《林业科学》 北大核心 2026年第1期67-82,共16页
【目的】利用桉树复层混交林固定样地土壤呼吸及其1年期环境因子连续观测数据,构建并筛选多因子土壤呼吸预测模型,明确影响该地区人工林土壤呼吸时空变异的关键环境因素,为提升人工林碳排放模拟精度及大尺度预测模型的校准提供科学依据... 【目的】利用桉树复层混交林固定样地土壤呼吸及其1年期环境因子连续观测数据,构建并筛选多因子土壤呼吸预测模型,明确影响该地区人工林土壤呼吸时空变异的关键环境因素,为提升人工林碳排放模拟精度及大尺度预测模型的校准提供科学依据。【方法】以雷州半岛桉树-灰木莲复层混交林为研究对象,引入6种机器学习算法(随机森林、时间卷积神经网络、长短期记忆网络、支持向量机回归、极限学习机、BP神经网络)和2种传统经验模型(Q10模型、Gamma模型),在1 h和24 h尺度上模拟土壤呼吸变化,比较模型精度评价指标,筛选适合研究区的最优模型算法。【结果】桉树复层混交林土壤呼吸表现为雨季高于旱季,土壤呼吸累积通量在雨季为616.83 g·m^(-2),在旱季为319.81 g·m^(-2),全年为936.64 g·m^(-2),旱季土壤呼吸波动程度高于雨季。6种机器学习算法和2种经验模型均能成功模拟桉树复层混交林土壤呼吸变化,但机器学习模型模拟结果明显优于经验模型。机器学习算法中随机森林模型表现最稳定,当输入变量为土壤温、湿度双自变量时,决定系数R^(2)为0.89(训练集)和0.76(测试集),当输入变量增加土壤电导率、土壤热通量、空气温度、空气相对湿度、太阳总辐射、光合有效辐射后,模型决定系数R^(2)提高至0.99(训练集)和0.93(测试集)。除土壤温、湿度外,土壤电导率对土壤呼吸变化具有显著影响。【结论】桉树复层混交林土壤呼吸具有明显的旱雨季变化特征,机器学习算法相比于传统经验模型在预测土壤呼吸变化时更具优势,其中随机森林模型表现最佳;通过增加土壤电导率等输入变量能大幅提高随机森林模型的预测能力,考虑增加这些因素能更好地预测土壤呼吸的变化,为评估人工林碳收支状况提供可靠依据。 展开更多
关键词 土壤呼吸 预测模型 随机森林 桉树 复层混交林
在线阅读 下载PDF
基于可解释随机森林的多类别交通事故风险研判
18
作者 董春娇 万雨杰 李鹏辉 《北京工业大学学报》 北大核心 2026年第3期315-323,共9页
为了探究影响因素对不同事故类别的影响程度,考虑道路条件、天气状况和交通流状态三方面因素,采用网格搜索方法确定超参数的最优参数,建立了基于随机森林的道路交通事故风险研判模型,研判是车车碰撞还是人车碰撞事故风险,是受伤事故还... 为了探究影响因素对不同事故类别的影响程度,考虑道路条件、天气状况和交通流状态三方面因素,采用网格搜索方法确定超参数的最优参数,建立了基于随机森林的道路交通事故风险研判模型,研判是车车碰撞还是人车碰撞事故风险,是受伤事故还是死亡事故风险。为了量化影响因素对事故风险研判结果的贡献,提出基于SHAP(Shapley additive explanations)的交通事故风险致因解释方法。利用北京市京开高速和南六环等路段的事故数据对构建的模型方法进行参数标定和测试,并与传统随机森林模型、逻辑回归模型和支持向量机(support vector machines,SVM)模型进行了对比。研究结果表明:构建的模型在人车碰撞事故风险研判上表现最优,有较高的测试精度,召回率(recall,REC)相较传统随机森林模型、逻辑回归模型和支持向量机模型分别有30%、40%和40%的提升;在总体交通事故和受伤事故风险研判上表现次之,相较对比模型分别提升约20%、10%和10%;在车车碰撞事故上相较逻辑回归模型有30%的提升;而在死亡事故上无显著提升。影响因素中,当前车道车头间距、时间占用率、降水等对总体事故风险研判分别有约30%、30%和10%的影响;而在各类细分事故上降水是主导因素,其次才是车道车头间距、时间占用率因素。 展开更多
关键词 交通安全 交通事故 风险研判 随机森林 模型可解释性 多源数据
在线阅读 下载PDF
基于AHC-PSO-RF代理模型的大型集装箱船参数横摇运动快速预报
19
作者 孙强 谭杰 周耀华 《中国舰船研究》 北大核心 2026年第1期104-113,共10页
[目的]针对传统基于水动力学的数值模拟方法计算船舶参数横摇存在计算成本高、操作要求高且无法覆盖所有装载工况等问题,提出一种融合特征物理化重构、凝聚层次聚类(AHC)与改进随机森林(RF)的集成机器学习替代模型,用于高效预测船舶参... [目的]针对传统基于水动力学的数值模拟方法计算船舶参数横摇存在计算成本高、操作要求高且无法覆盖所有装载工况等问题,提出一种融合特征物理化重构、凝聚层次聚类(AHC)与改进随机森林(RF)的集成机器学习替代模型,用于高效预测船舶参数横摇幅值。[方法]利用AHC压缩特征维度,降低模型复杂度和计算开销;采用粒子群算法(PSO)对RF超参数进行全局寻优。[结果]基于某大型集装箱船多工况水动力数值模拟结果数据的验证结果表明:与广义回归神经网络(GRNN)及未优化RF模型相比,在迎浪和艉随浪工况下,该模型(AHC–PSO–RF)在横摇有义值预测中的决定系数(R2)平均提升5.84%与0.27%,均方根误差(RMSE)平均降低59.28%与10.69%,预测精度较高。此外,模型在单个装载工况的平均计算耗时相比于水动力数值模拟方法减少84.5%。[结论]该模型在批量预测任务中具备显著效率优势,证明了其作为高效替代方案的工程实用价值。 展开更多
关键词 船舶稳性 参数横摇 代理模型 凝聚层次聚类−粒子群优化−随机森林模型 集装箱船 动稳性预报
在线阅读 下载PDF
上一页 1 2 86 下一页 到第
使用帮助 返回顶部