期刊文献+
共找到1,636篇文章
< 1 2 82 >
每页显示 20 50 100
AI-Driven Malware Detection with VGG Feature Extraction and Artificial Rabbits Optimized Random Forest Model
1
作者 Brij B.Gupta Akshat Gaurav +3 位作者 Wadee Alhalabi Varsha Arya Shavi Bansal Ching-Hsien Hsu 《Computers, Materials & Continua》 2025年第9期4755-4772,共18页
Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support v... Detecting cyber attacks in networks connected to the Internet of Things(IoT)is of utmost importance because of the growing vulnerabilities in the smart environment.Conventional models,such as Naive Bayes and support vector machine(SVM),as well as ensemble methods,such as Gradient Boosting and eXtreme gradient boosting(XGBoost),are often plagued by high computational costs,which makes it challenging for them to perform real-time detection.In this regard,we suggested an attack detection approach that integrates Visual Geometry Group 16(VGG16),Artificial Rabbits Optimizer(ARO),and Random Forest Model to increase detection accuracy and operational efficiency in Internet of Things(IoT)networks.In the suggested model,the extraction of features from malware pictures was accomplished with the help of VGG16.The prediction process is carried out by the random forest model using the extracted features from the VGG16.Additionally,ARO is used to improve the hyper-parameters of the random forest model of the random forest.With an accuracy of 96.36%,the suggested model outperforms the standard models in terms of accuracy,F1-score,precision,and recall.The comparative research highlights our strategy’s success,which improves performance while maintaining a lower computational cost.This method is ideal for real-time applications,but it is effective. 展开更多
关键词 Malware detection VGG feature extraction artificial rabbits OPTIMIZATION random forest model
在线阅读 下载PDF
Desertification status mapping in MuttumaWatershed by using Random Forest Model 被引量:1
2
作者 S.Dharumarajan Thomas F.A.Bishop 《Research in Cold and Arid Regions》 CSCD 2022年第1期32-42,共11页
Potential of the Random Forest Model on mapping of different desertification processes was studied in Muttuma watershed of mid-Murrumbidgee river region of New South Wales,Australia.Desertification vulnerability index... Potential of the Random Forest Model on mapping of different desertification processes was studied in Muttuma watershed of mid-Murrumbidgee river region of New South Wales,Australia.Desertification vulnerability index was developed using climate,terrain,vegetation,soil and land quality indices to identify environmentally sensitive areas for desertification.Random Forest Model(RFM)was used to predict the different desertification processes such as soil erosion,salinization and waterlogging in the watershed and the information needed to train classification algorithms was obtained from satellite imagery interpretation and ground truth data.Climatic factors(evaporation,rainfall,temperature),terrain factors(aspect,slope,slope length,steepness,and wetness index),soil properties(pH,organic carbon,clay and sand content)and vulnerability indices were used as an explanatory variable.Classification accuracy and kappa index were calculated for training and testing datasets.We recorded an overall accuracy rate of 87.7%and 72.1%for training and testing sites,respectively.We found larger discrepancies between overall accuracy rate and kappa index for testing datasets(72.2%and 27.5%,respectively)suggesting that all the classes are not predicted well.The prediction of soil erosion and no desertification process was good and poor for salinization and water-logging process.Overall,the results observed give a new idea of using the knowledge of desertification process in training areas that can be used to predict the desertification processes at unvisited areas. 展开更多
关键词 desertification processes vulnerability indices random forest model EXTRAPOLATION
在线阅读 下载PDF
Experimental study of population density using an optimized random forest model
3
作者 LI Lingling LIU Jinsong +3 位作者 LI Zhi WEN Peizhang LI Yancheng LIU Yi 《Journal of Geographical Sciences》 SCIE CSCD 2024年第8期1636-1656,共21页
Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning ba... Random forest model is the mainstream research method used to accurately describe the distribution law and impact mechanism of regional population.We took Shijiazhuang as the research area,with comprehensive zoning based on endowments as the modeling unit,conducted stratified sampling on a hectare grid cell,and systematically carried out incremental selection experiments of population density impact factors,optimizing the population density random forest model throughout the process(zonal modeling,stratified sampling,factor selection,weighted output).The results are as follows:(1)Zonal modeling addresses the issue of confusion in population distribution laws caused by a single model.Sampling on a grid cell not only ensures the quality of training data by avoiding the modifiable areal unit problem(MAUP)but also attempts to mitigate the adverse effects of the ecological fallacy.Stratified sampling ensures the stability of population density label values(target variable)in the training sample.(2)Zonal selection experiments on population density impact factors help identify suitable combinations of factors,leading to a significant improvement in the goodness of fit(R^(2))of the zonal models.(3)Weighted combination output of the population density prediction dataset substantially enhances the model's robustness.(4)The population density dataset exhibits multi-scale superposition characteristics.On a large scale,the population density in plains is higher than that in mountainous areas,while on a small scale,urban areas have higher density compared to rural areas.The optimization scheme for the population density random forest model that we propose offers a unified technical framework for uncovering local population distribution law and the impact mechanisms. 展开更多
关键词 population density random forest model endowment zones stratified sampling factor selection weighted output
原文传递
Identification of Mixtures of Two Types of Body Fluids Using the Multiplex Methylation System and Random Forest Models
4
作者 Han-xiao WANG Xiao-zhao LIU +3 位作者 Xi-miao HE Chao XIAO Dai-xin HUANG Shao-hua YI 《Current Medical Science》 SCIE CAS 2023年第5期908-918,共11页
Objective Body fluid mixtures are complex biological samples that frequently occur in crime scenes,and can provide important clues for criminal case analysis.DNA methylation assay has been applied in the identificatio... Objective Body fluid mixtures are complex biological samples that frequently occur in crime scenes,and can provide important clues for criminal case analysis.DNA methylation assay has been applied in the identification of human body fluids,and has exhibited excellent performance in predicting single-source body fluids.The present study aims to develop a methylation SNaPshot multiplex system for body fluid identification,and accurately predict the mixture samples.In addition,the value of DNA methylation in the prediction of body fluid mixtures was further explored.Methods In the present study,420 samples of body fluid mixtures and 250 samples of single body fluids were tested using an optimized multiplex methylation system.Each kind of body fluid sample presented the specific methylation profiles of the 10 markers.Results Significant differences in methylation levels were observed between the mixtures and single body fluids.For all kinds of mixtures,the Spearman’s correlation analysis revealed a significantly strong correlation between the methylation levels and component proportions(1:20,1:10,1:5,1:1,5:1,10:1 and 20:1).Two random forest classification models were trained for the prediction of mixture types and the prediction of the mixture proportion of 2 components,based on the methylation levels of 10 markers.For the mixture prediction,Model-1 presented outstanding prediction accuracy,which reached up to 99.3%in 427 training samples,and had a remarkable accuracy of 100%in 243 independent test samples.For the mixture proportion prediction,Model-2 demonstrated an excellent accuracy of 98.8%in 252 training samples,and 98.2%in 168 independent test samples.The total prediction accuracy reached 99.3%for body fluid mixtures and 98.6%for the mixture proportions.Conclusion These results indicate the excellent capability and powerful value of the multiplex methylation system in the identification of forensic body fluid mixtures. 展开更多
关键词 body fluid identification MIXTURE mixing ratio DNA methylation multiplex assay random forest model
暂未订购
Modeling the Spatial Distribution of Soil Heavy Metals Using Random Forest Model—A Case Study of Nairobi and Thirirka Rivers’ Confluence 被引量:1
5
作者 Evans Omondi Mark Boitt 《Journal of Geographic Information System》 2020年第6期597-619,共23页
Modeling the spatial distribution of soil heavy metals is important in determining the safety of contaminated soils for agricultural use. This study utilized 60 topsoil samples (0 - 30 cm), multispectral images (Senti... Modeling the spatial distribution of soil heavy metals is important in determining the safety of contaminated soils for agricultural use. This study utilized 60 topsoil samples (0 - 30 cm), multispectral images (Sentinel-2), spectral indices, and ancillary data to model the spatial distribution of heavy metals in the soils along the Nairobi River. The model was generated using the Random Forest package in R. Using R2 to assess the prediction accuracy, the Random Forest model generated satisfactory results for all the elements. It also ranked the variables in order of their importance in the overall prediction. Spectral indices were the most important variables within the rankings. From the predicted topsoil maps, there were high concentrations of Cadmium on the easterly end of the river. Cadmium is an impurity in detergents, and this section is in close proximity to the Nairobi water sewerage plant, which could be a direct source of Cadmium. Some farms had Zinc levels which were above the World Health Organization recommended limit. The Random Forest model performed satisfactorily. However, the predictions can be improved further if the spatial resolutions of the various variables are increased and through the addition of more predictor variables. 展开更多
关键词 random forest Sentinel 2 Heavy Metals Spectral Indices Spatial modeling
在线阅读 下载PDF
Modelling the Densities of Soil Organic Carbon,Total Nitrogen and Phosphorus Using Random Forest Model,and Their Spatial Distributions of Cultivated Lands in the YLN Region of Xizang
6
作者 SUN Wei LI Tianyu +8 位作者 LI Shaowei ZHA Xinjie HAN Fusong HUANG Shaolin Dorblha CHEN Chuhong Dawaqiongda Luobu FU Gang 《Journal of Resources and Ecology》 2025年第6期1842-1850,共9页
The“Yarlung Zangbo River,Lhasa River and Nyangqu River”(YLN)region is the main grain producing area on which the Tibetan people depend for survival.The densities of soil organic carbon(SOC),total nitrogen(TN)and tot... The“Yarlung Zangbo River,Lhasa River and Nyangqu River”(YLN)region is the main grain producing area on which the Tibetan people depend for survival.The densities of soil organic carbon(SOC),total nitrogen(TN)and total phosphorus(TP)in farmlands are closely related to grain production.Scientific management and regulation of these nutrient densities are of great significance for ensuring food security.However,accurate simulations of spatial variations in the densities of SOC(SOCD),TN(TND)and TP(TPD)and the spatial distributions of SOCD,TND and TPD are still unclear.In this study,388 samples of cultivated soils at 0–10 and 10–20 cm in the YLN region were collected to determine the SOC,TN,and TP contents,as well as pH and bulk density(BD).Random forest models of SOCD,TND and TPD were constructed using longitude,latitude,elevation,mean annual temperature,mean annual precipitation,mean annual radiation and vegetation index,which were then used to obtain the spatial distribution maps of SOCD,TND and TPD,and the storages of SOC(SOCS),TN(TNS)and TP(TPS).Mean annual radiation can partially explain the spatial variations of SOCD and TND,in addition to temperature and precipitation.The relative biases between modelled and observed SOCD,TND,TPD,SOCS,TNS and TPS ranged from–9.43%to 7.57%.The SOCD and TND increased from west to east,but they were both low in the middle and high in the north and south.The SOCD and TND decreased with increasing pH and BD.SOCD,TND and TPD were low at mid-elevations but high at low and high elevations.The SOCD,TND,TPD,SOCS,TNS and TPS were 2.72 kg m^(-2),0.30 kg m^(-2),0.18 kg m^(-2),4.88 Tg,0.54 Tg and 0.32 Tg,respectively,at 0–20 cm over the cultivated lands of the YLN region.Based on these results,the random forest models constructed in this study can be used for subsequent related studies.Besides warming and precipitation changes,radiation changes can also affect SOCD and TND.In terms of the production of food crops such as highland barley,the farmland soils in the YLN region currently can have relative deficiencies of nitrogen and phosphorus nutrients.In the future,measures such as increasing the application of organic fertilizers should be taken to improve the carbon sequestration capacity and nitrogen and phosphorus nutrition of the soil.These findings have important guiding significance for the fertilization management of cultivated lands in the YLN region and other alpine regions similar to the YLN region. 展开更多
关键词 carbon storage random forest model soil organic carbon density YLN
原文传递
A Hybrid Air Quality Prediction Method Based on VAR and Random Forest
7
作者 Minghao Yi Fuming Lin 《Journal of Computer and Communications》 2025年第2期142-154,共13页
To improve the efficiency of air quality analysis and the accuracy of predictions, this paper proposes a composite method based on Vector Autoregressive (VAR) and Random Forest (RF) models. In the theoretical section,... To improve the efficiency of air quality analysis and the accuracy of predictions, this paper proposes a composite method based on Vector Autoregressive (VAR) and Random Forest (RF) models. In the theoretical section, the model introduction and estimation algorithms are provided. In the empirical analysis section, global air quality data from 2022 to 2024 are used, and the proposed method is applied. Specifically, principal component analysis (PCA) is first conducted, and then VAR and Random Forest methods are used for prediction on the reduced-dimensional data. The results show that the RMSE of the hybrid model is 45.27, significantly lower than the 49.11 of the VAR model alone, verifying its superiority. The stability and predictive performance of the model are effectively enhanced. 展开更多
关键词 Var model Principal Component Analysis random forest model
在线阅读 下载PDF
Mapping landslide susceptibility at the Three Gorges Reservoir, China, using gradient boosting decision tree,random forest and information value models 被引量:14
8
作者 CHEN Tao ZHU Li +3 位作者 NIU Rui-qing TRINDER C John PENG Ling LEI Tao 《Journal of Mountain Science》 SCIE CSCD 2020年第3期670-685,共16页
This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting de... This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR. 展开更多
关键词 MAPPING LANDSLIDE SUSCEPTIBILITY Gradient BOOSTING DECISION tree random forest Information value model Three Gorges Reservoir
原文传递
Random forest algorithm and regional applications of spectral inversion model for estimating canopy nitrogen concentration in rice 被引量:1
9
作者 LI Xuqing LIU Xiangnan LIU Meiling WU Ling 《遥感学报》 CSCD 北大核心 2014年第4期923-945,共23页
原文传递
A Data-Driven Car-Following Model Based on the Random Forest
10
作者 Huili Shi Tingli Wang +3 位作者 Fusheng Zhong Hanqing Wang Junyan Han Xiaoyuan Wang 《World Journal of Engineering and Technology》 2021年第3期503-515,共13页
The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare... The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare. In recent years, the related technologies of Intelligent Transportation System (ITS) re</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">presented by the Vehicles to Everything (V2X) technology have been developing rapidly. Utilizing the related technologies of ITS, the large-scale vehicle microscopic trajectory data with high quality can be acquired, which provides the research foundation for modeling the car-following behavior based on the data-driven methods. According to this point, a data-driven car-following model based on the Random Forest (RF) method was constructed in this work, and the Next Generation Simulation (NGSIM) dataset was used to calibrate and train the constructed model. The Artificial Neural Network (ANN) model, GM model, and Full Velocity Difference (FVD) model are em</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">ployed to comparatively verify the proposed model. The research results suggest that the model proposed in this work can accurately describe the car-</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">following behavior with better performance under multiple performance indicators. 展开更多
关键词 Traffic Flow Car-Following model Data-Driven Method random forest Intelligent Transportation System
在线阅读 下载PDF
Linear and Nonlinear Trading Models with Gradient Boosted Random Forests and Application to Singapore Stock Market
11
作者 Qin Qin Qing-Guo Wang +1 位作者 Jin Li Shuzhi Sam Ge 《Journal of Intelligent Learning Systems and Applications》 2013年第1期1-10,共10页
This paper presents new trading models for the stock market and test whether they are able to consistently generate excess returns from the Singapore Exchange (SGX). Instead of conventional ways of modeling stock pric... This paper presents new trading models for the stock market and test whether they are able to consistently generate excess returns from the Singapore Exchange (SGX). Instead of conventional ways of modeling stock prices, we construct models which relate the market indicators to a trading decision directly. Furthermore, unlike a reversal trading system or a binary system of buy and sell, we allow three modes of trades, namely, buy, sell or stand by, and the stand-by case is important as it caters to the market conditions where a model does not produce a strong signal of buy or sell. Linear trading models are firstly developed with the scoring technique which weights higher on successful indicators, as well as with the Least Squares technique which tries to match the past perfect trades with its weights. The linear models are then made adaptive by using the forgetting factor to address market changes. Because stock markets could be highly nonlinear sometimes, the Random Forest is adopted as a nonlinear trading model, and improved with Gradient Boosting to form a new technique—Gradient Boosted Random Forest. All the models are trained and evaluated on nine stocks and one index, and statistical tests such as randomness, linear and nonlinear correlations are conducted on the data to check the statistical significance of the inputs and their relation with the output before a model is trained. Our empirical results show that the proposed trading methods are able to generate excess returns compared with the buy-and-hold strategy. 展开更多
关键词 Stock modeling SCORING TECHNIQUE Least Square TECHNIQUE random forest GRADIENT Boosted random forest
暂未订购
Mixed-effects modeling for tree height prediction models of Oriental beech in the Hyrcanian forests 被引量:9
12
作者 Siavash Kalbi Asghar Fallah +2 位作者 Pete Bettinger Shaban Shataee Rassoul Yousefpour 《Journal of Forestry Research》 SCIE CAS CSCD 2018年第5期1195-1204,共10页
Height–diameter relationships are essential elements of forest assessment and modeling efforts.In this work,two linear and eighteen nonlinear height–diameter equations were evaluated to find a local model for Orient... Height–diameter relationships are essential elements of forest assessment and modeling efforts.In this work,two linear and eighteen nonlinear height–diameter equations were evaluated to find a local model for Oriental beech(Fagus orientalis Lipsky) in the Hyrcanian Forest in Iran.The predictive performance of these models was first assessed by different evaluation criteria: adjusted R^2(R^2_(adj)),root mean square error(RMSE),relative RMSE(%RMSE),bias,and relative bias(%bias) criteria.The best model was selected for use as the base mixed-effects model.Random parameters for test plots were estimated with different tree selection options.Results show that the Chapman–Richards model had better predictive ability in terms of adj R^2(0.81),RMSE(3.7 m),%RMSE(12.9),bias(0.8),%Bias(2.79) than the other models.Furthermore,the calibration response,based on a selection of four trees from the sample plots,resulted in a reduction percentage for bias and RMSE of about 1.6–2.7%.Our results indicate that the calibrated model produced the most accurate results. 展开更多
关键词 random effects Tree height CALIBRATION Sangdeh forest Chapman–Richards model Oriental beech
在线阅读 下载PDF
Establishment and evaluation of a risk prediction model for gestational diabetes mellitus 被引量:7
13
作者 Qing Lin Zhuan-Ji Fang 《World Journal of Diabetes》 SCIE 2023年第10期1541-1550,共10页
BACKGROUND Gestational diabetes mellitus(GDM)is a condition characterized by high blood sugar levels during pregnancy.The prevalence of GDM is on the rise globally,and this trend is particularly evident in China,which... BACKGROUND Gestational diabetes mellitus(GDM)is a condition characterized by high blood sugar levels during pregnancy.The prevalence of GDM is on the rise globally,and this trend is particularly evident in China,which has emerged as a significant issue impacting the well-being of expectant mothers and their fetuses.Identifying and addressing GDM in a timely manner is crucial for maintaining the health of both expectant mothers and their developing fetuses.Therefore,this study aims to establish a risk prediction model for GDM and explore the effects of serum ferritin,blood glucose,and body mass index(BMI)on the occurrence of GDM.AIM To develop a risk prediction model to analyze factors leading to GDM,and evaluate its efficiency for early prevention.METHODS The clinical data of 406 pregnant women who underwent routine prenatal examination in Fujian Maternity and Child Health Hospital from April 2020 to December 2022 were retrospectively analyzed.According to whether GDM occurred,they were divided into two groups to analyze the related factors affecting GDM.Then,according to the weight of the relevant risk factors,the training set and the verification set were divided at a ratio of 7:3.Subsequently,a risk prediction model was established using logistic regression and random forest models,and the model was evaluated and verified.RESULTS Pre-pregnancy BMI,previous history of GDM or macrosomia,hypertension,hemoglobin(Hb)level,triglyceride level,family history of diabetes,serum ferritin,and fasting blood glucose levels during early pregnancy were determined.These factors were found to have a significant impact on the development of GDM(P<0.05).According to the nomogram model’s prediction of GDM in pregnancy,the area under the curve(AUC)was determined to be 0.883[95%confidence interval(CI):0.846-0.921],and the sensitivity and specificity were 74.1%and 87.6%,respectively.The top five variables in the random forest model for predicting the occurrence of GDM were serum ferritin,fasting blood glucose in early pregnancy,pre-pregnancy BMI,Hb level and triglyceride level.The random forest model achieved an AUC of 0.950(95%CI:0.927-0.973),the sensitivity was 84.8%,and the specificity was 91.4%.The Delong test showed that the AUC value of the random forest model was higher than that of the decision tree model(P<0.05).CONCLUSION The random forest model is superior to the nomogram model in predicting the risk of GDM.This method is helpful for early diagnosis and appropriate intervention of GDM. 展开更多
关键词 Gestational diabetes mellitus Prediction model model evaluation random forest model NOMOGRAMS Risk factor
暂未订购
An Intelligent Fine-Tuned Forecasting Technique for Covid-19 Prediction Using Neuralprophet Model 被引量:5
14
作者 Savita Khurana Gaurav Sharma +5 位作者 Neha Miglani Aman Singh Abdullah Alharbi Wael Alosaimi Hashem Alyami Nitin Goyal 《Computers, Materials & Continua》 SCIE EI 2022年第4期629-649,共21页
COVID-19,being the virus of fear and anxiety,is one of the most recent and emergent of various respiratory disorders.It is similar to the MERS-COV and SARS-COV,the viruses that affected a large population of different... COVID-19,being the virus of fear and anxiety,is one of the most recent and emergent of various respiratory disorders.It is similar to the MERS-COV and SARS-COV,the viruses that affected a large population of different countries in the year 2012 and 2002,respectively.Various standard models have been used for COVID-19 epidemic prediction but they suffered from low accuracy due to lesser data availability and a high level of uncertainty.The proposed approach used a machine learning-based time-series Facebook NeuralProphet model for prediction of the number of death as well as confirmed cases and compared it with Poisson Distribution,and Random Forest Model.The analysis upon dataset has been performed considering the time duration from January 1st 2020 to16th July 2021.The model has been developed to obtain the forecast values till September 2021.This study aimed to determine the pandemic prediction of COVID-19 in the second wave of coronavirus in India using the latest Time-Series model to observe and predict the coronavirus pandemic situation across the country.In India,the cases are rapidly increasing day-by-day since mid of Feb 2021.The prediction of death rate using the proposed model has a good ability to forecast the COVID-19 dataset essentially in the second wave.To empower the prediction for future validation,the proposed model works effectively. 展开更多
关键词 Covid-19 machine learning neuralprophet model poisson distribution PREDICTION random forest model
暂未订购
结核性支气管扩张症患者转归情况及其影响因素研究
15
作者 汪雪 杨程茗 +1 位作者 王子云 周忠 《中国全科医学》 北大核心 2026年第5期606-611,共6页
背景结核性支气管扩张症是结核病的重要后遗症之一,目前关于结核性支气管扩张症患者预后的具体影响因素尚未十分明确。目的了解结核性支气管扩张症患者转归情况及影响因素,为改善患者预后提供科学依据。方法选取2023年4月—2024年4月在... 背景结核性支气管扩张症是结核病的重要后遗症之一,目前关于结核性支气管扩张症患者预后的具体影响因素尚未十分明确。目的了解结核性支气管扩张症患者转归情况及影响因素,为改善患者预后提供科学依据。方法选取2023年4月—2024年4月在贵阳市公共卫生救治中心住院的结核性支气管扩张症患者337例,对其出院后6个月的转归情况进行电话随访,并将其分为转归良好和转归不良。自制资料采集表,通过查阅病历收集临床资料。使用随机森林模型和LASSO回归筛选和分析转归情况的影响因素并对其重要性进行排序。结果337例结核性支气管扩张症患者转归不良发生率为41.2%(139/337)。转归良好和转归不良患者年龄、文化程度、BMI、既往吸烟、第1秒用力呼气量占预计值的百分比(FEV1%pred)、肺泡灌洗、合并慢性肺源性心脏病、合并慢性阻塞性肺疾病、合并高血压、铜绿假单胞菌感染、出院后吸烟、出院后饮酒、缺乏日常运动、独居、未按医嘱服药及出院白介素6(IL-6)、出院超敏C反应蛋白(hs-CRP)水平比较,差异均有统计学意义(P<0.05)。将单因素分析中有意义的变量进行LASSO回归模型分析,共筛选出9个变量。随机森林模型结果显示,各影响因素重要性排序依次为:未按医嘱服药、出院后吸烟、缺乏日常运动、文化程度、合并慢性肺源性心脏病、出院IL-6、年龄、出院后饮酒、独居。多因素Logistic回归分析结果显示,未按医嘱服药(OR=8.394,95%CI=3.323~21.203)、出院后吸烟(OR=5.294,95%CI=2.313~12.116)、缺乏日常运动(OR=4.555,95%CI=2.206~9.406)、合并慢性肺源性心脏病(OR=4.331,95%CI=2.012~9.323)、出院IL-6(OR=1.054,95%CI=1.018~1.091)和年龄≥80岁(OR=5.246,95%CI=1.097~25.077)是结核性支气管扩张症患者转归的危险因素(P<0.05),文化程度为初等教育(OR=0.358,95%CI=0.143~0.898)和中等教育(OR=0.202,95%CI=0.074~0.554)是结核性支气管扩张症患者转归的保护因素(P<0.05)。结论结核性支气管扩张症患者转归不良发生率较高,未按医嘱服药、出院后吸烟、缺乏日常运动、合并慢性肺源性心脏病、出院IL-6、年龄≥80岁、文化程度是结核性支气管扩张症的重要影响因素,可根据影响因素采取针对性措施。 展开更多
关键词 结核性支气管扩张症 转归 影响因素 随机森林模型
暂未订购
Establishment of models to predict factors influencing periodontitis in patients with type 2 diabetes mellitus 被引量:3
16
作者 Hong-Miao Xu Xuan-Jiang Shen Jia Liu 《World Journal of Diabetes》 SCIE 2023年第12期1793-1802,共10页
BACKGROUND Type 2 diabetes mellitus(T2DM)is associated with periodontitis.Currently,there are few studies proposing predictive models for periodontitis in patients with T2DM.AIM To determine the factors influencing pe... BACKGROUND Type 2 diabetes mellitus(T2DM)is associated with periodontitis.Currently,there are few studies proposing predictive models for periodontitis in patients with T2DM.AIM To determine the factors influencing periodontitis in patients with T2DM by constructing logistic regression and random forest models.METHODS In this a retrospective study,300 patients with T2DM who were hospitalized at the First People’s Hospital of Wenling from January 2022 to June 2022 were selected for inclusion,and their data were collected from hospital records.We used logistic regression to analyze factors associated with periodontitis in patients with T2DM,and random forest and logistic regression prediction models were established.The prediction efficiency of the models was compared using the area under the receiver operating characteristic curve(AUC).RESULTS Of 300 patients with T2DM,224 had periodontitis,with an incidence of 74.67%.Logistic regression analysis showed that age[odds ratio(OR)=1.047,95%confidence interval(CI):1.017-1.078],teeth brushing frequency(OR=4.303,95%CI:2.154-8.599),education level(OR=0.528,95%CI:0.348-0.800),glycosylated hemoglobin(HbA1c)(OR=2.545,95%CI:1.770-3.661),total cholesterol(TC)(OR=2.872,95%CI:1.725-4.781),and triglyceride(TG)(OR=3.306,95%CI:1.019-10.723)influenced the occurrence of periodontitis(P<0.05).The random forest model showed that the most influential variable was HbA1c followed by age,TC,TG, education level, brushing frequency, and sex. Comparison of the prediction effects of the two models showedthat in the training dataset, the AUC of the random forest model was higher than that of the logistic regressionmodel (AUC = 1.000 vs AUC = 0.851;P < 0.05). In the validation dataset, there was no significant difference in AUCbetween the random forest and logistic regression models (AUC = 0.946 vs AUC = 0.915;P > 0.05).CONCLUSION Both random forest and logistic regression models have good predictive value and can accurately predict the riskof periodontitis in patients with T2DM. 展开更多
关键词 Type 2 diabetes mellitus PERIODONTITIS Logistic regression Prediction model random forest model Gingival disease
暂未订购
初治涂阳肺结核患者标准强化治疗期限内未转阴的影响因素分析
17
作者 吴斌军 吕麟亚 +2 位作者 王建宾 黄承刚 张俊臣 《贵州医药》 2026年第1期122-126,共5页
目的 探讨初治涂阳肺结核患者标准强化治疗期限内未转阴的影响因素。方法 选择病原学检查阳性的初治肺结核(PTB)患者181例作为研究对象。所有患者均收集一般资料及治疗前实验室检查结果资料,其中实验室检查资料包含铁代谢相关指标。对... 目的 探讨初治涂阳肺结核患者标准强化治疗期限内未转阴的影响因素。方法 选择病原学检查阳性的初治肺结核(PTB)患者181例作为研究对象。所有患者均收集一般资料及治疗前实验室检查结果资料,其中实验室检查资料包含铁代谢相关指标。对所有患者进行为期半年的随访,统计标准强化治疗期内(强化治疗后2月内)痰涂片转阴情况并对患者进行分组,分析强化治疗周期内未转阴的相关影响因素。结果 截止随访结束,其中133例患者标准强化治疗期限内转阴,48例患者标准强化治疗期限内未转阴,最终分组为转阴组(n=133)、未转阴组(n=48)。未转阴组患者年龄大于转阴组(P<0.05)。治疗后,未转阴组患者铁调素、IL-6、IL-10及TNF-α水平显著高于转阴组(P<0.05)。年龄、铁调素、IL-6及TNF-α是影响转阴时限>2个月的独立危险因素(P<0.05)。随机森林模型预测精度平均下降量对重要预测变量的排名为:年龄>TNF-α>铁调素>IL-6。采用梯度提升回归树算法调参并设置每片森林的决策树数目为20棵后构建的随机森林的集成分类算法ROC曲线下面积为0.998。F1-score为0.941,准确率为94.34%,敏感性为97.73%,特异性为93.71%,阳性预测值为98.71%,阴性预测值为93.91%。而多因素Logistic回归分析模型下ROC曲线下面积为0.740。结论 年龄、铁调素、IL-6及TNF-α与初治病原学检查阳性肺结核患者强化治疗后未转阴显著相关。 展开更多
关键词 肺结核 铁调素 随机森林模型
暂未订购
Machine Learning Models for Predicting Order Returns in Cross-Border E-Commerce
18
作者 Jia Cai Ronaldo Juanatas +1 位作者 Apollo Portez Jonan Rose Montaña 《Proceedings of Business and Economic Studies》 2024年第6期34-43,共10页
This study investigates the application of machine learning models to address after-sales service issues in cross-border e-commerce,focusing on predicting order returns to reduce return costs and optimize customer exp... This study investigates the application of machine learning models to address after-sales service issues in cross-border e-commerce,focusing on predicting order returns to reduce return costs and optimize customer experience.Using H cross-border e-commerce company as a case study,the research employs Random Forest and XGBoost models to identify high-risk return orders.By comparing the performance of these two models,the study highlights their respective strengths and weaknesses and proposes optimization strategies.The findings provide a valuable reference for e-commerce companies to refine their business models,reduce return rates,improve operational efficiency,and enhance customer satisfaction. 展开更多
关键词 random forest model XGBoost model After-sales issues Prediction
在线阅读 下载PDF
道路性能影响因素选择方法研究
19
作者 李毅帆 王维庄 +3 位作者 丁小平 倪静哲 张利维 曹江 《山西建筑》 2026年第1期147-151,共5页
道路性能预测模型在道路养护与管理中发挥着至关重要的作用,然而道路监测数据中包含的影响因素复杂多样,数据规模大、维度高、质量参差不齐,并且影响因素之间存在复杂的非线性关系,评价道路性能影响因素的重要性有助于提高决策模型的有... 道路性能预测模型在道路养护与管理中发挥着至关重要的作用,然而道路监测数据中包含的影响因素复杂多样,数据规模大、维度高、质量参差不齐,并且影响因素之间存在复杂的非线性关系,评价道路性能影响因素的重要性有助于提高决策模型的有效性和可靠性。文中对递归特征消除法进行改进,将道路性能影响因素的筛选流程分成两个阶段:第一阶段基于Spearman系数和随机森林方法进行初筛;第二阶段基于递归消除法进行复筛。研究表明,基于改进递归特征消除法的道路性能影响因素选择方法,降低了模型迭代训练过程中的计算资源消耗,避免了冗余特征的干扰,能够更精确地量化各影响因素与道路性能之间的关联强度,为道路性能的评估提供了更可靠的分析工具。 展开更多
关键词 特征选择 Spearman系数 随机森林 递归特征消除 计算模型
在线阅读 下载PDF
上一页 1 2 82 下一页 到第
使用帮助 返回顶部