期刊文献+
共找到554篇文章
< 1 2 28 >
每页显示 20 50 100
A Novel Hybrid Sine Cosine-Flower Pollination Algorithm for Optimized Feature Selection
1
作者 Sumbul Azeem Shazia Javed +3 位作者 Farheen Ibraheem Uzma Bashir Nazar Waheed Khursheed Aurangzeb 《Computers, Materials & Continua》 2026年第5期1916-1930,共15页
Data serves as the foundation for training and testing machine learning and artificial intelligencemodels.The most fundamental part of data is its attributes or features.The feature set size changes from one dataset t... Data serves as the foundation for training and testing machine learning and artificial intelligencemodels.The most fundamental part of data is its attributes or features.The feature set size changes from one dataset to another.Only the relevant features contributemeaningfully to classificationaccuracy.The presence of irrelevant features reduces the system’s effectiveness.Classification performance often deteriorates on high-dimensional datasets due to the large search space.Thus,one of the significant obstacles affecting the performance of the learning process in the majority of machine learning and data mining techniques is the dimensionality of the datasets.Feature selection(FS)is an effective preprocessing step in classification tasks.The aim of applying FS is to exclude redundant and unrelated features while retaining the most informative ones to optimize classification capability and compress computational complexity.In this paper,a novel hybrid binary metaheuristic algorithm,termed hSC-FPA,is proposed by hybridizing the Flower Pollination Algorithm(FPA)and the Sine Cosine Algorithm(SCA).Hybridization controls the exploration capacity of SCA and the exploitation behavior of FPA to maintain a balanced search process.SCA guides the global search in the early iterations,while FPA’s local pollination refines promising solutions in later stages.A binary conversion mechanism using a threshold function is implemented to handle the discrete nature of the feature selection problem.The functionality of the proposed hSC-FPA is authenticated on fourteen standard datasets from the UCI repository using the K-Nearest Neighbors(K-NN)classifier.Experimental results are benchmarked against the standalone SCA and FPA algorithms.The hSC-FPA consistently achieves higher classification accuracy,selects a more compact feature subset,and demonstrates superior convergence behavior.These findings support the stability and outperformance of the hybrid feature selection method presented. 展开更多
关键词 Classification algorithms feature selection process flower pollination algorithm hybrid model metaheuristics multi-objective optimization search algorithm sine cosine algorithm
在线阅读 下载PDF
The distribution modeling and analysis of Antarctic krill:impacts of algorithm and spatial resolution
2
作者 LI Wenxiong YING Yiping +5 位作者 ZHANG Jichang ZHAO Yunxia ZHU Jiancheng FAN Gangzhou MU Xiuxia WANG Xinliang 《Advances in Polar Science》 2025年第4期373-391,共19页
Antarctic krill(Euphausia superba),widely distributes around Antarctica,is a key species supporting the biodiversity of the Southern Ocean ecosystem.The Commission for the Conservation of Antarctic Marine Living Resou... Antarctic krill(Euphausia superba),widely distributes around Antarctica,is a key species supporting the biodiversity of the Southern Ocean ecosystem.The Commission for the Conservation of Antarctic Marine Living Resources(CCAMLR)has thus managed the krill fishery according to a precautionary way.Currently,CCAMLR is making effort to develop a refined krill fishery management approach based on more solid science,which requires accurate predictions of krill distribution.To address this need,this study investigated the effects of algorithm and spatial resolution on the performance of Antarctic krill distribution modelling.We integrated acoustic data from 4 surveys conducted in the waters adjacent to the Antarctic Peninsula with 11 environmental variables characterizing krill prey conditions,water mass properties,and seafloor topography.These data were processed at 4 spatial resolutions(5,10,15,and 20 km)to fit distribution models using 4 algorithms:Random Forests(RF),Generalized Additive Models(GAM),Extreme Gradient Boosting(XGBoost),and Artificial Neural Networks(ANN).Model performance was assessed and compared in terms of goodness-of-fit and predictive accuracy.The results showed that RF achieved the highest predictive performance at most resolutions,whereas GAM performed best at the coarsest resolution(20 km).XGBoost closely following RF in accuracy and demonstrated robustness as evidenced by the highly consistent partial dependence curves across resolutions.In contrast,ANN exhibited limitations with smaller sample sizes,resulting in comparatively poorer predictive performance.The analysis revealed a trade-off whereby reducing spatial resolution improved model fit and mitigated zero-inflation at the expense of fine-scale information and overall predictive accuracy.Ensemble models,integrating RF,GAM,and XGBoost,are proposed as potential balanced solutions to improve predictive stability,offering a more robust scientific basis for the refinement of krill management. 展开更多
关键词 Antarctic krill species distribution model algorithm selection spatial resolution machine learning
在线阅读 下载PDF
A systematic data-driven modelling framework for nonlinear distillation processes incorporating data intervals clustering and new integrated learning algorithm
3
作者 Zhe Wang Renchu He Jian Long 《Chinese Journal of Chemical Engineering》 2025年第5期182-199,共18页
The distillation process is an important chemical process,and the application of data-driven modelling approach has the potential to reduce model complexity compared to mechanistic modelling,thus improving the efficie... The distillation process is an important chemical process,and the application of data-driven modelling approach has the potential to reduce model complexity compared to mechanistic modelling,thus improving the efficiency of process optimization or monitoring studies.However,the distillation process is highly nonlinear and has multiple uncertainty perturbation intervals,which brings challenges to accurate data-driven modelling of distillation processes.This paper proposes a systematic data-driven modelling framework to solve these problems.Firstly,data segment variance was introduced into the K-means algorithm to form K-means data interval(KMDI)clustering in order to cluster the data into perturbed and steady state intervals for steady-state data extraction.Secondly,maximal information coefficient(MIC)was employed to calculate the nonlinear correlation between variables for removing redundant features.Finally,extreme gradient boosting(XGBoost)was integrated as the basic learner into adaptive boosting(AdaBoost)with the error threshold(ET)set to improve weights update strategy to construct the new integrated learning algorithm,XGBoost-AdaBoost-ET.The superiority of the proposed framework is verified by applying this data-driven modelling framework to a real industrial process of propylene distillation. 展开更多
关键词 Integrated learning algorithm Data intervals clustering Feature selection Application of artificial intelligence in distillation industry Data-driven modelling
在线阅读 下载PDF
Construction and validation of a machine learning algorithm-based predictive model for difficult colonoscopy insertion
4
作者 Ren-Xuan Gao Xin-Lei Wang +6 位作者 Ming-Jie Tian Xiao-Ming Li Jia-Jia Zhang Jun-Jing Wang Jing Gao Chao Zhang Zhi-Ting Li 《World Journal of Gastrointestinal Endoscopy》 2025年第7期149-161,共13页
BACKGROUND Difficulty of colonoscopy insertion(DCI)significantly affects colonoscopy effectiveness and serves as a key quality indicator.Predicting and evaluating DCI risk preoperatively is crucial for optimizing intr... BACKGROUND Difficulty of colonoscopy insertion(DCI)significantly affects colonoscopy effectiveness and serves as a key quality indicator.Predicting and evaluating DCI risk preoperatively is crucial for optimizing intraoperative strategies.AIM To evaluate the predictive performance of machine learning(ML)algorithms for DCI by comparing three modeling approaches,identify factors influencing DCI,and develop a preoperative prediction model using ML algorithms to enhance colonoscopy quality and efficiency.METHODS This cross-sectional study enrolled 712 patients who underwent colonoscopy at a tertiary hospital between June 2020 and May 2021.Demographic data,past medical history,medication use,and psychological status were collected.The endoscopist assessed DCI using the visual analogue scale.After univariate screening,predictive models were developed using multivariable logistic regression,least absolute shrinkage and selection operator(LASSO)regression,and random forest(RF)algorithms.Model performance was evaluated based on discrimination,calibration,and decision curve analysis(DCA),and results were visualized using nomograms.RESULTS A total of 712 patients(53.8%male;mean age 54.5 years±12.9 years)were included.Logistic regression analysis identified constipation[odds ratio(OR)=2.254,95%confidence interval(CI):1.289-3.931],abdominal circumference(AC)(77.5–91.9 cm,OR=1.895,95%CI:1.065-3.350;AC≥92 cm,OR=1.271,95%CI:0.730-2.188),and anxiety(OR=1.071,95%CI:1.044-1.100)as predictive factors for DCI,validated by LASSO and RF methods.Model performance revealed training/validation sensitivities of 0.826/0.925,0.924/0.868,and 1.000/0.981;specificities of 0.602/0.511,0.510/0.562,and 0.977/0.526;and corresponding area under the receiver operating characteristic curves(AUCs)of 0.780(0.737-0.823)/0.726(0.654-0.799),0.754(0.710-0.798)/0.723(0.656-0.791),and 1.000(1.000-1.000)/0.754(0.688-0.820),respectively.DCA indicated optimal net benefit within probability thresholds of 0-0.9 and 0.05-0.37.The RF model demonstrated superior diagnostic accuracy,reflected by perfect training sensitivity(1.000)and highest validation AUC(0.754),outperforming other methods in clinical applicability.CONCLUSION The RF-based model exhibited superior predictive accuracy for DCI compared to multivariable logistic and LASSO regression models.This approach supports individualized preoperative optimization,enhancing colonoscopy quality through targeted risk stratification. 展开更多
关键词 COLONOSCOPY Difficulty of colonoscopy insertion Machine learning algorithms Predictive model Logistic regression Least absolute shrinkage and selection operator regression Random forest
暂未订购
An Optimization Algorithm Employing Multiple Metamodels and Optimizers 被引量:1
5
作者 Yoel Tenne 《International Journal of Automation and computing》 EI CSCD 2013年第3期227-241,共15页
Modern engineering design optimization often relies on computer simulations to evaluate candidate designs, a setup which results in expensive black-box optimization problems. Such problems introduce unique challenges,... Modern engineering design optimization often relies on computer simulations to evaluate candidate designs, a setup which results in expensive black-box optimization problems. Such problems introduce unique challenges, which has motivated the application of metamodel-assisted computational intelligence algorithms to solve them. Such algorithms combine a computational intelligence optimizer which employs a population of candidate solutions, with a metamodel which is a computationally cheaper approximation of the expensive computer simulation. However, although a variety of metamodels and optimizers have been proposed, the optimal types to employ are problem dependant. Therefore, a priori prescribing the type of metamodel and optimizer to be used may degrade its effectiveness. Leveraging on this issue, this study proposes a new computational intelligence algorithm which autonomously adapts the type of the metamodel and optimizer during the search by selecting the most suitable types out of a family of candidates at each stage. Performance analysis using a set of test functions demonstrates the effectiveness of the proposed algorithm, and highlights the merit of the proposed adaptation approach. 展开更多
关键词 Expensive optimization problems computational intelligence adaptive algorithms METAmodelLING model selection.
原文传递
MultiDMet: Designing a Hybrid Multidimensional Metrics Framework to Predictive Modeling for Performance Evaluation and Feature Selection
6
作者 Tesfay Gidey Hailu Taye Abdulkadir Edris 《Intelligent Information Management》 2023年第6期391-425,共35页
In a competitive digital age where data volumes are increasing with time, the ability to extract meaningful knowledge from high-dimensional data using machine learning (ML) and data mining (DM) techniques and making d... In a competitive digital age where data volumes are increasing with time, the ability to extract meaningful knowledge from high-dimensional data using machine learning (ML) and data mining (DM) techniques and making decisions based on the extracted knowledge is becoming increasingly important in all business domains. Nevertheless, high-dimensional data remains a major challenge for classification algorithms due to its high computational cost and storage requirements. The 2016 Demographic and Health Survey of Ethiopia (EDHS 2016) used as the data source for this study which is publicly available contains several features that may not be relevant to the prediction task. In this paper, we developed a hybrid multidimensional metrics framework for predictive modeling for both model performance evaluation and feature selection to overcome the feature selection challenges and select the best model among the available models in DM and ML. The proposed hybrid metrics were used to measure the efficiency of the predictive models. Experimental results show that the decision tree algorithm is the most efficient model. The higher score of HMM (m, r) = 0.47 illustrates the overall significant model that encompasses almost all the user’s requirements, unlike the classical metrics that use a criterion to select the most appropriate model. On the other hand, the ANNs were found to be the most computationally intensive for our prediction task. Moreover, the type of data and the class size of the dataset (unbalanced data) have a significant impact on the efficiency of the model, especially on the computational cost, and the interpretability of the parameters of the model would be hampered. And the efficiency of the predictive model could be improved with other feature selection algorithms (especially hybrid metrics) considering the experts of the knowledge domain, as the understanding of the business domain has a significant impact. 展开更多
关键词 Predictive modeling Hybrid Metrics Feature selection model selection algorithm Analysis Machine Learning
在线阅读 下载PDF
Test selection and optimization for PHM based on failure evolution mechanism model 被引量:8
7
作者 Jing Qiu Xiaodong Tan +1 位作者 Guanjun Liu Kehong L 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2013年第5期780-792,共13页
The test selection and optimization (TSO) can improve the abilities of fault diagnosis, prognosis and health-state evalua- tion for prognostics and health management (PHM) systems. Traditionally, TSO mainly focuse... The test selection and optimization (TSO) can improve the abilities of fault diagnosis, prognosis and health-state evalua- tion for prognostics and health management (PHM) systems. Traditionally, TSO mainly focuses on fault detection and isolation, but they cannot provide an effective guide for the design for testability (DFT) to improve the PHM performance level. To solve the problem, a model of TSO for PHM systems is proposed. Firstly, through integrating the characteristics of fault severity and propa- gation time, and analyzing the test timing and sensitivity, a testability model based on failure evolution mechanism model (FEMM) for PHM systems is built up. This model describes the fault evolution- test dependency using the fault-symptom parameter matrix and symptom parameter-test matrix. Secondly, a novel method of in- herent testability analysis for PHM systems is developed based on the above information. Having completed the analysis, a TSO model, whose objective is to maximize fault trackability and mini- mize the test cost, is proposed through inherent testability analysis results, and an adaptive simulated annealing genetic algorithm (ASAGA) is introduced to solve the TSO problem. Finally, a case of a centrifugal pump system is used to verify the feasibility and effectiveness of the proposed models and methods. The results show that the proposed technology is important for PHM systems to select and optimize the test set in order to improve their performance level. 展开更多
关键词 test selection and optimization (TSO) prognostics and health management (PHM) failure evolution mechanism model (FEMM) adaptive simulated annealing genetic algorithm (ASAGA).
在线阅读 下载PDF
Application of Genetic Algorithm in Estimation of Gyro Drift Error Model 被引量:1
8
作者 LI Dongmei BAI Taixun +1 位作者 HE Xiaoxia ZHANG Rong 《Aerospace China》 2019年第1期3-8,共6页
Extended Kalman Filter(EKF)algorithm is widely used in parameter estimation for nonlinear systems.The estimation precision is sensitively dependent on EKF’s initial state covariance matrix and state noise matrix.The ... Extended Kalman Filter(EKF)algorithm is widely used in parameter estimation for nonlinear systems.The estimation precision is sensitively dependent on EKF’s initial state covariance matrix and state noise matrix.The grid optimization method is always used to find proper initial matrix for off-line estimation.However,the grid method has the draw back being time consuming hence,coarse grid followed by a fine grid method is adopted.To further improve efficiency without the loss of estimation accuracy,we propose a genetic algorithm for the coarse grid optimization in this paper.It is recognized that the crossover rate and mutation rate are the main influencing factors for the performance of the genetic algorithm,so sensitivity experiments for these two factors are carried out and a set of genetic algorithm parameters with good adaptability were selected by testing with several gyros’experimental data.Experimental results show that the proposed algorithm has higher efficiency and better estimation accuracy than the traversing grid algorithm. 展开更多
关键词 genetic algorithm traversing GRID algorithm coarse GRID optimization GYRO DRIFT error model CROSSOVER RATE and mutation RATE selecting
在线阅读 下载PDF
Variable selection for skew-normal mixture of joint location and scale models
9
作者 WU Liu-cang YANG Song-qin TAO Ye 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2021年第4期475-491,共17页
Although there are many papers on variable selection methods based on mean model in the nite mixture of regression models,little work has been done on how to select signi cant explanatory variables in the modeling of ... Although there are many papers on variable selection methods based on mean model in the nite mixture of regression models,little work has been done on how to select signi cant explanatory variables in the modeling of the variance parameter.In this paper,we propose and study a novel class of models:a skew-normal mixture of joint location and scale models to analyze the heteroscedastic skew-normal data coming from a heterogeneous population.The problem of variable selection for the proposed models is considered.In particular,a modi ed Expectation-Maximization(EM)algorithm for estimating the model parameters is developed.The consistency and the oracle property of the penalized estimators is established.Simulation studies are conducted to investigate the nite sample performance of the proposed methodolo-gies.An example is illustrated by the proposed methodologies. 展开更多
关键词 heterogeneous population skew-normal(SN)distribution mixture of joint location and scale models variable selection EM algorithm
在线阅读 下载PDF
Theoretical Basis in Regression Model Based Selection of the Most Cost Effective Parameters of Hard Rock Surface Mining
10
作者 Antipas T. S. Massawe Karim R. Baruti Paul S. M. Gongo 《Engineering(科研)》 2011年第2期156-161,共6页
What determines selection of the most cost effective parameters of hard rock surface mining is consideration of all alternative variants of mine design and the conflicting effect of their parameters on cost. Considera... What determines selection of the most cost effective parameters of hard rock surface mining is consideration of all alternative variants of mine design and the conflicting effect of their parameters on cost. Consideration could be realized based on the mathematical model of the cumulative influence of rockmass and mine design variables on the overall cost per ton of the hard rock drilled, blasted, hauled and primary crushed. Available works on the topic mostly dwelt on four processes of hard rock surface mining separately. This paper dwells on the theoretical part of a research proposed to enhance effectiveness in the selection of the parameters of hard rock surface mining design based on the regression model of overall cost per tonne of the rock mined fit on the determinant variations of rockmass and mine design. The regression model could be developed based on the statistical data generated by many of the hard rock surface mines operating in variable conditions of rockmass and mine design worldwide. Also, a regression model based general algorithm has been formulated for the development of software and computer aided selection of the most cost effective parameters of hard rock surface mining. 展开更多
关键词 PARAMETERS of Rockmass PARAMETERS of MINING Design Regression model algorithm of selection
暂未订购
Gaussian mixture model clustering with completed likelihood minimum message length criterion 被引量:1
11
作者 曾洪 卢伟 宋爱国 《Journal of Southeast University(English Edition)》 EI CAS 2013年第1期43-47,共5页
An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the ... An improved Gaussian mixture model (GMM)- based clustering method is proposed for the difficult case where the true distribution of data is against the assumed GMM. First, an improved model selection criterion, the completed likelihood minimum message length criterion, is derived. It can measure both the goodness-of-fit of the candidate GMM to the data and the goodness-of-partition of the data. Secondly, by utilizing the proposed criterion as the clustering objective function, an improved expectation- maximization (EM) algorithm is developed, which can avoid poor local optimal solutions compared to the standard EM algorithm for estimating the model parameters. The experimental results demonstrate that the proposed method can rectify the over-fitting tendency of representative GMM-based clustering approaches and can robustly provide more accurate clustering results. 展开更多
关键词 Gaussian mixture model non-Gaussian distribution model selection expectation-maximization algorithm completed likelihood minimum message length criterion
在线阅读 下载PDF
考虑特征选择的土石坝溃口峰值流量预测模型
12
作者 张美满 李晶 +5 位作者 张友明 杨旭 雷天宇 陈瀚 徐津 王玲玲 《水利水电科技进展》 北大核心 2026年第1期54-59,共6页
针对溃口特征维度高且特征间高度相关性导致预测模型性能下降的问题,提出了一种融合Lasso算法和XGBoost模型的土石坝溃口峰值流量预测模型。该模型采用斯皮尔曼相关系数法分析溃口特征间的相关性,使用Lasso算法进行进一步的特征选择,并... 针对溃口特征维度高且特征间高度相关性导致预测模型性能下降的问题,提出了一种融合Lasso算法和XGBoost模型的土石坝溃口峰值流量预测模型。该模型采用斯皮尔曼相关系数法分析溃口特征间的相关性,使用Lasso算法进行进一步的特征选择,并通过剔除冗余特征得到最优特征子集,再将该特征子集输入XGBoost模型进行溃口峰值流量预测。与支持向量回归和岭回归机器学习模型对比结果表明,该模型具有良好的非线性信息挖掘能力,可对高维特征进行有效降维,在减少模型复杂性的同时提高了模型预测精度。 展开更多
关键词 土石坝溃口 峰值流量预测 特征选择 多重共线性 Lasso算法 XGBoost模型
在线阅读 下载PDF
基于可解释性因子选择的多模型耦合式大坝变形预测方法
13
作者 柳聪聪 张锋 +2 位作者 胡超 张启灵 郭永成 《长江科学院院报》 北大核心 2026年第1期144-154,共11页
目前,传统、单一模型难以全面捕捉大坝变形数据的复杂性和多样性,导致其预测性能和解释能力受限。为解决上述问题,通过对多种预测模型的组合与优化,提出了一种高效且具备可解释性的大坝变形预测方法。首先,利用最小绝对值收缩和选择算子... 目前,传统、单一模型难以全面捕捉大坝变形数据的复杂性和多样性,导致其预测性能和解释能力受限。为解决上述问题,通过对多种预测模型的组合与优化,提出了一种高效且具备可解释性的大坝变形预测方法。首先,利用最小绝对值收缩和选择算子(LASSO)在众多环境变量中高效筛选,既简化模型输入,又解释了因子选择的可靠性。然后,采用长短期记忆(LSTM)网络对大坝变形进行预测,并引入注意力机制,增强对重要信息的提取。最后,通过Bagging算法集成多个模型预测结果,进一步提高整体预测的准确度、稳定性和泛化能力。以某碾压混凝土重力坝为例,所构建的模型具有较高的预测精度,各测点上平均MAE、MSE、RMSE依次为0.052、0.005、0.067 mm。将耦合模型与多种常用模型对比分析,结果表明耦合模型能够更准确地捕捉到大坝变形的动态变化,为预测模型研究提供了一种简洁高效的方法。 展开更多
关键词 大坝变形预测 最小绝对值收缩和选择算子(LASSO) 注意力机制 长短期记忆(LSTM) BAGGING算法 耦合模型
在线阅读 下载PDF
基于混合优化模型和关键气象因素的参考作物蒸散量估算研究
14
作者 周罕觅 马林爽 +4 位作者 牛晓丽 秦龙 向友珍 李纪琛 苏裕民 《应用基础与工程科学学报》 北大核心 2026年第1期132-147,共16页
准确获取参考作物蒸散量(Reference crop evapotranspiration,ET_(0))对衡量作物需水量和优化灌溉水资源管理具有重要意义.为在气象观测信息受限条件下实现ET_(0)的高效、准确估算,采用鲸鱼优化算法(Whale optimization algorithm,WOA)... 准确获取参考作物蒸散量(Reference crop evapotranspiration,ET_(0))对衡量作物需水量和优化灌溉水资源管理具有重要意义.为在气象观测信息受限条件下实现ET_(0)的高效、准确估算,采用鲸鱼优化算法(Whale optimization algorithm,WOA)对轻量级梯度提升机(Light gradient boosting machine,LightGBM)的超参数进行优化,建立了一种混合优化模型WOA-LightGBM.选取河南省15个气象站点逐日气象数据系统评估此模型的性能,并将其与LightGBM、K最近邻算法(K-nearest neighbor,KNN)和随机森林(Random forest,RF)模型进行比较.采用RF、自适应增强(Adaptive boosting,AdaBoost)和梯度提升树(Gradient boosting decision tree,GBDT)构建集成嵌入式特征选择方法(Ensemble embedded feature selection,EEFS),用于分析对估算ET_(0)有重要影响的气象因素,以确定最佳输入组合.研究结果表明:基于EEFS方法得出的4种重要组合作为输入,模型能更准确估算ET_(0);在河南省的ET_(0)估算中,所有模型均表现出显著的空间差异.相较于LightGBM、KNN和RF模型,WOA-LightGBM模型在不同区域均表现出稳定的高估算精度,其决定系数R^(2)、纳什效率系数NSE、均方根误差RMSE和平均绝对误差MAE分别为0.897~0.998、0.897~0.998、0.071~0.545mm·d^(-1)和0.052~0.409mm·d^(-1);模型的ET_(0)估算精度存在季节差异,春季和秋季的估算精度受湿度和风速影响较大,尤其是冬季,其受到的影响更为显著;在目标站点数据不足的情况下,基于邻近站点数据训练的WOA-LightGBM模型仍能维持较高精度.总之,该研究为在气象资料有限的情况下准确估算ET_(0)提供了可靠的解决方案. 展开更多
关键词 蒸散 模型 有限气象资料 轻量级梯度提升机 集成嵌入式特征选择 鲸鱼优化算法
原文传递
基于动态阈值调整特征选择下Transformer模型对阿尔茨海默病病程分类
15
作者 施转芳 范炤 《山西医科大学学报》 2026年第2期215-222,共8页
目的采用Transformer模型,融合结构磁共振成像(sMRI)数据与人口统计学资料,以实现对阿尔茨海默病(AD)病程阶段的分类识别。方法数据来源于阿尔茨海默病神经影像学倡议数据库(ADNI),随机选取543例研究对象,其中包括139例认知功能正常者(... 目的采用Transformer模型,融合结构磁共振成像(sMRI)数据与人口统计学资料,以实现对阿尔茨海默病(AD)病程阶段的分类识别。方法数据来源于阿尔茨海默病神经影像学倡议数据库(ADNI),随机选取543例研究对象,其中包括139例认知功能正常者(NC)、220例早期轻度认知障碍(EMCI)、108例晚期轻度认知障碍(LMCI)以及76例AD患者。采用基于动态阈值调整的L1正则化(L1-DTFS)及基于动态阈值调整的梯度提升决策树(GBDT-DTFS)算法,分别对这些研究对象的272项sMRI数据进行特征选择,筛选出最优特征子集。将筛选后的sMRI特征与3项人口统计学指标(年龄、性别、受教育程度)及简易精神状态检查(MMSE)评分共同输入Transformer模型和逻辑回归(LR)模型,观察其在区分AD连续病程中所有两两组合[共分为NC-EMCI(表示NC组与EMCI组分类,下同)、NC-LMCI、NC-AD、EMCI-LMCI、EMCI-AD以及LMCI-AD 6个分类组]时的分类效果,并通过受试者工作特征曲线下面积(AUC)评价模型的判别性能。结果L1-DTFS和GBDT-DTFS两种特征选择方法均筛选出了6组分类任务中最有贡献的优势特征,且L1-DTFS特征选择下的Transformer模型对NC与LMCI组的分类预测准确率、精确度、敏感度均达100%,AUC值为1.00。结论Transformer模型在AD病程分类中有较好且稳定的表现,其中在NC与LMCI病程分类组表现更佳。 展开更多
关键词 阿尔茨海默病 轻度认知障碍 磁共振成像 Transformer模型 LR模型 特征选择算法 动态阈值
暂未订购
一种基于错误发现率的模型选择规则
16
作者 荣晶晶 冶继民 《广西师范大学学报(自然科学版)》 北大核心 2026年第1期110-118,共9页
针对高维稀疏线性回归模型,本文从后验估计角度提出基于错误发现率(false discovery rate,FDR)的模型选择FDR规则;之后在其基础上引入动态信噪比(signal-to-noise ratio,SNR)变化因子,提出对SNR变化更稳健且对数据尺度具有不变性的FDR_... 针对高维稀疏线性回归模型,本文从后验估计角度提出基于错误发现率(false discovery rate,FDR)的模型选择FDR规则;之后在其基础上引入动态信噪比(signal-to-noise ratio,SNR)变化因子,提出对SNR变化更稳健且对数据尺度具有不变性的FDR_(R)规则;结合OMP算法,仿真实验对比分别采用FDR规则、FDR_(R)规则和已有规则下成功选择全部真正变量的概率和FDR值,结果表明,相较于其他规则,FDR_(R)规则在高SNR或大样本量下更稳健,对数据缩放问题更加鲁棒,且错误发现率最低;最后,将所提方法应用到套细胞淋巴瘤患者的真实数据,筛选出影响细胞增殖的基因编号。 展开更多
关键词 高维模型选择 错误发现率 FDR_(R) OMP算法 信噪比
在线阅读 下载PDF
基于Cox模型的ST企业复发风险预警研究
17
作者 武燕婷 黄希芬 冯龙 《数理统计与管理》 北大核心 2026年第1期178-190,共13页
伴随着经济的快速发展,上市公司陷入财务困境的情况备受关注,企业多次被实施特别处理(ST)的现象也常有发生。为了能够提前预知并规避风险,从而防止企业ST复发风险的发生,我们引入Cox比例风险模型来构建有效的财务预警系统。Cox模型是一... 伴随着经济的快速发展,上市公司陷入财务困境的情况备受关注,企业多次被实施特别处理(ST)的现象也常有发生。为了能够提前预知并规避风险,从而防止企业ST复发风险的发生,我们引入Cox比例风险模型来构建有效的财务预警系统。Cox模型是一种半参数模型,文章将利用MM算法分离参数的优点来实现模型的参数估计。然而,在以251家上市公司的50个预警指标来构建财务预警系统时,高维度的财务指标数据会影响模型的可靠性和预测精度。因此,文章还将MCP、SCAD惩罚函数引入Cox模型中进行变量选择实现压缩降维,从而筛选出高维协变量中对上市公司发生ST复发风险有重要影响的因素。数值模拟结果表明,两种惩罚函数在Cox模型中的变量选择准确率较高,且MM算法在Cox模型的半参数估计中表现良好。最后,实证分析结果表明,高维Cox回归模型以及所提出的MM优化算法在上市公司财务危机预测中效果良好,能够较为准确地识别财务风险。 展开更多
关键词 ST复发风险 高维Cox回归模型 MM算法 变量选择
原文传递
基于路径选择模型的分路段分时段差异化收费策略优化
18
作者 张奇樊 《公路交通科技》 北大核心 2026年第1期48-58,共11页
【目标】针对高速公路分路段分时段差异化收费中局部路段因过度折扣导致整体收益下滑的问题,本研究提出融合多维度成本分析的路径选择模型,量化车辆路径选择与收益的关联机制,以优化原有收费策略。【方法】基于货车驾驶员出行路径选择行... 【目标】针对高速公路分路段分时段差异化收费中局部路段因过度折扣导致整体收益下滑的问题,本研究提出融合多维度成本分析的路径选择模型,量化车辆路径选择与收益的关联机制,以优化原有收费策略。【方法】基于货车驾驶员出行路径选择行为,构建改进的Logit离散选择模型,将经济成本、时间成本、舒适性成本及安全性成本纳入其效用函数,计算路径选择概率,且通过动态调整步长参数,提出自适应步长的随机梯度上升算法,提升模型求解精度与收敛速度。结合福建省实施分路段分时段差异化收费的9条高速公路数据,设计优惠折扣和优惠路段调整的双重优化方案:针对价格敏感的六类货车,保持折扣不变;缩减四、五类货车日间和夜间折扣;对亏损路段提出替代的优惠路段,通过改进的Logit离散选择模型求解方案的引流成效。【结果】推荐方案日均吸引竞争路段车流727辆及减少低效优惠,可实现全年增加路方收入1634万元,其中费率调整贡献1450万元、路段置换贡献184万元,整体路网收入较政策调整前增加5.6%。【结论】该方法为高速公路差异化收费策略的优化提供了科学依据和实践指导。 展开更多
关键词 智能交通 差异化收费策略 LOGIT模型 货车路径选择 梯度上升算法
原文传递
基于改进NSGA-Ⅱ算法的电动汽车充电站选址方法
19
作者 俞宁 冯鑫 +1 位作者 汤爱华 舒梓荣 《郑州大学学报(理学版)》 北大核心 2026年第2期64-69,共6页
电动汽车充电站的战略布局对电动汽车的发展至关重要。充电站的不合理布局会导致运营成本增加和用户满意度下降。为应对这些挑战,构建了一个综合模型来量化充电站的总运营成本和用户满意度。在该模型中,运营商总成本分为土地成本、建设... 电动汽车充电站的战略布局对电动汽车的发展至关重要。充电站的不合理布局会导致运营成本增加和用户满意度下降。为应对这些挑战,构建了一个综合模型来量化充电站的总运营成本和用户满意度。在该模型中,运营商总成本分为土地成本、建设成本、运营成本和政府补贴。用户满意度则通过充电距离和等待时间来量化。设计了改进的非支配排序遗传算法Ⅱ(non-dominated sorting genetic algorithmⅡ,NSGA-Ⅱ)求解多目标优化模型,解决了在剩余电量、充电距离、充电桩数量等约束下,运营成本最小化、用户满意度最大化的问题。最后,以江北区为例,模拟电动汽车充电站的选址,证实了所提方法的有效性。 展开更多
关键词 电动汽车 充电站选址 多目标模型 改进NSGA-Ⅱ算法
在线阅读 下载PDF
数据驱动下基于时间序列云模型的特征选择聚类算法研究
20
作者 刘小红 张人龙 《统计与决策》 北大核心 2026年第5期41-47,共7页
由于时间序列数据具有多变量性和高维度性特征,增加了重要特征提取的难度,进而降低了高维数据聚类的精度与准确度。因此,针对多变量时间序列数据具有的非线性、高维冗余等特征,文章首先在传统特征选择算法、云模型、复杂时间序列等研究... 由于时间序列数据具有多变量性和高维度性特征,增加了重要特征提取的难度,进而降低了高维数据聚类的精度与准确度。因此,针对多变量时间序列数据具有的非线性、高维冗余等特征,文章首先在传统特征选择算法、云模型、复杂时间序列等研究的基础上,提出了有效的、可拓展的基于时间序列云模型的混合特征选择聚类算法;其次,针对提取出来的多维度时间序列数据特征,应用云模型时间相似度与多目标粒子群优化算法相结合的方法进行特征筛选与特征优化,以获取更多高质量的特征,从而有效提高混合算法的聚类精度;最后,基于高维数据集进行仿真实验,实验结果表明,该混合特征选择算法能有效解决多维度时间序列数据的复杂特征问题。 展开更多
关键词 时间序列 云模型 多目标粒子群优化 混合特征选择 聚类算法
原文传递
上一页 1 2 28 下一页 到第
使用帮助 返回顶部