ELM-APDPs:An Explainable Ensemble Learning Method for Accurate Prediction of Druggable Proteins

下载PDF

导出

摘要 Identifying druggable proteins,which are capable of binding therapeutic compounds,remains a critical and resource-intensive challenge in drug discovery.To address this,we propose CEL-IDP(Comparison of Ensemble Learning Methods for Identification of Druggable Proteins),a computational framework combining three feature extraction methods Dipeptide Deviation from Expected Mean(DDE),Enhanced Amino Acid Composition(EAAC),and Enhanced Grouped Amino Acid Composition(EGAAC)with ensemble learning strategies(Bagging,Boosting,Stacking)to classify druggable proteins from sequence data.DDE captures dipeptide frequency deviations,EAAC encodes positional amino acid information,and EGAAC groups residues by physicochemical properties to generate discriminative feature vectors.These features were analyzed using ensemble models to overcome the limitations of single classifiers.EGAAC outperformed DDE and EAAC,with Random Forest(Bagging)and XGBoost(Boosting)achieving the highest accuracy of 71.66%,demonstrating superior performance in capturing critical biochemical patterns.Stacking showed intermediate results(68.33%),while EAAC and DDE-based models yielded lower accuracies(56.66%–66.87%).CEL-IDP streamlines large-scale druggability prediction,reduces reliance on costly experimental screening,and aligns with global initiatives like Target 2035 to expand action-able drug targets.This work advances machine learning-driven drug discovery by systematizing feature engineering and ensemble model optimization,providing a scalable workflow to accelerate target identification and validation.

作者 Mujeebu Rehman Qinghua Liu Ali Ghulam Tariq Ahmad Jawad Khan Dildar Hussain Yeong Hyeon Gu

机构地区 School of Information and Communication Engineering Information Technology Centre School of Electrical and Information Engineering School of Computing Department AI and Data Science

出处《Computer Modeling in Engineering & Sciences》 2025年第10期779-805,共27页 工程与科学中的计算机建模(英文)

基金 supported by the MSIT(Ministry of Science and ICT),Korea,under the ITRC(Information Technology Research Centre)support program(IITP-2024-RS-2024-00437191) supervised by the IITP(Institute for Information&Communications Technology Planning&Evaluation).

关键词 Druggable proteins ensemble learning computational drug discovery pharmacological target identification machine learning feature extraction

分类号 TP181 [自动化与计算机技术—控制理论与控制工程] R91 [医药卫生—药学]

引文网络
相关文献

1黄建,钟永洪.一种新型咸味肽的合成及其性能研究[J].食品科技,2025,50(7):253-256.
2Heng Zhao,Jiehao Chen,Xianghua Fu.Regularization for Deep Imbalanced Regression Based on Quantitative Relationship[J].Big Data Mining and Analytics,2025,8(4):951-965.
3Qiang Zhang,Wenxiang Zhao,Tao Tao,Zongwang Li.Suppressing Dual Zero-sequence Current in Dual Three-phase Open-winding PMSM Using Multi-zero-vectors Hysteresis Control[J].Chinese Journal of Electrical Engineering,2025,11(3):167-177.
4杨德全,陈一,战欣.双壳贝类足丝蛋白预测模型构建和功能保守区域特征[J].海洋与湖沼,2025,56(5):1226-1233.
5Jun Liu,Zhaoyu Feng,Renming Pan,Xiaolong Yu,Meijuan Zhou,Gang Zhao,Hongyu Wang.Enantioselective regulation to coronal polyheterocyclic compounds via phosphonium salt-catalyzed cycloadditions of azomethine imines with γ-butenolides[J].Chinese Chemical Letters,2025,36(8):283-289.
6Peihua Wangyang,Xiaolin Huang,Xiao-Lei Shi,Niuniu Zhang,Yu Ye,Shuangzhi Zhao,Jiamin Zhang,Yingbo Liu,Fabi Zhang,Xingpeng Liu,Haiou Li,Tangyou Sun,Ying Peng,Zhi-Gang Chen.Advances in Schottky parameter extraction and applications[J].Journal of Materials Science & Technology,2025,218(15):317-335.
7Qingqing Shi,Pengchao Liu,Biao Yu,Peng Xu.Efficient Synthesis of Anticoagulant Fondaparinux via Orthogonal One-Pot Glycosylation and Microwave-Assisted Simultaneous O,N-Sulfonation[J].Chinese Journal of Chemistry,2025,43(18):2325-2330.
8Minghui Shang,Chen Tang,Yunfei Sun,Yongxu Cheng.Nutritional quality of adult Eriocheir sinensis from the Yangtze River and Yellow River populations cultured in ponds at the Yellow River estuary area[J].Aquaculture and Fisheries,2025,10(5):836-842.
9Haiyang Zhou,Yu Wu,Chunhui Liu,Haozhe Geng,Chenyu Yao.Study on the diffusion and migration law of CO_(2)sequestrated in abandoned coal mine goaf[J].Deep Underground Science and Engineering,2025,4(4):530-547.
10Naibiao Yu,Dengshuai Cui,Chenyu Li,Siyu Yang,Chuanmin Qiao,Lei Xie.Multi-omics integration reveals Chr1 associated QTL mediating backfat thickness in pigs[J].Journal of Animal Science and Biotechnology,2025,16(6):2641-2657.

Computer Modeling in Engineering & Sciences

2025年第10期

浏览历史

内容加载中请稍等...

ELM-APDPs:An Explainable Ensemble Learning Method for Accurate Prediction of Druggable Proteins

相关作者

相关机构

相关主题

浏览历史