The methods of network attacks have become increasingly sophisticated,rendering traditional cybersecurity defense mechanisms insufficient to address novel and complex threats effectively.In recent years,artificial int...The methods of network attacks have become increasingly sophisticated,rendering traditional cybersecurity defense mechanisms insufficient to address novel and complex threats effectively.In recent years,artificial intelligence has achieved significant progress in the field of network security.However,many challenges and issues remain,particularly regarding the interpretability of deep learning and ensemble learning algorithms.To address the challenge of enhancing the interpretability of network attack prediction models,this paper proposes a method that combines Light Gradient Boosting Machine(LGBM)and SHapley Additive exPlanations(SHAP).LGBM is employed to model anomalous fluctuations in various network indicators,enabling the rapid and accurate identification and prediction of potential network attack types,thereby facilitating the implementation of timely defense measures,the model achieved an accuracy of 0.977,precision of 0.985,recall of 0.975,and an F1 score of 0.979,demonstrating better performance compared to other models in the domain of network attack prediction.SHAP is utilized to analyze the black-box decision-making process of the model,providing interpretability by quantifying the contribution of each feature to the prediction results and elucidating the relationships between features.The experimental results demonstrate that the network attack predictionmodel based on LGBM exhibits superior accuracy and outstanding predictive capabilities.Moreover,the SHAP-based interpretability analysis significantly improves the model’s transparency and interpretability.展开更多
The first 2^(+)excited states of the nucleus directly reflect the interaction between the shell structure and the nucleus,providing insights into the validity of the shell model and nuclear structure characteristics.A...The first 2^(+)excited states of the nucleus directly reflect the interaction between the shell structure and the nucleus,providing insights into the validity of the shell model and nuclear structure characteristics.Although the features of the first 2^(+)excited states can be measured for stable nuclei and calculated using nuclear models,significant uncertainty remains.This study employs a machine learning model based on a light gradient boosting machine(LightGBM)to investigate the first 2^(+)excited states.Specifically,the training of the LightGBM algorithm and the prediction of the first 2^(+)properties of 642 nuclei are presented.Furthermore,detailed comparisons of the LightGBM predictions were performed with available experimental data,shell model calculations,and Bayesian neural network predictions.The results revealed that the average difference between the LightGBM predictions and the experimental data was 18 times smaller than that obtained by the shell model and only 70%of the BNN prediction results.Considering Mg,Ca,Kr,Sm,and Pb isotopes as examples,it was also observed that LightGBM can effectively reproduce the magic number mutation caused by shell effects,with the energy being as low as 0.04 MeV due to shape coexistence.Therefore,we believe that leveraging LightGBM-based machine learning can profoundly enhance our insights into nuclear structures and provide new avenues for nuclear physics research.展开更多
Addressing classification and prediction challenges, tree ensemble models have gained significant importance. Boosting ensemble techniques are commonly employed for forecasting Type-II diabetes mellitus. Light Gradien...Addressing classification and prediction challenges, tree ensemble models have gained significant importance. Boosting ensemble techniques are commonly employed for forecasting Type-II diabetes mellitus. Light Gradient Boosting Machine (LightGBM) is a widely used algorithm known for its leaf growth strategy, loss reduction, and enhanced training precision. However, LightGBM is prone to overfitting. In contrast, CatBoost utilizes balanced base predictors known as decision tables, which mitigate overfitting risks and significantly improve testing time efficiency. CatBoost’s algorithm structure counteracts gradient boosting biases and incorporates an overfitting detector to stop training early. This study focuses on developing a hybrid model that combines LightGBM and CatBoost to minimize overfitting and improve accuracy by reducing variance. For the purpose of finding the best hyperparameters to use with the underlying learners, the Bayesian hyperparameter optimization method is used. By fine-tuning the regularization parameter values, the hybrid model effectively reduces variance (overfitting). Comparative evaluation against LightGBM, CatBoost, XGBoost, Decision Tree, Random Forest, AdaBoost, and GBM algorithms demonstrates that the hybrid model has the best F1-score (99.37%), recall (99.25%), and accuracy (99.37%). Consequently, the proposed framework holds promise for early diabetes prediction in the healthcare industry and exhibits potential applicability to other datasets sharing similarities with diabetes.展开更多
In this paper,an advanced and optimized Light Gradient Boosting Machine(LGBM)technique is proposed to identify the intrusive activities in the Internet of Things(IoT)network.The followings are the major contributions:...In this paper,an advanced and optimized Light Gradient Boosting Machine(LGBM)technique is proposed to identify the intrusive activities in the Internet of Things(IoT)network.The followings are the major contributions:i)An optimized LGBM model has been developed for the identification of malicious IoT activities in the IoT network;ii)An efficient evolutionary optimization approach has been adopted for finding the optimal set of hyper-parameters of LGBM for the projected problem.Here,a Genetic Algorithm(GA)with k-way tournament selection and uniform crossover operation is used for efficient exploration of hyper-parameter search space;iii)Finally,the performance of the proposed model is evaluated using state-of-the-art ensemble learning and machine learning-based model to achieve overall generalized performance and efficiency.Simulation outcomes reveal that the proposed approach is superior to other considered methods and proves to be a robust approach to intrusion detection in an IoT environment.展开更多
Coal rock mass instability fracture may result in serious hazards to underground coal mining.Acoustic emissions(AE)stimulated by internal structure fracture should carry lots of favorable information about health cond...Coal rock mass instability fracture may result in serious hazards to underground coal mining.Acoustic emissions(AE)stimulated by internal structure fracture should carry lots of favorable information about health condition of rock mass.AE as a sensitive non-destructive test method is gradually utilized to detect anomaly conditions of coal rock.This paper proposes an improved multi-resolution feature to extract AE waveform at different frequency resolutions using Coilflet Wavelet Transform method(CWT).It is further adopt an efficient Light Gradient Boosting Machine(LightGBM)by several cascaded sub weak classifier models to merge AE features at different views of frequency for coal rock anomaly damage recognition.The results denote that the proposed method achieves excellent recognition performance on anomaly damage levels of coal rock.It is an effective method to detect the critical stability further to predict the rock mass bursting in time.展开更多
基金supported by the National Natural Science Foundation of China Project(No.62302540)please visit their website at https://www.nsfc.gov.cn/(accessed on 18 June 2024).
文摘The methods of network attacks have become increasingly sophisticated,rendering traditional cybersecurity defense mechanisms insufficient to address novel and complex threats effectively.In recent years,artificial intelligence has achieved significant progress in the field of network security.However,many challenges and issues remain,particularly regarding the interpretability of deep learning and ensemble learning algorithms.To address the challenge of enhancing the interpretability of network attack prediction models,this paper proposes a method that combines Light Gradient Boosting Machine(LGBM)and SHapley Additive exPlanations(SHAP).LGBM is employed to model anomalous fluctuations in various network indicators,enabling the rapid and accurate identification and prediction of potential network attack types,thereby facilitating the implementation of timely defense measures,the model achieved an accuracy of 0.977,precision of 0.985,recall of 0.975,and an F1 score of 0.979,demonstrating better performance compared to other models in the domain of network attack prediction.SHAP is utilized to analyze the black-box decision-making process of the model,providing interpretability by quantifying the contribution of each feature to the prediction results and elucidating the relationships between features.The experimental results demonstrate that the network attack predictionmodel based on LGBM exhibits superior accuracy and outstanding predictive capabilities.Moreover,the SHAP-based interpretability analysis significantly improves the model’s transparency and interpretability.
基金supported by the National Key R&D Program of China (No. 2022YFA1603300)the Romanian Ministry of Research,Innovation and Digitalization under Contract PN 23.21.01.06+1 种基金The ELI-RO project with Contract ELI-RORDI-2024-008 (AMAP)a grant from the Romanian Ministry of Research,Innovation and Digitization,CNCS-UEFIS-CDI,with project numbers PN-Ⅲ-P4-PCE-2021-1014, PN-Ⅲ-P4-PCE-2021-0595, and PN-Ⅲ-P1-1.1-TE2021-1464 within PNCDI Ⅲ
文摘The first 2^(+)excited states of the nucleus directly reflect the interaction between the shell structure and the nucleus,providing insights into the validity of the shell model and nuclear structure characteristics.Although the features of the first 2^(+)excited states can be measured for stable nuclei and calculated using nuclear models,significant uncertainty remains.This study employs a machine learning model based on a light gradient boosting machine(LightGBM)to investigate the first 2^(+)excited states.Specifically,the training of the LightGBM algorithm and the prediction of the first 2^(+)properties of 642 nuclei are presented.Furthermore,detailed comparisons of the LightGBM predictions were performed with available experimental data,shell model calculations,and Bayesian neural network predictions.The results revealed that the average difference between the LightGBM predictions and the experimental data was 18 times smaller than that obtained by the shell model and only 70%of the BNN prediction results.Considering Mg,Ca,Kr,Sm,and Pb isotopes as examples,it was also observed that LightGBM can effectively reproduce the magic number mutation caused by shell effects,with the energy being as low as 0.04 MeV due to shape coexistence.Therefore,we believe that leveraging LightGBM-based machine learning can profoundly enhance our insights into nuclear structures and provide new avenues for nuclear physics research.
文摘Addressing classification and prediction challenges, tree ensemble models have gained significant importance. Boosting ensemble techniques are commonly employed for forecasting Type-II diabetes mellitus. Light Gradient Boosting Machine (LightGBM) is a widely used algorithm known for its leaf growth strategy, loss reduction, and enhanced training precision. However, LightGBM is prone to overfitting. In contrast, CatBoost utilizes balanced base predictors known as decision tables, which mitigate overfitting risks and significantly improve testing time efficiency. CatBoost’s algorithm structure counteracts gradient boosting biases and incorporates an overfitting detector to stop training early. This study focuses on developing a hybrid model that combines LightGBM and CatBoost to minimize overfitting and improve accuracy by reducing variance. For the purpose of finding the best hyperparameters to use with the underlying learners, the Bayesian hyperparameter optimization method is used. By fine-tuning the regularization parameter values, the hybrid model effectively reduces variance (overfitting). Comparative evaluation against LightGBM, CatBoost, XGBoost, Decision Tree, Random Forest, AdaBoost, and GBM algorithms demonstrates that the hybrid model has the best F1-score (99.37%), recall (99.25%), and accuracy (99.37%). Consequently, the proposed framework holds promise for early diabetes prediction in the healthcare industry and exhibits potential applicability to other datasets sharing similarities with diabetes.
文摘In this paper,an advanced and optimized Light Gradient Boosting Machine(LGBM)technique is proposed to identify the intrusive activities in the Internet of Things(IoT)network.The followings are the major contributions:i)An optimized LGBM model has been developed for the identification of malicious IoT activities in the IoT network;ii)An efficient evolutionary optimization approach has been adopted for finding the optimal set of hyper-parameters of LGBM for the projected problem.Here,a Genetic Algorithm(GA)with k-way tournament selection and uniform crossover operation is used for efficient exploration of hyper-parameter search space;iii)Finally,the performance of the proposed model is evaluated using state-of-the-art ensemble learning and machine learning-based model to achieve overall generalized performance and efficiency.Simulation outcomes reveal that the proposed approach is superior to other considered methods and proves to be a robust approach to intrusion detection in an IoT environment.
基金This work is supported by the National Nature Science Foundation of China(No.51875100,No.61673108,No.61674133)The authors would like to thank anonymous reviewers and the associate editor,whose constructive comments help improve the presentation of this work.
文摘Coal rock mass instability fracture may result in serious hazards to underground coal mining.Acoustic emissions(AE)stimulated by internal structure fracture should carry lots of favorable information about health condition of rock mass.AE as a sensitive non-destructive test method is gradually utilized to detect anomaly conditions of coal rock.This paper proposes an improved multi-resolution feature to extract AE waveform at different frequency resolutions using Coilflet Wavelet Transform method(CWT).It is further adopt an efficient Light Gradient Boosting Machine(LightGBM)by several cascaded sub weak classifier models to merge AE features at different views of frequency for coal rock anomaly damage recognition.The results denote that the proposed method achieves excellent recognition performance on anomaly damage levels of coal rock.It is an effective method to detect the critical stability further to predict the rock mass bursting in time.
文摘针对历史负荷特征提取困难所导致的短期电力负荷预测精度不高的问题,提出了基于堆叠泛化集成思想的逻辑斯谛灰狼优化-极限梯度提升-轻量级梯度提升机-门控循环单元(logistic grey wolf optimizer-extreme gradient boosting-light gradient boosting machine-gated recurrent unit, LGWO-XGBoost-LightGBM-GRU)的短期电力负荷预测算法。该算法首先使用逻辑斯谛映射对灰狼优化(grey wolf optimizer, GWO)算法进行改进得到LGWO算法,接着使用LGWO算法分别对XGBoost、LightGBM、GRU算法进行参数寻优,然后使用XGBoost、LightGBM算法对数据的不同特征进行初步提炼,最后将提炼的特征合并到历史负荷数据集中作为输入,并使用GRU进行最终的负荷预测,得到预测结果。以某工业园区的负荷预测为例进行验证,结果表明,该算法与最小二乘支持向量机(least squares support vector machines, LS-SVM)算法相比,均方根误差降低了68.85%,平均绝对误差降低了69.57%,平均绝对百分比误差降低了69.97%,决定系数提高了8.42%。该算法提高了短期电力负荷预测的精度。