Discovering new superconductors via traditional trial-and-error experimental approaches is apparently a time-consuming process,and the correlations between the critical temperature(Tc) and material features are still ...Discovering new superconductors via traditional trial-and-error experimental approaches is apparently a time-consuming process,and the correlations between the critical temperature(Tc) and material features are still obscure.The rise of machine learning(ML) technology provides new opportunities to speed up inefficient exploration processes,and could potentially uncover new hints on the unclear correlations.In this work,we utilize open-source materials data,ML models,and data mining methods to explore the correlation between the chemical features and Tcvalues of superconducting materials.To further improve the prediction accuracy,a new model is created by integrating three basic algorithms,showing an enhanced accuracy with the coefficient of determination(R2) score of 95.9 % and root mean square error(RMSE) of 6.3 K.The average marginal contributions of material features towards Tcvalues are estimated to determine the importance of various features during prediction processes.The results suggest that the range thermal conductivity plays a critical role in Tcprediction among all element features.Furthermore,the integrated ML model is utilized to screen out potential twenty superconducting materials with Tcvalues beyond 50.0 K.This study provides insights towards Tcprediction to accelerate the exploration of potential high-Tcsuperconductors.展开更多
In this study,the future landslide population amount risk(LPAR)is assessed based on integrated machine learning models(MLMs)and scenario simulation techniques in Shuicheng County,China.Firstly,multiple MLMs were selec...In this study,the future landslide population amount risk(LPAR)is assessed based on integrated machine learning models(MLMs)and scenario simulation techniques in Shuicheng County,China.Firstly,multiple MLMs were selected and hyperparameters were optimized,and the generated 11 models were crossintegrated to select the best model to calculate landslide susceptibility;by calculating precipitation for different extreme precipitation recurrence periods and combining the susceptibility results to assess the landslide hazard.Using the town as the basic unit,the exposure and vulnerability of the future landslide population under different Shared Socioeconomic Pathways(SSPs)scenarios in each town were assessed,and then combined with the hazard to estimate the LPAR in 2050.The results showed that the integrated model with the optimized random forest model as the combination strategy had the best comprehensive performance in susceptibility assessment.The distribution of hazard classes is similar to susceptibility,and with an increase in precipitation,the low-hazard area and high-hazard decrease and shift to medium-hazard and very high-hazard classes.The high-risk areas for future landslide populations in Shuicheng County are mainly concentrated in the three southwestern towns with high vulnerability,whereas the northern towns of Baohua and Qinglin are at the lowest risk class.The LPAR increased with the intensity of extreme precipitation.The LPAR differs significantly among the SSPs scenarios,with the lowest in the“fossil-fueled development(SSP5)”scenario and the highest in the“regional rivalry(SSP3)”scenario.In summary,the landslide susceptibility model based on integrated machine learning proposed in this study has a high predictive capability.The results of future LPAR assessment can provide theoretical guidance for relevant departments to cope with future socioeconomic development challenges and make corresponding disaster prevention and mitigation plans to prevent landslide risks from a developmental perspective.展开更多
Online banking fraud occurs whenever a criminal can seize accounts and transfer funds from an individual’s online bank account.Successfully preventing this requires the detection of as many fraudsters as possible,wit...Online banking fraud occurs whenever a criminal can seize accounts and transfer funds from an individual’s online bank account.Successfully preventing this requires the detection of as many fraudsters as possible,without producing too many false alarms.This is a challenge for machine learning owing to the extremely imbalanced data and complexity of fraud.In addition,classical machine learning methods must be extended,minimizing expected financial losses.Finally,fraud can only be combated systematically and economically if the risks and costs in payment channels are known.We define three models that overcome these challenges:machine learning-based fraud detection,economic optimization of machine learning results,and a risk model to predict the risk of fraud while considering countermeasures.The models were tested utilizing real data.Our machine learning model alone reduces the expected and unexpected losses in the three aggregated payment channels by 15%compared to a benchmark consisting of static if-then rules.Optimizing the machine-learning model further reduces the expected losses by 52%.These results hold with a low false positive rate of 0.4%.Thus,the risk framework of the three models is viable from a business and risk perspective.展开更多
基金financial supports from the Fund of Science and Technology on Reactor Fuel and Materials Laboratory(JCKYS2019201074)the Affiliated Hospital of Putian University,the Shenzhen Fundamental Research Program(JCYJ20220531095404009)+1 种基金the Shenzhen Knowledge Innovation Plan-Fundamental Research(Discipline Distribution)(JCYJ20180507184623297)the Major Science and Technology Infrastructure Project of Material Genome Big-science Facilities Platform supported by Municipal Development and Reform Commission of Shenzhen。
文摘Discovering new superconductors via traditional trial-and-error experimental approaches is apparently a time-consuming process,and the correlations between the critical temperature(Tc) and material features are still obscure.The rise of machine learning(ML) technology provides new opportunities to speed up inefficient exploration processes,and could potentially uncover new hints on the unclear correlations.In this work,we utilize open-source materials data,ML models,and data mining methods to explore the correlation between the chemical features and Tcvalues of superconducting materials.To further improve the prediction accuracy,a new model is created by integrating three basic algorithms,showing an enhanced accuracy with the coefficient of determination(R2) score of 95.9 % and root mean square error(RMSE) of 6.3 K.The average marginal contributions of material features towards Tcvalues are estimated to determine the importance of various features during prediction processes.The results suggest that the range thermal conductivity plays a critical role in Tcprediction among all element features.Furthermore,the integrated ML model is utilized to screen out potential twenty superconducting materials with Tcvalues beyond 50.0 K.This study provides insights towards Tcprediction to accelerate the exploration of potential high-Tcsuperconductors.
基金supported by“The National Key Research and Development Program of China(2018YFC1508804)The Key Scientific and Technology Program of Jilin Province(20170204035SF)+2 种基金The Key Scientific and Technology Research and Development Program of Jilin Province(20200403074SF)The Key Scientific and Technology Research and Development Program of Jilin Province(20180201035SF)National Natural Science Fund for Young Scholars of China(41907238)”.
文摘In this study,the future landslide population amount risk(LPAR)is assessed based on integrated machine learning models(MLMs)and scenario simulation techniques in Shuicheng County,China.Firstly,multiple MLMs were selected and hyperparameters were optimized,and the generated 11 models were crossintegrated to select the best model to calculate landslide susceptibility;by calculating precipitation for different extreme precipitation recurrence periods and combining the susceptibility results to assess the landslide hazard.Using the town as the basic unit,the exposure and vulnerability of the future landslide population under different Shared Socioeconomic Pathways(SSPs)scenarios in each town were assessed,and then combined with the hazard to estimate the LPAR in 2050.The results showed that the integrated model with the optimized random forest model as the combination strategy had the best comprehensive performance in susceptibility assessment.The distribution of hazard classes is similar to susceptibility,and with an increase in precipitation,the low-hazard area and high-hazard decrease and shift to medium-hazard and very high-hazard classes.The high-risk areas for future landslide populations in Shuicheng County are mainly concentrated in the three southwestern towns with high vulnerability,whereas the northern towns of Baohua and Qinglin are at the lowest risk class.The LPAR increased with the intensity of extreme precipitation.The LPAR differs significantly among the SSPs scenarios,with the lowest in the“fossil-fueled development(SSP5)”scenario and the highest in the“regional rivalry(SSP3)”scenario.In summary,the landslide susceptibility model based on integrated machine learning proposed in this study has a high predictive capability.The results of future LPAR assessment can provide theoretical guidance for relevant departments to cope with future socioeconomic development challenges and make corresponding disaster prevention and mitigation plans to prevent landslide risks from a developmental perspective.
基金from any funding agency in the public,commercial,or not-for-profit sectors.
文摘Online banking fraud occurs whenever a criminal can seize accounts and transfer funds from an individual’s online bank account.Successfully preventing this requires the detection of as many fraudsters as possible,without producing too many false alarms.This is a challenge for machine learning owing to the extremely imbalanced data and complexity of fraud.In addition,classical machine learning methods must be extended,minimizing expected financial losses.Finally,fraud can only be combated systematically and economically if the risks and costs in payment channels are known.We define three models that overcome these challenges:machine learning-based fraud detection,economic optimization of machine learning results,and a risk model to predict the risk of fraud while considering countermeasures.The models were tested utilizing real data.Our machine learning model alone reduces the expected and unexpected losses in the three aggregated payment channels by 15%compared to a benchmark consisting of static if-then rules.Optimizing the machine-learning model further reduces the expected losses by 52%.These results hold with a low false positive rate of 0.4%.Thus,the risk framework of the three models is viable from a business and risk perspective.