Titanium-based semiconductors are known for their high chemical stability and suitable band gap widths.However,the conventional experimental screening methods are inefficient due to the wide variety of materials.To sp...Titanium-based semiconductors are known for their high chemical stability and suitable band gap widths.However,the conventional experimental screening methods are inefficient due to the wide variety of materials.To speed up the selection process,this work focuses on interpretable feature learning and band gap prediction for titanium-based semiconductors.First,titanium compounds were selected from the Materials Project database by machine learning,and elemental features were extracted using the Magpie descriptors.Then,principal component analysis(PCA)was applied to reduce the data dimensionality,creating a representative dataset.Meantime,heatmaps and SHAP(SHapley Additive exPlanations)methods were used to demonstrate the influence of key features such as electronegativity,covalent radius,period number,and unit cell volume on the bandgap,understanding the relationship between the material’s properties and performance.After comparing different machine learning models,including Random Forest(RF),Support Vector Machines(SVM),Linear Regression(LR),and Gradient Boosting Regression(GBR),the RF was found to be the most accurate for band gap prediction.Finally,the model performance was improved through parameter tuning,showing high accuracy.These findings provide strong data support and design guidance for the development of materials in fields like photocatalysis and solar cells.展开更多
Predicting monsoon climate is one of the major endeavors in climate science and is becoming increasingly challenging due to global warming. The accuracy of monsoon seasonal predictions significantly impacts the lives ...Predicting monsoon climate is one of the major endeavors in climate science and is becoming increasingly challenging due to global warming. The accuracy of monsoon seasonal predictions significantly impacts the lives of billions who depend on or are affected by monsoons, as it is essential for the water cycle, food security, ecology, disaster prevention, and the economy of monsoon regions. Given the extensive literature on Asian monsoon climate prediction, we limit our focus to reviewing the seasonal prediction and predictability of the Asian Summer Monsoon (ASM). However, much of this review is also relevant to monsoon predictions in other seasons and regions. Over the past two decades, considerable progress has been made in the seasonal forecasting of the ASM, driven by an enhanced understanding of the sources of predictability and the dynamics of seasonal variability, along with advanced development in sophisticated models and technologies. This review centers on advances in understanding the physical foundation for monsoon climate prediction (section 2), significant findings and insights into the primary and regional sources of predictability arising from feedback processes among various climate components (sections 3 and 4), the effects of global warming and external forcings on predictability (section 5), developments in seasonal prediction models and techniques (section 6), the challenges and limitations of monsoon climate prediction (section 7), and emerging research trends with suggestions for future directions (section 8). We hope this review will stimulate creative activities to enhance monsoon climate prediction.展开更多
Pinus radiata(D.Don)dominates New Zealand's forestry industry,constituting 91%of plantations,and is among the world's most important plantation species.Given the socio-economic and environmental importance of ...Pinus radiata(D.Don)dominates New Zealand's forestry industry,constituting 91%of plantations,and is among the world's most important plantation species.Given the socio-economic and environmental importance of this species,it is important to have accurate and precise projections over time to make efficient decisions for forest management and greenfield investments in afforestation projects,especially for permanent carbon forests.Future projections of any natural resource systems rely on modeling;however,the acceleration of climate change makes future projections of yield less certain.These challenges also impact national expectations of the contribution planted forests will provide to address climate change and meet international commitments under the Paris Agreement.Using a large national-scale set of contemporary ground-measured data(2013–2023),this study investigates the performance of two growth models developed over 30 years ago that are widely used by NZ plantation growers:1)the Pumice Plateau Model 1988(PPM88)and 2)the 300-index(including a model variant of regional drift).Model simulations were made using the FORECASTER modeling suite with geographic boundaries to adjust for drift in space and time.Basal area(BA,m^(2)⋅ha^(-1))and volume(m^(3)⋅ha^(-1))were simulated,and standard errors and goodness-of-fit metrics calculated up to a typical rotation age of 30 years.Model residuals were then separated and analysed for the main plantation growing regions.The models overpredicted observed growth by between 6.8%and 16.2%,but model predictions and errors varied significantly between regions.The results of this study provided clear evidence of divergence between the outputs of both models and the measured data.Finally,this study suggests future measures to address challenges posed by these discrepancies that will provide better information for forest management and investment decisions in a changing climate.展开更多
The objective of the current study is to investigate an adaptive predictive observer-based autopilot for a skid-to-turn(STT)missile model with uncertainties and unknown dynamic equations.A predictive control for the S...The objective of the current study is to investigate an adaptive predictive observer-based autopilot for a skid-to-turn(STT)missile model with uncertainties and unknown dynamic equations.A predictive control for the STT missile is designed based on nonlinear model predictive control(NMPC)using Taylor series expansion,after which,via a neural network(NN),unknown functions are approximated.The present study also evaluates an adaptive optimal observer of a new strategy-based nonlinear system.Specifically,to estimate the missile states such as normal acceleration and its derivatives for the future,originally the Taylor series states expansion was gained to any specified order,based on their receding horizons.To address the problem of prediction error,an analytic solution was prepared that led to a closed form regarding the nonlinear optimal observer.Out of the gains resulting from the analytic solution,as developed for the problem of prediction error,the selection of the proposed observer gain was optimally conducted to meet the stability condition.Thus,combining the adaptive predictive autopilot and the adaptive optimal observer scheme was implemented to secure the performance,which needed only estimated normal acceleration and its derivatives.Meanwhile,no angular velocity measurement or wind angle estimation was required.Ultimately,the proposed technique was found effective,as confirmed by the qualitative simulation results.展开更多
Deep learning has undeniably sharpened our ability to forecast risk in neuropsychiatry[1].Yet the very success of prediction has exposed a deeper limitation:we are still remarkably uncertain about which levers to pull...Deep learning has undeniably sharpened our ability to forecast risk in neuropsychiatry[1].Yet the very success of prediction has exposed a deeper limitation:we are still remarkably uncertain about which levers to pull to change patient trajectories[2].Accurate risk scores that cannot be translated into credible actions leave clinicians where they began,testing symptomatic fixes and hoping for the best.If we want to move beyond this impasse,the next step is not simply to train larger models,but to rethink what we ask of them.展开更多
Thermal power plants are the main contributors to greenhouse gas emissions.The prediction of the emission supports the decision makers and environmental sustainability.The objective of this study is to enhance the acc...Thermal power plants are the main contributors to greenhouse gas emissions.The prediction of the emission supports the decision makers and environmental sustainability.The objective of this study is to enhance the accuracy of emission prediction models,supporting more effective real-time monitoring and enabling informed operational decisions that align with environmental compliance efforts.This paper presents a data-driven approach for the accurate prediction of gas emissions,specifically nitrogen oxides(NOx)and carbon monoxide(CO),in natural gas power plants using an optimized hybrid machine learning framework.The proposed model integrates a Feedforward Neural Network(FFNN)trained using Particle Swarm Optimization to capture the nonlinear emission dynamics under varying gas turbine operating conditions.To further enhance predictive performance,the K-Nearest Neighbor(K-NN)algorithm serves as a post-processing method to enhance IPSO-FFNN predictions through adjustment and refinement,improving overall prediction accuracy,while Neighbor Component Analysis is used to identify and rank the most influential operational variables.The study makes a significant contribution through the combination of NCA feature selection with PSO global optimization,FFNN nonlinear modelling,and K-NN error correction into one unified system,which delivers precise emission predictions.The model was developed and tested using a real-world dataset collected from gas-fired turbine operations,with validated results demonstrating robust accuracy,achieving Root Mean Square Error values of 0.355 for CO and 0.368 for NOx.When benchmarked against conventional models such as standard FFNN,Support Vector Regression,and Long Short-Term Memory networks,the hybrid model achieved substantial improvements,up to 97.8%in Mean Squared Error,95%in Mean Absolute Error(MAE),and 85.19%in RMSE for CO;and 97.16%in MSE,93.4%in MAE,and 83.15%in RMSE for NOx.These results underscore the model’s potential for improving emission prediction,thereby supporting enhanced operational efficiency and adherence to environmental standards.展开更多
Artificial Intelligence(AI)in healthcare enables predicting diabetes using data-driven methods instead of the traditional ways of screening the disease,which include hemoglobin A1c(HbA1c),oral glucose tolerance test(O...Artificial Intelligence(AI)in healthcare enables predicting diabetes using data-driven methods instead of the traditional ways of screening the disease,which include hemoglobin A1c(HbA1c),oral glucose tolerance test(OGTT),and fasting plasma glucose(FPG)screening techniques,which are invasive and limited in scale.Machine learning(ML)and deep neural network(DNN)models that use large datasets to learn the complex,nonlinear feature interactions,but the conventional ML algorithms are data sensitive and often show unstable predictive accuracy.Conversely,DNN models are more robust,though the ability to reach a high accuracy rate consistently on heterogeneous datasets is still an open challenge.For predicting diabetes,this work proposed a hybrid DNN approach by integrating a bidirectional long short-term memory(BiLSTM)network with a bidirectional gated recurrent unit(BiGRU).A robust DL model,developed by combining various datasets with weighted coefficients,dense operations in the connection of deep layers,and the output aggregation using batch normalization and dropout functions to avoid overfitting.The goal of this hybrid model is better generalization and consistency among various datasets,which facilitates the effective management and early intervention.The proposed DNN model exhibits an excellent predictive performance as compared to the state-of-the-art and baseline ML and DNN models for diabetes prediction tasks.The robust performance indicates the possible usefulness of DL-based models in the development of disease prediction in healthcare and other areas that demand high-quality analytics.展开更多
The genetic basis of early-stage salt tolerance in alfalfa(Medicago sativa L.),a key factor limiting its productivity,remains poorly understood.To dissect this complex trait,we integrate genome-wide association studie...The genetic basis of early-stage salt tolerance in alfalfa(Medicago sativa L.),a key factor limiting its productivity,remains poorly understood.To dissect this complex trait,we integrate genome-wide association studies(GWAS)and transcriptomics from 176 accessions within a machine learning based genomic prediction framework.Analysis reveals weak genetic correlations among four salt-tolerance traits and a gradual decline in performance under increasing salt stress.GWAS identify 60 significant associated SNPs,with the highest number detected under 100 mM salt stress.Salt tolerance exhibits an additive effect from favorable haplotypes,which are most abundant in Chinese accessions.GWAS-associated genes are related to key regulators of hormone signaling and osmotic adjustment,while transcriptome analysis indicates a global repression of stress-responsive transcription factors.Integrating these multi-omics datasets allows us to identify 14 candidate genes,including MsHSD1(seed dormancy)and MsMTATP6(energy metabolism).Crucially,incorporating these markers into genomic prediction models improve cross-population predictive accuracy to an average of 54.4%.This study provides insights into the genetic architecture of salt tolerance in alfalfa and offers valuable markers to facilitate molecular breeding.展开更多
Arctic sea ice concentration(SIC)prediction on a subseasonal scale plays an important role in polar navigation.To reduce the high uncertainty of daily forecasts,three time series prediction models are combined with em...Arctic sea ice concentration(SIC)prediction on a subseasonal scale plays an important role in polar navigation.To reduce the high uncertainty of daily forecasts,three time series prediction models are combined with empirical orthogonal function(EOF)decomposition to forecast Arctic pentad-mean SIC,where each month is divided into six pentad-means–the first five each span five days,and the last encompasses the remaining days,which may vary in length.The models were trained on SIC data from 1989 to2018 and tested from 2019 to 2023,with lead times ranging from 1 to 12 pentad-means.Model skill was evaluated based on SIC spatial patterns,sea ice area(SIA),and the sea ice edge in September from 2019 to 2023.The moving-averaged 2-m temperature helps reduce the long short-term memory model's error in the Beaufort and Chukchi Seas.Based on the models'scores for each EOF time series,weighted ensemble prediction results were obtained.These results outperform two benchmark models across all lead times.In addition,the ensemble prediction better reproduces the seasonal cycle of the SIA,with relative errors ranging from 1.04%to 3.85%.The predicted September ice edge closely matches observations,with binary accuracy consistently above 90%.Forecast models show the lowest errors in the central Arctic,while relatively higher errors appear in the Barents and Kara Seas.展开更多
With the widespread deployment of assembly robots in smart manufacturing,efficiently offloading tasks and allocating resources in highly dynamic industrial environments has become a critical challenge for Mobile Edge ...With the widespread deployment of assembly robots in smart manufacturing,efficiently offloading tasks and allocating resources in highly dynamic industrial environments has become a critical challenge for Mobile Edge Computing(MEC).To address this challenge,this paper constructs a cloud-edge-end collaborative MEC system that enables assembly robots to offload complex workflow tasks via multiple paths(horizontal,vertical,and hybrid collaboration).Tomitigate uncertainties arising frommobility,the location predictionmodule is employed.This enables proactive channel-quality estimation,providing forward-looking insights for offloading decisions.Furthermore,we propose a fairness-aware joint optimization framework.Utilizing an improved Multi-Agent Deep Reinforcement Learning(MADRL)algorithm whose reward function incorporates total system cost,positional reliability,and timeout penalties,the framework aims to balance resource distribution among assembly robots while maximizing system utility.Simulation results demonstrate that the proposed framework outperforms traditional offloading strategies.By integrating predictive mobility management with fairness-aware optimization,the framework offers a robust solution for dynamic industrial MEC environments.展开更多
Satellite clock bias(SCB)prediction is essential for enhancing the accuracy and reliability of real-time precise point positioning(RT-PPP)in Global Navigation Satellite Systems(GNSS).To address the nonlinearity,non-st...Satellite clock bias(SCB)prediction is essential for enhancing the accuracy and reliability of real-time precise point positioning(RT-PPP)in Global Navigation Satellite Systems(GNSS).To address the nonlinearity,non-stationarity,and short-term interruptions of SCB data under complex environments,this paper proposes an enhanced SCB prediction model combining Temporal Convolutional Networks(TCN)and Transformers.Experimental results indicate that,in a 24-h prediction task,the proposed model reduces root mean square error(RMSE)and range error(RE)by 95.6%,86.0%,and 61.3%,and93.7%,86.3%,and 58.8%,respectively,compared with LSTM,Transformer,and CNN-BiGRU-Attention models,while improving computational efficiency by 48.6%over the Transformer.Moreover,although the clock bias products generated by the proposed method result in slightly higher static PPP positioning errors than the International GNSS Service(IGS)rapid clock products,the error differences are generally at the millimeter level,demonstrating the feasibility of using predicted clock bias products to replace rapid clock products in the short term.This method addresses the PPP positioning issue during short-term network service interruptions from the perspective of time series prediction and provides potential solutions for engineering applications such as landslide,earthquake,and subsidence monitoring.展开更多
The integration of machine learning(ML)technology with Internet of Things(IoT)systems produces essential changes in healthcare operations.Healthcare personnel can track patients around the clock thanks to healthcare I...The integration of machine learning(ML)technology with Internet of Things(IoT)systems produces essential changes in healthcare operations.Healthcare personnel can track patients around the clock thanks to healthcare IoT(H-IoT)technology,which also provides proactive statistical findings and precise medical diagnoses that enhance healthcare performance.This study examines how ML might support IoT-based health care systems,namely in the areas of prognostic systems,disease detection,patient tracking,and healthcare operations control.The study looks at the benefits and drawbacks of several machine learning techniques for H-IoT applications.It also examines the fundamental problems,such as data security and cyberthreats,as well as the high processing demands that these systems face.Alongside this,the essay discusses the advantages of all the technologies,including machine learning,deep learning,and the Internet of Things,as well as the significant difficulties and problems that arise when integrating the technology into healthcare forecasts.展开更多
Accurate prediction of rockburst intensity levels is crucial for ensuring the safety of deep hard rock engineering construction.This paper introduced an expert system for rockburst intensity level prediction that empl...Accurate prediction of rockburst intensity levels is crucial for ensuring the safety of deep hard rock engineering construction.This paper introduced an expert system for rockburst intensity level prediction that employs machine learning algorithms as the basis for its inference rules.The system comprises four modules:a database,a repository,an inference engine,and an interpreter.A database containing 1114 rockburst cases was used to construct 357 datasets that serve as the repository for the expert system.Additionally,19 types of machine learning algorithms were used to establish 6783 micro-models to construct cognitive rules within the inference engine.By integrating probability theory and marginal analysis,a fuzzy scoring method based on the SoftMax function was developed and applied to the interpreter for rockburst intensity level prediction,effectively restoring the continuity of rockburst characteristics.The research results indicate that ensemble algorithms based on decision trees are more effective in capturing the characteristics of rockburst.Key factors for accurate prediction of rockburst intensity include uniaxial compressive strength,elastic energy index,the maximum principal stress,tangential stress,and their composite indicators.The accuracy of the proposed rockburst intensity level prediction expert system was verified using 20 engineering rockburst cases,with predictions aligning closely with the actual rockburst intensity levels.展开更多
Precise forecasts of wildfire danger are crucial for proactive fuel management and emergency responses,yet they pose a challenge at the subseasonal scale due to limitations in prediction capabilities and a gap between...Precise forecasts of wildfire danger are crucial for proactive fuel management and emergency responses,yet they pose a challenge at the subseasonal scale due to limitations in prediction capabilities and a gap between forecast outputs and the needs of decision-makers.This study introduces an innovative hybrid modeling framework that integrates artificial intelligence(AI)with climate dynamic prediction systems to accurately forecast High Fire-Danger Days(HFDDs)for the following month.These HFDDs are derived from historical satellite fire data and the optimum fire danger index,with a particular focus on Southwest China as a case study.The AI module,based on the ResNet-18 neural network model,integrates observational and physically constrained analysis to establish links between HFDDs and optimal predictors of atmospheric circulation from both the concurrent and preceding months.Leveraging climate dynamical forecasting,this hybrid model provides more reliable deterministic predictions for monthly HFDDs than conventional methods that rely solely on terrestrial variables such as precipitation.More importantly,the integration of dynamical ensemble prediction enhances the model’s capability for skillful probabilistic predictions of HFDDs,facilitating the creation of customized fire danger outlooks and emergency action maps tailored to stakeholders’needs.The model’s added economic value was also evaluated,demonstrating its potential to improve decision-making in disaster management and bridge the“last-mile gap”in climate service delivery.This work contributes to the Seamless Prediction and Services for Sustainable Natural and Built Environment(SEPRESS)Program(2025–32),under the United Nations Educational Scientific and Cultural Organization(UNESCO)International Decade of Sciences for Sustainable Development(2024–33).展开更多
Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and ...Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and the environment.Estimating ammonia emissions is crucial for national inventories and policy-making.Various models exist for predicting emissions,including mechanistic,empirical,and semi-empirical approaches.While machine learning(ML)is widely used in environmental science,its application to ammonia emissions remains limited.In this study,we used 5939 ammonia emission data from 538 trials,extracted from the ALFAM2 database,to train three machine learning methods-random forest,gradient boosting,and lasso-for predicting cumulative ammonia emissions 72 h after manure application.These methods were compared to the semi-empirical ALFAM2 model using an independent test dataset.Random forest(RMSE=4.51,r=0.94,MAE=3.28,Bias=0.92)and gradient boosting(RMSE=6.19,r=0.89,MAE=4.10,Bias=0.51)showed the best performance,while the lasso log-linear model(RMSE=7.30,r=0.84,MAE=5.57,Bias=-1.38)performed worst.Both random forest and gradient boosting outperformed the semi-empirical ALFAM2 model,which showed performance comparable to the lasso model.We then used these models and the ALFAM2 model to compare five slurry management techniques,varying in application method(trailing hoses,trailing shoes,and open slot)and post-application incorporation,across 128 scenarios with different manure types and weather conditions.Compared to broadcast application,alternative techniques reduced emissions by a median of-13.6%to-61.7%.This study highlights the promise of ML models in assessing ammonia emission reduction methods,while emphasizing the importance of evaluating model sensitivity to algorithm choice.展开更多
Customer churn is the rate at which customers discontinue doing business with a company over a given time period.It is an essential measure for businesses to monitor high churn rates,as they often indicate underlying ...Customer churn is the rate at which customers discontinue doing business with a company over a given time period.It is an essential measure for businesses to monitor high churn rates,as they often indicate underlying issues with services,products,or customer experience,resulting in considerable income loss.Prediction of customer churn is a crucial task aimed at retaining customers and maintaining revenue growth.Traditional machine learning(ML)models often struggle to capture complex temporal dependencies in client behavior data.To address this,an optimized deep learning(DL)approach using a Regularized Bidirectional Long Short-Term Memory(RBiLSTM)model is proposed to mitigate overfitting and improve generalization error.The model integrates dropout,L2-regularization,and early stopping to enhance predictive accuracy while preventing over-reliance on specific patterns.Moreover,this study investigates the effect of optimization techniques on boosting the training efficiency of the developed model.Experimental results on a recent public customer churn dataset demonstrate that the trained model outperforms the traditional ML models and some other DL models,such as Long Short-Term Memory(LSTM)and Deep Neural Network(DNN),in churn prediction performance and stability.The proposed approach achieves 96.1%accuracy,compared with LSTM and DNN,which attain 94.5%and 94.1%accuracy,respectively.These results confirm that the proposed approach can be used as a valuable tool for businesses to identify at-risk consumers proactively and implement targeted retention strategies.展开更多
To address the challenges of complex fluvial sandbody distribution and difficult remaining oil recovery in mature continental oilfields,this study focuses on key issues in reservoir identification such as ambiguous na...To address the challenges of complex fluvial sandbody distribution and difficult remaining oil recovery in mature continental oilfields,this study focuses on key issues in reservoir identification such as ambiguous narrow-channel boundaries and subdivision of multi-stage superimposed sandbodies.Taking the Upper Cretaceous continental sandstone in the Sazhong Oilfield of the Daqing Placanticline as an example,a technical system integrating OVT high-resolution processing,multi-attribute fusion,and varible-scale inversion was developed to establish a complete workflow from seismic processing to reservoir prediction and remaining oil recovery.The following results are obtained.First,the Offset Vector Tile(OVT)seismic processing technology is extended,for the first time,from fracture imaging to sandbody prediction,in order to address the weak seismic responses from boundaries of narrow and thin sandbodies.A geology-oriented OVT partitioning method is developed to significantly improve the imaging accuracy,enabling identification of channel sandbodies as narrow as 50 m.Second,an amplitude-coherence dual-attribute fusion method is proposed for predicting narrow channel boundaries between wells.Constrained by a sedimentary unit-level sequence chronostratigraphic framework,this method accurately delineates 800-2000 m long subaqueous distributary channels with bifurcation-convergence features.Third,considering the superimposition of multi-stage channels,a three-level variable-scale stratigraphic model(sandstone groups,sublayers,sedimentary units)is constructed to overcome single-scale modeling limitations,successfully characterizing key sedimentary features like meandering river“cut-offs”through 3D seismic inversion.Based on these advances,a direct link between seismic prediction and remaining oil recovery is established.The horizontal wells deployed using narrow-channel predictions encountered oil-bearing sandstones in the horizontal section by 97%,and achieved initial daily production of 12.5 t per well.Precise identification of individual channel boundaries within 17 composite sandbodies guided recovery processes in 135 wells,yielding an average daily increase of 2.8 t per well and a cumulative increase of 13.6×10^(4)t.展开更多
Sepsis poses a serious threat to patient survival,making timely risk assessment crucial.Predicting in-hospital mortality based on clinical indicators can aid in making better clinical decisions.Previous studies have f...Sepsis poses a serious threat to patient survival,making timely risk assessment crucial.Predicting in-hospital mortality based on clinical indicators can aid in making better clinical decisions.Previous studies have focused on classifier selection but lacked a comprehensive analysis of feature selection and data preprocessing.This study optimized machine learning models for sepsis mortality prediction by:(1)comprehensively comparing feature selection and classification methods to identify the best combination,(2)building a high-performing model with fewer features,and(3)identifying key clinically relevant indicators.Methods:Using the MIMIC-III sepsis cohort,we conducted a comprehensive analysis to determine the optimal model,including data preprocessing,data balance,classifier selection,and feature selection.Feature importance was further analyzed to identify the key predictors of in-hospital mortality.Results:The proposed Synthetic Minority Oversampling Technique-Random Forest Recursive Feature Elimination-Extreme Gradient Boosting(SMOTE-(RF-RFE)-XGB)model achieved high predictive performance with a mean Area Under the Curve(AUC)of 0.8507,while reducing the number of features from 78 to 39.Compared to other feature selection methods evaluated in this study and those reported in related literature,Random Forest Recursive Feature Elimination(RF-RFE)offers the best trade-off between accuracy,feature compactness,and stability.Additionally,feature importance rankings consistently identified Acute Physiology Score Ⅲ(APS Ⅲ),Ventilation on First Day,and Depression as the top three most influential predictors,besides the Length of Stay in ICU and Hospital.Conclusions:This study addresses key gaps by conducting a comprehensive evaluation of classifiers and feature selection methods for predicting in-hospital mortality in patients with sepsis.The proposed SMOTE-(RFRFE)-XGB model achieved a high predictive performance and stability with a compact feature set.APS III,Ventilation on First Day,and Depression were consistently identified as key predictors besides Length of Stay in ICU and Hospital.展开更多
To achieve low-carbon regulation of electric vehicle(EV)charging loads under the“dual carbon”goals,this paper proposes a coordinated scheduling strategy that integrates dynamic carbon factor prediction and multiobje...To achieve low-carbon regulation of electric vehicle(EV)charging loads under the“dual carbon”goals,this paper proposes a coordinated scheduling strategy that integrates dynamic carbon factor prediction and multiobjective optimization.First,a dual-convolution enhanced improved Crossformer prediction model is constructed,which employs parallel 1×1 global and 3×3 local convolutionmodules(Integrated Convolution Block,ICB)formultiscale feature extraction,combinedwith anAdaptive Spectral Block(ASB)to enhance time-series fluctuationmodeling.Based on high-precision predictions,a carbon-electricity cost joint optimization model is further designed to balance economic,environmental,and grid-friendly objectives.The model’s superiority was validated through a case study using real-world data from a renewable-heavy grid.Simulation results show that the proposed multi-objective strategy demonstrated a superior balance compared to baseline and benchmark models,achieving a 15.8%reduction in carbon emissions and a 5.2%reduction in economic costs,while still providing a substantial 22.2%reduction in the peak-valley difference.Its balanced performance significantly outperformed both a single-objective strategy and a state-of-the-art Model Predictive Control(MPC)benchmark,highlighting the advantage of a global optimization approach.This study provides theoretical and technical pathways for dynamic carbon factor-driven EV charging optimization.展开更多
Accurate purchase prediction in e-commerce critically depends on the quality of behavioral features.This paper proposes a layered and interpretable feature engineering framework that organizes user signals into three ...Accurate purchase prediction in e-commerce critically depends on the quality of behavioral features.This paper proposes a layered and interpretable feature engineering framework that organizes user signals into three layers:Basic,Conversion&Stability(efficiency and volatility across actions),and Advanced Interactions&Activity(crossbehavior synergies and intensity).Using real Taobao(Alibaba’s primary e-commerce platform)logs(57,976 records for 10,203 users;25 November–03 December 2017),we conducted a hierarchical,layer-wise evaluation that holds data splits and hyperparameters fixed while varying only the feature set to quantify each layer’s marginal contribution.Across logistic regression(LR),decision tree,random forest,XGBoost,and CatBoost models with stratified 5-fold cross-validation,the performance improvedmonotonically fromBasic to Conversion&Stability to Advanced features.With LR,F1 increased from 0.613(Basic)to 0.962(Advanced);boosted models achieved high discrimination(0.995 AUC Score)and an F1 score up to 0.983.Calibration and precision–recall analyses indicated strong ranking quality and acknowledged potential dataset and period biases given the short(9-day)window.By making feature contributions measurable and reproducible,the framework complements model-centric advances and offers a transparent blueprint for production-grade behavioralmodeling.The code and processed artifacts are publicly available,and future work will extend the validation to longer,seasonal datasets and hybrid approaches that combine automated feature learning with domain-driven design.展开更多
文摘Titanium-based semiconductors are known for their high chemical stability and suitable band gap widths.However,the conventional experimental screening methods are inefficient due to the wide variety of materials.To speed up the selection process,this work focuses on interpretable feature learning and band gap prediction for titanium-based semiconductors.First,titanium compounds were selected from the Materials Project database by machine learning,and elemental features were extracted using the Magpie descriptors.Then,principal component analysis(PCA)was applied to reduce the data dimensionality,creating a representative dataset.Meantime,heatmaps and SHAP(SHapley Additive exPlanations)methods were used to demonstrate the influence of key features such as electronegativity,covalent radius,period number,and unit cell volume on the bandgap,understanding the relationship between the material’s properties and performance.After comparing different machine learning models,including Random Forest(RF),Support Vector Machines(SVM),Linear Regression(LR),and Gradient Boosting Regression(GBR),the RF was found to be the most accurate for band gap prediction.Finally,the model performance was improved through parameter tuning,showing high accuracy.These findings provide strong data support and design guidance for the development of materials in fields like photocatalysis and solar cells.
基金supported by the National Natural Science Foundation of China(Grant No.U2342208)support from NSF/Climate Dynamics Award#2025057。
文摘Predicting monsoon climate is one of the major endeavors in climate science and is becoming increasingly challenging due to global warming. The accuracy of monsoon seasonal predictions significantly impacts the lives of billions who depend on or are affected by monsoons, as it is essential for the water cycle, food security, ecology, disaster prevention, and the economy of monsoon regions. Given the extensive literature on Asian monsoon climate prediction, we limit our focus to reviewing the seasonal prediction and predictability of the Asian Summer Monsoon (ASM). However, much of this review is also relevant to monsoon predictions in other seasons and regions. Over the past two decades, considerable progress has been made in the seasonal forecasting of the ASM, driven by an enhanced understanding of the sources of predictability and the dynamics of seasonal variability, along with advanced development in sophisticated models and technologies. This review centers on advances in understanding the physical foundation for monsoon climate prediction (section 2), significant findings and insights into the primary and regional sources of predictability arising from feedback processes among various climate components (sections 3 and 4), the effects of global warming and external forcings on predictability (section 5), developments in seasonal prediction models and techniques (section 6), the challenges and limitations of monsoon climate prediction (section 7), and emerging research trends with suggestions for future directions (section 8). We hope this review will stimulate creative activities to enhance monsoon climate prediction.
基金funded by Scion's Strategic Science Investment Fund(SSIF)the Forest Growers Levy Trust(FGLT)through the Resilient Forests Programme(Task No.A89220)。
文摘Pinus radiata(D.Don)dominates New Zealand's forestry industry,constituting 91%of plantations,and is among the world's most important plantation species.Given the socio-economic and environmental importance of this species,it is important to have accurate and precise projections over time to make efficient decisions for forest management and greenfield investments in afforestation projects,especially for permanent carbon forests.Future projections of any natural resource systems rely on modeling;however,the acceleration of climate change makes future projections of yield less certain.These challenges also impact national expectations of the contribution planted forests will provide to address climate change and meet international commitments under the Paris Agreement.Using a large national-scale set of contemporary ground-measured data(2013–2023),this study investigates the performance of two growth models developed over 30 years ago that are widely used by NZ plantation growers:1)the Pumice Plateau Model 1988(PPM88)and 2)the 300-index(including a model variant of regional drift).Model simulations were made using the FORECASTER modeling suite with geographic boundaries to adjust for drift in space and time.Basal area(BA,m^(2)⋅ha^(-1))and volume(m^(3)⋅ha^(-1))were simulated,and standard errors and goodness-of-fit metrics calculated up to a typical rotation age of 30 years.Model residuals were then separated and analysed for the main plantation growing regions.The models overpredicted observed growth by between 6.8%and 16.2%,but model predictions and errors varied significantly between regions.The results of this study provided clear evidence of divergence between the outputs of both models and the measured data.Finally,this study suggests future measures to address challenges posed by these discrepancies that will provide better information for forest management and investment decisions in a changing climate.
文摘The objective of the current study is to investigate an adaptive predictive observer-based autopilot for a skid-to-turn(STT)missile model with uncertainties and unknown dynamic equations.A predictive control for the STT missile is designed based on nonlinear model predictive control(NMPC)using Taylor series expansion,after which,via a neural network(NN),unknown functions are approximated.The present study also evaluates an adaptive optimal observer of a new strategy-based nonlinear system.Specifically,to estimate the missile states such as normal acceleration and its derivatives for the future,originally the Taylor series states expansion was gained to any specified order,based on their receding horizons.To address the problem of prediction error,an analytic solution was prepared that led to a closed form regarding the nonlinear optimal observer.Out of the gains resulting from the analytic solution,as developed for the problem of prediction error,the selection of the proposed observer gain was optimally conducted to meet the stability condition.Thus,combining the adaptive predictive autopilot and the adaptive optimal observer scheme was implemented to secure the performance,which needed only estimated normal acceleration and its derivatives.Meanwhile,no angular velocity measurement or wind angle estimation was required.Ultimately,the proposed technique was found effective,as confirmed by the qualitative simulation results.
文摘Deep learning has undeniably sharpened our ability to forecast risk in neuropsychiatry[1].Yet the very success of prediction has exposed a deeper limitation:we are still remarkably uncertain about which levers to pull to change patient trajectories[2].Accurate risk scores that cannot be translated into credible actions leave clinicians where they began,testing symptomatic fixes and hoping for the best.If we want to move beyond this impasse,the next step is not simply to train larger models,but to rethink what we ask of them.
文摘Thermal power plants are the main contributors to greenhouse gas emissions.The prediction of the emission supports the decision makers and environmental sustainability.The objective of this study is to enhance the accuracy of emission prediction models,supporting more effective real-time monitoring and enabling informed operational decisions that align with environmental compliance efforts.This paper presents a data-driven approach for the accurate prediction of gas emissions,specifically nitrogen oxides(NOx)and carbon monoxide(CO),in natural gas power plants using an optimized hybrid machine learning framework.The proposed model integrates a Feedforward Neural Network(FFNN)trained using Particle Swarm Optimization to capture the nonlinear emission dynamics under varying gas turbine operating conditions.To further enhance predictive performance,the K-Nearest Neighbor(K-NN)algorithm serves as a post-processing method to enhance IPSO-FFNN predictions through adjustment and refinement,improving overall prediction accuracy,while Neighbor Component Analysis is used to identify and rank the most influential operational variables.The study makes a significant contribution through the combination of NCA feature selection with PSO global optimization,FFNN nonlinear modelling,and K-NN error correction into one unified system,which delivers precise emission predictions.The model was developed and tested using a real-world dataset collected from gas-fired turbine operations,with validated results demonstrating robust accuracy,achieving Root Mean Square Error values of 0.355 for CO and 0.368 for NOx.When benchmarked against conventional models such as standard FFNN,Support Vector Regression,and Long Short-Term Memory networks,the hybrid model achieved substantial improvements,up to 97.8%in Mean Squared Error,95%in Mean Absolute Error(MAE),and 85.19%in RMSE for CO;and 97.16%in MSE,93.4%in MAE,and 83.15%in RMSE for NOx.These results underscore the model’s potential for improving emission prediction,thereby supporting enhanced operational efficiency and adherence to environmental standards.
基金supported by the School of Digital Science,Universiti Brunei Darussalam,Brunei.
文摘Artificial Intelligence(AI)in healthcare enables predicting diabetes using data-driven methods instead of the traditional ways of screening the disease,which include hemoglobin A1c(HbA1c),oral glucose tolerance test(OGTT),and fasting plasma glucose(FPG)screening techniques,which are invasive and limited in scale.Machine learning(ML)and deep neural network(DNN)models that use large datasets to learn the complex,nonlinear feature interactions,but the conventional ML algorithms are data sensitive and often show unstable predictive accuracy.Conversely,DNN models are more robust,though the ability to reach a high accuracy rate consistently on heterogeneous datasets is still an open challenge.For predicting diabetes,this work proposed a hybrid DNN approach by integrating a bidirectional long short-term memory(BiLSTM)network with a bidirectional gated recurrent unit(BiGRU).A robust DL model,developed by combining various datasets with weighted coefficients,dense operations in the connection of deep layers,and the output aggregation using batch normalization and dropout functions to avoid overfitting.The goal of this hybrid model is better generalization and consistency among various datasets,which facilitates the effective management and early intervention.The proposed DNN model exhibits an excellent predictive performance as compared to the state-of-the-art and baseline ML and DNN models for diabetes prediction tasks.The robust performance indicates the possible usefulness of DL-based models in the development of disease prediction in healthcare and other areas that demand high-quality analytics.
基金supported by the National Key Research and Development Program of China(2022YFF1003203)Biological Breeding-National Science and Technology Major Project(2022ZDo4011)+2 种基金the Central Public-interest Scientific Institution Basal Research Fund(Y2025YC44)the Central Public-interest Scientific Institution Basal Research Fund(2025-YWF-ZYSQ-04)the China Postdoctoral Science Foundation(2023M733832).
文摘The genetic basis of early-stage salt tolerance in alfalfa(Medicago sativa L.),a key factor limiting its productivity,remains poorly understood.To dissect this complex trait,we integrate genome-wide association studies(GWAS)and transcriptomics from 176 accessions within a machine learning based genomic prediction framework.Analysis reveals weak genetic correlations among four salt-tolerance traits and a gradual decline in performance under increasing salt stress.GWAS identify 60 significant associated SNPs,with the highest number detected under 100 mM salt stress.Salt tolerance exhibits an additive effect from favorable haplotypes,which are most abundant in Chinese accessions.GWAS-associated genes are related to key regulators of hormone signaling and osmotic adjustment,while transcriptome analysis indicates a global repression of stress-responsive transcription factors.Integrating these multi-omics datasets allows us to identify 14 candidate genes,including MsHSD1(seed dormancy)and MsMTATP6(energy metabolism).Crucially,incorporating these markers into genomic prediction models improve cross-population predictive accuracy to an average of 54.4%.This study provides insights into the genetic architecture of salt tolerance in alfalfa and offers valuable markers to facilitate molecular breeding.
基金supported by the National Key Research and Development Program(No.2023YFC2809101)the Laoshan Laboratory Technology Innovation Project(No.LSKJ202202301)。
文摘Arctic sea ice concentration(SIC)prediction on a subseasonal scale plays an important role in polar navigation.To reduce the high uncertainty of daily forecasts,three time series prediction models are combined with empirical orthogonal function(EOF)decomposition to forecast Arctic pentad-mean SIC,where each month is divided into six pentad-means–the first five each span five days,and the last encompasses the remaining days,which may vary in length.The models were trained on SIC data from 1989 to2018 and tested from 2019 to 2023,with lead times ranging from 1 to 12 pentad-means.Model skill was evaluated based on SIC spatial patterns,sea ice area(SIA),and the sea ice edge in September from 2019 to 2023.The moving-averaged 2-m temperature helps reduce the long short-term memory model's error in the Beaufort and Chukchi Seas.Based on the models'scores for each EOF time series,weighted ensemble prediction results were obtained.These results outperform two benchmark models across all lead times.In addition,the ensemble prediction better reproduces the seasonal cycle of the SIA,with relative errors ranging from 1.04%to 3.85%.The predicted September ice edge closely matches observations,with binary accuracy consistently above 90%.Forecast models show the lowest errors in the central Arctic,while relatively higher errors appear in the Barents and Kara Seas.
基金supported by the National Key R&D Program of China under Grant Nos.2024YFD2400200 and 2024YFD2400204supported in part by the Science and Technology Development Program for the Two Zones under Grant No.2023LQ02004.
文摘With the widespread deployment of assembly robots in smart manufacturing,efficiently offloading tasks and allocating resources in highly dynamic industrial environments has become a critical challenge for Mobile Edge Computing(MEC).To address this challenge,this paper constructs a cloud-edge-end collaborative MEC system that enables assembly robots to offload complex workflow tasks via multiple paths(horizontal,vertical,and hybrid collaboration).Tomitigate uncertainties arising frommobility,the location predictionmodule is employed.This enables proactive channel-quality estimation,providing forward-looking insights for offloading decisions.Furthermore,we propose a fairness-aware joint optimization framework.Utilizing an improved Multi-Agent Deep Reinforcement Learning(MADRL)algorithm whose reward function incorporates total system cost,positional reliability,and timeout penalties,the framework aims to balance resource distribution among assembly robots while maximizing system utility.Simulation results demonstrate that the proposed framework outperforms traditional offloading strategies.By integrating predictive mobility management with fairness-aware optimization,the framework offers a robust solution for dynamic industrial MEC environments.
基金supported by the National Natural Science Foundation of China(42304050)Major Science and Technology Projects in Anhui Province,grant number(202103a05020026)+1 种基金Open Foundation of the Key Laboratory of Universities in Anhui Province for Prevention of Mine Geological Disasters(2022-MGDP-08)University Natural Science Research Project of Anhui Province(2023AH051190)。
文摘Satellite clock bias(SCB)prediction is essential for enhancing the accuracy and reliability of real-time precise point positioning(RT-PPP)in Global Navigation Satellite Systems(GNSS).To address the nonlinearity,non-stationarity,and short-term interruptions of SCB data under complex environments,this paper proposes an enhanced SCB prediction model combining Temporal Convolutional Networks(TCN)and Transformers.Experimental results indicate that,in a 24-h prediction task,the proposed model reduces root mean square error(RMSE)and range error(RE)by 95.6%,86.0%,and 61.3%,and93.7%,86.3%,and 58.8%,respectively,compared with LSTM,Transformer,and CNN-BiGRU-Attention models,while improving computational efficiency by 48.6%over the Transformer.Moreover,although the clock bias products generated by the proposed method result in slightly higher static PPP positioning errors than the International GNSS Service(IGS)rapid clock products,the error differences are generally at the millimeter level,demonstrating the feasibility of using predicted clock bias products to replace rapid clock products in the short term.This method addresses the PPP positioning issue during short-term network service interruptions from the perspective of time series prediction and provides potential solutions for engineering applications such as landslide,earthquake,and subsidence monitoring.
文摘The integration of machine learning(ML)technology with Internet of Things(IoT)systems produces essential changes in healthcare operations.Healthcare personnel can track patients around the clock thanks to healthcare IoT(H-IoT)technology,which also provides proactive statistical findings and precise medical diagnoses that enhance healthcare performance.This study examines how ML might support IoT-based health care systems,namely in the areas of prognostic systems,disease detection,patient tracking,and healthcare operations control.The study looks at the benefits and drawbacks of several machine learning techniques for H-IoT applications.It also examines the fundamental problems,such as data security and cyberthreats,as well as the high processing demands that these systems face.Alongside this,the essay discusses the advantages of all the technologies,including machine learning,deep learning,and the Internet of Things,as well as the significant difficulties and problems that arise when integrating the technology into healthcare forecasts.
基金Project(42077244)supported by the National Natural Science Foundation of ChinaProject(2020-05)supported by the Open Research Fund of Guangdong Provincial Key Laboratory of Deep Earth Sciences and Geothermal Energy Exploitation and Utilization,China。
文摘Accurate prediction of rockburst intensity levels is crucial for ensuring the safety of deep hard rock engineering construction.This paper introduced an expert system for rockburst intensity level prediction that employs machine learning algorithms as the basis for its inference rules.The system comprises four modules:a database,a repository,an inference engine,and an interpreter.A database containing 1114 rockburst cases was used to construct 357 datasets that serve as the repository for the expert system.Additionally,19 types of machine learning algorithms were used to establish 6783 micro-models to construct cognitive rules within the inference engine.By integrating probability theory and marginal analysis,a fuzzy scoring method based on the SoftMax function was developed and applied to the interpreter for rockburst intensity level prediction,effectively restoring the continuity of rockburst characteristics.The research results indicate that ensemble algorithms based on decision trees are more effective in capturing the characteristics of rockburst.Key factors for accurate prediction of rockburst intensity include uniaxial compressive strength,elastic energy index,the maximum principal stress,tangential stress,and their composite indicators.The accuracy of the proposed rockburst intensity level prediction expert system was verified using 20 engineering rockburst cases,with predictions aligning closely with the actual rockburst intensity levels.
基金J.YANG was supported by funding from the National Natural Science Foundation of China(Grant Nos.42475022,42261144671)the National Key R&D Program of China(Project No.2024YFC3013100)+2 种基金the Fundamental Research Funds for the Central UniversitiesM.LU was supported by the Otto Poon Centre of Climate Resilience and Sustainability at HKUST and the Hong Kong Research Grant Committee(Project No.16300424)Data processing and storage were supported by the National Key Scientific and Technological Infrastructure project“Earth System Numerical Simulation Facility”(EarthLab).
文摘Precise forecasts of wildfire danger are crucial for proactive fuel management and emergency responses,yet they pose a challenge at the subseasonal scale due to limitations in prediction capabilities and a gap between forecast outputs and the needs of decision-makers.This study introduces an innovative hybrid modeling framework that integrates artificial intelligence(AI)with climate dynamic prediction systems to accurately forecast High Fire-Danger Days(HFDDs)for the following month.These HFDDs are derived from historical satellite fire data and the optimum fire danger index,with a particular focus on Southwest China as a case study.The AI module,based on the ResNet-18 neural network model,integrates observational and physically constrained analysis to establish links between HFDDs and optimal predictors of atmospheric circulation from both the concurrent and preceding months.Leveraging climate dynamical forecasting,this hybrid model provides more reliable deterministic predictions for monthly HFDDs than conventional methods that rely solely on terrestrial variables such as precipitation.More importantly,the integration of dynamical ensemble prediction enhances the model’s capability for skillful probabilistic predictions of HFDDs,facilitating the creation of customized fire danger outlooks and emergency action maps tailored to stakeholders’needs.The model’s added economic value was also evaluated,demonstrating its potential to improve decision-making in disaster management and bridge the“last-mile gap”in climate service delivery.This work contributes to the Seamless Prediction and Services for Sustainable Natural and Built Environment(SEPRESS)Program(2025–32),under the United Nations Educational Scientific and Cultural Organization(UNESCO)International Decade of Sciences for Sustainable Development(2024–33).
基金the French state aid managed by the ANR under the“Investissements d’avenir”programme with the reference ANR-16-CONV-0003from the AgroEcoSystem department of INRAE.We are grateful to the INRAE MIGALE bioinformatics facility(MIGALE,INRAE,2020.Migale bioinformatics Facility,doi:10.15454/1.5572390655343293E12)for providing help and/or computing and/or storage resources.We are also grateful to Sasha Hafner for his help in reproducing some of the results of Hafner et al.(2019).
文摘Anthropogenic ammonia emissions primarily originate from agriculture,especially field fertilization.These emissions represent nitrogen loss for farmers and contribute to air pollution,posing risks to human health and the environment.Estimating ammonia emissions is crucial for national inventories and policy-making.Various models exist for predicting emissions,including mechanistic,empirical,and semi-empirical approaches.While machine learning(ML)is widely used in environmental science,its application to ammonia emissions remains limited.In this study,we used 5939 ammonia emission data from 538 trials,extracted from the ALFAM2 database,to train three machine learning methods-random forest,gradient boosting,and lasso-for predicting cumulative ammonia emissions 72 h after manure application.These methods were compared to the semi-empirical ALFAM2 model using an independent test dataset.Random forest(RMSE=4.51,r=0.94,MAE=3.28,Bias=0.92)and gradient boosting(RMSE=6.19,r=0.89,MAE=4.10,Bias=0.51)showed the best performance,while the lasso log-linear model(RMSE=7.30,r=0.84,MAE=5.57,Bias=-1.38)performed worst.Both random forest and gradient boosting outperformed the semi-empirical ALFAM2 model,which showed performance comparable to the lasso model.We then used these models and the ALFAM2 model to compare five slurry management techniques,varying in application method(trailing hoses,trailing shoes,and open slot)and post-application incorporation,across 128 scenarios with different manure types and weather conditions.Compared to broadcast application,alternative techniques reduced emissions by a median of-13.6%to-61.7%.This study highlights the promise of ML models in assessing ammonia emission reduction methods,while emphasizing the importance of evaluating model sensitivity to algorithm choice.
文摘Customer churn is the rate at which customers discontinue doing business with a company over a given time period.It is an essential measure for businesses to monitor high churn rates,as they often indicate underlying issues with services,products,or customer experience,resulting in considerable income loss.Prediction of customer churn is a crucial task aimed at retaining customers and maintaining revenue growth.Traditional machine learning(ML)models often struggle to capture complex temporal dependencies in client behavior data.To address this,an optimized deep learning(DL)approach using a Regularized Bidirectional Long Short-Term Memory(RBiLSTM)model is proposed to mitigate overfitting and improve generalization error.The model integrates dropout,L2-regularization,and early stopping to enhance predictive accuracy while preventing over-reliance on specific patterns.Moreover,this study investigates the effect of optimization techniques on boosting the training efficiency of the developed model.Experimental results on a recent public customer churn dataset demonstrate that the trained model outperforms the traditional ML models and some other DL models,such as Long Short-Term Memory(LSTM)and Deep Neural Network(DNN),in churn prediction performance and stability.The proposed approach achieves 96.1%accuracy,compared with LSTM and DNN,which attain 94.5%and 94.1%accuracy,respectively.These results confirm that the proposed approach can be used as a valuable tool for businesses to identify at-risk consumers proactively and implement targeted retention strategies.
基金Supported by the China National Science and Technology Major Project(2025ZD1407000)PetroChina Science and Technology Major Project(2023ZZ22)。
文摘To address the challenges of complex fluvial sandbody distribution and difficult remaining oil recovery in mature continental oilfields,this study focuses on key issues in reservoir identification such as ambiguous narrow-channel boundaries and subdivision of multi-stage superimposed sandbodies.Taking the Upper Cretaceous continental sandstone in the Sazhong Oilfield of the Daqing Placanticline as an example,a technical system integrating OVT high-resolution processing,multi-attribute fusion,and varible-scale inversion was developed to establish a complete workflow from seismic processing to reservoir prediction and remaining oil recovery.The following results are obtained.First,the Offset Vector Tile(OVT)seismic processing technology is extended,for the first time,from fracture imaging to sandbody prediction,in order to address the weak seismic responses from boundaries of narrow and thin sandbodies.A geology-oriented OVT partitioning method is developed to significantly improve the imaging accuracy,enabling identification of channel sandbodies as narrow as 50 m.Second,an amplitude-coherence dual-attribute fusion method is proposed for predicting narrow channel boundaries between wells.Constrained by a sedimentary unit-level sequence chronostratigraphic framework,this method accurately delineates 800-2000 m long subaqueous distributary channels with bifurcation-convergence features.Third,considering the superimposition of multi-stage channels,a three-level variable-scale stratigraphic model(sandstone groups,sublayers,sedimentary units)is constructed to overcome single-scale modeling limitations,successfully characterizing key sedimentary features like meandering river“cut-offs”through 3D seismic inversion.Based on these advances,a direct link between seismic prediction and remaining oil recovery is established.The horizontal wells deployed using narrow-channel predictions encountered oil-bearing sandstones in the horizontal section by 97%,and achieved initial daily production of 12.5 t per well.Precise identification of individual channel boundaries within 17 composite sandbodies guided recovery processes in 135 wells,yielding an average daily increase of 2.8 t per well and a cumulative increase of 13.6×10^(4)t.
文摘Sepsis poses a serious threat to patient survival,making timely risk assessment crucial.Predicting in-hospital mortality based on clinical indicators can aid in making better clinical decisions.Previous studies have focused on classifier selection but lacked a comprehensive analysis of feature selection and data preprocessing.This study optimized machine learning models for sepsis mortality prediction by:(1)comprehensively comparing feature selection and classification methods to identify the best combination,(2)building a high-performing model with fewer features,and(3)identifying key clinically relevant indicators.Methods:Using the MIMIC-III sepsis cohort,we conducted a comprehensive analysis to determine the optimal model,including data preprocessing,data balance,classifier selection,and feature selection.Feature importance was further analyzed to identify the key predictors of in-hospital mortality.Results:The proposed Synthetic Minority Oversampling Technique-Random Forest Recursive Feature Elimination-Extreme Gradient Boosting(SMOTE-(RF-RFE)-XGB)model achieved high predictive performance with a mean Area Under the Curve(AUC)of 0.8507,while reducing the number of features from 78 to 39.Compared to other feature selection methods evaluated in this study and those reported in related literature,Random Forest Recursive Feature Elimination(RF-RFE)offers the best trade-off between accuracy,feature compactness,and stability.Additionally,feature importance rankings consistently identified Acute Physiology Score Ⅲ(APS Ⅲ),Ventilation on First Day,and Depression as the top three most influential predictors,besides the Length of Stay in ICU and Hospital.Conclusions:This study addresses key gaps by conducting a comprehensive evaluation of classifiers and feature selection methods for predicting in-hospital mortality in patients with sepsis.The proposed SMOTE-(RFRFE)-XGB model achieved a high predictive performance and stability with a compact feature set.APS III,Ventilation on First Day,and Depression were consistently identified as key predictors besides Length of Stay in ICU and Hospital.
基金Supported by State Grid Corporation of China Science and Technology Project:Research on Key Technologies for Intelligent Carbon Metrology in Vehicle-to-Grid Interaction(Project Number:B3018524000Q).
文摘To achieve low-carbon regulation of electric vehicle(EV)charging loads under the“dual carbon”goals,this paper proposes a coordinated scheduling strategy that integrates dynamic carbon factor prediction and multiobjective optimization.First,a dual-convolution enhanced improved Crossformer prediction model is constructed,which employs parallel 1×1 global and 3×3 local convolutionmodules(Integrated Convolution Block,ICB)formultiscale feature extraction,combinedwith anAdaptive Spectral Block(ASB)to enhance time-series fluctuationmodeling.Based on high-precision predictions,a carbon-electricity cost joint optimization model is further designed to balance economic,environmental,and grid-friendly objectives.The model’s superiority was validated through a case study using real-world data from a renewable-heavy grid.Simulation results show that the proposed multi-objective strategy demonstrated a superior balance compared to baseline and benchmark models,achieving a 15.8%reduction in carbon emissions and a 5.2%reduction in economic costs,while still providing a substantial 22.2%reduction in the peak-valley difference.Its balanced performance significantly outperformed both a single-objective strategy and a state-of-the-art Model Predictive Control(MPC)benchmark,highlighting the advantage of a global optimization approach.This study provides theoretical and technical pathways for dynamic carbon factor-driven EV charging optimization.
基金supported by the research fund of Hanyang University(HY-202500000001616).
文摘Accurate purchase prediction in e-commerce critically depends on the quality of behavioral features.This paper proposes a layered and interpretable feature engineering framework that organizes user signals into three layers:Basic,Conversion&Stability(efficiency and volatility across actions),and Advanced Interactions&Activity(crossbehavior synergies and intensity).Using real Taobao(Alibaba’s primary e-commerce platform)logs(57,976 records for 10,203 users;25 November–03 December 2017),we conducted a hierarchical,layer-wise evaluation that holds data splits and hyperparameters fixed while varying only the feature set to quantify each layer’s marginal contribution.Across logistic regression(LR),decision tree,random forest,XGBoost,and CatBoost models with stratified 5-fold cross-validation,the performance improvedmonotonically fromBasic to Conversion&Stability to Advanced features.With LR,F1 increased from 0.613(Basic)to 0.962(Advanced);boosted models achieved high discrimination(0.995 AUC Score)and an F1 score up to 0.983.Calibration and precision–recall analyses indicated strong ranking quality and acknowledged potential dataset and period biases given the short(9-day)window.By making feature contributions measurable and reproducible,the framework complements model-centric advances and offers a transparent blueprint for production-grade behavioralmodeling.The code and processed artifacts are publicly available,and future work will extend the validation to longer,seasonal datasets and hybrid approaches that combine automated feature learning with domain-driven design.