Small-drone technology has opened a range of new applications for aerial transportation. These drones leverage the Internet of Things (IoT) to offer cross-location services for navigation. However, they are susceptibl...Small-drone technology has opened a range of new applications for aerial transportation. These drones leverage the Internet of Things (IoT) to offer cross-location services for navigation. However, they are susceptible to security and privacy threats due to hardware and architectural issues. Although small drones hold promise for expansion in both civil and defense sectors, they have safety, security, and privacy threats. Addressing these challenges is crucial to maintaining the security and uninterrupted operations of these drones. In this regard, this study investigates security, and preservation concerning both the drones and Internet of Drones (IoD), emphasizing the significance of creating drone networks that are secure and can robustly withstand interceptions and intrusions. The proposed framework incorporates a weighted voting ensemble model comprising three convolutional neural network (CNN) models to enhance intrusion detection within the network. The employed CNNs are customized 1D models optimized to obtain better performance. The output from these CNNs is voted using a weighted criterion using a 0.4, 0.3, and 0.3 ratio for three CNNs, respectively. Experiments involve using multiple benchmark datasets, achieving an impressive accuracy of up to 99.89% on drone data. The proposed model shows promising results concerning precision, recall, and F1 as indicated by their obtained values of 99.92%, 99.98%, and 99.97%, respectively. Furthermore, cross-validation and performance comparison with existing works is also carried out. Findings indicate that the proposed approach offers a prospective solution for detecting security threats for aerial systems and satellite systems with high accuracy.展开更多
With the rapid development of economy,air pollution caused by industrial expansion has caused serious harm to human health and social development.Therefore,establishing an effective air pollution concentration predict...With the rapid development of economy,air pollution caused by industrial expansion has caused serious harm to human health and social development.Therefore,establishing an effective air pollution concentration prediction system is of great scientific and practical significance for accurate and reliable predictions.This paper proposes a combination of pointinterval prediction system for pollutant concentration prediction by leveraging neural network,meta-heuristic optimization algorithm,and fuzzy theory.Fuzzy information granulation technology is used in data preprocessing to transform numerical sequences into fuzzy particles for comprehensive feature extraction.The golden Jackal optimization algorithm is employed in the optimization stage to fine-tune model hyperparameters.In the prediction stage,an ensemble learning method combines training results frommultiplemodels to obtain final point predictions while also utilizing quantile regression and kernel density estimation methods for interval predictions on the test set.Experimental results demonstrate that the combined model achieves a high goodness of fit coefficient of determination(R^(2))at 99.3% and a maximum difference between prediction accuracy mean absolute percentage error(MAPE)and benchmark model at 12.6%.This suggests that the integrated learning system proposed in this paper can provide more accurate deterministic predictions as well as reliable uncertainty analysis compared to traditionalmodels,offering practical reference for air quality early warning.展开更多
6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,faul...6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,fault detection is investigated in this paper.Considering the fast response and low timeand-computational consumption,it is the first time that the Online Broad Learning System(OBLS)is applied to identify outages in cellular networks.In addition,the Automatic-constructed Online Broad Learning System(AOBLS)is put forward to rationalize its structure and consequently avoid over-fitting and under-fitting.Furthermore,a multi-layer classification structure is proposed to further improve the classification performance.To face the challenges caused by imbalanced data in fault detection problems,a novel weighting strategy is derived to achieve the Multilayer Automatic-constructed Weighted Online Broad Learning System(MAWOBLS)and ensemble learning with retrained Support Vector Machine(SVM),denoted as EMAWOBLS,for superior treatment with this imbalance issue.Simulation results show that the proposed algorithm has excellent performance in detecting faults with satisfactory time usage.展开更多
Background:Stomach cancer(SC)is one of the most lethal malignancies worldwide due to late-stage diagnosis and limited treatment.The transcriptomic,epigenomic,and proteomic,etc.,omics datasets generated by high-through...Background:Stomach cancer(SC)is one of the most lethal malignancies worldwide due to late-stage diagnosis and limited treatment.The transcriptomic,epigenomic,and proteomic,etc.,omics datasets generated by high-throughput sequencing technology have become prominent in biomedical research,and they reveal molecular aspects of cancer diagnosis and therapy.Despite the development of advanced sequencing technology,the presence of high-dimensionality in multi-omics data makes it challenging to interpret the data.Methods:In this study,we introduce RankXLAN,an explainable ensemble-based multi-omics framework that integrates feature selection(FS),ensemble learning,bioinformatics,and in-silico validation for robust biomarker detection,potential therapeutic drug-repurposing candidates’identification,and classification of SC.To enhance the interpretability of the model,we incorporated explainable artificial intelligence(SHapley Additive exPlanations analysis),as well as accuracy,precision,F1-score,recall,cross-validation,specificity,likelihood ratio(LR)+,LR−,and Youden index results.Results:The experimental results showed that the top four FS algorithms achieved improved results when applied to the ensemble learning classification model.The proposed ensemble model produced an area under the curve(AUC)score of 0.994 for gene expression,0.97 for methylation,and 0.96 for miRNA expression data.Through the integration of bioinformatics and ML approach of the transcriptomic and epigenomic multi-omics dataset,we identified potential marker genes,namely,UBE2D2,HPCAL4,IGHA1,DPT,and FN3K.In-silico molecular docking revealed a strong binding affinity between ANKRD13C and the FDA-approved drug Everolimus(binding affinity−10.1 kcal/mol),identifying ANKRD13C as a potential therapeutic drug-repurposing target for SC.Conclusion:The proposed framework RankXLAN outperforms other existing frameworks for serum biomarker identification,therapeutic target identification,and SC classification with multi-omics datasets.展开更多
The surge in smishing attacks underscores the urgent need for robust,real-time detection systems powered by advanced deep learning models.This paper introduces PhishNet,a novel ensemble learning framework that integra...The surge in smishing attacks underscores the urgent need for robust,real-time detection systems powered by advanced deep learning models.This paper introduces PhishNet,a novel ensemble learning framework that integrates transformer-based models(RoBERTa)and large language models(LLMs)(GPT-OSS 120B,LLaMA3.370B,and Qwen332B)to enhance smishing detection performance significantly.To mitigate class imbalance,we apply synthetic data augmentation using T5 and leverage various text preprocessing techniques.Our system employs a duallayer voting mechanism:weighted majority voting among LLMs and a final ensemble vote to classify messages as ham,spam,or smishing.Experimental results show an average accuracy improvement from 96%to 98.5%compared to the best standalone transformer,and from 93%to 98.5%when compared to LLMs across datasets.Furthermore,we present a real-time,user-friendly application to operationalize our detection model for practical use.PhishNet demonstrates superior scalability,usability,and detection accuracy,filling critical gaps in current smishing detection methodologies.展开更多
Optical non-reciprocity is a fundamental phenomenon in photonics.It is crucial for developing devices that rely on directional signal control,such as optical isolators and circulators.However,most research in this fie...Optical non-reciprocity is a fundamental phenomenon in photonics.It is crucial for developing devices that rely on directional signal control,such as optical isolators and circulators.However,most research in this field has focused on systems in equilibrium or steady states.In this work,we demonstrate a room-temperature Rydberg atomic platform where the unidirectional propagation of light acts as a switch to mediate time-crystalline-like collective oscillations through atomic synchronization.展开更多
With the development of landslide monitoring system,many attempts have been made to predict landslide failure-time utilizing monitoring data of displacements.Classical models(e.g.,Verhulst,GM(1,1),and Saito models)tha...With the development of landslide monitoring system,many attempts have been made to predict landslide failure-time utilizing monitoring data of displacements.Classical models(e.g.,Verhulst,GM(1,1),and Saito models)that consider the characteristics of landslide displacement to determine the failuretime have been investigated extensively.In practice,monitoring is continuously implemented with monitoring data-set updated,meaning that the predicted landslide life expectancy(i.e.,the lag between the predicted failure-time and time node at each instant of conducting the prediction)should be re-evaluated with time.This manner is termed“dynamic prediction”.However,the performances of the classical models have not been discussed in the context of the dynamic prediction yet.In this study,such performances are investigated firstly,and disadvantages of the classical models are then reported,incorporating the monitoring data from four real landslides.Subsequently,a more qualified ensemble model is proposed,where the individual classical models are integrated by machine learning(ML)-based meta-model.To evaluate the quality of the models under the dynamic prediction,a novel indicator termed“discredit index(b)”is proposed,and a higher value of b indicates lower prediction quality.It is found that Verhulst and Saito models would produce predicted results with significantly higher b,while GM(1,1)model would indicate results with the highest mean absolute error.Meanwhile,the ensemble models are found to be more accurate and qualified than the classical models.Here,the performance of decision tree regression-based ensemble model is the best among the various ML-based ensemble models.展开更多
An initial conditions (ICs) perturbation method was developed with the aim to improve an operational regional ensemble prediction system (REPS). Three issues were identified and investigated: (1) the impacts of...An initial conditions (ICs) perturbation method was developed with the aim to improve an operational regional ensemble prediction system (REPS). Three issues were identified and investigated: (1) the impacts of perturbation scale on the ensemble spread and forecast skill of the REPS; (2) the scale characteristic of the IC perturbations of the REPS; and (3) whether the REPS's skill could be improved by adding large-scale information to the IC perturbations. Numerical experiments were conducted to reveal the impact of perturbation scale on the ensemble spread and forecast skill. The scales of IC perturbations from the REPS and an operational global ensemble prediction system (GEPS) were analyzed. A "multi-scale blending" (MSB) IC perturbation scheme was developed, and the main findings can be summarized as follows: The growth rates of the ensemble spread of the REPS are sensitive to the scale of the IC perturbations; the ensemble forecast skills can benefit from large-scale perturbations; the global ensemble IC perturbations exhibit more power at larger scales, while the regional ensemble IC perturbations contain more power at smaller scales; the MSB method can generate IC perturbations by combining the small-scale component from the REPS and the large-scale component from the GEPS; the energy norm growth of the MSB-generated perturbations can be appropriate at all forecast lead times; and the MSB-based REPS shows higher skill than the original system, as determined by ensemble forecast verification.展开更多
In this study, the Institute of Atmospheric Physics, Chinese Academy of Sciences - regional ensemble forecast system (IAP-REFS) described in Part I was further validated through a 65-day experiment using the summer ...In this study, the Institute of Atmospheric Physics, Chinese Academy of Sciences - regional ensemble forecast system (IAP-REFS) described in Part I was further validated through a 65-day experiment using the summer season of 2010. The verification results show that IAP-REFS is skillful for quantitative precipitation forecasts (QPF) and probabilistic QPF, but it has a systematic bias in forecasting near-surface variables. Applying a 7-day running mean bias correction to the forecasts of near-surface variables remarkably improved the reliability of the forecasts. In this study, the perturbation extraction and inflation method (proposed with the single case study in Part I) was further applied to the full season with different inflation factors. This method increased the ensemble spread and improved the accuracy of forecasts of precipitation and near-surface variables. The seasonal mean profiles of the IAP-REFS ensemble indicate good spread among ensemble members and some model biases at certain vertical levels.展开更多
A single-model, short-range, ensemble forecasting system (Institute of Atmospheric Physics, Regional Ensemble Forecast System, IAP REFS) with 15-km grid spacing, configured with multiple initial conditions, multiple...A single-model, short-range, ensemble forecasting system (Institute of Atmospheric Physics, Regional Ensemble Forecast System, IAP REFS) with 15-km grid spacing, configured with multiple initial conditions, multiple lateral boundary conditions, and multiple physics parameterizations with 11 ensemble members, was developed using the Weather and Research Forecasting Model Advanced Research modeling system for prediction of stratiform precipitation events in northern China. This is the first part of a broader research project to develop a novel cloud-seeding operational system in a probabilistic framework. The ensemble perturbations were extracted from selected members of the National Center for Environmental Prediction Global Ensemble Forecasting System (NCEP GEFS) forecasts, and an inflation factor of two was applied to compensate for the lack of spread in the GEFS forecasts over the research region. Experiments on an actual stratiform precipitation case that occurred on 5-7 June 2009 in northern China were conducted to validate the ensemble system. The IAP REFS system had reasonably good performance in predicting the observed stratiform precipitation system. The perturbation inflation enlarged the ensemble spread and alleviated the underdispersion caused by parent forecasts. Centering the extracted perturbations on higher-resolution NCEP Global Forecast System forecasts resulted in less ensemble mean root-mean-square error and better accuracy in probabilistic quantitative precipitation forecasts (PQPF). However, the perturbation inflation and recentering had less effect on near-surface-level variables compared to the mid-level variables, and its influence on PQPF resolution was limited as well.展开更多
The Cloud Aerosol- Radiation (CAR) ensemble modeling system has recently been built to better un- derstand cloud/aerosol/radiation processes and determine the uncertainties caused by different treatments of cloud/ae...The Cloud Aerosol- Radiation (CAR) ensemble modeling system has recently been built to better un- derstand cloud/aerosol/radiation processes and determine the uncertainties caused by different treatments of cloud/aerosol/radiation in climate models. The CAR system comprises a large scheme collection of cloud, aerosol, and radiation processes available in the literature, including those commonly used by the world's leading GCMs. In this study, detailed analyses of the overall accuracy and efficiency of the CAR system were performed. Despite the different observations used, the overall accuracies of the CAR ensemble means were found to be very good for both shortwave (SW) and longwave (LW) radiation calculations. Taking tile percentage errors for July 2004 compared to ISCCP (International Satellite Cloud Climatology Project) data over (60~N, 60~S) as an example, even among the 448 CAR members selected here, those errors of the CAR ensemble means were only about -0.67% (-0.6 W m-2) and -0.82% (-2.0 W m-2) for SW and LW upward fluxes at the top of atmosphere, and 0.06% (0.1 W m-2) and -2.12% (-7.8 W m 2) for SW and LW downward fluxes at the surface, respectively. Furthermore, model SW frequency distributions in July 2004 covered the observational ranges entirely, with ensemble means located in the middle of the ranges. Moreover, it was found that the accuracy of radiative transfer calculations can be significantly enhanced by" using certain combinations of cloud schemes for the cloud cover fraction, particle effective size, water path, and optical properties, along with better explicit treatments for unresolved cloud structures.展开更多
The ensemble optimal interpolation (EnOI) is applied to the regional ocean modeling system (ROMS) with the ability to assimilate the along-track sea level anomaly (TSLA). This system is tested with an eddy-resol...The ensemble optimal interpolation (EnOI) is applied to the regional ocean modeling system (ROMS) with the ability to assimilate the along-track sea level anomaly (TSLA). This system is tested with an eddy-resolving system of the South China Sea (SCS). Background errors are derived from a running seasonal ensemble to account for the seasonal variability within the SCS. A fifth-order localization function with a 250 km localization radius is chosen to reduce the negative effects of sampling errors. The data assimilation system is tested from January 2004 to December 2006. The results show that the root mean square deviation (RMSD) of the sea level anomaly decreased from 10.57 to 6.70 cm, which represents a 36.6% reduction of error. The data assimilation reduces error for temperature within the upper 800 m and for salinity within the upper 200 m, although error degrades slightly at deeper depths. Surface currents are in better agreement with trajectories of surface drifters after data assimilation. The variance of sea level improves significantly in terms of both the amplitude and position of the strong and weak variance regions after assimilating TSLA. Results with AGE error (AGE) perform better than no AGE error (NoAGE) when considering the improvements of the temperature and the salinity. Furthermore, reasons for the extremely strong variability in the northern SCS in high resolution models are investigated. The results demonstrate that the strong variability of sea level in the high resolution model is caused by an extremely strong Kuroshio intrusion. Therefore, it is demonstrated that it is necessary to assimilate the TSLA in order to better simulate the SCS with high resolution models.展开更多
An ensemble Kalman filter(EnKF) combined with the Advanced Research Weather Research and Forecasting model(WRF) is cycled and evaluated for western North Pacific(WNP) typhoons of year 2016. Conventional in situ data, ...An ensemble Kalman filter(EnKF) combined with the Advanced Research Weather Research and Forecasting model(WRF) is cycled and evaluated for western North Pacific(WNP) typhoons of year 2016. Conventional in situ data, radiance observations, and tropical cyclone(TC) minimum sea level pressure(SLP) are assimilated every 6 h using an 80-member ensemble. For all TC categories, the 6-h ensemble priors from the WRF/EnKF system have an appropriate amount of variance for TC tracks but have insufficient variance for TC intensity. The 6-h ensemble priors from the WRF/EnKF system tend to overestimate the intensity for weak storms but underestimate the intensity for strong storms. The 5-d deterministic forecasts launched from the ensemble mean analyses of WRF/EnKF are compared to the NCEP and ECMWF operational control forecasts. Results show that the WRF/EnKF forecasts generally have larger track errors than the NCEP and ECMWF forecasts for all TC categories because the regional simulation cannot represent the large-scale environment better than the global simulation. The WRF/EnKF forecasts produce smaller intensity errors and biases than the NCEP and ECMWF forecasts for typhoons, but the opposite is true for tropical storms and severe tropical storms. The 5-d ensemble forecasts from the WRF/EnKF system for seven typhoon cases show appropriate variance for TC track and intensity with short forecast lead times but have insufficient spread with long forecast lead times. The WRF/EnKF system provides better ensemble forecasts and higher predictability for TC intensity than the NCEP and ECMWF ensemble forecasts.展开更多
Formulating model uncertainties for a convection-allowing ensemble prediction system(CAEPS)is a much more challenging problem compared to well-utilized approaches in synoptic weather forecasting.A new approach is prop...Formulating model uncertainties for a convection-allowing ensemble prediction system(CAEPS)is a much more challenging problem compared to well-utilized approaches in synoptic weather forecasting.A new approach is proposed and tested through assuming that the model uncertainty should reasonably describe the fast nonlinear error growth of the convection-allowing model,due to the fast developing character and strong nonlinearity of convective events.The Conditional Nonlinear Optimal Perturbation related to Parameters(CNOP-P)is applied in this study.Also,an ensemble approach is adopted to solve the CNOP-P problem.By using five locally developed strong convective events that occurred in pre-rainy season of South China,the most sensitive parameters were detected based on CNOP-P,which resulted in the maximum variations in precipitation.A formulation of model uncertainty is designed by adding stochastic perturbations into these sensitive parameters.Through comparison ensemble experiments by using all the 13 heavy rainfall cases that occurred in the flood season of South China in 2017,the advantages of the CNOP-P-based method are examined and verified by comparing with the well-utilized stochastically perturbed physics tendencies(SPPT)scheme.The results indicate that the CNOP-P-based method has potential in improving the under-dispersive problem of the current CAEPS.展开更多
Atmospheric variability is driven not only by internal dynamics, but also by external forcing, such as soil states, SST, snow, sea-ice cover, and so on. To investigate the forecast uncertainties and effects of land su...Atmospheric variability is driven not only by internal dynamics, but also by external forcing, such as soil states, SST, snow, sea-ice cover, and so on. To investigate the forecast uncertainties and effects of land surface processes on numerical weather prediction, we added modules to perturb soil moisture and soil temperature into NCEP's Global Ensemble Forecast System (GEFS), and compared the results of a set of experiments involving different configurations of land surface and atmospheric perturbation. It was found that uncertainties in different soil layers varied due to the multiple timescales of interactions between land surface and atmospheric processes. Perturbations of the soil moisture and soil temperature at the land surface changed sensible and latent heat flux obviously, as compared to the less or indirect land surface perturbation experiment from the day-to-day forecasts. Soil state perturbations led to greater variation in surface heat fluxes that transferred to the upper troposphere, thus reflecting interactions and the response to atmospheric external forcing. Various verification scores were calculated in this study. The results indicated that taking the uncertainties of land surface processes into account in GEFS could contribute a slight improvement in forecast skill in terms of resolution and reliability, a noticeable reduction in forecast error, as well as an increase in ensemble spread in an under-dispersive system. This paper provides a preliminary evaluation of the effects of land surface processes on predictability. Further research using more complex and suitable methods is needed to fully explore our understanding in this area.展开更多
An ensemble-based method for the observation system simulation experiment(OSSE)is employed to design optimal observation stations and assess the present observation stations in the northeastern South China Sea(SCS).We...An ensemble-based method for the observation system simulation experiment(OSSE)is employed to design optimal observation stations and assess the present observation stations in the northeastern South China Sea(SCS).We employed the 20-year(1992-2012)sea surface height(SSH)data to design an array to monitor the intraseasonal to interannual variability.The results show that the most key region was found located at the northwest of Luzon Island(LI)where the energetic Luzon cyclonic gyre(LCG)occurs;other key regions include the edge of the LCG,the northwest of the Luzon Strait(LS),and the southwest of Taiwan,China.By contrast,we found that the present observation stations might oversample at the northwest of the LS and undersample at the northwest of LI.In addition,the optimal stations perform better in a larger area than the present stations.In vertical direction,the key layer is located within the upper 200-m depth,of which the surface and subsurface layers are most valuable to the observing system.展开更多
Ensemble forecasting has become the prevailing method in current operational weather forecasting. Although ensemble mean forecast skill has been studied for many ensemble prediction systems(EPSs) and different cases...Ensemble forecasting has become the prevailing method in current operational weather forecasting. Although ensemble mean forecast skill has been studied for many ensemble prediction systems(EPSs) and different cases, theoretical analysis regarding ensemble mean forecast skill has rarely been investigated, especially quantitative analysis without any assumptions of ensemble members. This paper investigates fundamental questions about the ensemble mean, such as the advantage of the ensemble mean over individual members, the potential skill of the ensemble mean, and the skill gain of the ensemble mean with increasing ensemble size. The average error coefficient between each pair of ensemble members is the most important factor in ensemble mean forecast skill, which determines the mean-square error of ensemble mean forecasts and the skill gain with increasing ensemble size. More members are useful if the errors of the members have lower correlations with each other, and vice versa. The theoretical investigation in this study is verified by application with the T213 EPS. A typical EPS has an average error coefficient of between 0.5 and 0.8; the 15-member T213 EPS used here reaches a saturation degree of 95%(i.e., maximum 5% skill gain by adding new members with similar skill to the existing members) for 1–10-day lead time predictions, as far as the mean-square error is concerned.展开更多
How to accurately address model uncertainties with consideration of the rapid nonlinear error growth characteristics in a convection-allowing system is a crucial issue for performing convection-scale ensemble forecast...How to accurately address model uncertainties with consideration of the rapid nonlinear error growth characteristics in a convection-allowing system is a crucial issue for performing convection-scale ensemble forecasts.In this study,a new nonlinear model perturbation technique for convective-scale ensemble forecasts is developed to consider a nonlinear representation of model errors in the Global and Regional Assimilation and Prediction Enhanced System(GRAPES)Convection-Allowing Ensemble Prediction System(CAEPS).The nonlinear forcing singular vector(NFSV)approach,that is,conditional nonlinear optimal perturbation-forcing(CNOP-F),is applied in this study,to construct a nonlinear model perturbation method for GRAPES-CAEPS.Three experiments are performed:One of them is the CTL experiment,without adding any model perturbation;the other two are NFSV-perturbed experiments,which are perturbed by NFSV with two different groups of constraint radii to test the sensitivity of the perturbation magnitude constraint.Verification results show that the NFSV-perturbed experiments achieve an overall improvement and produce more skillful forecasts compared to the CTL experiment,which indicates that the nonlinear NFSV-perturbed method can be used as an effective model perturbation method for convection-scale ensemble forecasts.Additionally,the NFSV-L experiment with large perturbation constraints generally performs better than the NFSV-S experiment with small perturbation constraints in the verification for upper-air and surface weather variables.But for precipitation verification,the NFSV-S experiment performs better in forecasts for light precipitation,and the NFSV-L experiment performs better in forecasts for heavier precipitation,indicating that for different precipitation events,the perturbation magnitude constraint must be carefully selected.All the findings above lay a foundation for the design of nonlinear model perturbation methods for future CAEPSs.展开更多
This study presented an evaluation of tropical cyclone(TC) intensity forecasts from five global ensemble prediction systems(EPSs) during 2015-2019 in the western North Pacific region. Notable error features include th...This study presented an evaluation of tropical cyclone(TC) intensity forecasts from five global ensemble prediction systems(EPSs) during 2015-2019 in the western North Pacific region. Notable error features include the underestimation of the TC intensity by ensemble mean forecast and the under-dispersion of the probability forecasts.The root mean square errors(brier scores) of the ensemble mean(probability forecasts) generally decrease consecutively at long lead times during the five years, but fluctuate between certain values at short lead times.Positive forecast skill appeared in the most recent two years(2018-2019) at 120 h or later as compared with the climatology forecasts. However, there is no obvious improvement for the intensity change forecasts during the 5-year period, with abrupt intensity change remaining a big challenge. The probability forecasts show no skill for strong TCs at all the lead times. Among the five EPSs, ECMWF-EPS ranks the best for the intensity forecast, while NCEPGEFS ranks the best for the intensity change forecast, according to the evaluation of ensemble mean and dispersion.As for the other probability forecast evaluation, ECMWF-EPS ranks the best at lead times shorter than 72 h, while NCEP-GEFS ranks the best later on.展开更多
Smart Industrial environments use the Industrial Internet of Things(IIoT)for their routine operations and transform their industrial operations with intelligent and driven approaches.However,IIoT devices are vulnerabl...Smart Industrial environments use the Industrial Internet of Things(IIoT)for their routine operations and transform their industrial operations with intelligent and driven approaches.However,IIoT devices are vulnerable to cyber threats and exploits due to their connectivity with the internet.Traditional signature-based IDS are effective in detecting known attacks,but they are unable to detect unknown emerging attacks.Therefore,there is the need for an IDS which can learn from data and detect new threats.Ensemble Machine Learning(ML)and individual Deep Learning(DL)based IDS have been developed,and these individual models achieved low accuracy;however,their performance can be improved with the ensemble stacking technique.In this paper,we have proposed a Deep Stacked Neural Network(DSNN)based IDS,which consists of two stacked Convolutional Neural Network(CNN)models as base learners and Extreme Gradient Boosting(XGB)as the meta learner.The proposed DSNN model was trained and evaluated with the next-generation dataset,TON_IoT.Several pre-processing techniques were applied to prepare a dataset for the model,including ensemble feature selection and the SMOTE technique.Accuracy,precision,recall,F1-score,and false positive rates were used to evaluate the performance of the proposed ensemble model.Our experimental results showed that the accuracy for binary classification is 99.61%,which is better than in the baseline individual DL and ML models.In addition,the model proposed for IDS has been compared with similar models.The proposed DSNN achieved better performance metrics than the other models.The proposed DSNN model will be used to develop enhanced IDS for threat mitigation in smart industrial environments.展开更多
文摘Small-drone technology has opened a range of new applications for aerial transportation. These drones leverage the Internet of Things (IoT) to offer cross-location services for navigation. However, they are susceptible to security and privacy threats due to hardware and architectural issues. Although small drones hold promise for expansion in both civil and defense sectors, they have safety, security, and privacy threats. Addressing these challenges is crucial to maintaining the security and uninterrupted operations of these drones. In this regard, this study investigates security, and preservation concerning both the drones and Internet of Drones (IoD), emphasizing the significance of creating drone networks that are secure and can robustly withstand interceptions and intrusions. The proposed framework incorporates a weighted voting ensemble model comprising three convolutional neural network (CNN) models to enhance intrusion detection within the network. The employed CNNs are customized 1D models optimized to obtain better performance. The output from these CNNs is voted using a weighted criterion using a 0.4, 0.3, and 0.3 ratio for three CNNs, respectively. Experiments involve using multiple benchmark datasets, achieving an impressive accuracy of up to 99.89% on drone data. The proposed model shows promising results concerning precision, recall, and F1 as indicated by their obtained values of 99.92%, 99.98%, and 99.97%, respectively. Furthermore, cross-validation and performance comparison with existing works is also carried out. Findings indicate that the proposed approach offers a prospective solution for detecting security threats for aerial systems and satellite systems with high accuracy.
基金supported by General Scientific Research Funding of the Science and Technology Development Fund(FDCT)in Macao(No.0150/2022/A)the Faculty Research Grants of Macao University of Science and Technology(No.FRG-22-074-FIE).
文摘With the rapid development of economy,air pollution caused by industrial expansion has caused serious harm to human health and social development.Therefore,establishing an effective air pollution concentration prediction system is of great scientific and practical significance for accurate and reliable predictions.This paper proposes a combination of pointinterval prediction system for pollutant concentration prediction by leveraging neural network,meta-heuristic optimization algorithm,and fuzzy theory.Fuzzy information granulation technology is used in data preprocessing to transform numerical sequences into fuzzy particles for comprehensive feature extraction.The golden Jackal optimization algorithm is employed in the optimization stage to fine-tune model hyperparameters.In the prediction stage,an ensemble learning method combines training results frommultiplemodels to obtain final point predictions while also utilizing quantile regression and kernel density estimation methods for interval predictions on the test set.Experimental results demonstrate that the combined model achieves a high goodness of fit coefficient of determination(R^(2))at 99.3% and a maximum difference between prediction accuracy mean absolute percentage error(MAPE)and benchmark model at 12.6%.This suggests that the integrated learning system proposed in this paper can provide more accurate deterministic predictions as well as reliable uncertainty analysis compared to traditionalmodels,offering practical reference for air quality early warning.
基金supported in part by the National Key Research and Development Project under Grant 2020YFB1806805partially funded through a grant from Qualcomm。
文摘6G is desired to support more intelligence networks and this trend attaches importance to the self-healing capability if degradation emerges in the cellular networks.As a primary component of selfhealing networks,fault detection is investigated in this paper.Considering the fast response and low timeand-computational consumption,it is the first time that the Online Broad Learning System(OBLS)is applied to identify outages in cellular networks.In addition,the Automatic-constructed Online Broad Learning System(AOBLS)is put forward to rationalize its structure and consequently avoid over-fitting and under-fitting.Furthermore,a multi-layer classification structure is proposed to further improve the classification performance.To face the challenges caused by imbalanced data in fault detection problems,a novel weighting strategy is derived to achieve the Multilayer Automatic-constructed Weighted Online Broad Learning System(MAWOBLS)and ensemble learning with retrained Support Vector Machine(SVM),denoted as EMAWOBLS,for superior treatment with this imbalance issue.Simulation results show that the proposed algorithm has excellent performance in detecting faults with satisfactory time usage.
基金the Deanship of Research and Graduate Studies at King Khalid University,KSA,for funding this work through the Large Research Project under grant number RGP2/164/46.
文摘Background:Stomach cancer(SC)is one of the most lethal malignancies worldwide due to late-stage diagnosis and limited treatment.The transcriptomic,epigenomic,and proteomic,etc.,omics datasets generated by high-throughput sequencing technology have become prominent in biomedical research,and they reveal molecular aspects of cancer diagnosis and therapy.Despite the development of advanced sequencing technology,the presence of high-dimensionality in multi-omics data makes it challenging to interpret the data.Methods:In this study,we introduce RankXLAN,an explainable ensemble-based multi-omics framework that integrates feature selection(FS),ensemble learning,bioinformatics,and in-silico validation for robust biomarker detection,potential therapeutic drug-repurposing candidates’identification,and classification of SC.To enhance the interpretability of the model,we incorporated explainable artificial intelligence(SHapley Additive exPlanations analysis),as well as accuracy,precision,F1-score,recall,cross-validation,specificity,likelihood ratio(LR)+,LR−,and Youden index results.Results:The experimental results showed that the top four FS algorithms achieved improved results when applied to the ensemble learning classification model.The proposed ensemble model produced an area under the curve(AUC)score of 0.994 for gene expression,0.97 for methylation,and 0.96 for miRNA expression data.Through the integration of bioinformatics and ML approach of the transcriptomic and epigenomic multi-omics dataset,we identified potential marker genes,namely,UBE2D2,HPCAL4,IGHA1,DPT,and FN3K.In-silico molecular docking revealed a strong binding affinity between ANKRD13C and the FDA-approved drug Everolimus(binding affinity−10.1 kcal/mol),identifying ANKRD13C as a potential therapeutic drug-repurposing target for SC.Conclusion:The proposed framework RankXLAN outperforms other existing frameworks for serum biomarker identification,therapeutic target identification,and SC classification with multi-omics datasets.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,under Grant No.(GPIP:1074-612-2024).
文摘The surge in smishing attacks underscores the urgent need for robust,real-time detection systems powered by advanced deep learning models.This paper introduces PhishNet,a novel ensemble learning framework that integrates transformer-based models(RoBERTa)and large language models(LLMs)(GPT-OSS 120B,LLaMA3.370B,and Qwen332B)to enhance smishing detection performance significantly.To mitigate class imbalance,we apply synthetic data augmentation using T5 and leverage various text preprocessing techniques.Our system employs a duallayer voting mechanism:weighted majority voting among LLMs and a final ensemble vote to classify messages as ham,spam,or smishing.Experimental results show an average accuracy improvement from 96%to 98.5%compared to the best standalone transformer,and from 93%to 98.5%when compared to LLMs across datasets.Furthermore,we present a real-time,user-friendly application to operationalize our detection model for practical use.PhishNet demonstrates superior scalability,usability,and detection accuracy,filling critical gaps in current smishing detection methodologies.
基金supported by the National Natural Science Foundation of China (Grant No.12274131)the Innovation Program for Quantum Science and Technology (Grant No.2024ZD0300101)。
文摘Optical non-reciprocity is a fundamental phenomenon in photonics.It is crucial for developing devices that rely on directional signal control,such as optical isolators and circulators.However,most research in this field has focused on systems in equilibrium or steady states.In this work,we demonstrate a room-temperature Rydberg atomic platform where the unidirectional propagation of light acts as a switch to mediate time-crystalline-like collective oscillations through atomic synchronization.
基金The work described in this paper was funded by grants from the Natural Science Foundation of Hunan Province,China(Grant Nos.2020JJ5704 and 2022JJ20058)the Special Fund for Safety Production Prevention and Emergency of Hunan Province(Grant No.2021YJ009)+2 种基金the Research Project of Geological Bureau of Hunan Province(Grant Nos.HNGSTP202106 and HNGSTP202202)the Fund of Wenzhou Municipal Science and Technology Bureau(Grant No.2022G0015)the Fundamental Research Funds for Central Universities of the Central South University(Grant No.2023ZZTS0470).
文摘With the development of landslide monitoring system,many attempts have been made to predict landslide failure-time utilizing monitoring data of displacements.Classical models(e.g.,Verhulst,GM(1,1),and Saito models)that consider the characteristics of landslide displacement to determine the failuretime have been investigated extensively.In practice,monitoring is continuously implemented with monitoring data-set updated,meaning that the predicted landslide life expectancy(i.e.,the lag between the predicted failure-time and time node at each instant of conducting the prediction)should be re-evaluated with time.This manner is termed“dynamic prediction”.However,the performances of the classical models have not been discussed in the context of the dynamic prediction yet.In this study,such performances are investigated firstly,and disadvantages of the classical models are then reported,incorporating the monitoring data from four real landslides.Subsequently,a more qualified ensemble model is proposed,where the individual classical models are integrated by machine learning(ML)-based meta-model.To evaluate the quality of the models under the dynamic prediction,a novel indicator termed“discredit index(b)”is proposed,and a higher value of b indicates lower prediction quality.It is found that Verhulst and Saito models would produce predicted results with significantly higher b,while GM(1,1)model would indicate results with the highest mean absolute error.Meanwhile,the ensemble models are found to be more accurate and qualified than the classical models.Here,the performance of decision tree regression-based ensemble model is the best among the various ML-based ensemble models.
基金supported by the National Natural Science Foundation of China (Grant No. 91437113)the Special Fund for Meteorological Scientific Research in the Public Interest (Grant Nos. GYHY201506007 and GYHY201006015)+1 种基金the National 973 Program of China (Grant Nos. 2012CB417204 and 2012CB955200)the Scientific Research & Innovation Projects for Academic Degree Students of Ordinary Universities of Jiangsu (Grant No. KYLX 0827)
文摘An initial conditions (ICs) perturbation method was developed with the aim to improve an operational regional ensemble prediction system (REPS). Three issues were identified and investigated: (1) the impacts of perturbation scale on the ensemble spread and forecast skill of the REPS; (2) the scale characteristic of the IC perturbations of the REPS; and (3) whether the REPS's skill could be improved by adding large-scale information to the IC perturbations. Numerical experiments were conducted to reveal the impact of perturbation scale on the ensemble spread and forecast skill. The scales of IC perturbations from the REPS and an operational global ensemble prediction system (GEPS) were analyzed. A "multi-scale blending" (MSB) IC perturbation scheme was developed, and the main findings can be summarized as follows: The growth rates of the ensemble spread of the REPS are sensitive to the scale of the IC perturbations; the ensemble forecast skills can benefit from large-scale perturbations; the global ensemble IC perturbations exhibit more power at larger scales, while the regional ensemble IC perturbations contain more power at smaller scales; the MSB method can generate IC perturbations by combining the small-scale component from the REPS and the large-scale component from the GEPS; the energy norm growth of the MSB-generated perturbations can be appropriate at all forecast lead times; and the MSB-based REPS shows higher skill than the original system, as determined by ensemble forecast verification.
基金supported by a project of the National Natural Science Foundation of China (Grant No. 40875079)
文摘In this study, the Institute of Atmospheric Physics, Chinese Academy of Sciences - regional ensemble forecast system (IAP-REFS) described in Part I was further validated through a 65-day experiment using the summer season of 2010. The verification results show that IAP-REFS is skillful for quantitative precipitation forecasts (QPF) and probabilistic QPF, but it has a systematic bias in forecasting near-surface variables. Applying a 7-day running mean bias correction to the forecasts of near-surface variables remarkably improved the reliability of the forecasts. In this study, the perturbation extraction and inflation method (proposed with the single case study in Part I) was further applied to the full season with different inflation factors. This method increased the ensemble spread and improved the accuracy of forecasts of precipitation and near-surface variables. The seasonal mean profiles of the IAP-REFS ensemble indicate good spread among ensemble members and some model biases at certain vertical levels.
基金supported by the project of the NSFC (Grants No. 40875079)
文摘A single-model, short-range, ensemble forecasting system (Institute of Atmospheric Physics, Regional Ensemble Forecast System, IAP REFS) with 15-km grid spacing, configured with multiple initial conditions, multiple lateral boundary conditions, and multiple physics parameterizations with 11 ensemble members, was developed using the Weather and Research Forecasting Model Advanced Research modeling system for prediction of stratiform precipitation events in northern China. This is the first part of a broader research project to develop a novel cloud-seeding operational system in a probabilistic framework. The ensemble perturbations were extracted from selected members of the National Center for Environmental Prediction Global Ensemble Forecasting System (NCEP GEFS) forecasts, and an inflation factor of two was applied to compensate for the lack of spread in the GEFS forecasts over the research region. Experiments on an actual stratiform precipitation case that occurred on 5-7 June 2009 in northern China were conducted to validate the ensemble system. The IAP REFS system had reasonably good performance in predicting the observed stratiform precipitation system. The perturbation inflation enlarged the ensemble spread and alleviated the underdispersion caused by parent forecasts. Centering the extracted perturbations on higher-resolution NCEP Global Forecast System forecasts resulted in less ensemble mean root-mean-square error and better accuracy in probabilistic quantitative precipitation forecasts (PQPF). However, the perturbation inflation and recentering had less effect on near-surface-level variables compared to the mid-level variables, and its influence on PQPF resolution was limited as well.
基金supported by the National Basic Research Program of China (973 Program) (Grant No. 2010CB951901)the U.S. DOE office of Biological and Environmental Research (BER) (Grant No. DE-SC0001683)+2 种基金the National Natural Science Foundation of China (Grant Nos. 40605026 and 40830103)the "Strategic Priority Research Program-Climate Change: Carbon Budget and Relevant Issues" of the Chinese Academy of Sciences (Grant No. XDA05110101)the National Energy Research Scientific Computing Center, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231
文摘The Cloud Aerosol- Radiation (CAR) ensemble modeling system has recently been built to better un- derstand cloud/aerosol/radiation processes and determine the uncertainties caused by different treatments of cloud/aerosol/radiation in climate models. The CAR system comprises a large scheme collection of cloud, aerosol, and radiation processes available in the literature, including those commonly used by the world's leading GCMs. In this study, detailed analyses of the overall accuracy and efficiency of the CAR system were performed. Despite the different observations used, the overall accuracies of the CAR ensemble means were found to be very good for both shortwave (SW) and longwave (LW) radiation calculations. Taking tile percentage errors for July 2004 compared to ISCCP (International Satellite Cloud Climatology Project) data over (60~N, 60~S) as an example, even among the 448 CAR members selected here, those errors of the CAR ensemble means were only about -0.67% (-0.6 W m-2) and -0.82% (-2.0 W m-2) for SW and LW upward fluxes at the top of atmosphere, and 0.06% (0.1 W m-2) and -2.12% (-7.8 W m 2) for SW and LW downward fluxes at the surface, respectively. Furthermore, model SW frequency distributions in July 2004 covered the observational ranges entirely, with ensemble means located in the middle of the ranges. Moreover, it was found that the accuracy of radiative transfer calculations can be significantly enhanced by" using certain combinations of cloud schemes for the cloud cover fraction, particle effective size, water path, and optical properties, along with better explicit treatments for unresolved cloud structures.
基金The Major State Basic Research Development Program of China under contract Nos 201-1CB403606 and 2011CB403500the National Natural Science Foundation of China under contract Nos 41222038,41076011and 41206023the National Marine Environmental Forecasting Center Operational Development Foundation of the State Oceanic Administration of China under contract No.2013002
文摘The ensemble optimal interpolation (EnOI) is applied to the regional ocean modeling system (ROMS) with the ability to assimilate the along-track sea level anomaly (TSLA). This system is tested with an eddy-resolving system of the South China Sea (SCS). Background errors are derived from a running seasonal ensemble to account for the seasonal variability within the SCS. A fifth-order localization function with a 250 km localization radius is chosen to reduce the negative effects of sampling errors. The data assimilation system is tested from January 2004 to December 2006. The results show that the root mean square deviation (RMSD) of the sea level anomaly decreased from 10.57 to 6.70 cm, which represents a 36.6% reduction of error. The data assimilation reduces error for temperature within the upper 800 m and for salinity within the upper 200 m, although error degrades slightly at deeper depths. Surface currents are in better agreement with trajectories of surface drifters after data assimilation. The variance of sea level improves significantly in terms of both the amplitude and position of the strong and weak variance regions after assimilating TSLA. Results with AGE error (AGE) perform better than no AGE error (NoAGE) when considering the improvements of the temperature and the salinity. Furthermore, reasons for the extremely strong variability in the northern SCS in high resolution models are investigated. The results demonstrate that the strong variability of sea level in the high resolution model is caused by an extremely strong Kuroshio intrusion. Therefore, it is demonstrated that it is necessary to assimilate the TSLA in order to better simulate the SCS with high resolution models.
基金jointly sponsored by the National Key R&D Program of China through Grant No. 2017YFC1501603the National Natural Science Foundation of China through Grant Nos. 41675052 and 41775057。
文摘An ensemble Kalman filter(EnKF) combined with the Advanced Research Weather Research and Forecasting model(WRF) is cycled and evaluated for western North Pacific(WNP) typhoons of year 2016. Conventional in situ data, radiance observations, and tropical cyclone(TC) minimum sea level pressure(SLP) are assimilated every 6 h using an 80-member ensemble. For all TC categories, the 6-h ensemble priors from the WRF/EnKF system have an appropriate amount of variance for TC tracks but have insufficient variance for TC intensity. The 6-h ensemble priors from the WRF/EnKF system tend to overestimate the intensity for weak storms but underestimate the intensity for strong storms. The 5-d deterministic forecasts launched from the ensemble mean analyses of WRF/EnKF are compared to the NCEP and ECMWF operational control forecasts. Results show that the WRF/EnKF forecasts generally have larger track errors than the NCEP and ECMWF forecasts for all TC categories because the regional simulation cannot represent the large-scale environment better than the global simulation. The WRF/EnKF forecasts produce smaller intensity errors and biases than the NCEP and ECMWF forecasts for typhoons, but the opposite is true for tropical storms and severe tropical storms. The 5-d ensemble forecasts from the WRF/EnKF system for seven typhoon cases show appropriate variance for TC track and intensity with short forecast lead times but have insufficient spread with long forecast lead times. The WRF/EnKF system provides better ensemble forecasts and higher predictability for TC intensity than the NCEP and ECMWF ensemble forecasts.
文摘Formulating model uncertainties for a convection-allowing ensemble prediction system(CAEPS)is a much more challenging problem compared to well-utilized approaches in synoptic weather forecasting.A new approach is proposed and tested through assuming that the model uncertainty should reasonably describe the fast nonlinear error growth of the convection-allowing model,due to the fast developing character and strong nonlinearity of convective events.The Conditional Nonlinear Optimal Perturbation related to Parameters(CNOP-P)is applied in this study.Also,an ensemble approach is adopted to solve the CNOP-P problem.By using five locally developed strong convective events that occurred in pre-rainy season of South China,the most sensitive parameters were detected based on CNOP-P,which resulted in the maximum variations in precipitation.A formulation of model uncertainty is designed by adding stochastic perturbations into these sensitive parameters.Through comparison ensemble experiments by using all the 13 heavy rainfall cases that occurred in the flood season of South China in 2017,the advantages of the CNOP-P-based method are examined and verified by comparing with the well-utilized stochastically perturbed physics tendencies(SPPT)scheme.The results indicate that the CNOP-P-based method has potential in improving the under-dispersive problem of the current CAEPS.
基金supported by the National Fundamental(973) Research Program of China(Grant No.2013CB430100)the Special Fund for Meteorological Scientific Research in the Public Interest(Grant No.GYHY201506005)the National Natural Science Foundation of China(Grant Nos.41475097,41075079,41275065 and 41475054)
文摘Atmospheric variability is driven not only by internal dynamics, but also by external forcing, such as soil states, SST, snow, sea-ice cover, and so on. To investigate the forecast uncertainties and effects of land surface processes on numerical weather prediction, we added modules to perturb soil moisture and soil temperature into NCEP's Global Ensemble Forecast System (GEFS), and compared the results of a set of experiments involving different configurations of land surface and atmospheric perturbation. It was found that uncertainties in different soil layers varied due to the multiple timescales of interactions between land surface and atmospheric processes. Perturbations of the soil moisture and soil temperature at the land surface changed sensible and latent heat flux obviously, as compared to the less or indirect land surface perturbation experiment from the day-to-day forecasts. Soil state perturbations led to greater variation in surface heat fluxes that transferred to the upper troposphere, thus reflecting interactions and the response to atmospheric external forcing. Various verification scores were calculated in this study. The results indicated that taking the uncertainties of land surface processes into account in GEFS could contribute a slight improvement in forecast skill in terms of resolution and reliability, a noticeable reduction in forecast error, as well as an increase in ensemble spread in an under-dispersive system. This paper provides a preliminary evaluation of the effects of land surface processes on predictability. Further research using more complex and suitable methods is needed to fully explore our understanding in this area.
基金Supported by the National Key Research&Development Plan of China(Nos.2016YFC1401703,2016YFC1401702,2018YFC0309803)the National Natural Science Foundation of China(Nos.41506002,41676010,41476011,41676015,41606026)+1 种基金the Institution of South China Sea Ecology and Environmental Engineering,Chinese Academy of Sciences(No.ISEE2019ZR0)the Guangzhou Science and Technology Foundation(No.201804010133)。
文摘An ensemble-based method for the observation system simulation experiment(OSSE)is employed to design optimal observation stations and assess the present observation stations in the northeastern South China Sea(SCS).We employed the 20-year(1992-2012)sea surface height(SSH)data to design an array to monitor the intraseasonal to interannual variability.The results show that the most key region was found located at the northwest of Luzon Island(LI)where the energetic Luzon cyclonic gyre(LCG)occurs;other key regions include the edge of the LCG,the northwest of the Luzon Strait(LS),and the southwest of Taiwan,China.By contrast,we found that the present observation stations might oversample at the northwest of the LS and undersample at the northwest of LI.In addition,the optimal stations perform better in a larger area than the present stations.In vertical direction,the key layer is located within the upper 200-m depth,of which the surface and subsurface layers are most valuable to the observing system.
基金supported by the National Basic Research (973) Program of China (Grant No. 2013CB430106)the R&D Special Fund for Public Welfare Industry (Meteorology) (Grant Nos. GYHY201306002 and GYHY201206005)+2 种基金the National Natural Science Foundation of China (Grant Nos. 40830958 and 41175087)the Jiangsu Collaborative Innovation Center for Climate Changethe High Performance Computing Center of Nanjing University
文摘Ensemble forecasting has become the prevailing method in current operational weather forecasting. Although ensemble mean forecast skill has been studied for many ensemble prediction systems(EPSs) and different cases, theoretical analysis regarding ensemble mean forecast skill has rarely been investigated, especially quantitative analysis without any assumptions of ensemble members. This paper investigates fundamental questions about the ensemble mean, such as the advantage of the ensemble mean over individual members, the potential skill of the ensemble mean, and the skill gain of the ensemble mean with increasing ensemble size. The average error coefficient between each pair of ensemble members is the most important factor in ensemble mean forecast skill, which determines the mean-square error of ensemble mean forecasts and the skill gain with increasing ensemble size. More members are useful if the errors of the members have lower correlations with each other, and vice versa. The theoretical investigation in this study is verified by application with the T213 EPS. A typical EPS has an average error coefficient of between 0.5 and 0.8; the 15-member T213 EPS used here reaches a saturation degree of 95%(i.e., maximum 5% skill gain by adding new members with similar skill to the existing members) for 1–10-day lead time predictions, as far as the mean-square error is concerned.
基金supported by the National Key Research and Development (R&D) Program of the Ministry of Science and Technology of China (Grant No. 2021YFC3000902)
文摘How to accurately address model uncertainties with consideration of the rapid nonlinear error growth characteristics in a convection-allowing system is a crucial issue for performing convection-scale ensemble forecasts.In this study,a new nonlinear model perturbation technique for convective-scale ensemble forecasts is developed to consider a nonlinear representation of model errors in the Global and Regional Assimilation and Prediction Enhanced System(GRAPES)Convection-Allowing Ensemble Prediction System(CAEPS).The nonlinear forcing singular vector(NFSV)approach,that is,conditional nonlinear optimal perturbation-forcing(CNOP-F),is applied in this study,to construct a nonlinear model perturbation method for GRAPES-CAEPS.Three experiments are performed:One of them is the CTL experiment,without adding any model perturbation;the other two are NFSV-perturbed experiments,which are perturbed by NFSV with two different groups of constraint radii to test the sensitivity of the perturbation magnitude constraint.Verification results show that the NFSV-perturbed experiments achieve an overall improvement and produce more skillful forecasts compared to the CTL experiment,which indicates that the nonlinear NFSV-perturbed method can be used as an effective model perturbation method for convection-scale ensemble forecasts.Additionally,the NFSV-L experiment with large perturbation constraints generally performs better than the NFSV-S experiment with small perturbation constraints in the verification for upper-air and surface weather variables.But for precipitation verification,the NFSV-S experiment performs better in forecasts for light precipitation,and the NFSV-L experiment performs better in forecasts for heavier precipitation,indicating that for different precipitation events,the perturbation magnitude constraint must be carefully selected.All the findings above lay a foundation for the design of nonlinear model perturbation methods for future CAEPSs.
基金National Key R&D Program of China(2017YFC1501604)National Natural Science Foundation of China (41875114)+1 种基金Shanghai Science&Technology Research Program (19dz1200101)Fundamental Research Funds of the STI/CMA (2020JB06)。
文摘This study presented an evaluation of tropical cyclone(TC) intensity forecasts from five global ensemble prediction systems(EPSs) during 2015-2019 in the western North Pacific region. Notable error features include the underestimation of the TC intensity by ensemble mean forecast and the under-dispersion of the probability forecasts.The root mean square errors(brier scores) of the ensemble mean(probability forecasts) generally decrease consecutively at long lead times during the five years, but fluctuate between certain values at short lead times.Positive forecast skill appeared in the most recent two years(2018-2019) at 120 h or later as compared with the climatology forecasts. However, there is no obvious improvement for the intensity change forecasts during the 5-year period, with abrupt intensity change remaining a big challenge. The probability forecasts show no skill for strong TCs at all the lead times. Among the five EPSs, ECMWF-EPS ranks the best for the intensity forecast, while NCEPGEFS ranks the best for the intensity change forecast, according to the evaluation of ensemble mean and dispersion.As for the other probability forecast evaluation, ECMWF-EPS ranks the best at lead times shorter than 72 h, while NCEP-GEFS ranks the best later on.
文摘Smart Industrial environments use the Industrial Internet of Things(IIoT)for their routine operations and transform their industrial operations with intelligent and driven approaches.However,IIoT devices are vulnerable to cyber threats and exploits due to their connectivity with the internet.Traditional signature-based IDS are effective in detecting known attacks,but they are unable to detect unknown emerging attacks.Therefore,there is the need for an IDS which can learn from data and detect new threats.Ensemble Machine Learning(ML)and individual Deep Learning(DL)based IDS have been developed,and these individual models achieved low accuracy;however,their performance can be improved with the ensemble stacking technique.In this paper,we have proposed a Deep Stacked Neural Network(DSNN)based IDS,which consists of two stacked Convolutional Neural Network(CNN)models as base learners and Extreme Gradient Boosting(XGB)as the meta learner.The proposed DSNN model was trained and evaluated with the next-generation dataset,TON_IoT.Several pre-processing techniques were applied to prepare a dataset for the model,including ensemble feature selection and the SMOTE technique.Accuracy,precision,recall,F1-score,and false positive rates were used to evaluate the performance of the proposed ensemble model.Our experimental results showed that the accuracy for binary classification is 99.61%,which is better than in the baseline individual DL and ML models.In addition,the model proposed for IDS has been compared with similar models.The proposed DSNN achieved better performance metrics than the other models.The proposed DSNN model will be used to develop enhanced IDS for threat mitigation in smart industrial environments.