Mapping and monitoring the distribution of croplands and crop types support policymakers and international organizations by reducing the risks to food security,notably from climate change and,for that purpose,remote s...Mapping and monitoring the distribution of croplands and crop types support policymakers and international organizations by reducing the risks to food security,notably from climate change and,for that purpose,remote sensing is routinely used.However,identifying specific crop types,cropland,and cropping patterns using space-based observations is challenging because different crop types and cropping patterns have similarity spectral signatures.This study applied a methodology to identify cropland and specific crop types,including tobacco,wheat,barley,and gram,as well as the following cropping patterns:wheat-tobacco,wheat-gram,wheat-barley,and wheat-maize,which are common in Gujranwala District,Pakistan,the study region.The methodology consists of combining optical remote sensing images from Sentinel-2 and Landsat-8 with Machine Learning(ML)methods,namely a Decision Tree Classifier(DTC)and a Random Forest(RF)algorithm.The best time-periods for differentiating cropland from other land cover types were identified,and then Sentinel-2 and Landsat 8 NDVI-based time-series were linked to phenological parameters to determine the different crop types and cropping patterns over the study region using their temporal indices and ML algorithms.The methodology was subsequently evaluated using Landsat images,crop statistical data for 2020 and 2021,and field data on cropping patterns.The results highlight the high level of accuracy of the methodological approach presented using Sentinel-2 and Landsat-8 images,together with ML techniques,for mapping not only the distribution of cropland,but also crop types and cropping patterns when validated at the county level.These results reveal that this methodology has benefits for monitoring and evaluating food security in Pakistan,adding to the evidence base of other studies on the use of remote sensing to identify crop types and cropping patterns in other countries.展开更多
In this work, a hardware intrusion detection system (IDS) model and its implementation are introduced to perform online real-time traffic monitoring and analysis. The introduced system gathers some advantages of man...In this work, a hardware intrusion detection system (IDS) model and its implementation are introduced to perform online real-time traffic monitoring and analysis. The introduced system gathers some advantages of many IDSs: hardware based from implementation point of view, network based from system type point of view, and anomaly detection from detection approach point of view. In addition, it can detect most of network attacks, such as denial of services (DOS), leakage, etc. from detection behavior point of view and can detect both internal and external intruders from intruder type point of view. Gathering these features in one IDS system gives lots of strengths and advantages of the work. The system is implemented by using field programmable gate array (FPGA), giving a more advantages to the system. A C5.0 decision tree classifier is used as inference engine to the system and gives a high detection ratio of 99.93%.展开更多
Snort rule-checking is one of the most popular forms of Network Intrusion Detection Systems (NIDS). In this article, we show that Snort priorities of true positive traffic (real attacks) can be approximated in real-ti...Snort rule-checking is one of the most popular forms of Network Intrusion Detection Systems (NIDS). In this article, we show that Snort priorities of true positive traffic (real attacks) can be approximated in real-time, in the context of high speed networks, by a decision tree classifier, using the information of only three easily extracted features (protocol, source port, and destination port), with an accuracy of 99%. Snort issues alert priorities based on its own default set of attack classes (34 classes) that are used by the default set of rules it provides. But the decision tree model is able to predict the priorities without using this default classification. The obtained tagger can provide a useful complement to an anomaly detection intrusion detection system.展开更多
To address the confrontation decision-making issues in multi-round air combat,a dynamic game decision method is proposed based on decision tree for the confrontation of unmanned aerial vehicle(UAV)air combat.Based on ...To address the confrontation decision-making issues in multi-round air combat,a dynamic game decision method is proposed based on decision tree for the confrontation of unmanned aerial vehicle(UAV)air combat.Based on game the-ory and the confrontation characteristics of air combat,a dynamic game process is constructed including the strategy sets,the situation information,and the maneuver decisions for both sides of air combat.By analyzing the UAV’s flight dyna-mics and the both sides’information,a payment matrix is estab-lished through the situation advantage function,performance advantage function,and profit function.Furthermore,the dynamic game decision problem is solved based on the linear induction method to obtain the Nash equilibrium solution,where the decision tree method is introduced to obtain the optimal maneuver decision,thereby improving the situation advantage in the next round of confrontation.According to the analysis,the simulation results for the confrontation scenarios of multi-round air combat are presented to verify the effectiveness and advan-tages of the proposed method.展开更多
System upgrades in unmanned systems have made Unmanned Aerial Vehicle(UAV)-based patrolling and monitoring a preferred solution for ocean surveillance.However,dynamic environments and large-scale deployments pose sign...System upgrades in unmanned systems have made Unmanned Aerial Vehicle(UAV)-based patrolling and monitoring a preferred solution for ocean surveillance.However,dynamic environments and large-scale deployments pose significant challenges for efficient decision-making,necessitating a modular multiagent control system.Deep Reinforcement Learning(DRL)and Decision Tree(DT)have been utilized for these complex decision-making tasks,but each has its limitations:DRL is highly adaptive but lacks interpretability,while DT is inherently interpretable but has limited adaptability.To overcome these challenges,we propose the Adaptive Interpretable Decision Tree(AIDT),an evolutionary-based algorithm that is both adaptable to diverse environmental settings and highly interpretable in its decision-making processes.We first construct a Markov decision process(MDP)-based simulation environment using the Cooperative Submarine Search task as a representative scenario for training and testing the proposed method.Specifically,we use the heat map as a state variable to address the issue of multi-agent input state proliferation.Next,we introduce the curiosity-guiding intrinsic reward to encourage comprehensive exploration and enhance algorithm performance.Additionally,we incorporate decision tree size as an influence factor in the adaptation process to balance task completion with computational efficiency.To further improve the generalization capability of the decision tree,we apply a normalization method to ensure consistent processing of input states.Finally,we validate the proposed algorithm in different environmental settings,and the results demonstrate both its adaptability and interpretability.展开更多
Despite the widespread use of Decision trees (DT) across various applications, their performance tends to suffer when dealing with imbalanced datasets, where the distribution of certain classes significantly outweighs...Despite the widespread use of Decision trees (DT) across various applications, their performance tends to suffer when dealing with imbalanced datasets, where the distribution of certain classes significantly outweighs others. Cost-sensitive learning is a strategy to solve this problem, and several cost-sensitive DT algorithms have been proposed to date. However, existing algorithms, which are heuristic, tried to greedily select either a better splitting point or feature node, leading to local optima for tree nodes and ignoring the cost of the whole tree. In addition, determination of the costs is difficult and often requires domain expertise. This study proposes a DT for imbalanced data, called Swarm-based Cost-sensitive DT (SCDT), using the cost-sensitive learning strategy and an enhanced swarm-based algorithm. The DT is encoded using a hybrid individual representation. A hybrid artificial bee colony approach is designed to optimize rules, considering specified costs in an F-Measure-based fitness function. Experimental results using datasets compared with state-of-the-art DT algorithms show that the SCDT method achieved the highest performance on most datasets. Moreover, SCDT also excels in other critical performance metrics, such as recall, precision, F1-score, and AUC, with notable results with average values of 83%, 87.3%, 85%, and 80.7%, respectively.展开更多
In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,inclu...In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,including Mo,Cu,Pb,Zn,Ag,Ni,Co,Mn,Fe,and As,were used with these machine learning algorithms(MLAs)to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area.The performance of the models was evaluated using the Mean Absolute Percentage Error(MAPE)and Root Mean Square Error(RMSE)metrics.The proposed ensemble Voting algorithm outperformed the other models,yielding more ac-curate predictions according to both metrics.The predicted data from the GB,LR,DT,and Voting MLAs were modeled using the Concentration-Area fractal method,and Au geochemical anomalies were mapped.To compare and validate the results,factors such as the location of the mineral deposits,their surface extent,and mineralization trend were considered.The results indicate that integrating hybrid MLAs with fractal modeling signifi-cantly improves geochemical prospectivity mapping.Among the four models,three(DT,GB,Voting)accurately identified both mineral deposits.The LR model,however,only identified Deposit I(central),and its mineralization trend diverged from the field data.The GB and Voting models produced similar results,with their final maps derived from fractal modeling showing the same anomalous areas.The anomaly boundaries identified by these two models are consistent with the two known reserves in the region.The results and plots related to prediction indicators and error rates for these two models also show high similarity,with lower error rates than the other models.Notably,the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.展开更多
The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes ...The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes classifier based on decision tree inductive learning algorithm ID3 for handwritten Chinese characters is presented. With a feature extraction controller, the classifier can reduce the number of extracted features and accelerate classification speed. Experimental results show that the 4-corner codes classifier performs well on both recognition accuracy and speed.展开更多
[Objective] The aim was to explore the feasibility of using single spectrum image to classify crops based on multi-spectral image and Decision Tree Method. [Method] Taking the typical agriculture plantation area in Hu...[Objective] The aim was to explore the feasibility of using single spectrum image to classify crops based on multi-spectral image and Decision Tree Method. [Method] Taking the typical agriculture plantation area in Hulunbeier area, according to field measured spectrum data, the optimum time of main crops, barley, wheat, rapeseed, based on crops spectrum characteristics, by dint of decision-making tree method, and considering spectral matching method, classification of crops was studied such as SAM. [Result] By dint of Landsat TM image gained in the first half of August, based on geographic and atmospheric proof-reading, decision-making tree was constructed. Plantation information about wheat, barley, and rapeseed and plantation grassland was extracted successfully. The general classification accuracy reached 86.90%. Kappa coefficient was 0.831 1. [Conclusion] Taking typical spectrum image as data source, and applying Decision Tree Method to get crops type's information had fine application future.展开更多
With western Jilin Province as the study region, spectral characteristics and texture features of remote sensing images were taken as the classification basis to construct a Decision Tree Model and extract information...With western Jilin Province as the study region, spectral characteristics and texture features of remote sensing images were taken as the classification basis to construct a Decision Tree Model and extract information about settlements in western Jilin Province, and the manually-extracted information about settlements in western Jilin Province was evaluated by confusion matrix. The results showed that Decision Tree Model was convenient for extracting settlements information by integrating spectral and texture features, and the accuracy of such a method was higher than that of the traditional Maximum Liklihood Method, in addition, calculation methods of extracting settlements information by this mean were concluded.展开更多
Based on the discuss of the basic concept of data mining technology and the decision tree method,combining with the data samples of wind and hailstorm disasters in some counties of Mudanjiang region,the forecasting mo...Based on the discuss of the basic concept of data mining technology and the decision tree method,combining with the data samples of wind and hailstorm disasters in some counties of Mudanjiang region,the forecasting model of agro-meteorological disaster grade was established by adopting the C4.5 classification algorithm of decision tree,which can forecast the direct economic loss degree to provide rational data mining model and obtain effective analysis results.展开更多
[Objective] This paper aims to construct an improved fuzzy decision tree which is based on clustering,and researches into its application in the screening of maize germplasm.[Method] A new decision tree algorithm base...[Objective] This paper aims to construct an improved fuzzy decision tree which is based on clustering,and researches into its application in the screening of maize germplasm.[Method] A new decision tree algorithm based upon clustering is adopted in this paper,which is improved against the defect that traditional decision tree algorithm fails to handle samples of no classes.Meanwhile,the improved algorithm is also applied to the screening of maize varieties.Through the indices as leaf area,plant height,dry weight,potassium(K) utilization and others,maize seeds with strong tolerance of hypokalemic are filtered out.[Result] The algorithm in the screening of maize germplasm has great applicability and good performance.[Conclusion] In the future more efforts should be made to compare improved the performance of fuzzy decision tree based upon clustering with the performance of traditional fuzzy one,and it should be applied into more realistic problems.展开更多
Accurate classification of cardiac arrhythmias is a crucial task because of the non-stationary nature of electrocardiogram(ECG)signals.In a life-threatening situation,an automated system is necessary for early detecti...Accurate classification of cardiac arrhythmias is a crucial task because of the non-stationary nature of electrocardiogram(ECG)signals.In a life-threatening situation,an automated system is necessary for early detection of beat abnormalities in order to reduce the mortality rate.In this paper,we propose an automatic classification system of ECG beats based on the multi-domain features derived from the ECG signals.The experimental study was evaluated on ECG signals obtained from the MIT-BIH Arrhythmia Database.The feature set comprises eight empirical mode decomposition(EMD)based features,three features from variational mode decomposition(VMD)and four features from RR intervals.In total,15 features are ranked according to a ranker search approach and then used as input to the support vector machine(SVM)and C4.5 decision tree classifiers for classifying six types of arrhythmia beats.The proposed method achieved best result in C4.5 decision tree classifier with an accuracy of 98.89%compared to cubic-SVM classifier which achieved an accuracy of 95.35%only.Besides accuracy measures,all other parameters such as sensitivity(Se),specificity(Sp)and precision rates of 95.68%,99.28%and 95.8%was achieved better in C4.5 classifier.Also the computational time of 0.65 s with an error rate of 0.11 was achieved which is very less compared to SVM.The multi-domain based features with decision tree classifier obtained the best results in classifying cardiac arrhythmias hence the system could be used efficiently in clinical practices.展开更多
One of the most important methods that finds usefulness in various applications, such as searching historical manuscripts, forensic search, bank check reading, mail sorting, book and handwritten notes transcription, i...One of the most important methods that finds usefulness in various applications, such as searching historical manuscripts, forensic search, bank check reading, mail sorting, book and handwritten notes transcription, is handwritten character recognition. The common issues in the character recognition are often due to different writing styles, orientation angle, size variation(regarding length and height), etc. This study presents a classification model using a hybrid classifier for the character recognition by combining holoentropy enabled decision tree(HDT) and deep neural network(DNN). In feature extraction, the local gradient features that include histogram oriented gabor feature and grid level feature, and grey level co-occurrence matrix(GLCM) features are extracted. Then, the extracted features are concatenated to encode shape, color, texture, local and statistical information, for the recognition of characters in the image by applying the extracted features to the hybrid classifier. In the experimental analysis, recognition accuracy of 96% is achieved. Thus, it can be suggested that the proposed model intends to provide more accurate character recognition rate compared to that of character recognition techniques used in the literature.展开更多
This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting de...This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR.展开更多
To build any spatial soil database, a set of environmental data including digital elevation model(DEM) and satellite images beside geomorphic landscape description are essentials. Such a database, integrates field obs...To build any spatial soil database, a set of environmental data including digital elevation model(DEM) and satellite images beside geomorphic landscape description are essentials. Such a database, integrates field observations and laboratory analyses data with the results obtained from qualitative and quantitative models. So far, various techniques have been developed for soil data processing. The performance of Artificial Neural Network(ANN) and Decision Tree(DT) models was compared to map out some soil attributes in Alborz Province, Iran. Terrain attributes derived from a DEM along with Landsat 8 ETM+, geomorphology map, and the routine laboratory analyses of the studied area were used as input data. The relationships between soil properties(including sand, silt, clay, electrical conductivity, organic carbon, and carbonates) and the environmental variables were assessed using the Pearson Correlation Coefficient and Principle Components Analysis. Slope, elevation, geomforms, carbonate index, stream network, wetness index, and the band’s number 2, 3, 4, and 5 were the most significantly correlated variables. ANN and DT did not show the same accuracy in predicting all parameters. The DT model showed higher performances in estimating sand(R^2=0.73), silt(R^2=0.70), clay(R^2=0.72), organic carbon(R^2=0.71), and carbonates(R^2=0.70). While the ANN model only showed higher performance in predicting soil electrical conductivity(R^2=0.95). The results showed that determination the best model to use, is dependent upon the relation between the considered soil properties with the environmental variables. However, the DT model showed more reasonable results than the ANN model in this study. The results showed that before using a certain model to predict variability of all soil parameters, it would be better to evaluate the efficiency of all possible models for choosing the best fitted model for each property. In other words, most of the developed models are sitespecific and may not be applicable to use for predicting other soil properties or other area.展开更多
AIM: To evaluate a different decision tree for safe liver resection and verify its efficiency.METHODS: A total of 2457 patients underwent hepatic resection between January 2004 and December 2010 at the Chinese PLA Gen...AIM: To evaluate a different decision tree for safe liver resection and verify its efficiency.METHODS: A total of 2457 patients underwent hepatic resection between January 2004 and December 2010 at the Chinese PLA General Hospital,and 634 hepatocellular carcinoma(HCC) patients were eligible for the final analyses. Post-hepatectomy liver failure(PHLF) was identified by the association of prothrombin time < 50% and serum bilirubin > 50 μmol/L(the "50-50" criteria),which were assessed at day 5 postoperatively or later. The Swiss-Clavien decision tree,Tokyo University-Makuuchi decision tree,and Chinese consensus decision tree were adopted to divide patients into two groups based on those decision trees in sequence,and the PHLF rates were recorded.RESULTS: The overall mortality and PHLF rate were 0.16% and 3.0%. A total of 19 patients experienced PHLF. The numbers of patients to whom the SwissClavien,Tokyo University-Makuuchi,and Chinese consensus decision trees were applied were 581,573,and 622,and the PHLF rates were 2.75%,2.62%,and 2.73%,respectively. Significantly more cases satisfied the Chinese consensus decision tree than the Swiss-Clavien decision tree and Tokyo University-Makuuchi decision tree(P < 0.01,P < 0.01); nevertheless,the latter two shared no difference(P = 0.147). The PHLF rate exhibited no significant difference with respect to the three decision trees.CONCLUSION: The Chinese consensus decision tree expands the indications for hepatic resection for HCC patients and does not increase the PHLF rate compared to the Swiss-Clavien and Tokyo UniversityMakuuchi decision trees. It would be a safe and effective algorithm for hepatectomy in patients with hepatocellular carcinoma.展开更多
In order to improve nitrogen removal in anoxic/oxic(A/O) process effectively for treating domestic wastewaters, the influence factors, DO(dissolved oxygen), nitrate recirculation, sludge recycle, SRT(solids residence ...In order to improve nitrogen removal in anoxic/oxic(A/O) process effectively for treating domestic wastewaters, the influence factors, DO(dissolved oxygen), nitrate recirculation, sludge recycle, SRT(solids residence time), influent COD/TN and HRT(hydraulic retention time) were studied. Results indicated that it was possible to increase nitrogen removal by using corresponding control strategies, such as, adjusting the DO set point according to effluent ammonia concentration; manipulating nitrate recirculation flow according to nitrate concentration at the end of anoxic zone. Based on the experiments results, a knowledge-based approach for supervision of the nitrogen removal problems was considered, and decision trees for diagnosing nitrification and denitrification problems were built and successfully applied to A/O process.展开更多
文摘Mapping and monitoring the distribution of croplands and crop types support policymakers and international organizations by reducing the risks to food security,notably from climate change and,for that purpose,remote sensing is routinely used.However,identifying specific crop types,cropland,and cropping patterns using space-based observations is challenging because different crop types and cropping patterns have similarity spectral signatures.This study applied a methodology to identify cropland and specific crop types,including tobacco,wheat,barley,and gram,as well as the following cropping patterns:wheat-tobacco,wheat-gram,wheat-barley,and wheat-maize,which are common in Gujranwala District,Pakistan,the study region.The methodology consists of combining optical remote sensing images from Sentinel-2 and Landsat-8 with Machine Learning(ML)methods,namely a Decision Tree Classifier(DTC)and a Random Forest(RF)algorithm.The best time-periods for differentiating cropland from other land cover types were identified,and then Sentinel-2 and Landsat 8 NDVI-based time-series were linked to phenological parameters to determine the different crop types and cropping patterns over the study region using their temporal indices and ML algorithms.The methodology was subsequently evaluated using Landsat images,crop statistical data for 2020 and 2021,and field data on cropping patterns.The results highlight the high level of accuracy of the methodological approach presented using Sentinel-2 and Landsat-8 images,together with ML techniques,for mapping not only the distribution of cropland,but also crop types and cropping patterns when validated at the county level.These results reveal that this methodology has benefits for monitoring and evaluating food security in Pakistan,adding to the evidence base of other studies on the use of remote sensing to identify crop types and cropping patterns in other countries.
文摘In this work, a hardware intrusion detection system (IDS) model and its implementation are introduced to perform online real-time traffic monitoring and analysis. The introduced system gathers some advantages of many IDSs: hardware based from implementation point of view, network based from system type point of view, and anomaly detection from detection approach point of view. In addition, it can detect most of network attacks, such as denial of services (DOS), leakage, etc. from detection behavior point of view and can detect both internal and external intruders from intruder type point of view. Gathering these features in one IDS system gives lots of strengths and advantages of the work. The system is implemented by using field programmable gate array (FPGA), giving a more advantages to the system. A C5.0 decision tree classifier is used as inference engine to the system and gives a high detection ratio of 99.93%.
文摘Snort rule-checking is one of the most popular forms of Network Intrusion Detection Systems (NIDS). In this article, we show that Snort priorities of true positive traffic (real attacks) can be approximated in real-time, in the context of high speed networks, by a decision tree classifier, using the information of only three easily extracted features (protocol, source port, and destination port), with an accuracy of 99%. Snort issues alert priorities based on its own default set of attack classes (34 classes) that are used by the default set of rules it provides. But the decision tree model is able to predict the priorities without using this default classification. The obtained tagger can provide a useful complement to an anomaly detection intrusion detection system.
基金supported by the Major Projects for Science and Technology Innovation 2030(2018AAA0100805).
文摘To address the confrontation decision-making issues in multi-round air combat,a dynamic game decision method is proposed based on decision tree for the confrontation of unmanned aerial vehicle(UAV)air combat.Based on game the-ory and the confrontation characteristics of air combat,a dynamic game process is constructed including the strategy sets,the situation information,and the maneuver decisions for both sides of air combat.By analyzing the UAV’s flight dyna-mics and the both sides’information,a payment matrix is estab-lished through the situation advantage function,performance advantage function,and profit function.Furthermore,the dynamic game decision problem is solved based on the linear induction method to obtain the Nash equilibrium solution,where the decision tree method is introduced to obtain the optimal maneuver decision,thereby improving the situation advantage in the next round of confrontation.According to the analysis,the simulation results for the confrontation scenarios of multi-round air combat are presented to verify the effectiveness and advan-tages of the proposed method.
文摘System upgrades in unmanned systems have made Unmanned Aerial Vehicle(UAV)-based patrolling and monitoring a preferred solution for ocean surveillance.However,dynamic environments and large-scale deployments pose significant challenges for efficient decision-making,necessitating a modular multiagent control system.Deep Reinforcement Learning(DRL)and Decision Tree(DT)have been utilized for these complex decision-making tasks,but each has its limitations:DRL is highly adaptive but lacks interpretability,while DT is inherently interpretable but has limited adaptability.To overcome these challenges,we propose the Adaptive Interpretable Decision Tree(AIDT),an evolutionary-based algorithm that is both adaptable to diverse environmental settings and highly interpretable in its decision-making processes.We first construct a Markov decision process(MDP)-based simulation environment using the Cooperative Submarine Search task as a representative scenario for training and testing the proposed method.Specifically,we use the heat map as a state variable to address the issue of multi-agent input state proliferation.Next,we introduce the curiosity-guiding intrinsic reward to encourage comprehensive exploration and enhance algorithm performance.Additionally,we incorporate decision tree size as an influence factor in the adaptation process to balance task completion with computational efficiency.To further improve the generalization capability of the decision tree,we apply a normalization method to ensure consistent processing of input states.Finally,we validate the proposed algorithm in different environmental settings,and the results demonstrate both its adaptability and interpretability.
文摘Despite the widespread use of Decision trees (DT) across various applications, their performance tends to suffer when dealing with imbalanced datasets, where the distribution of certain classes significantly outweighs others. Cost-sensitive learning is a strategy to solve this problem, and several cost-sensitive DT algorithms have been proposed to date. However, existing algorithms, which are heuristic, tried to greedily select either a better splitting point or feature node, leading to local optima for tree nodes and ignoring the cost of the whole tree. In addition, determination of the costs is difficult and often requires domain expertise. This study proposes a DT for imbalanced data, called Swarm-based Cost-sensitive DT (SCDT), using the cost-sensitive learning strategy and an enhanced swarm-based algorithm. The DT is encoded using a hybrid individual representation. A hybrid artificial bee colony approach is designed to optimize rules, considering specified costs in an F-Measure-based fitness function. Experimental results using datasets compared with state-of-the-art DT algorithms show that the SCDT method achieved the highest performance on most datasets. Moreover, SCDT also excels in other critical performance metrics, such as recall, precision, F1-score, and AUC, with notable results with average values of 83%, 87.3%, 85%, and 80.7%, respectively.
文摘In this investigation,the Gradient Boosting(GB),Linear Regression(LR),Decision Tree(DT),and Voting algo-rithms were applied to predict the distribution pattern of Au geochemical data.Trace and indicator elements,including Mo,Cu,Pb,Zn,Ag,Ni,Co,Mn,Fe,and As,were used with these machine learning algorithms(MLAs)to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area.The performance of the models was evaluated using the Mean Absolute Percentage Error(MAPE)and Root Mean Square Error(RMSE)metrics.The proposed ensemble Voting algorithm outperformed the other models,yielding more ac-curate predictions according to both metrics.The predicted data from the GB,LR,DT,and Voting MLAs were modeled using the Concentration-Area fractal method,and Au geochemical anomalies were mapped.To compare and validate the results,factors such as the location of the mineral deposits,their surface extent,and mineralization trend were considered.The results indicate that integrating hybrid MLAs with fractal modeling signifi-cantly improves geochemical prospectivity mapping.Among the four models,three(DT,GB,Voting)accurately identified both mineral deposits.The LR model,however,only identified Deposit I(central),and its mineralization trend diverged from the field data.The GB and Voting models produced similar results,with their final maps derived from fractal modeling showing the same anomalous areas.The anomaly boundaries identified by these two models are consistent with the two known reserves in the region.The results and plots related to prediction indicators and error rates for these two models also show high similarity,with lower error rates than the other models.Notably,the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.
文摘The classification for handwritten Chinese character recognition can be viewed as a transformation in discrete vector space. In this paper, from the point of discrete vector space transformation, a new 4-corner codes classifier based on decision tree inductive learning algorithm ID3 for handwritten Chinese characters is presented. With a feature extraction controller, the classifier can reduce the number of extracted features and accelerate classification speed. Experimental results show that the 4-corner codes classifier performs well on both recognition accuracy and speed.
基金Supported by the Open Subject of Key Lab of Resources Remote-sensing and Digital Agriculture in Agricultural Department(RDA1008)~~
文摘[Objective] The aim was to explore the feasibility of using single spectrum image to classify crops based on multi-spectral image and Decision Tree Method. [Method] Taking the typical agriculture plantation area in Hulunbeier area, according to field measured spectrum data, the optimum time of main crops, barley, wheat, rapeseed, based on crops spectrum characteristics, by dint of decision-making tree method, and considering spectral matching method, classification of crops was studied such as SAM. [Result] By dint of Landsat TM image gained in the first half of August, based on geographic and atmospheric proof-reading, decision-making tree was constructed. Plantation information about wheat, barley, and rapeseed and plantation grassland was extracted successfully. The general classification accuracy reached 86.90%. Kappa coefficient was 0.831 1. [Conclusion] Taking typical spectrum image as data source, and applying Decision Tree Method to get crops type's information had fine application future.
基金Supported by Financial Support of China Geological Survey(1212010916048)the Fundamental Research Funds for the Central Universities(200903046)~~
文摘With western Jilin Province as the study region, spectral characteristics and texture features of remote sensing images were taken as the classification basis to construct a Decision Tree Model and extract information about settlements in western Jilin Province, and the manually-extracted information about settlements in western Jilin Province was evaluated by confusion matrix. The results showed that Decision Tree Model was convenient for extracting settlements information by integrating spectral and texture features, and the accuracy of such a method was higher than that of the traditional Maximum Liklihood Method, in addition, calculation methods of extracting settlements information by this mean were concluded.
基金Supported by Science and Technology Plan of Mudanjiang City (G200920064)Teaching Reform Construction of Mudanjiang Normal University (10-xj11080)
文摘Based on the discuss of the basic concept of data mining technology and the decision tree method,combining with the data samples of wind and hailstorm disasters in some counties of Mudanjiang region,the forecasting model of agro-meteorological disaster grade was established by adopting the C4.5 classification algorithm of decision tree,which can forecast the direct economic loss degree to provide rational data mining model and obtain effective analysis results.
文摘[Objective] This paper aims to construct an improved fuzzy decision tree which is based on clustering,and researches into its application in the screening of maize germplasm.[Method] A new decision tree algorithm based upon clustering is adopted in this paper,which is improved against the defect that traditional decision tree algorithm fails to handle samples of no classes.Meanwhile,the improved algorithm is also applied to the screening of maize varieties.Through the indices as leaf area,plant height,dry weight,potassium(K) utilization and others,maize seeds with strong tolerance of hypokalemic are filtered out.[Result] The algorithm in the screening of maize germplasm has great applicability and good performance.[Conclusion] In the future more efforts should be made to compare improved the performance of fuzzy decision tree based upon clustering with the performance of traditional fuzzy one,and it should be applied into more realistic problems.
文摘Accurate classification of cardiac arrhythmias is a crucial task because of the non-stationary nature of electrocardiogram(ECG)signals.In a life-threatening situation,an automated system is necessary for early detection of beat abnormalities in order to reduce the mortality rate.In this paper,we propose an automatic classification system of ECG beats based on the multi-domain features derived from the ECG signals.The experimental study was evaluated on ECG signals obtained from the MIT-BIH Arrhythmia Database.The feature set comprises eight empirical mode decomposition(EMD)based features,three features from variational mode decomposition(VMD)and four features from RR intervals.In total,15 features are ranked according to a ranker search approach and then used as input to the support vector machine(SVM)and C4.5 decision tree classifiers for classifying six types of arrhythmia beats.The proposed method achieved best result in C4.5 decision tree classifier with an accuracy of 98.89%compared to cubic-SVM classifier which achieved an accuracy of 95.35%only.Besides accuracy measures,all other parameters such as sensitivity(Se),specificity(Sp)and precision rates of 95.68%,99.28%and 95.8%was achieved better in C4.5 classifier.Also the computational time of 0.65 s with an error rate of 0.11 was achieved which is very less compared to SVM.The multi-domain based features with decision tree classifier obtained the best results in classifying cardiac arrhythmias hence the system could be used efficiently in clinical practices.
文摘One of the most important methods that finds usefulness in various applications, such as searching historical manuscripts, forensic search, bank check reading, mail sorting, book and handwritten notes transcription, is handwritten character recognition. The common issues in the character recognition are often due to different writing styles, orientation angle, size variation(regarding length and height), etc. This study presents a classification model using a hybrid classifier for the character recognition by combining holoentropy enabled decision tree(HDT) and deep neural network(DNN). In feature extraction, the local gradient features that include histogram oriented gabor feature and grid level feature, and grey level co-occurrence matrix(GLCM) features are extracted. Then, the extracted features are concatenated to encode shape, color, texture, local and statistical information, for the recognition of characters in the image by applying the extracted features to the hybrid classifier. In the experimental analysis, recognition accuracy of 96% is achieved. Thus, it can be suggested that the proposed model intends to provide more accurate character recognition rate compared to that of character recognition techniques used in the literature.
基金This work was supported in part by the National Natural Science Foundation of China(61601418,41602362,61871259)in part by the Opening Foundation of Hunan Engineering and Research Center of Natural Resource Investigation and Monitoring(2020-5)+1 种基金in part by the Qilian Mountain National Park Research Center(Qinghai)(grant number:GKQ2019-01)in part by the Geomatics Technology and Application Key Laboratory of Qinghai Province,Grant No.QHDX-2019-01.
文摘This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR.
基金College of Agriculture and Natural Resources,University of Tehran for financial support of the study(Grant No.7104017/6/24 and 28)
文摘To build any spatial soil database, a set of environmental data including digital elevation model(DEM) and satellite images beside geomorphic landscape description are essentials. Such a database, integrates field observations and laboratory analyses data with the results obtained from qualitative and quantitative models. So far, various techniques have been developed for soil data processing. The performance of Artificial Neural Network(ANN) and Decision Tree(DT) models was compared to map out some soil attributes in Alborz Province, Iran. Terrain attributes derived from a DEM along with Landsat 8 ETM+, geomorphology map, and the routine laboratory analyses of the studied area were used as input data. The relationships between soil properties(including sand, silt, clay, electrical conductivity, organic carbon, and carbonates) and the environmental variables were assessed using the Pearson Correlation Coefficient and Principle Components Analysis. Slope, elevation, geomforms, carbonate index, stream network, wetness index, and the band’s number 2, 3, 4, and 5 were the most significantly correlated variables. ANN and DT did not show the same accuracy in predicting all parameters. The DT model showed higher performances in estimating sand(R^2=0.73), silt(R^2=0.70), clay(R^2=0.72), organic carbon(R^2=0.71), and carbonates(R^2=0.70). While the ANN model only showed higher performance in predicting soil electrical conductivity(R^2=0.95). The results showed that determination the best model to use, is dependent upon the relation between the considered soil properties with the environmental variables. However, the DT model showed more reasonable results than the ANN model in this study. The results showed that before using a certain model to predict variability of all soil parameters, it would be better to evaluate the efficiency of all possible models for choosing the best fitted model for each property. In other words, most of the developed models are sitespecific and may not be applicable to use for predicting other soil properties or other area.
基金Supported by Grants from the Project of the National Science and Technology Major Project,No.2012BAI06B01Postdoctoral Science Foundation funded project,No.201003781
文摘AIM: To evaluate a different decision tree for safe liver resection and verify its efficiency.METHODS: A total of 2457 patients underwent hepatic resection between January 2004 and December 2010 at the Chinese PLA General Hospital,and 634 hepatocellular carcinoma(HCC) patients were eligible for the final analyses. Post-hepatectomy liver failure(PHLF) was identified by the association of prothrombin time < 50% and serum bilirubin > 50 μmol/L(the "50-50" criteria),which were assessed at day 5 postoperatively or later. The Swiss-Clavien decision tree,Tokyo University-Makuuchi decision tree,and Chinese consensus decision tree were adopted to divide patients into two groups based on those decision trees in sequence,and the PHLF rates were recorded.RESULTS: The overall mortality and PHLF rate were 0.16% and 3.0%. A total of 19 patients experienced PHLF. The numbers of patients to whom the SwissClavien,Tokyo University-Makuuchi,and Chinese consensus decision trees were applied were 581,573,and 622,and the PHLF rates were 2.75%,2.62%,and 2.73%,respectively. Significantly more cases satisfied the Chinese consensus decision tree than the Swiss-Clavien decision tree and Tokyo University-Makuuchi decision tree(P < 0.01,P < 0.01); nevertheless,the latter two shared no difference(P = 0.147). The PHLF rate exhibited no significant difference with respect to the three decision trees.CONCLUSION: The Chinese consensus decision tree expands the indications for hepatic resection for HCC patients and does not increase the PHLF rate compared to the Swiss-Clavien and Tokyo UniversityMakuuchi decision trees. It would be a safe and effective algorithm for hepatectomy in patients with hepatocellular carcinoma.
文摘In order to improve nitrogen removal in anoxic/oxic(A/O) process effectively for treating domestic wastewaters, the influence factors, DO(dissolved oxygen), nitrate recirculation, sludge recycle, SRT(solids residence time), influent COD/TN and HRT(hydraulic retention time) were studied. Results indicated that it was possible to increase nitrogen removal by using corresponding control strategies, such as, adjusting the DO set point according to effluent ammonia concentration; manipulating nitrate recirculation flow according to nitrate concentration at the end of anoxic zone. Based on the experiments results, a knowledge-based approach for supervision of the nitrogen removal problems was considered, and decision trees for diagnosing nitrification and denitrification problems were built and successfully applied to A/O process.