Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds...Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds of classification rules in the application,two fuzzy classifiers were established by combining with fuzzy decision algorithm especially based on Second General Soil Survey of Guangdong Province.The results of experiments demonstrated that the fuzzy classifier based on association rules obtain a higher accuracy rate,but with more complex calculation process and more computational overhead;the fuzzy classifier based on C4.5 rules obtain a slightly lower accuracy,but with fast computation and simpler calculation.展开更多
Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and c...Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and constrained quasi-differentiable programming is proved.展开更多
A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partiti...A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partitions based clustering algorithms and a hybrid clustering algorithm. The results verification done by using classification algorithms via its accuracy. The perfor-mance of clustering and classification algorithms were carried out in this work based on the tumor identification, cluster quality and other parameters like run time and volume complexity. Some of the well known classification algorithms were used to find the accuracy of produced results of the clustering algorithms. The performance of the clustering algorithms proved mean-ingful in many domains, particularly k-Means, FCM. In addition, the proposed multifarious clustering technique has revealed their efficiency in terms of performance in predicting tumor affected regions in mammogram images. The color images are converted in to gray scale images and then it is processed. Finally, it is identified the best method for the analysis of finding tumor in breast images. This research would be immensely useful to physicians and radiologist to identify cancer affected area in the breast.展开更多
For the question that fuzzy c-means(FCM)clustering algorithm has the disadvantages of being too sensitive to the initial cluster centers and easily trapped in local optima,this paper introduces a new metric norm in FC...For the question that fuzzy c-means(FCM)clustering algorithm has the disadvantages of being too sensitive to the initial cluster centers and easily trapped in local optima,this paper introduces a new metric norm in FCM and particle swarm optimization(PSO)clustering algorithm,and proposes a parallel optimization algorithm using an improved fuzzy c-means method combined with particle swarm optimization(AF-APSO).The experiment shows that the AF-APSO can avoid local optima,and get the best fitness and clustering performance significantly.展开更多
Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of...Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of rainstorm days in summer in Lianyungang area is large was established by the classical C5. 0 decision tree algorithm. The data samples in 48 years( accounting for about 80% of total number of samples)was as the training set of a model,and the training accuracy rate of the model was 95. 83%. The data samples in the remaining 14 years( accounting for about 20% of total number of samples) were used as the test set of the model to test the model,and the test accuracy of the model was 85. 71%. The results showed that the prediction model of number of rainstorm days in summer constructed by C5. 0 algorithm had high accuracy and was easy to explain. Moreover,it is convenient for meteorological staff to use directly. At the same time,this study provides a new idea for short-term climate prediction of number of rainstorm days in summer.展开更多
Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to...Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to the C4.5 algorithm for network intrusion detection in order to detect misuse behaviors of internal attackers through effective classification and decision making in computer networks. This enhanced C4.5 algorithm derives a set of classification rules from network audit data and then the generated rules are used to detect network intrusions in a real-time environment. Unlike most existing decision tree based approaches, the spawned rules generated and fired in this work are more effective because the information-theoretic approach minimizes the expected number of tests needed to classify an object and guarantees that a simple (but not necessarily the simplest) tree is found. The main advantage of this proposed algorithm is that the generalization ability of enhanced C4.5 decision trees is better than that of C4.5 decision trees. We have employed data from the third international knowledge discovery and data mining tools competition (KDDcup’99) to train and test the feasibility of this proposed model. By applying the enhanced C4.5 algorithm an average detection rate of 93.28 percent and a false positive rate of 0.7 percent have respectively been obtained in this work.展开更多
As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to...As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to detect, speculative execution is usually used for dealing with this problem, by simply backing up those stragglers on alternative nodes. In this paper, we design a new Speculative Execution algorithm based on C4.5 Decision Tree, SECDT, for Hadoop. In SECDT, we speculate completion time of stragglers and also of backup tasks, based on a kind of decision tree method: C4.5 decision tree. After we speculate the completion time, we compare the completion time of stragglers and of the backup tasks, calculating their differential value, and selecting the straggler with the maximum differential value to start the backup task.Experiment result shows that the SECDT can predict execution time more accurately than other speculative execution methods, hence reduce the job completion time.展开更多
Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audie...Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audiences and improve the likelihood of response. In this work we have investigated two data mining techniques: the Naive Bayes and the C4.5 decision tree algorithms. The goal of this work is to predict whether a client will subscribe a term deposit. We also made comparative study of performance of those two algorithms. Publicly available UCI data is used to train and test the performance of the algorithms. Besides, we extract actionable knowledge from decision tree that focuses to take interesting and important decision in business area.展开更多
In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of to...In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of totally quasi-C-asymptotically nonexpansivemulti-valued maps are constructed. Strong convergence of the sequence generated by thesealgorithms is proved in uniformly smooth and strictly convex real Banach spaces with Kadec-Klee property. Furthermore, several applications of our theorems are also presented. Finally,our theorems are significant improvements on several important recent results for this classof nonlinear problems.展开更多
In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many research...In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many researchers have adopted the fuzzy clustering approach to segment them. In this work, a fast and robust multi-agent system (MAS) for MRI segmentation of the brain is proposed. This system gets its robustness from a robust c-means algorithm (RFCM) and obtains its fastness from the beneficial properties of agents, such as autonomy, social ability and reactivity. To show the efficiency of the proposed method, we test it on a normal brain brought from the BrainWeb Simulated Brain Database. The experimental results are valuable in both robustness to noise and running times standpoints.展开更多
The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algor...The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algorithm for determining optimal or close to it solutions of some “minor” changes of instance I. To solve the problem Ins-Max-EkCSP-P (reoptimization of Max-EkCSP-P with the addition of one constraint) with approximation resistant predicate P exists a polynomial threshold (optimal) -approximation algorithm, where the threshold “random” approximation ratio of P). When the unique games conjecture (UGC) is hold there exists a polynomial threshold (optimal) -approximation algorithm (where and the integrality gap of semidefinite relaxation of Max-EkCSP-P problem Z) to solve the problem Ins-Max-EkCSP-P.展开更多
This paper proposes a novel cargo loading algorithm applicable to automated conveyor-type loading systems.The algorithm offers improvements in computational efficiency and robustness by utilizing the concept of discre...This paper proposes a novel cargo loading algorithm applicable to automated conveyor-type loading systems.The algorithm offers improvements in computational efficiency and robustness by utilizing the concept of discrete derivatives and introducing logistics-related constraints.Optional consideration of the rotation of the cargoes was made to further enhance the optimality of the solutions,if possible to be physically implemented.Evaluation metrics were developed for accurate evaluation and enhancement of the algorithm’s ability to efficiently utilize the loading space and provide a high level of dynamic stability.Experimental results demonstrate the extensive robustness of the proposed algorithm to the diversity of cargoes present in Business-to-Consumer environments.This study contributes practical advancements in both cargo loading optimization and automation of the logistics industry,with potential applications in last-mile delivery services,warehousing,and supply chain management.展开更多
基金Supported by Science and Technology Plan Project of Guangdong Province (2009B010900026,2009CD058,2009CD078,2009CD079,2009CD080)Special Funds for Support Program of Development of Modern Information Service Industry of Guangdong Province(06120840B0370124)Funded Fund Project of South China Agricultural University (2007K017)~~
文摘Association rules and C4.5 rules can overcome the shortage of the traditional land evaluation methods and improve the intelligibility and efficiency of the land evaluation knowledge.In order to compare these two kinds of classification rules in the application,two fuzzy classifiers were established by combining with fuzzy decision algorithm especially based on Second General Soil Survey of Guangdong Province.The results of experiments demonstrated that the fuzzy classifier based on association rules obtain a higher accuracy rate,but with more complex calculation process and more computational overhead;the fuzzy classifier based on C4.5 rules obtain a slightly lower accuracy,but with fast computation and simpler calculation.
基金Supported by the State Foundations of Ph.D.Units(20020141013)Supported by the NSF of China(10001007)
文摘Some properties of a class of quasi-differentiable functions(the difference of two finite convex functions) are considered in this paper. And the convergence of the steepest descent algorithm for unconstrained and constrained quasi-differentiable programming is proved.
文摘A number of clustering algorithms were used to analyze many databases in the field of image clustering. The main objective of this research work was to perform a comparative analysis of the two of the existing partitions based clustering algorithms and a hybrid clustering algorithm. The results verification done by using classification algorithms via its accuracy. The perfor-mance of clustering and classification algorithms were carried out in this work based on the tumor identification, cluster quality and other parameters like run time and volume complexity. Some of the well known classification algorithms were used to find the accuracy of produced results of the clustering algorithms. The performance of the clustering algorithms proved mean-ingful in many domains, particularly k-Means, FCM. In addition, the proposed multifarious clustering technique has revealed their efficiency in terms of performance in predicting tumor affected regions in mammogram images. The color images are converted in to gray scale images and then it is processed. Finally, it is identified the best method for the analysis of finding tumor in breast images. This research would be immensely useful to physicians and radiologist to identify cancer affected area in the breast.
基金the China Agriculture Research System(No.CARS-49)Jiangsu College of Humanities and Social Sciences Outside Campus Research Base & Chinese Development of Strategic Research Base for Internet of Things
文摘For the question that fuzzy c-means(FCM)clustering algorithm has the disadvantages of being too sensitive to the initial cluster centers and easily trapped in local optima,this paper introduces a new metric norm in FCM and particle swarm optimization(PSO)clustering algorithm,and proposes a parallel optimization algorithm using an improved fuzzy c-means method combined with particle swarm optimization(AF-APSO).The experiment shows that the AF-APSO can avoid local optima,and get the best fitness and clustering performance significantly.
基金Support by Meteorological Open Research Foundation for the Huaihe River Basin(HRM201602)Foundation for Young Scholars of Jiangsu Meteorological Bureau(Q201708,KQ201802)+2 种基金Science and Technology Innovation Team Foundation for Marine Meteorological Forecast Technology of Lianyungang Meteorological BureauKey Technology R&D Program Project of Lianyungang City(SH1634)Special Project for Forecasters of Jiangsu Meteorological Bureau(JSYBY201811,JSYBY201812,JSYBY201810)
文摘Based on the data of daily precipitation in Lianyungang area from 1951 to 2012 and various climate signal data from the National Climate Center website and the NOAA website,a model for predicting whether the number of rainstorm days in summer in Lianyungang area is large was established by the classical C5. 0 decision tree algorithm. The data samples in 48 years( accounting for about 80% of total number of samples)was as the training set of a model,and the training accuracy rate of the model was 95. 83%. The data samples in the remaining 14 years( accounting for about 20% of total number of samples) were used as the test set of the model to test the model,and the test accuracy of the model was 85. 71%. The results showed that the prediction model of number of rainstorm days in summer constructed by C5. 0 algorithm had high accuracy and was easy to explain. Moreover,it is convenient for meteorological staff to use directly. At the same time,this study provides a new idea for short-term climate prediction of number of rainstorm days in summer.
文摘Intrusion detection systems provide additional defense capacity to a networked information system in addition to the security measures provided by the firewalls. This paper proposes an active rule based enhancement to the C4.5 algorithm for network intrusion detection in order to detect misuse behaviors of internal attackers through effective classification and decision making in computer networks. This enhanced C4.5 algorithm derives a set of classification rules from network audit data and then the generated rules are used to detect network intrusions in a real-time environment. Unlike most existing decision tree based approaches, the spawned rules generated and fired in this work are more effective because the information-theoretic approach minimizes the expected number of tests needed to classify an object and guarantees that a simple (but not necessarily the simplest) tree is found. The main advantage of this proposed algorithm is that the generalization ability of enhanced C4.5 decision trees is better than that of C4.5 decision trees. We have employed data from the third international knowledge discovery and data mining tools competition (KDDcup’99) to train and test the feasibility of this proposed model. By applying the enhanced C4.5 algorithm an average detection rate of 93.28 percent and a false positive rate of 0.7 percent have respectively been obtained in this work.
文摘As a distributed computing platform, Hadoop provides an effective way to handle big data. In Hadoop, the completion time of job will be delayed by a straggler. Although the definitive cause of the straggler is hard to detect, speculative execution is usually used for dealing with this problem, by simply backing up those stragglers on alternative nodes. In this paper, we design a new Speculative Execution algorithm based on C4.5 Decision Tree, SECDT, for Hadoop. In SECDT, we speculate completion time of stragglers and also of backup tasks, based on a kind of decision tree method: C4.5 decision tree. After we speculate the completion time, we compare the completion time of stragglers and of the backup tasks, calculating their differential value, and selecting the straggler with the maximum differential value to start the backup task.Experiment result shows that the SECDT can predict execution time more accurately than other speculative execution methods, hence reduce the job completion time.
文摘Many companies like credit card, insurance, bank, retail industry require direct marketing. Data mining can help those institutes to set marketing goal. Data mining techniques have good prospects in their target audiences and improve the likelihood of response. In this work we have investigated two data mining techniques: the Naive Bayes and the C4.5 decision tree algorithms. The goal of this work is to predict whether a client will subscribe a term deposit. We also made comparative study of performance of those two algorithms. Publicly available UCI data is used to train and test the performance of the algorithms. Besides, we extract actionable knowledge from decision tree that focuses to take interesting and important decision in business area.
文摘In this paper, relaxed iterative algorithms of Krasnoselskii-type and Halpern-type that approximate a solution of a system of a generalized mixed equilibrium problem anda common fixed point of a countable family of totally quasi-C-asymptotically nonexpansivemulti-valued maps are constructed. Strong convergence of the sequence generated by thesealgorithms is proved in uniformly smooth and strictly convex real Banach spaces with Kadec-Klee property. Furthermore, several applications of our theorems are also presented. Finally,our theorems are significant improvements on several important recent results for this classof nonlinear problems.
文摘In the last decade, the MRI (Magnetic Resonance Imaging) image segmentation has become one of the most active research fields in the medical imaging domain. Because of the fuzzy nature of the MRI images, many researchers have adopted the fuzzy clustering approach to segment them. In this work, a fast and robust multi-agent system (MAS) for MRI segmentation of the brain is proposed. This system gets its robustness from a robust c-means algorithm (RFCM) and obtains its fastness from the beneficial properties of agents, such as autonomy, social ability and reactivity. To show the efficiency of the proposed method, we test it on a normal brain brought from the BrainWeb Simulated Brain Database. The experimental results are valuable in both robustness to noise and running times standpoints.
文摘The purpose of reoptimization using approximation methods—application of knowledge about the solution of the initial instance I, provided to achieve a better quality of approximation (approximation ratio) of an algorithm for determining optimal or close to it solutions of some “minor” changes of instance I. To solve the problem Ins-Max-EkCSP-P (reoptimization of Max-EkCSP-P with the addition of one constraint) with approximation resistant predicate P exists a polynomial threshold (optimal) -approximation algorithm, where the threshold “random” approximation ratio of P). When the unique games conjecture (UGC) is hold there exists a polynomial threshold (optimal) -approximation algorithm (where and the integrality gap of semidefinite relaxation of Max-EkCSP-P problem Z) to solve the problem Ins-Max-EkCSP-P.
基金supported by the BK21 FOUR funded by the Ministry of Education of Korea and National Research Foundation of Korea,a Korea Agency for Infrastructure Technology Advancement(KAIA)grant funded by the Ministry of Land,Infrastructure,and Transport(Grant 1615013176)IITP(Institute of Information&Coummunications Technology Planning&Evaluation)-ICAN(ICT Challenge and Advanced Network of HRD)grant funded by the Korea government(Ministry of Science and ICT)(RS-2024-00438411).
文摘This paper proposes a novel cargo loading algorithm applicable to automated conveyor-type loading systems.The algorithm offers improvements in computational efficiency and robustness by utilizing the concept of discrete derivatives and introducing logistics-related constraints.Optional consideration of the rotation of the cargoes was made to further enhance the optimality of the solutions,if possible to be physically implemented.Evaluation metrics were developed for accurate evaluation and enhancement of the algorithm’s ability to efficiently utilize the loading space and provide a high level of dynamic stability.Experimental results demonstrate the extensive robustness of the proposed algorithm to the diversity of cargoes present in Business-to-Consumer environments.This study contributes practical advancements in both cargo loading optimization and automation of the logistics industry,with potential applications in last-mile delivery services,warehousing,and supply chain management.