Imbalanced data classification is an important research topic in real-world applications,like fault diagnosis in an aircraft manufacturing system.The over-sampling method is often used to solve this problem.It generat...Imbalanced data classification is an important research topic in real-world applications,like fault diagnosis in an aircraft manufacturing system.The over-sampling method is often used to solve this problem.It generates samples according to the distance between minority data.However,the traditional over-sampling method may change the original data distribution,which is harmful to the classification performance.In this paper,we propose a new method called Conditional SelfAttention Generative Adversarial Network with Differential Evolution(CSAGAN-DE)for imbalanced data classification.The new method aims at improving the classification performance of minority data by enhancing the quality of the generation of minority data.In CSAGAN-DE,the minority data are fed into the self-attention generative adversarial network to approximate the data distribution and create new data for the minority class.Then,the differential evolution algorithm is employed to automatically determine the number of generated minority data for achieving a satisfactory classification performance.Several experiments are conducted to evaluate the performance of the new CSAGAN-DE method.The results show that the new method can efficiently improve the classification performance compared with other related methods.展开更多
In the traditional pattern classification method,it usually assumes that the object to be classified must lie in one of given(known)classes of the training data set.However,the training data set may not contain the cl...In the traditional pattern classification method,it usually assumes that the object to be classified must lie in one of given(known)classes of the training data set.However,the training data set may not contain the class of some objects in practice,and this is considered as an Open-Set Recognition(OSR)problem.In this paper,we propose a new progressive open-set recognition method with adaptive probability threshold.Both the labeled training data and the test data(objects to be classified)are put into a common data set,and the k-Nearest Neighbors(k-NNs)of each object are sought in this common set.Then,we can determine the probability of object lying in the given classes.If the majority of k-NNs of the object are from labeled training data,this object quite likely belongs to one of the given classes,and the density of the object and its neighbors is taken into account here.However,when most of k-NNs are from the unlabeled test data set,the class of object is considered very uncertain because the class of test data is unknown,and this object cannot be classified in this step.Once the objects belonging to known classes with high probability are all found,we re-calculate the probability of the other uncertain objects belonging to known classes based on the labeled training data and the objects marked with the estimated probability.Such iteration will stop when the probabilities of all the objects belonging to known classes are not changed.Then,a modified Otsu’s method is employed to adaptively seek the probability threshold for the final classification.If the probability of object belonging to known classes is smaller than this threshold,it will be assigned to the ignorant(unknown)class that is not included in training data set.The other objects will be committed to a specific class.The effectiveness of the proposed method has been validated using some experiments.展开更多
The methods for combining multiple classifiers based on belief functions require to work with a common and complete(closed)Frame of Discernment(Fo D)on which the belief functions are defined before making their combin...The methods for combining multiple classifiers based on belief functions require to work with a common and complete(closed)Frame of Discernment(Fo D)on which the belief functions are defined before making their combination.This theoretical requirement is however difficult to satisfy in practice because some abnormal(or unknown)objects that do not belong to any predefined class of the Fo D can appear in real classification applications.The classifiers learnt using different attributes information can provide complementary knowledge which is very useful for making the classification but they are usually based on different Fo Ds.In order to clearly identify the specific class of the abnormal objects,we propose a new method for combination of classifiers working with incomplete frames of discernment,named CCIF for short.This is a progressive detection method that select and add the detected abnormal objects to the training data set.Because one pattern can be considered as an abnormal object by one classifier and be committed to a specific class by another one,a weighted evidence combination method is proposed to fuse the classification results of multiple classifiers.This new method offers the advantage to make a refined classification of abnormal objects,and to improve the classification accuracy thanks to the complementarity of the classifiers.Some experimental results are given to validate the effectiveness of the proposed method using real data sets.展开更多
基金partially supported by the Aeronautical Science Foundation of China(No.201920007001)National Natural Science Foundation of China(Nos.U20B2067,61790552 and 61790554)。
文摘Imbalanced data classification is an important research topic in real-world applications,like fault diagnosis in an aircraft manufacturing system.The over-sampling method is often used to solve this problem.It generates samples according to the distance between minority data.However,the traditional over-sampling method may change the original data distribution,which is harmful to the classification performance.In this paper,we propose a new method called Conditional SelfAttention Generative Adversarial Network with Differential Evolution(CSAGAN-DE)for imbalanced data classification.The new method aims at improving the classification performance of minority data by enhancing the quality of the generation of minority data.In CSAGAN-DE,the minority data are fed into the self-attention generative adversarial network to approximate the data distribution and create new data for the minority class.Then,the differential evolution algorithm is employed to automatically determine the number of generated minority data for achieving a satisfactory classification performance.Several experiments are conducted to evaluate the performance of the new CSAGAN-DE method.The results show that the new method can efficiently improve the classification performance compared with other related methods.
基金supported by the National Natural Science Foundation of China(Nos.U20B2067).
文摘In the traditional pattern classification method,it usually assumes that the object to be classified must lie in one of given(known)classes of the training data set.However,the training data set may not contain the class of some objects in practice,and this is considered as an Open-Set Recognition(OSR)problem.In this paper,we propose a new progressive open-set recognition method with adaptive probability threshold.Both the labeled training data and the test data(objects to be classified)are put into a common data set,and the k-Nearest Neighbors(k-NNs)of each object are sought in this common set.Then,we can determine the probability of object lying in the given classes.If the majority of k-NNs of the object are from labeled training data,this object quite likely belongs to one of the given classes,and the density of the object and its neighbors is taken into account here.However,when most of k-NNs are from the unlabeled test data set,the class of object is considered very uncertain because the class of test data is unknown,and this object cannot be classified in this step.Once the objects belonging to known classes with high probability are all found,we re-calculate the probability of the other uncertain objects belonging to known classes based on the labeled training data and the objects marked with the estimated probability.Such iteration will stop when the probabilities of all the objects belonging to known classes are not changed.Then,a modified Otsu’s method is employed to adaptively seek the probability threshold for the final classification.If the probability of object belonging to known classes is smaller than this threshold,it will be assigned to the ignorant(unknown)class that is not included in training data set.The other objects will be committed to a specific class.The effectiveness of the proposed method has been validated using some experiments.
基金partially supported by National Natural Science Foundation of China(Nos.U20B2067,61790552,61790554)Shaanxi Science Fund for Distinguished Young Scholars,China(No.2018JC-006)。
文摘The methods for combining multiple classifiers based on belief functions require to work with a common and complete(closed)Frame of Discernment(Fo D)on which the belief functions are defined before making their combination.This theoretical requirement is however difficult to satisfy in practice because some abnormal(or unknown)objects that do not belong to any predefined class of the Fo D can appear in real classification applications.The classifiers learnt using different attributes information can provide complementary knowledge which is very useful for making the classification but they are usually based on different Fo Ds.In order to clearly identify the specific class of the abnormal objects,we propose a new method for combination of classifiers working with incomplete frames of discernment,named CCIF for short.This is a progressive detection method that select and add the detected abnormal objects to the training data set.Because one pattern can be considered as an abnormal object by one classifier and be committed to a specific class by another one,a weighted evidence combination method is proposed to fuse the classification results of multiple classifiers.This new method offers the advantage to make a refined classification of abnormal objects,and to improve the classification accuracy thanks to the complementarity of the classifiers.Some experimental results are given to validate the effectiveness of the proposed method using real data sets.