In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying result...In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear sta- tistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two repre- sentative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method per- forms well in selecting genes and achieves high classification accuracies with these genes.展开更多
This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used ...This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used to approximate each output of the controlled plant Then the model is used in MPC control scheme to predict the outputs of the controlled plant.The optimal control sequence is calculated using GA with elite preserve strategy.Simulation results of a typical MIMO nonlinear system show that this method has a good ability of set points tracking and disturbance rejection.展开更多
In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(S...In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(SVM). Automatic target recognition process on the nonlinear and non-stationary of Doppler signals of military target by using automatic target recognition model can be expressed as follows. Firstly, the nonlinearity and non-stationary of Doppler signals were decomposed into a set of intrinsic mode functions(IMFs) using EMD. After the Hilbert transform of IMF, the energy ratio of each IMF to the total IMFs can be extracted as the features of military target. Then, the SVM was trained through using the energy ratio to classify the military targets, and genetic algorithm(GA) was used to optimize SVM parameters in the solution space. The experimental results show that this algorithm can achieve the recognition accuracies of 86.15%, 87.93%, and 82.28% for tank, vehicle and soldier, respectively.展开更多
Corporate net value is efficiently described on its stock price, offering investors a chance to include a potentially surplus value to the net worth of the overall investment portfolio. Financial analysis of corporati...Corporate net value is efficiently described on its stock price, offering investors a chance to include a potentially surplus value to the net worth of the overall investment portfolio. Financial analysis of corporations extracted from the accounting statements is constantly demanded to support decisions making of portfolio managers. Econometrics and Artificial Intelligence methods aim to extract hidden information from complex accounting and financial data. Support Vector Machines hybrids optimized in their components by Genetic Algorithms provide effective results in corporate financial analysis.展开更多
Parkinson’s disease (PD) is the most common disease of motor system degeneration that occurs when the dopamine-producing cells are damaged in substantia nigra. To detect PD, various signals have been investigated, in...Parkinson’s disease (PD) is the most common disease of motor system degeneration that occurs when the dopamine-producing cells are damaged in substantia nigra. To detect PD, various signals have been investigated, including EEG, gait and speech. Since approximately 90 percent of the people with PD suffer from speech disorders, speech analysis is considered as the most common technique for this aim. This paper proposes a new algorithm for diagnosing of Parkinson’s disease based on voice analysis. In the first step, genetic algorithm (GA) is undertaken for selecting optimized features from all extracted features. Afterwards a network based on support vector machine (SVM) is used for classification between healthy and people with Parkinson. The dataset of this research is composed of a range of biomedical voice signals from 31 people, 23 with Parkinson’s disease and 8 healthy people. The subjects were asked to pronounce letter “A” for 3 seconds. 22 linear and non-linear features were extracted from the signals that 14 features were based on F0 (fundamental frequency or pitch), jitter, shimmer and noise to harmonics ratio, which are main factors in voice signal. Because changing in these factors is noticeable for the people with PD, optimized features were selected among them. Of the various numbers of optimized features, the data classification was investigated. Results show that the classification accuracy percent of 94.50 per 4 optimized features, the accuracy percent of 93.66 per 7 optimized features and the accuracy percent of 94.22 per 9 optimized features, could be achieved. It can be observed that the best classification accuracy may be achieved using Fhi (Hz), Fho (Hz), jitter (RAP) and shimmer (APQ5).展开更多
Support vector machines (SVMs) have been introduced as effective methods for solving classification problems. However, due to some limitations in practical applications, their generalization performance is sometimes...Support vector machines (SVMs) have been introduced as effective methods for solving classification problems. However, due to some limitations in practical applications, their generalization performance is sometimes far from the expected level. Therefore, it is meaningful to study SVM ensemble learning. In this paper, a novel genetic algorithm based ensemble learning method, namely Direct Genetic Ensemble (DGE), is proposed. DGE adopts the predictive accuracy of ensemble as the fitness function and searches a good ensemble from the ensemble space. In essence, DGE is also a selective ensemble learning method because the base classifiers of the ensemble are selected according to the solution of genetic algorithm. In comparison with other ensemble learning methods, DGE works on a higher level and is more direct. Different strategies of constructing diverse base classifiers can be utilized in DGE. Experimental results show that SVM ensembles constructed by DGE can achieve better performance than single SVMs, hagged and boosted SVM ensembles. In addition, some valuable conclusions are obtained.展开更多
The rough set-genetic support vector machine(SVM) model is applied to supply chain performance evaluation. First, the rough set theory is used to remove the redundant factors that affect the performance evaluation of ...The rough set-genetic support vector machine(SVM) model is applied to supply chain performance evaluation. First, the rough set theory is used to remove the redundant factors that affect the performance evaluation of supply chain to obtain the core influencing factors. Then the support vector machine is used to extract the core influencing factors to predict the level of supply chain performance. In the process of SVM classification, the genetic algorithm is used to optimize the parameters of the SVM algorithm to obtain the best parameter model, and then the supply chain performance evaluation level is predicted. Finally, an example is used to predict this model, and compared with the result of using only rough set-support vector machine to predict. The results show that the method of rough set-genetic support vector machine can predict the level of supply chain performance more accurately and the prediction result is more realistic, which is a scientific and feasible method.展开更多
A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to ...A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.展开更多
The behavior of schools of zebrafish (Danio rerio) was studied in acute toxicity environments. Behavioral features were extracted and a method for water quality assessment using support vector machine (SVM) was de...The behavior of schools of zebrafish (Danio rerio) was studied in acute toxicity environments. Behavioral features were extracted and a method for water quality assessment using support vector machine (SVM) was de- veloped. The behavioral parameters of fish were recorded and analyzed during one hour in an environment of a 24-h half-lethal concentration (LC50) of a pollutant. The data were used to develop a method to evaluate water quality, so as 6+ 2+ to give an early indication of toxicity. Four kinds of metal ions (Cu2~, Hg2~, Cr , and Cd ) were used for toxicity testing. To enhance the efficiency and accuracy of assessment, a method combining SVM and a genetic algorithm (GA) was used. The results showed that the average prediction accuracy of the method was over 80% and the time cost was acceptable. The method gave satisfactory results for a variety of metal pollutants, demonstrating that this is an effective approach to the classification of water quality.展开更多
Support vector machine (SVM) is a popular pattern classification method with many application areas. SVM shows its outstanding performance in high-dimensional data classification. In the process of classification, SVM...Support vector machine (SVM) is a popular pattern classification method with many application areas. SVM shows its outstanding performance in high-dimensional data classification. In the process of classification, SVM kernel parameter setting during the SVM training procedure, along with the feature selection significantly influences the classification accuracy. This paper proposes two novel intelligent optimization methods, which simultaneously determines the parameter values while discovering a subset of features to increase SVM classification accuracy. The study focuses on two evolutionary computing approaches to optimize the parameters of SVM: particle swarm optimization (PSO) and genetic algorithm (GA). And we combine above the two intelligent optimization methods with SVM to choose appropriate subset features and SVM parameters, which are termed GA-FSSVM (Genetic Algorithm-Feature Selection Support Vector Machines) and PSO-FSSVM(Particle Swarm Optimization-Feature Selection Support Vector Machines) models. Experimental results demonstrate that the classification accuracy by our proposed methods outperforms traditional grid search approach and many other approaches. Moreover, the result indicates that PSO-FSSVM can obtain higher classification accuracy than GA-FSSVM classification for hyperspectral data.展开更多
On-line chatter detection can avoid unstable cutting through monitoring the machining process.In order to identify chatter in a timely manner,an improved Support Vector Machine(SVM)is developed in this paper,based on ...On-line chatter detection can avoid unstable cutting through monitoring the machining process.In order to identify chatter in a timely manner,an improved Support Vector Machine(SVM)is developed in this paper,based on extracted features.In the SVM model,the penalty factor(e)and the core parameter(g)have important influence on the classification,more than from Kernel Functions(KFs).Hence,first the classification results are conducted using different KFs.Then two methods are presented for exploring the best parameters.The chatter identification results show that the Genetic Algorithm(GA)approach is more suitable for deciding the parameters than the Grid Explore(GE)approach.展开更多
Support vector machine (SVM) technique has recently become a research focus in intrusion detection field for its better generalization performance when given less priori knowledge than other soft-computing techniques....Support vector machine (SVM) technique has recently become a research focus in intrusion detection field for its better generalization performance when given less priori knowledge than other soft-computing techniques. But the randomicity of parameter selection in its implement often prevents it achieving expected performance. By utilizing genetic algorithm (GA) to optimize the parameters in data preprocessing and the training model of SVM simultaneously, a hybrid optimization algorithm is proposed in the paper to address this problem. The experimental results demonstrate that it’s an effective method and can improve the performance of SVM-based intrusion detection system further.展开更多
We propose a systematic ECG quality classification method based on a kernel support vector machine(KSVM) and genetic algorithm(GA) to determine whether ECGs collected via mobile phone are acceptable or not. This metho...We propose a systematic ECG quality classification method based on a kernel support vector machine(KSVM) and genetic algorithm(GA) to determine whether ECGs collected via mobile phone are acceptable or not. This method includes mainly three modules, i.e., lead-fall detection, feature extraction, and intelligent classification. First, lead-fall detection is executed to make the initial classification. Then the power spectrum, baseline drifts, amplitude difference, and other time-domain features for ECGs are analyzed and quantified to form the feature matrix. Finally, the feature matrix is assessed using KSVM and GA to determine the ECG quality classification results. A Gaussian radial basis function(GRBF) is employed as the kernel function of KSVM and its performance is compared with that of the Mexican hat wavelet function(MHWF). GA is used to determine the optimal parameters of the KSVM classifier and its performance is compared with that of the grid search(GS) method. The performance of the proposed method was tested on a database from PhysioNet/Computing in Cardiology Challenge 2011, which includes 1500 12-lead ECG recordings. True positive(TP), false positive(FP), and classification accuracy were used as the assessment indices. For training database set A(1000 recordings), the optimal results were obtained using the combination of lead-fall, GA, and GRBF methods, and the corresponding results were: TP 92.89%, FP 5.68%, and classification accuracy 94.00%. For test database set B(500 recordings), the optimal results were also obtained using the combination of lead-fall, GA, and GRBF methods, and the classification accuracy was 91.80%.展开更多
A new method was proposed to extract sensitive features and to construct a monitoring model for wheat scab based on in situ hyperspectral data of wheat ears to achieve effective prevention and control and provide theo...A new method was proposed to extract sensitive features and to construct a monitoring model for wheat scab based on in situ hyperspectral data of wheat ears to achieve effective prevention and control and provide theoretical support for its large-scale monitoring.Eight sensitive features were selected through correlation analysis and wavelet transform.These features were as follows:three original bands of 350-400 nm,500-600 nm,and 720-1000 nm;three vegetation indices of modified simple ratio(MSR),normalized difference vegetation index,and structural independent pigment index;and two wavelet features of WF01 and WF02.By combining the selected sensitive features with support vector machine(SVM)and SVM optimized by genetic algorithm(GASVM),a total of 16 monitoring models were built,and the monitoring accuracies of the two types of models were compared.The ability of the monitoring models built by GASVM to identify scab was better than that of SVM algorithm under the same characteristic variables.Among the 16 models,MSR combined with GASVM had an overall accuracy of 75%and a Kappa coefficient of 0.47.GASVM can be used to monitor wheat scab and its application can improve the accuracy of disease monitoring.展开更多
Many different chicken breeds are found around the world,their features vary among them,and they are valuable resources.Currently,there is a huge lack of knowledge of the genetic determinants responsible for phenotypi...Many different chicken breeds are found around the world,their features vary among them,and they are valuable resources.Currently,there is a huge lack of knowledge of the genetic determinants responsible for phenotypic and biochemical properties of these breeds of chickens.Understanding the underlying genetic mechanisms that explain across-breed variation can help breeders develop improved chicken breeds.The whole-genomes of 140 chickens from 7 Shandong native breeds and 20 introduced recessive white chickens from China were re-sequenced.Comparative population genomics based on autosomal single nucleotide polymorphisms(SNPs)revealed geographically based clusters among the chickens.Through genome-wide scans for selective sweeps,we identified thyroid stimulating hormone receptor(TSHR,reproductive traits,circadian rhythm),erythrocyte membrane protein band 4.1 like 1(EPB41L1,body size),and alkylglycerol monooxygenase(AGMO,aggressive behavior),as major candidate breed-specific determining genes in chickens.In addition,we used a machine learning classification model to predict chicken breeds based on the SNPs significantly associated with recourse characteristics,and the prediction accuracy was 92%,which can effectively achieve the breed identification of Laiwu Black chickens.We provide the first comprehensive genomic data of the Shandong indigenous chickens.Our analyses revealed phylogeographic patterns among the Shandong indigenous chickens and candidate genes that potentially contribute to breed-specific traits of the chickens.In addition,we developed a machine learning-based prediction model using SNP data to identify chicken breeds.The genetic basis of indigenous chicken breeds revealed in this study is useful to better understand the mechanisms underlying the resource characteristics of chicken.展开更多
A hybrid feature selection and classification strategy was proposed based on the simulated annealing genetic algonthrn and multiple instance learning (MIL). The band selection method was proposed from subspace decom...A hybrid feature selection and classification strategy was proposed based on the simulated annealing genetic algonthrn and multiple instance learning (MIL). The band selection method was proposed from subspace decomposition, which combines the simulated annealing algorithm with the genetic algorithm in choosing different cross-over and mutation probabilities, as well as mutation individuals. Then MIL was combined with image segmentation, clustering and support vector machine algorithms to classify hyperspectral image. The experimental results show that this proposed method can get high classification accuracy of 93.13% at small training samples and the weaknesses of the conventional methods are overcome.展开更多
Detection and classification of the blurred and the non-blurred regions in images is a challenging task due to the limited available information about blur type,scenarios and level of blurriness.In this paper,we propo...Detection and classification of the blurred and the non-blurred regions in images is a challenging task due to the limited available information about blur type,scenarios and level of blurriness.In this paper,we propose an effective method for blur detection and segmentation based on transfer learning concept.The proposed method consists of two separate steps.In the first step,genetic programming(GP)model is developed that quantify the amount of blur for each pixel in the image.The GP model method uses the multiresolution features of the image and it provides an improved blur map.In the second phase,the blur map is segmented into blurred and non-blurred regions by using an adaptive threshold.A model based on support vector machine(SVM)is developed to compute adaptive threshold for the input blur map.The performance of the proposed method is evaluated using two different datasets and compared with various state-of-the-art methods.The comparative analysis reveals that the proposed method performs better against the state-of-the-art techniques.展开更多
Traditional geostatistical estimation techniques have been used predominantly by the mining industry for ore reserve estimation. Determination of mineral reserve has posed considerable challenge to mining engineers du...Traditional geostatistical estimation techniques have been used predominantly by the mining industry for ore reserve estimation. Determination of mineral reserve has posed considerable challenge to mining engineers due to the geological complexities of ore body formation. Extensive research over the years has resulted in the development of several state-of-the-art methods for predictive spatial mapping, which could be used for ore reserve estimation;and recent advances in the use of machine learning algorithms (MLA) have provided a new approach for solving the prob-lem of ore reserve estimation. The focus of the present study was on the use of two MLA for estimating ore reserve: namely, neural networks (NN) and support vector machines (SVM). Application of MLA and the various issues involved with using them for reserve estimation have been elaborated with the help of a complex drill-hole dataset that exhibits the typical properties of sparseness and impreciseness that might be associated with a mining dataset. To investigate the accuracy and applicability of MLA for ore reserve estimation, the generalization ability of NN and SVM was compared with the geostatistical ordinary kriging (OK) method.展开更多
Mining aeroengine operational data and developing fault diagnosis models for aeroengines are to avoid running aeroengines under undesired conditions.Because of the complexity of working environment and faults of aeroe...Mining aeroengine operational data and developing fault diagnosis models for aeroengines are to avoid running aeroengines under undesired conditions.Because of the complexity of working environment and faults of aeroengines,it is unavoidable that the monitored parameters vary widely and possess larger noise levels.This paper reports the extrapolation of a diagnosis model for 20 gas path faults of a double-spool turbofan civil aeroengine.By applying support vector machine(SVM)algorithm together with genetic algorithm(GA),the fault diagnosis model is obtained from the training set that was based on the deviations of the monitored parameters superimposed with the noise level of 10%.The SVM model(C=24.7034;γ=179.835)was extrapolated for the samples whose noise levels were larger than 10%.The accuracies of extrapolation for samples with the noise levels of 20%and 30%are 97%and 94%,respectively.Compared with the models reported on the same faults,the extrapolation results of the GASVM model are accurate.展开更多
Breast cancer diagnosis through mammography is a pivotal application within medical image-based diagnostics,integral for early detection and effective treatment.While deep learning has significantly advanced the analy...Breast cancer diagnosis through mammography is a pivotal application within medical image-based diagnostics,integral for early detection and effective treatment.While deep learning has significantly advanced the analysis of mammographic images,challenges such as low contrast,image noise,and the high dimensionality of features often degrade model performance.Addressing these challenges,our study introduces a novel method integrating Genetic Algorithms(GA)with pre-trained Convolutional Neural Network(CNN)models to enhance feature selection and classification accuracy.Our approach involves a systematic process:first,we employ widely-used CNN architectures(VGG16,VGG19,MobileNet,and DenseNet)to extract a broad range of features from the Medical Image Analysis Society(MIAS)mammography dataset.Subsequently,a GA optimizes these features by selecting the most relevant and least redundant,aiming to overcome the typical pitfalls of high dimensionality.The selected features are then utilized to train several classifiers,including Linear and Polynomial Support Vector Machines(SVMs),K-Nearest Neighbors,Decision Trees,and Random Forests,enabling a robust evaluation of the method’s effectiveness across varied learning algorithms.Our extensive experimental evaluation demonstrates that the integration of MobileNet and GA significantly improves classification accuracy,from 83.33%to 89.58%,underscoring the method’s efficacy.By detailing these steps,we highlight the innovation of our approach which not only addresses key issues in breast cancer imaging analysis but also offers a scalable solution potentially applicable to other domains within medical imaging.展开更多
基金Project supported by the National Basic Research Program (973) of China (No. 2002CB312200) and the Center for Bioinformatics Pro-gram Grant of Harvard Center of Neurodegeneration and Repair,Harvard Medical School, Harvard University, Boston, USA
文摘In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear sta- tistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two repre- sentative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method per- forms well in selecting genes and achieves high classification accuracies with these genes.
基金Supported by the National Natural Science Foundation of China(21076179)the National Basic Research Program of China(2012CB720500)
文摘This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used to approximate each output of the controlled plant Then the model is used in MPC control scheme to predict the outputs of the controlled plant.The optimal control sequence is calculated using GA with elite preserve strategy.Simulation results of a typical MIMO nonlinear system show that this method has a good ability of set points tracking and disturbance rejection.
基金Projects(61471370,61401479)supported by the National Natural Science Foundation of China
文摘In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(SVM). Automatic target recognition process on the nonlinear and non-stationary of Doppler signals of military target by using automatic target recognition model can be expressed as follows. Firstly, the nonlinearity and non-stationary of Doppler signals were decomposed into a set of intrinsic mode functions(IMFs) using EMD. After the Hilbert transform of IMF, the energy ratio of each IMF to the total IMFs can be extracted as the features of military target. Then, the SVM was trained through using the energy ratio to classify the military targets, and genetic algorithm(GA) was used to optimize SVM parameters in the solution space. The experimental results show that this algorithm can achieve the recognition accuracies of 86.15%, 87.93%, and 82.28% for tank, vehicle and soldier, respectively.
文摘Corporate net value is efficiently described on its stock price, offering investors a chance to include a potentially surplus value to the net worth of the overall investment portfolio. Financial analysis of corporations extracted from the accounting statements is constantly demanded to support decisions making of portfolio managers. Econometrics and Artificial Intelligence methods aim to extract hidden information from complex accounting and financial data. Support Vector Machines hybrids optimized in their components by Genetic Algorithms provide effective results in corporate financial analysis.
文摘Parkinson’s disease (PD) is the most common disease of motor system degeneration that occurs when the dopamine-producing cells are damaged in substantia nigra. To detect PD, various signals have been investigated, including EEG, gait and speech. Since approximately 90 percent of the people with PD suffer from speech disorders, speech analysis is considered as the most common technique for this aim. This paper proposes a new algorithm for diagnosing of Parkinson’s disease based on voice analysis. In the first step, genetic algorithm (GA) is undertaken for selecting optimized features from all extracted features. Afterwards a network based on support vector machine (SVM) is used for classification between healthy and people with Parkinson. The dataset of this research is composed of a range of biomedical voice signals from 31 people, 23 with Parkinson’s disease and 8 healthy people. The subjects were asked to pronounce letter “A” for 3 seconds. 22 linear and non-linear features were extracted from the signals that 14 features were based on F0 (fundamental frequency or pitch), jitter, shimmer and noise to harmonics ratio, which are main factors in voice signal. Because changing in these factors is noticeable for the people with PD, optimized features were selected among them. Of the various numbers of optimized features, the data classification was investigated. Results show that the classification accuracy percent of 94.50 per 4 optimized features, the accuracy percent of 93.66 per 7 optimized features and the accuracy percent of 94.22 per 9 optimized features, could be achieved. It can be observed that the best classification accuracy may be achieved using Fhi (Hz), Fho (Hz), jitter (RAP) and shimmer (APQ5).
基金This work was supported by National Basic Research Programof China under Grant2002cb312200 01 3National Nature ScienceFoundation of China under Grant60174038.
文摘Support vector machines (SVMs) have been introduced as effective methods for solving classification problems. However, due to some limitations in practical applications, their generalization performance is sometimes far from the expected level. Therefore, it is meaningful to study SVM ensemble learning. In this paper, a novel genetic algorithm based ensemble learning method, namely Direct Genetic Ensemble (DGE), is proposed. DGE adopts the predictive accuracy of ensemble as the fitness function and searches a good ensemble from the ensemble space. In essence, DGE is also a selective ensemble learning method because the base classifiers of the ensemble are selected according to the solution of genetic algorithm. In comparison with other ensemble learning methods, DGE works on a higher level and is more direct. Different strategies of constructing diverse base classifiers can be utilized in DGE. Experimental results show that SVM ensembles constructed by DGE can achieve better performance than single SVMs, hagged and boosted SVM ensembles. In addition, some valuable conclusions are obtained.
文摘The rough set-genetic support vector machine(SVM) model is applied to supply chain performance evaluation. First, the rough set theory is used to remove the redundant factors that affect the performance evaluation of supply chain to obtain the core influencing factors. Then the support vector machine is used to extract the core influencing factors to predict the level of supply chain performance. In the process of SVM classification, the genetic algorithm is used to optimize the parameters of the SVM algorithm to obtain the best parameter model, and then the supply chain performance evaluation level is predicted. Finally, an example is used to predict this model, and compared with the result of using only rough set-support vector machine to predict. The results show that the method of rough set-genetic support vector machine can predict the level of supply chain performance more accurately and the prediction result is more realistic, which is a scientific and feasible method.
基金Supported by the joint fund of National Natural Science Foundation of China and Civil Aviation Administration Foundation of China(No.U1233201)
文摘A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization.
基金Project supported by the Natural Science Foundation of Ningbo City (No.2010A610005)the Key Science and Technology Program of Zhejiang Province (No.2011C11049),China
文摘The behavior of schools of zebrafish (Danio rerio) was studied in acute toxicity environments. Behavioral features were extracted and a method for water quality assessment using support vector machine (SVM) was de- veloped. The behavioral parameters of fish were recorded and analyzed during one hour in an environment of a 24-h half-lethal concentration (LC50) of a pollutant. The data were used to develop a method to evaluate water quality, so as 6+ 2+ to give an early indication of toxicity. Four kinds of metal ions (Cu2~, Hg2~, Cr , and Cd ) were used for toxicity testing. To enhance the efficiency and accuracy of assessment, a method combining SVM and a genetic algorithm (GA) was used. The results showed that the average prediction accuracy of the method was over 80% and the time cost was acceptable. The method gave satisfactory results for a variety of metal pollutants, demonstrating that this is an effective approach to the classification of water quality.
文摘Support vector machine (SVM) is a popular pattern classification method with many application areas. SVM shows its outstanding performance in high-dimensional data classification. In the process of classification, SVM kernel parameter setting during the SVM training procedure, along with the feature selection significantly influences the classification accuracy. This paper proposes two novel intelligent optimization methods, which simultaneously determines the parameter values while discovering a subset of features to increase SVM classification accuracy. The study focuses on two evolutionary computing approaches to optimize the parameters of SVM: particle swarm optimization (PSO) and genetic algorithm (GA). And we combine above the two intelligent optimization methods with SVM to choose appropriate subset features and SVM parameters, which are termed GA-FSSVM (Genetic Algorithm-Feature Selection Support Vector Machines) and PSO-FSSVM(Particle Swarm Optimization-Feature Selection Support Vector Machines) models. Experimental results demonstrate that the classification accuracy by our proposed methods outperforms traditional grid search approach and many other approaches. Moreover, the result indicates that PSO-FSSVM can obtain higher classification accuracy than GA-FSSVM classification for hyperspectral data.
文摘On-line chatter detection can avoid unstable cutting through monitoring the machining process.In order to identify chatter in a timely manner,an improved Support Vector Machine(SVM)is developed in this paper,based on extracted features.In the SVM model,the penalty factor(e)and the core parameter(g)have important influence on the classification,more than from Kernel Functions(KFs).Hence,first the classification results are conducted using different KFs.Then two methods are presented for exploring the best parameters.The chatter identification results show that the Genetic Algorithm(GA)approach is more suitable for deciding the parameters than the Grid Explore(GE)approach.
基金This work was supported by the Research Grant of SEC E-Institute :Shanghai High Institution Grid and the Science Foundation ofShanghai Municipal Commission of Science and Technology No.00JC14052
文摘Support vector machine (SVM) technique has recently become a research focus in intrusion detection field for its better generalization performance when given less priori knowledge than other soft-computing techniques. But the randomicity of parameter selection in its implement often prevents it achieving expected performance. By utilizing genetic algorithm (GA) to optimize the parameters in data preprocessing and the training model of SVM simultaneously, a hybrid optimization algorithm is proposed in the paper to address this problem. The experimental results demonstrate that it’s an effective method and can improve the performance of SVM-based intrusion detection system further.
基金Project supported by the National Natural Science Foundation of China(Nos.51075243 and 61201049)the Excellent Young Scientist Awarded Foundation of Shandong Province,China(No.BS2013DX029)the China Postdoctoral Science Foundation(No.2013M530323)
文摘We propose a systematic ECG quality classification method based on a kernel support vector machine(KSVM) and genetic algorithm(GA) to determine whether ECGs collected via mobile phone are acceptable or not. This method includes mainly three modules, i.e., lead-fall detection, feature extraction, and intelligent classification. First, lead-fall detection is executed to make the initial classification. Then the power spectrum, baseline drifts, amplitude difference, and other time-domain features for ECGs are analyzed and quantified to form the feature matrix. Finally, the feature matrix is assessed using KSVM and GA to determine the ECG quality classification results. A Gaussian radial basis function(GRBF) is employed as the kernel function of KSVM and its performance is compared with that of the Mexican hat wavelet function(MHWF). GA is used to determine the optimal parameters of the KSVM classifier and its performance is compared with that of the grid search(GS) method. The performance of the proposed method was tested on a database from PhysioNet/Computing in Cardiology Challenge 2011, which includes 1500 12-lead ECG recordings. True positive(TP), false positive(FP), and classification accuracy were used as the assessment indices. For training database set A(1000 recordings), the optimal results were obtained using the combination of lead-fall, GA, and GRBF methods, and the corresponding results were: TP 92.89%, FP 5.68%, and classification accuracy 94.00%. For test database set B(500 recordings), the optimal results were also obtained using the combination of lead-fall, GA, and GRBF methods, and the classification accuracy was 91.80%.
基金This work was supported by National Natural Science Foundation of China(41571354,41871339)Natural Science Research Project of Anhui Provincial Education Department(KJ2019A0030)+2 种基金Anhui Provincial Science and Technology Project(16030701091,201904f06020038)supported by Hainan Provincial Key R&D Program of China(ZDYF2018073)National special support program for high-level personnel recruitment(Wenjiang Huang).
文摘A new method was proposed to extract sensitive features and to construct a monitoring model for wheat scab based on in situ hyperspectral data of wheat ears to achieve effective prevention and control and provide theoretical support for its large-scale monitoring.Eight sensitive features were selected through correlation analysis and wavelet transform.These features were as follows:three original bands of 350-400 nm,500-600 nm,and 720-1000 nm;three vegetation indices of modified simple ratio(MSR),normalized difference vegetation index,and structural independent pigment index;and two wavelet features of WF01 and WF02.By combining the selected sensitive features with support vector machine(SVM)and SVM optimized by genetic algorithm(GASVM),a total of 16 monitoring models were built,and the monitoring accuracies of the two types of models were compared.The ability of the monitoring models built by GASVM to identify scab was better than that of SVM algorithm under the same characteristic variables.Among the 16 models,MSR combined with GASVM had an overall accuracy of 75%and a Kappa coefficient of 0.47.GASVM can be used to monitor wheat scab and its application can improve the accuracy of disease monitoring.
基金funded by the China Agriculture Research System of MOF and MARA(CARS-41)the Agricultural Breed Project of Shandong Province,China(2019LZGC019 and 2020LZGC013)+1 种基金the Shandong Provincial Natural Science Foundation,China(ZR2020MC169)the Agricultural Scientific and Technological Innovation Project of Shandong Academy of Agricultural Sciences,China(CXGC2022C04 and CXGC2022E11).
文摘Many different chicken breeds are found around the world,their features vary among them,and they are valuable resources.Currently,there is a huge lack of knowledge of the genetic determinants responsible for phenotypic and biochemical properties of these breeds of chickens.Understanding the underlying genetic mechanisms that explain across-breed variation can help breeders develop improved chicken breeds.The whole-genomes of 140 chickens from 7 Shandong native breeds and 20 introduced recessive white chickens from China were re-sequenced.Comparative population genomics based on autosomal single nucleotide polymorphisms(SNPs)revealed geographically based clusters among the chickens.Through genome-wide scans for selective sweeps,we identified thyroid stimulating hormone receptor(TSHR,reproductive traits,circadian rhythm),erythrocyte membrane protein band 4.1 like 1(EPB41L1,body size),and alkylglycerol monooxygenase(AGMO,aggressive behavior),as major candidate breed-specific determining genes in chickens.In addition,we used a machine learning classification model to predict chicken breeds based on the SNPs significantly associated with recourse characteristics,and the prediction accuracy was 92%,which can effectively achieve the breed identification of Laiwu Black chickens.We provide the first comprehensive genomic data of the Shandong indigenous chickens.Our analyses revealed phylogeographic patterns among the Shandong indigenous chickens and candidate genes that potentially contribute to breed-specific traits of the chickens.In addition,we developed a machine learning-based prediction model using SNP data to identify chicken breeds.The genetic basis of indigenous chicken breeds revealed in this study is useful to better understand the mechanisms underlying the resource characteristics of chicken.
文摘A hybrid feature selection and classification strategy was proposed based on the simulated annealing genetic algonthrn and multiple instance learning (MIL). The band selection method was proposed from subspace decomposition, which combines the simulated annealing algorithm with the genetic algorithm in choosing different cross-over and mutation probabilities, as well as mutation individuals. Then MIL was combined with image segmentation, clustering and support vector machine algorithms to classify hyperspectral image. The experimental results show that this proposed method can get high classification accuracy of 93.13% at small training samples and the weaknesses of the conventional methods are overcome.
基金This work was supported by the BK-21 FOUR program through National Research Foundation of Korea(NRF)under Ministry of Education.
文摘Detection and classification of the blurred and the non-blurred regions in images is a challenging task due to the limited available information about blur type,scenarios and level of blurriness.In this paper,we propose an effective method for blur detection and segmentation based on transfer learning concept.The proposed method consists of two separate steps.In the first step,genetic programming(GP)model is developed that quantify the amount of blur for each pixel in the image.The GP model method uses the multiresolution features of the image and it provides an improved blur map.In the second phase,the blur map is segmented into blurred and non-blurred regions by using an adaptive threshold.A model based on support vector machine(SVM)is developed to compute adaptive threshold for the input blur map.The performance of the proposed method is evaluated using two different datasets and compared with various state-of-the-art methods.The comparative analysis reveals that the proposed method performs better against the state-of-the-art techniques.
文摘Traditional geostatistical estimation techniques have been used predominantly by the mining industry for ore reserve estimation. Determination of mineral reserve has posed considerable challenge to mining engineers due to the geological complexities of ore body formation. Extensive research over the years has resulted in the development of several state-of-the-art methods for predictive spatial mapping, which could be used for ore reserve estimation;and recent advances in the use of machine learning algorithms (MLA) have provided a new approach for solving the prob-lem of ore reserve estimation. The focus of the present study was on the use of two MLA for estimating ore reserve: namely, neural networks (NN) and support vector machines (SVM). Application of MLA and the various issues involved with using them for reserve estimation have been elaborated with the help of a complex drill-hole dataset that exhibits the typical properties of sparseness and impreciseness that might be associated with a mining dataset. To investigate the accuracy and applicability of MLA for ore reserve estimation, the generalization ability of NN and SVM was compared with the geostatistical ordinary kriging (OK) method.
基金supported by the National Natural Science Foundation of China(41701440).
文摘Mining aeroengine operational data and developing fault diagnosis models for aeroengines are to avoid running aeroengines under undesired conditions.Because of the complexity of working environment and faults of aeroengines,it is unavoidable that the monitored parameters vary widely and possess larger noise levels.This paper reports the extrapolation of a diagnosis model for 20 gas path faults of a double-spool turbofan civil aeroengine.By applying support vector machine(SVM)algorithm together with genetic algorithm(GA),the fault diagnosis model is obtained from the training set that was based on the deviations of the monitored parameters superimposed with the noise level of 10%.The SVM model(C=24.7034;γ=179.835)was extrapolated for the samples whose noise levels were larger than 10%.The accuracies of extrapolation for samples with the noise levels of 20%and 30%are 97%and 94%,respectively.Compared with the models reported on the same faults,the extrapolation results of the GASVM model are accurate.
基金The authors extend their appreciation to the Deputyship for Research&Innovation,“Ministry of Education”in Saudi Arabia for funding this research work through the project number (IFKSUDR_D127).
文摘Breast cancer diagnosis through mammography is a pivotal application within medical image-based diagnostics,integral for early detection and effective treatment.While deep learning has significantly advanced the analysis of mammographic images,challenges such as low contrast,image noise,and the high dimensionality of features often degrade model performance.Addressing these challenges,our study introduces a novel method integrating Genetic Algorithms(GA)with pre-trained Convolutional Neural Network(CNN)models to enhance feature selection and classification accuracy.Our approach involves a systematic process:first,we employ widely-used CNN architectures(VGG16,VGG19,MobileNet,and DenseNet)to extract a broad range of features from the Medical Image Analysis Society(MIAS)mammography dataset.Subsequently,a GA optimizes these features by selecting the most relevant and least redundant,aiming to overcome the typical pitfalls of high dimensionality.The selected features are then utilized to train several classifiers,including Linear and Polynomial Support Vector Machines(SVMs),K-Nearest Neighbors,Decision Trees,and Random Forests,enabling a robust evaluation of the method’s effectiveness across varied learning algorithms.Our extensive experimental evaluation demonstrates that the integration of MobileNet and GA significantly improves classification accuracy,from 83.33%to 89.58%,underscoring the method’s efficacy.By detailing these steps,we highlight the innovation of our approach which not only addresses key issues in breast cancer imaging analysis but also offers a scalable solution potentially applicable to other domains within medical imaging.