Parkinson’s disease(PD)is a debilitating neurological disorder affecting over 10 million people worldwide.PD classification models using voice signals as input are common in the literature.It is believed that using d...Parkinson’s disease(PD)is a debilitating neurological disorder affecting over 10 million people worldwide.PD classification models using voice signals as input are common in the literature.It is believed that using deep learning algorithms further enhances performance;nevertheless,it is challenging due to the nature of small-scale and imbalanced PD datasets.This paper proposed a convolutional neural network-based deep support vector machine(CNN-DSVM)to automate the feature extraction process using CNN and extend the conventional SVM to a DSVM for better classification performance in small-scale PD datasets.A customized kernel function reduces the impact of biased classification towards the majority class(healthy candidates in our consideration).An improved generative adversarial network(IGAN)was designed to generate additional training data to enhance the model’s performance.For performance evaluation,the proposed algorithm achieves a sensitivity of 97.6%and a specificity of 97.3%.The performance comparison is evaluated from five perspectives,including comparisons with different data generation algorithms,feature extraction techniques,kernel functions,and existing works.Results reveal the effectiveness of the IGAN algorithm,which improves the sensitivity and specificity by 4.05%–4.72%and 4.96%–5.86%,respectively;and the effectiveness of the CNN-DSVM algorithm,which improves the sensitivity by 1.24%–57.4%and specificity by 1.04%–163%and reduces biased detection towards the majority class.The ablation experiments confirm the effectiveness of individual components.Two future research directions have also been suggested.展开更多
Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex int...Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex internal chemical systems of LIBs and the nonlinear degradation of their performance,direct measurement of SOH and RUL is challenging.To address these issues,the Twin Support Vector Machine(TWSVM)method is proposed to predict SOH and RUL.Initially,the constant current charging time of the lithium battery is extracted as a health indicator(HI),decomposed using Variational Modal Decomposition(VMD),and feature correlations are computed using Importance of Random Forest Features(RF)to maximize the extraction of critical factors influencing battery performance degradation.Furthermore,to enhance the global search capability of the Convolution Optimization Algorithm(COA),improvements are made using Good Point Set theory and the Differential Evolution method.The Improved Convolution Optimization Algorithm(ICOA)is employed to optimize TWSVM parameters for constructing SOH and RUL prediction models.Finally,the proposed models are validated using NASA and CALCE lithium-ion battery datasets.Experimental results demonstrate that the proposed models achieve an RMSE not exceeding 0.007 and an MAPE not exceeding 0.0082 for SOH and RUL prediction,with a relative error in RUL prediction within the range of[-1.8%,2%].Compared to other models,the proposed model not only exhibits superior fitting capability but also demonstrates robust performance.展开更多
Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experie...Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experience in driving,navigation,and communication.These privacy needs are influenced by various factors,such as data collected at different intervals,trip durations,and user interactions.To address this,the paper proposes a Support Vector Machine(SVM)model designed to process large amounts of aggregated data and recommend privacy preserving measures.The model analyzes data based on user demands and interactions with service providers or neighboring infrastructure.It aims to minimize privacy risks while ensuring service continuity and sustainability.The SVMmodel helps validate the system’s reliability by creating a hyperplane that distinguishes between maximum and minimum privacy recommendations.The results demonstrate the effectiveness of the proposed SVM model in enhancing both privacy and service performance.展开更多
The total nitrogen(TN)is a major factor contributing to eutrophication and is a crucial parameter in assessing surface water quality.Accurate and rapid methods are crucial for determining the TN content in water.Herei...The total nitrogen(TN)is a major factor contributing to eutrophication and is a crucial parameter in assessing surface water quality.Accurate and rapid methods are crucial for determining the TN content in water.Herein,a fast,highly sensitive,and pollution-free approach is proposed,which combines ultraviolet(UV)absorption spectroscopy with Bayesian optimized least squares support vector machine(LSSVM)for detecting TN content in water.Water samples collected from sampling points near the Yangtze River basin in Chongqing of China were analyzed using national standard methods to measure TN content as reference values.The prediction of TN content in water was achieved by integrating the UV absorption spectra of water samples with LSSVM.To make the model quickly and accurately select the optimal parameters to improve the accuracy of the prediction model,the Bayesian optimization(BO)algorithm was used to optimize the parameters of the LSSVM.Results show that the prediction model performs well in predicting TN concentration,with a high coefficient of prediction determination(R^(2)=0.9413)and a low root mean square error of prediction(RMSE=0.0779 mg/L).Comparative analysis with previous studies indicates that the model used in this paper achieves lower prediction errors and superior predictive performance.展开更多
Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the l...Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the lowest temperature at which the wax begins to form.When crude oil cools to its WAT,wax crystals precipitate,forming deposits on pipelines as the solubility limit is reached.Therefore,WAT is a crucial quality assurance parameter,especially when dealing with modern fuel oil blends.In this study,we use machine learning via MATLAB’s Bioinformatics Toolbox to predict the WAT of marine fuel samples by correlating near-infrared spectral data with laboratory-measured values.The dataset provided by Intertek PLC-a total quality assurance provider of inspection,testing,and certification services-includes industrial data that is imbalanced,with a higher proportion of high-WAT samples compared to low-WAT samples.The objective is to predict marine fuel oil blends with unusually high WAT values(>35℃)without relying on time-consuming and irregular laboratory-based measurements.The results demonstrate that the developed model,based on the one-class support vector machine(OCSVM)algorithm,achieved a Recall of 96,accurately predicting 96%of fuel samples with WAT>35℃.For standard binary classification,the Recall was 85.7.The trained OCSVM model is expected to facilitate rapid and well-informed decision-making for logistics and storage when choosing fuel oils.展开更多
Miniature air quality sensors are widely used in urban grid-based monitoring due to their flexibility in deployment and low cost.However,the raw data collected by these devices often suffer from low accuracy caused by...Miniature air quality sensors are widely used in urban grid-based monitoring due to their flexibility in deployment and low cost.However,the raw data collected by these devices often suffer from low accuracy caused by environmental interference and sensor drift,highlighting the need for effective calibration methods to improve data reliability.This study proposes a data correction method based on Bayesian Optimization Support Vector Regression(BO-SVR),which combines the nonlinear modeling capability of Support Vector Regression(SVR)with the efficient global hyperparameter search of Bayesian Optimization.By introducing cross-validation loss as the optimization objective and using Gaussian process modeling with an Expected Improvement acquisition strategy,the approach automatically determines optimal hyperparameters for accurate pollutant concentration prediction.Experiments on real-world micro-sensor datasets demonstrate that BO-SVR outperforms traditional SVR,grid search SVR,and random forest(RF)models across multiple pollutants,including PM_(2.5),PM_(10),CO,NO_(2),SO_(2),and O_(3).The proposed method achieves lower prediction residuals,higher fitting accuracy,and better generalization,offering an efficient and practical solution for enhancing the quality of micro-sensor air monitoring data.展开更多
Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a...Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a SVM with spline kernel function. With the help of this model, nonlinear model predictive control can be transformed to linear model predictive control, and consequently a unified analytical solution of optimal input of multi-step-ahead predictive control is possible to derive. This algorithm does not require online iterative optimization in order to be suitable for real-time control with less calculation. The simulation results of pH neutralization process and CSTR reactor show the effectiveness and advantages of the presented algorithm.展开更多
In order to accurately identify a bearing fault on a wind turbine, a novel fault diagnosis method based on stochastic subspace identification(SSI) and multi-kernel support vector machine(MSVM) is proposed. Firstly, th...In order to accurately identify a bearing fault on a wind turbine, a novel fault diagnosis method based on stochastic subspace identification(SSI) and multi-kernel support vector machine(MSVM) is proposed. Firstly, the collected vibration signal of the wind turbine bearing is processed by the SSI method to extract fault feature vectors. Then, the MSVM is constructed based on Gauss kernel support vector machine(SVM) and polynomial kernel SVM. Finally, fault feature vectors which indicate the condition of the wind turbine bearing are inputted to the MSVM for fault pattern recognition. The results indicate that the SSI-MSVM method is effective in fault diagnosis for a wind turbine bearing and can successfully identify fault types of bearing and achieve higher diagnostic accuracy than that of K-means clustering, fuzzy means clustering and traditional SVM.展开更多
Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacie...Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacies and seismic information that is affected by many factors is complicated.Machine learning has received extensive attention in recent years,among which support vector machine(SVM) is a potential method for lithofacies classification.Lithofacies classification involves identifying various types of lithofacies and is generally a nonlinear problem,which needs to be solved by means of the kernel function.Multi-kernel learning SVM is one of the main tools for solving the nonlinear problem about multi-classification.However,it is very difficult to determine the kernel function and the parameters,which is restricted by human factors.Besides,its computational efficiency is low.A lithofacies classification method based on local deep multi-kernel learning support vector machine(LDMKL-SVM) that can consider low-dimensional global features and high-dimensional local features is developed.The method can automatically learn parameters of kernel function and SVM to build a relationship between lithofacies and seismic elastic information.The calculation speed will be expedited at no cost with respect to discriminant accuracy for multi-class lithofacies identification.Both the model data test results and the field data application results certify advantages of the method.This contribution offers an effective method for lithofacies recognition and reservoir prediction by using SVM.展开更多
Many remote sensing image classifiers are limited in their ability to combine spectral features with spatial features. Multi-kernel classifiers, however, are capable of integrating spectral features with spatial or st...Many remote sensing image classifiers are limited in their ability to combine spectral features with spatial features. Multi-kernel classifiers, however, are capable of integrating spectral features with spatial or structural features using multiple kernels and summing them for final outputs. Using a support vector machine (SVM) as classifier, different multi-kernel classifiers are constructed and tested using 64-band Operational Modular Imaging Spectrometer II hyperspectral image of Changping Area, Beijing City. Results show that by integrating spectral and wavelet texture information, multi-kernel SVM classifiers can obtain more accurate classification results than sole-kernel SVM classifiers and cross-information SVM kernel classifiers. Moreover, when the multi-kernel SVM classifier is used, the combination of the first four principal components from principal component analysis and wavelet texture provides the highest accuracy (97.06%). Multi-kernel SVM is therefore an effective approach to improve the accuracy of hyperspectral image classification and to expand possibilities for remote sensing image interpretation and application.展开更多
A new incremental support vector machine (SVM) algorithm is proposed which is based on multiple kernel learning. Through introducing multiple kernel learning into the SVM incremental learning, large scale data set l...A new incremental support vector machine (SVM) algorithm is proposed which is based on multiple kernel learning. Through introducing multiple kernel learning into the SVM incremental learning, large scale data set learning problem can be solved effectively. Furthermore, different punishments are adopted in allusion to the training subset and the acquired support vectors, which may help to improve the performance of SVM. Simulation results indicate that the proposed algorithm can not only solve the model selection problem in SVM incremental learning, but also improve the classification or prediction precision.展开更多
Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50...Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50) resulting from rock blast fragmentation in various mines based on the statistical learning theory. The data base consisted of blast design parameters, explosive parameters, modulus of elasticity and in-situ block size. The seven input independent variables used for the SVMs model for the prediction of X50 of rock blast fragmentation were the ratio of bench height to drilled burden (H/B), ratio of spacing to burden (S/B), ratio of burden to hole diameter (B/D), ratio of stemming to burden (T/B), powder factor (Pf), modulus of elasticity (E) and in-situ block size (XB). After using the 90 sets of the measured data in various mines and rock formations in the world for training and testing, the model was applied to 12 another blast data for validation of the trained support vector regression (SVR) model. The prediction results of SVR were compared with those of artificial neural network (ANN), multivariate regression analysis (MVRA) models, conventional Kuznetsov method and the measured X50 values. The proposed method shows promising results and the prediction accuracy of SVMs model is acceptable.展开更多
In order to deal with the issue of huge computational cost very well in direct numerical simulation, the traditional response surface method (RSM) as a classical regression algorithm is used to approximate a functiona...In order to deal with the issue of huge computational cost very well in direct numerical simulation, the traditional response surface method (RSM) as a classical regression algorithm is used to approximate a functional relationship between the state variable and basic variables in reliability design. The algorithm has treated successfully some problems of implicit performance function in reliability analysis. However, its theoretical basis of empirical risk minimization narrows its range of applications for...展开更多
Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional fe...Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional features construct a nonlinear problem in the input space, and SVMs based on nonlinear mapping can solve it more effectively than other linear methods. Multi class classification based on SVMs with a soft decision function is constructed to classify the four emotion situations. Compared with principal component analysis (PCA) method and modified PCA method, SVMs perform the best result in multi class discrimination by using nonlinear kernel mapping.展开更多
In order to solve the fatigue damage identification problem of helicopter moving components, a new approach for acoustic emission (AE) source type identification based on the harmonic wavelet packet (HWPT) feature...In order to solve the fatigue damage identification problem of helicopter moving components, a new approach for acoustic emission (AE) source type identification based on the harmonic wavelet packet (HWPT) feature extraction and the hierarchy support vector machine (H-SVM) classifier is proposed. After a four-level decomposition of the HWPT, the energy feature of AE signals in different frequency bands is extracted, which overcomes the shortcomings of the traditional wavelet packet including energy leakage, and inflexible frequency band selection and different frequency resolutions on different levels. The H-SVM classifier is trained with a subset of the experimental data for known AE source types and tested using the remaining set of data. The results of pressure-off experiments on the specimens of carbon fiber materials indicate that the proposed approach can effectively implement the AE source type identification, and has a better performance in terms of computational efficiency and identification accuracy than the wavelet packet (WPT) feature extraction.展开更多
In order to assist the design of short interfering ribonucleic acids (siRNA), 573 non-redundant siRNAs were collected from published literatures and the relationship between siRNAs sequences and RNA interference (R...In order to assist the design of short interfering ribonucleic acids (siRNA), 573 non-redundant siRNAs were collected from published literatures and the relationship between siRNAs sequences and RNA interference (RNAi) effect is analyzed by a support vector machine (SVM) based algorithm relied on a basebase correlation (BBC) feature. The results show that the proposed algorithm has the highest area under curve (AUC) value (0. 73) of the receive operating characteristic (ROC) curve and the greatest r value (0. 43) of the Pearson's correlation coefficient. This indicates that the proposed algorithm is better than the published algorithms on the collected datasets and that more attention should be paid to the base-base correlation information in future siRNA design.展开更多
In order to improve the accuracy of travel demand forecast and considering the distribution of travel behaviors within time dimension, a trip chaining pattern recognition model was established based on activity purpos...In order to improve the accuracy of travel demand forecast and considering the distribution of travel behaviors within time dimension, a trip chaining pattern recognition model was established based on activity purposes by applying three methods: the support vector machine (SVM) model, the radial basis function neural network (RBFNN) model and the multinomial logit (MNL) model. The effect of explanatory factors on trip chaining behaviors and their contribution to model performace were investigated by sensitivity analysis. Results show that the SVM model has a better performance than the RBFNN model and the MNL model due to its higher overall and partial accuracy, indicating its recognition advantage under a smai sample size scenario. It is also proved that the SVM model is capable of estimating the effect of multi-category factors on trip chaining behaviors more accurately. The different contribution of explanatory, factors to trip chaining pattern recognition reflects the importance of refining trip chaining patterns ad exploring factors that are specific to each pattern. It is shown that the SVM technology in travel demand forecast modeling and analysis of explanatory variable effects is practical.展开更多
The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accura...The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accuracy and reliability of prediction, the strategy of combining the above three computational approaches was applied to predict potential cytochrome P450 1A2 (CYP1A2) inhibitors. The accuracy of the optimal SVM qualitative model was 99.432%, 97.727%, and 91.667% for training set, internal test set and external test set, respectively, showing this model had high discrimination ability. The R2 and mean square error for the optimal SVR quantitative model were 0.763, 0.013 for training set, and 0.753, 0.056 for test set respectively, indicating that this SVR model has high predictive ability for the biolog-ical activities of compounds. According to the results of the SVM and SVR models, some types of descriptors were identi ed to be essential to bioactivity prediction of compounds, including the connectivity indices, constitutional descriptors and functional group counts. Moreover, molecular docking studies were used to reveal the binding poses and binding a n-ity of potential inhibitors interacting with CYP1A2. Wherein, the amino acids of THR124 and ASP320 could form key hydrogen bond interactions with active compounds. And the amino acids of ALA317 and GLY316 could form strong hydrophobic bond interactions with active compounds. The models obtained above were applied to discover potential CYP1A2 inhibitors from natural products, which could predict the CYPs-mediated drug-drug inter-actions and provide useful guidance and reference for rational drug combination therapy. A set of 20 potential CYP1A2 inhibitors were obtained. Part of the results was consistent with references, which further indicates the accuracy of these models and the reliability of this combinatorial computation strategy.展开更多
Multivariables, strong coupling, nonlinearity, and large delays characterize the boiler-turbine coordinated control systems for ship power equipment. To better deal with these conditions, a compound control strategy b...Multivariables, strong coupling, nonlinearity, and large delays characterize the boiler-turbine coordinated control systems for ship power equipment. To better deal with these conditions, a compound control strategy based on a support vector machine (SVM) with inverse identification was proposed and applied to research simulating coordinated control systems. This method combines SVM inverse control and fuzzy control, taking advantage of the merits of SVM inverse controls which can be designed easily and have high reliability, and those of fuzzy controls, which respond rapidly and have good anti-jamming capability and robustness. It ensures the controller can be controlled with near instantaneous adjustments to maintain a steady state, even if the SVM is not trained well. The simulation results show that the control quality of this fuzzy-SVM compound control algorithm is high, with good performance in dynamic response speed, static stability, restraint of overshoot, and robustness.展开更多
[Objective] The aim was to study the feature extraction of stored-grain insects based on ant colony optimization and support vector machine algorithm, and to explore the feasibility of the feature extraction of stored...[Objective] The aim was to study the feature extraction of stored-grain insects based on ant colony optimization and support vector machine algorithm, and to explore the feasibility of the feature extraction of stored-grain insects. [Method] Through the analysis of feature extraction in the image recognition of the stored-grain insects, the recognition accuracy of the cross-validation training model in support vector machine (SVM) algorithm was taken as an important factor of the evaluation principle of feature extraction of stored-grain insects. The ant colony optimization (ACO) algorithm was applied to the automatic feature extraction of stored-grain insects. [Result] The algorithm extracted the optimal feature subspace of seven features from the 17 morphological features, including area and perimeter. The ninety image samples of the stored-grain insects were automatically recognized by the optimized SVM classifier, and the recognition accuracy was over 95%. [Conclusion] The experiment shows that the application of ant colony optimization to the feature extraction of grain insects is practical and feasible.展开更多
基金The work described in this paper was fully supported by a grant from Hong Kong Metropolitan University(RIF/2021/05).
文摘Parkinson’s disease(PD)is a debilitating neurological disorder affecting over 10 million people worldwide.PD classification models using voice signals as input are common in the literature.It is believed that using deep learning algorithms further enhances performance;nevertheless,it is challenging due to the nature of small-scale and imbalanced PD datasets.This paper proposed a convolutional neural network-based deep support vector machine(CNN-DSVM)to automate the feature extraction process using CNN and extend the conventional SVM to a DSVM for better classification performance in small-scale PD datasets.A customized kernel function reduces the impact of biased classification towards the majority class(healthy candidates in our consideration).An improved generative adversarial network(IGAN)was designed to generate additional training data to enhance the model’s performance.For performance evaluation,the proposed algorithm achieves a sensitivity of 97.6%and a specificity of 97.3%.The performance comparison is evaluated from five perspectives,including comparisons with different data generation algorithms,feature extraction techniques,kernel functions,and existing works.Results reveal the effectiveness of the IGAN algorithm,which improves the sensitivity and specificity by 4.05%–4.72%and 4.96%–5.86%,respectively;and the effectiveness of the CNN-DSVM algorithm,which improves the sensitivity by 1.24%–57.4%and specificity by 1.04%–163%and reduces biased detection towards the majority class.The ablation experiments confirm the effectiveness of individual components.Two future research directions have also been suggested.
基金funded by the Pyramid Talent Training Project of Beijing University of Civil Engineering and Architecture under Grant GJZJ20220802。
文摘Accurately estimating the State of Health(SOH)and Remaining Useful Life(RUL)of lithium-ion batteries(LIBs)is crucial for the continuous and stable operation of battery management systems.However,due to the complex internal chemical systems of LIBs and the nonlinear degradation of their performance,direct measurement of SOH and RUL is challenging.To address these issues,the Twin Support Vector Machine(TWSVM)method is proposed to predict SOH and RUL.Initially,the constant current charging time of the lithium battery is extracted as a health indicator(HI),decomposed using Variational Modal Decomposition(VMD),and feature correlations are computed using Importance of Random Forest Features(RF)to maximize the extraction of critical factors influencing battery performance degradation.Furthermore,to enhance the global search capability of the Convolution Optimization Algorithm(COA),improvements are made using Good Point Set theory and the Differential Evolution method.The Improved Convolution Optimization Algorithm(ICOA)is employed to optimize TWSVM parameters for constructing SOH and RUL prediction models.Finally,the proposed models are validated using NASA and CALCE lithium-ion battery datasets.Experimental results demonstrate that the proposed models achieve an RMSE not exceeding 0.007 and an MAPE not exceeding 0.0082 for SOH and RUL prediction,with a relative error in RUL prediction within the range of[-1.8%,2%].Compared to other models,the proposed model not only exhibits superior fitting capability but also demonstrates robust performance.
基金supported by the Deanship of Graduate Studies and Scientific Research at University of Bisha for funding this research through the promising program under grant number(UB-Promising-33-1445).
文摘Open networks and heterogeneous services in the Internet of Vehicles(IoV)can lead to security and privacy challenges.One key requirement for such systems is the preservation of user privacy,ensuring a seamless experience in driving,navigation,and communication.These privacy needs are influenced by various factors,such as data collected at different intervals,trip durations,and user interactions.To address this,the paper proposes a Support Vector Machine(SVM)model designed to process large amounts of aggregated data and recommend privacy preserving measures.The model analyzes data based on user demands and interactions with service providers or neighboring infrastructure.It aims to minimize privacy risks while ensuring service continuity and sustainability.The SVMmodel helps validate the system’s reliability by creating a hyperplane that distinguishes between maximum and minimum privacy recommendations.The results demonstrate the effectiveness of the proposed SVM model in enhancing both privacy and service performance.
基金supported by the National Natural Science Foundation of China(Nos.32171627 and 62105252)the Science and Technology Research Program of Chongqing Municipal Education Commission(No.KJZD-M202200602)the Hangzhou Science and Technology Development Project(No.202204T04).
文摘The total nitrogen(TN)is a major factor contributing to eutrophication and is a crucial parameter in assessing surface water quality.Accurate and rapid methods are crucial for determining the TN content in water.Herein,a fast,highly sensitive,and pollution-free approach is proposed,which combines ultraviolet(UV)absorption spectroscopy with Bayesian optimized least squares support vector machine(LSSVM)for detecting TN content in water.Water samples collected from sampling points near the Yangtze River basin in Chongqing of China were analyzed using national standard methods to measure TN content as reference values.The prediction of TN content in water was achieved by integrating the UV absorption spectra of water samples with LSSVM.To make the model quickly and accurately select the optimal parameters to improve the accuracy of the prediction model,the Bayesian optimization(BO)algorithm was used to optimize the parameters of the LSSVM.Results show that the prediction model performs well in predicting TN concentration,with a high coefficient of prediction determination(R^(2)=0.9413)and a low root mean square error of prediction(RMSE=0.0779 mg/L).Comparative analysis with previous studies indicates that the model used in this paper achieves lower prediction errors and superior predictive performance.
基金Newcastle University and EPSRC(Grant No.2020/21 DTP:ref.EP/T517914/1).
文摘Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the lowest temperature at which the wax begins to form.When crude oil cools to its WAT,wax crystals precipitate,forming deposits on pipelines as the solubility limit is reached.Therefore,WAT is a crucial quality assurance parameter,especially when dealing with modern fuel oil blends.In this study,we use machine learning via MATLAB’s Bioinformatics Toolbox to predict the WAT of marine fuel samples by correlating near-infrared spectral data with laboratory-measured values.The dataset provided by Intertek PLC-a total quality assurance provider of inspection,testing,and certification services-includes industrial data that is imbalanced,with a higher proportion of high-WAT samples compared to low-WAT samples.The objective is to predict marine fuel oil blends with unusually high WAT values(>35℃)without relying on time-consuming and irregular laboratory-based measurements.The results demonstrate that the developed model,based on the one-class support vector machine(OCSVM)algorithm,achieved a Recall of 96,accurately predicting 96%of fuel samples with WAT>35℃.For standard binary classification,the Recall was 85.7.The trained OCSVM model is expected to facilitate rapid and well-informed decision-making for logistics and storage when choosing fuel oils.
文摘Miniature air quality sensors are widely used in urban grid-based monitoring due to their flexibility in deployment and low cost.However,the raw data collected by these devices often suffer from low accuracy caused by environmental interference and sensor drift,highlighting the need for effective calibration methods to improve data reliability.This study proposes a data correction method based on Bayesian Optimization Support Vector Regression(BO-SVR),which combines the nonlinear modeling capability of Support Vector Regression(SVR)with the efficient global hyperparameter search of Bayesian Optimization.By introducing cross-validation loss as the optimization objective and using Gaussian process modeling with an Expected Improvement acquisition strategy,the approach automatically determines optimal hyperparameters for accurate pollutant concentration prediction.Experiments on real-world micro-sensor datasets demonstrate that BO-SVR outperforms traditional SVR,grid search SVR,and random forest(RF)models across multiple pollutants,including PM_(2.5),PM_(10),CO,NO_(2),SO_(2),and O_(3).The proposed method achieves lower prediction residuals,higher fitting accuracy,and better generalization,offering an efficient and practical solution for enhancing the quality of micro-sensor air monitoring data.
基金Supported by the State Key Development Program for Basic Research of China (No.2002CB312200) and the National Natural Science Foundation of China (No.60574019).
文摘Multi-kernel-based support vector machine (SVM) model structure of nonlinear systems and its specific identification method is proposed, which is composed of a SVM with linear kernel function followed in series by a SVM with spline kernel function. With the help of this model, nonlinear model predictive control can be transformed to linear model predictive control, and consequently a unified analytical solution of optimal input of multi-step-ahead predictive control is possible to derive. This algorithm does not require online iterative optimization in order to be suitable for real-time control with less calculation. The simulation results of pH neutralization process and CSTR reactor show the effectiveness and advantages of the presented algorithm.
基金supported by National Key Technology Research and Development Program (No. 2015BAA06B03)
文摘In order to accurately identify a bearing fault on a wind turbine, a novel fault diagnosis method based on stochastic subspace identification(SSI) and multi-kernel support vector machine(MSVM) is proposed. Firstly, the collected vibration signal of the wind turbine bearing is processed by the SSI method to extract fault feature vectors. Then, the MSVM is constructed based on Gauss kernel support vector machine(SVM) and polynomial kernel SVM. Finally, fault feature vectors which indicate the condition of the wind turbine bearing are inputted to the MSVM for fault pattern recognition. The results indicate that the SSI-MSVM method is effective in fault diagnosis for a wind turbine bearing and can successfully identify fault types of bearing and achieve higher diagnostic accuracy than that of K-means clustering, fuzzy means clustering and traditional SVM.
基金financially supported by the National Natural Science Foundation of China (41774129, 41904116)the Foundation Research Project of Shaanxi Provincial Key Laboratory of Geological Support for Coal Green Exploitation (MTy2019-20)。
文摘Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacies and seismic information that is affected by many factors is complicated.Machine learning has received extensive attention in recent years,among which support vector machine(SVM) is a potential method for lithofacies classification.Lithofacies classification involves identifying various types of lithofacies and is generally a nonlinear problem,which needs to be solved by means of the kernel function.Multi-kernel learning SVM is one of the main tools for solving the nonlinear problem about multi-classification.However,it is very difficult to determine the kernel function and the parameters,which is restricted by human factors.Besides,its computational efficiency is low.A lithofacies classification method based on local deep multi-kernel learning support vector machine(LDMKL-SVM) that can consider low-dimensional global features and high-dimensional local features is developed.The method can automatically learn parameters of kernel function and SVM to build a relationship between lithofacies and seismic elastic information.The calculation speed will be expedited at no cost with respect to discriminant accuracy for multi-class lithofacies identification.Both the model data test results and the field data application results certify advantages of the method.This contribution offers an effective method for lithofacies recognition and reservoir prediction by using SVM.
基金supported by the National Natural Science Foundation of China (Nos.40401038 and 40871195)the National High-Tech Program of China (No.2007AA12Z162)+1 种基金Jiangsu Provincial Innovative Planning (No.CX08B 112Z)the Fundamental Research Funds for the Central Universities (2010QNA18)
文摘Many remote sensing image classifiers are limited in their ability to combine spectral features with spatial features. Multi-kernel classifiers, however, are capable of integrating spectral features with spatial or structural features using multiple kernels and summing them for final outputs. Using a support vector machine (SVM) as classifier, different multi-kernel classifiers are constructed and tested using 64-band Operational Modular Imaging Spectrometer II hyperspectral image of Changping Area, Beijing City. Results show that by integrating spectral and wavelet texture information, multi-kernel SVM classifiers can obtain more accurate classification results than sole-kernel SVM classifiers and cross-information SVM kernel classifiers. Moreover, when the multi-kernel SVM classifier is used, the combination of the first four principal components from principal component analysis and wavelet texture provides the highest accuracy (97.06%). Multi-kernel SVM is therefore an effective approach to improve the accuracy of hyperspectral image classification and to expand possibilities for remote sensing image interpretation and application.
基金supported by the National Natural Science Key Foundation of China(69974021)
文摘A new incremental support vector machine (SVM) algorithm is proposed which is based on multiple kernel learning. Through introducing multiple kernel learning into the SVM incremental learning, large scale data set learning problem can be solved effectively. Furthermore, different punishments are adopted in allusion to the training subset and the acquired support vectors, which may help to improve the performance of SVM. Simulation results indicate that the proposed algorithm can not only solve the model selection problem in SVM incremental learning, but also improve the classification or prediction precision.
基金Foundation item:Project (2006BAB02A02) supported by the National Key Technology R&D Program during the 11th Five-year Plan Period of ChinaProject (CX2011B119) supported by the Graduated Students' Research and Innovation Fund of Hunan Province, ChinaProject (2009ssxt230) supported by the Central South University Innovation Fund,China
文摘Aiming at the problems of the traditional method of assessing distribution of particle size in bench blasting, a support vector machines (SVMs) regression methodology was used to predict the mean particle size (X50) resulting from rock blast fragmentation in various mines based on the statistical learning theory. The data base consisted of blast design parameters, explosive parameters, modulus of elasticity and in-situ block size. The seven input independent variables used for the SVMs model for the prediction of X50 of rock blast fragmentation were the ratio of bench height to drilled burden (H/B), ratio of spacing to burden (S/B), ratio of burden to hole diameter (B/D), ratio of stemming to burden (T/B), powder factor (Pf), modulus of elasticity (E) and in-situ block size (XB). After using the 90 sets of the measured data in various mines and rock formations in the world for training and testing, the model was applied to 12 another blast data for validation of the trained support vector regression (SVR) model. The prediction results of SVR were compared with those of artificial neural network (ANN), multivariate regression analysis (MVRA) models, conventional Kuznetsov method and the measured X50 values. The proposed method shows promising results and the prediction accuracy of SVMs model is acceptable.
基金National High-tech Research and Development Pro-gram (2006AA04Z405)
文摘In order to deal with the issue of huge computational cost very well in direct numerical simulation, the traditional response surface method (RSM) as a classical regression algorithm is used to approximate a functional relationship between the state variable and basic variables in reliability design. The algorithm has treated successfully some problems of implicit performance function in reliability analysis. However, its theoretical basis of empirical risk minimization narrows its range of applications for...
文摘Support vector machines (SVMs) are utilized for emotion recognition in Chinese speech in this paper. Both binary class discrimination and the multi class discrimination are discussed. It proves that the emotional features construct a nonlinear problem in the input space, and SVMs based on nonlinear mapping can solve it more effectively than other linear methods. Multi class classification based on SVMs with a soft decision function is constructed to classify the four emotion situations. Compared with principal component analysis (PCA) method and modified PCA method, SVMs perform the best result in multi class discrimination by using nonlinear kernel mapping.
基金The Natural Science Foundation of Heilongjiang Province ( No. F201018)the National Natural Science Foundation of China( No. 60901042)
文摘In order to solve the fatigue damage identification problem of helicopter moving components, a new approach for acoustic emission (AE) source type identification based on the harmonic wavelet packet (HWPT) feature extraction and the hierarchy support vector machine (H-SVM) classifier is proposed. After a four-level decomposition of the HWPT, the energy feature of AE signals in different frequency bands is extracted, which overcomes the shortcomings of the traditional wavelet packet including energy leakage, and inflexible frequency band selection and different frequency resolutions on different levels. The H-SVM classifier is trained with a subset of the experimental data for known AE source types and tested using the remaining set of data. The results of pressure-off experiments on the specimens of carbon fiber materials indicate that the proposed approach can effectively implement the AE source type identification, and has a better performance in terms of computational efficiency and identification accuracy than the wavelet packet (WPT) feature extraction.
基金The National Natural Science Foundation of China(No60671018,60121101)
文摘In order to assist the design of short interfering ribonucleic acids (siRNA), 573 non-redundant siRNAs were collected from published literatures and the relationship between siRNAs sequences and RNA interference (RNAi) effect is analyzed by a support vector machine (SVM) based algorithm relied on a basebase correlation (BBC) feature. The results show that the proposed algorithm has the highest area under curve (AUC) value (0. 73) of the receive operating characteristic (ROC) curve and the greatest r value (0. 43) of the Pearson's correlation coefficient. This indicates that the proposed algorithm is better than the published algorithms on the collected datasets and that more attention should be paid to the base-base correlation information in future siRNA design.
基金The Fundamental Research Funds for the Central Universities,the Scientific Innovation Research of College Graduates in Jiangsu Province(No.KYLX_0177)
文摘In order to improve the accuracy of travel demand forecast and considering the distribution of travel behaviors within time dimension, a trip chaining pattern recognition model was established based on activity purposes by applying three methods: the support vector machine (SVM) model, the radial basis function neural network (RBFNN) model and the multinomial logit (MNL) model. The effect of explanatory factors on trip chaining behaviors and their contribution to model performace were investigated by sensitivity analysis. Results show that the SVM model has a better performance than the RBFNN model and the MNL model due to its higher overall and partial accuracy, indicating its recognition advantage under a smai sample size scenario. It is also proved that the SVM model is capable of estimating the effect of multi-category factors on trip chaining behaviors more accurately. The different contribution of explanatory, factors to trip chaining pattern recognition reflects the importance of refining trip chaining patterns ad exploring factors that are specific to each pattern. It is shown that the SVM technology in travel demand forecast modeling and analysis of explanatory variable effects is practical.
文摘The computational approaches of support vector machine (SVM), support vector regression (SVR) and molecular docking were widely utilized for the computation of active compounds. In this work, to improve the accuracy and reliability of prediction, the strategy of combining the above three computational approaches was applied to predict potential cytochrome P450 1A2 (CYP1A2) inhibitors. The accuracy of the optimal SVM qualitative model was 99.432%, 97.727%, and 91.667% for training set, internal test set and external test set, respectively, showing this model had high discrimination ability. The R2 and mean square error for the optimal SVR quantitative model were 0.763, 0.013 for training set, and 0.753, 0.056 for test set respectively, indicating that this SVR model has high predictive ability for the biolog-ical activities of compounds. According to the results of the SVM and SVR models, some types of descriptors were identi ed to be essential to bioactivity prediction of compounds, including the connectivity indices, constitutional descriptors and functional group counts. Moreover, molecular docking studies were used to reveal the binding poses and binding a n-ity of potential inhibitors interacting with CYP1A2. Wherein, the amino acids of THR124 and ASP320 could form key hydrogen bond interactions with active compounds. And the amino acids of ALA317 and GLY316 could form strong hydrophobic bond interactions with active compounds. The models obtained above were applied to discover potential CYP1A2 inhibitors from natural products, which could predict the CYPs-mediated drug-drug inter-actions and provide useful guidance and reference for rational drug combination therapy. A set of 20 potential CYP1A2 inhibitors were obtained. Part of the results was consistent with references, which further indicates the accuracy of these models and the reliability of this combinatorial computation strategy.
文摘Multivariables, strong coupling, nonlinearity, and large delays characterize the boiler-turbine coordinated control systems for ship power equipment. To better deal with these conditions, a compound control strategy based on a support vector machine (SVM) with inverse identification was proposed and applied to research simulating coordinated control systems. This method combines SVM inverse control and fuzzy control, taking advantage of the merits of SVM inverse controls which can be designed easily and have high reliability, and those of fuzzy controls, which respond rapidly and have good anti-jamming capability and robustness. It ensures the controller can be controlled with near instantaneous adjustments to maintain a steady state, even if the SVM is not trained well. The simulation results show that the control quality of this fuzzy-SVM compound control algorithm is high, with good performance in dynamic response speed, static stability, restraint of overshoot, and robustness.
基金Supported by the National Natural Science Foundation of China(31101085)the Program for Young Core Teachers of Colleges in Henan(2011GGJS-094)the Scientific Research Project for the High Level Talents,North China University of Water Conservancy and Hydroelectric Power~~
文摘[Objective] The aim was to study the feature extraction of stored-grain insects based on ant colony optimization and support vector machine algorithm, and to explore the feasibility of the feature extraction of stored-grain insects. [Method] Through the analysis of feature extraction in the image recognition of the stored-grain insects, the recognition accuracy of the cross-validation training model in support vector machine (SVM) algorithm was taken as an important factor of the evaluation principle of feature extraction of stored-grain insects. The ant colony optimization (ACO) algorithm was applied to the automatic feature extraction of stored-grain insects. [Result] The algorithm extracted the optimal feature subspace of seven features from the 17 morphological features, including area and perimeter. The ninety image samples of the stored-grain insects were automatically recognized by the optimized SVM classifier, and the recognition accuracy was over 95%. [Conclusion] The experiment shows that the application of ant colony optimization to the feature extraction of grain insects is practical and feasible.