Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the l...Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the lowest temperature at which the wax begins to form.When crude oil cools to its WAT,wax crystals precipitate,forming deposits on pipelines as the solubility limit is reached.Therefore,WAT is a crucial quality assurance parameter,especially when dealing with modern fuel oil blends.In this study,we use machine learning via MATLAB’s Bioinformatics Toolbox to predict the WAT of marine fuel samples by correlating near-infrared spectral data with laboratory-measured values.The dataset provided by Intertek PLC-a total quality assurance provider of inspection,testing,and certification services-includes industrial data that is imbalanced,with a higher proportion of high-WAT samples compared to low-WAT samples.The objective is to predict marine fuel oil blends with unusually high WAT values(>35℃)without relying on time-consuming and irregular laboratory-based measurements.The results demonstrate that the developed model,based on the one-class support vector machine(OCSVM)algorithm,achieved a Recall of 96,accurately predicting 96%of fuel samples with WAT>35℃.For standard binary classification,the Recall was 85.7.The trained OCSVM model is expected to facilitate rapid and well-informed decision-making for logistics and storage when choosing fuel oils.展开更多
Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.T...Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.展开更多
One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification ...One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.展开更多
为解决近年来用户行业变化特性加剧导致的难以准确辨识用户档案信息变动的问题,文中提出一种基于数据驱动的负荷特征异常辨识方法。首先,提出一种两阶段行业典型负荷形态构建方法,利用基于层次密度的含噪声应用空间聚类(hierarchical de...为解决近年来用户行业变化特性加剧导致的难以准确辨识用户档案信息变动的问题,文中提出一种基于数据驱动的负荷特征异常辨识方法。首先,提出一种两阶段行业典型负荷形态构建方法,利用基于层次密度的含噪声应用空间聚类(hierarchical density-based spatial clustering of applications with noise,HDBSCAN)提取用户在不同场景下的典型日负荷曲线,并利用改进的K-means算法对提取出的典型日负荷曲线进行聚类分析,构建行业的典型负荷形态;其次,提出一种多维场景负荷特征异常智能研判方法,通过构造用户的负荷特征,使用熵权法评估行业典型场景的相对重要性,并采用单分类支持向量机(one-class support vector machine,OCSVM)算法量化每个场景下的用户负荷特征的异常程度,通过加权计算得到用户的综合嫌疑得分并排序,从而实现对负荷特征异常用户的准确辨识。最后,采用某地区实际用户数据进行算例验证。仿真结果表明,所提方法在行业典型负荷场景构建及负荷特征异常辨识方面表现出良好的可行性与实用价值。展开更多
为了准确识别气体绝缘开关柜(gas insulated switchgear,GIS)设备的异常工况,提出了一种基于加权梅尔频率谱系数单类支持向量机(Mel frequency cestrum coefficient-one class support vector machine,MFCC-OCSVM)和贝叶斯优化的门控循...为了准确识别气体绝缘开关柜(gas insulated switchgear,GIS)设备的异常工况,提出了一种基于加权梅尔频率谱系数单类支持向量机(Mel frequency cestrum coefficient-one class support vector machine,MFCC-OCSVM)和贝叶斯优化的门控循环单元(bidirectional gate recurrent unit,BiGRU)声纹识别算法。首先,利用基于F统计量的MFCC对声纹数据进行加权特征提取,突出重要特征并减弱噪声的影响,然后利用OCSVM对加权后的特征进行异常检测并去除异常值,提高数据质量。为解决样本不平衡问题,采用合成少数类过采样技术(synthetic minority over-sampling technique,SMOTE)进行声纹样本的均衡。最后,应用基于贝叶斯优化的BiGRU模型进行声纹识别。以某气体绝缘全封闭组合电器(gas insulated switchgear,GIS)为例,采集了20类不同工况下操纵机构的声音样本,与多种经典分类模型进行对比。结果显示,所提算法取得的最高平均识别准确率达到了92.8%,相比于自适应增强、朴素贝叶斯和线性判别分析算法分别提升了30.1%、14.7%和11.5%。通过消融实验进一步评估和验证了所提算法各个流程对声纹识别的实际效果和性能影响,研究成果可为GIS设备异常工况的声纹识别提供高效技术路线。展开更多
One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative compar...One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative comparisons. The analysis uses a hypersphere version of one-class SVMs with a penalty term appended to the objective function. The method simultaneously finds the minimum sphere in the feature space that encloses most of the target points and considers the relative comparisons. The result is a standard convex quadratic programming problem, which can be solved by adapting standard methods for SVM training, i.e., sequential minimal optimization. This one-class SVM can be applied to semi-supervised clustering and multi-classification problems. Tests show that this method achieves higher accuracy and better generalization performance than previous SVMs.展开更多
One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of t...One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of the Gaussian kernel, OCSVM and SVDD are firstly unified into the framework of kernel density estimation, and the essential relationship between them is explicitly revealed. Then the result proves that the density estimation induced by OCSVM or SVDD is in agreement with the true density. Meanwhile, it can also reduce the integrated squared error (ISE). Finally, experiments on several simulated datasets verify the revealed relationships.展开更多
Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this...Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this issue,in this study,we propose segregation of the power disturbance from regular values using one-class support vector machine(OCSVM).To precisely detect the power disturbances of a voltage wave,some practical wavelet filters are applied.Considering the unlimited types of waveform abnormalities,OCSVM is picked as a semisupervised machine learning algorithm which needs to be trained solely on a relatively large sample of normal data.This model is able to automatically detect the existence of any types of disturbances in real time,even unknown types which are not available in the training time.In the case of existence,the disturbances are further classified into different types such as sag,swell,transients and unbalanced.Being light weighted and fast,the proposed technique can be integrated into smart grid devices such as smart meter in order to perform a real-time disturbance monitoring.The continuous monitoring of power quality in smart meters will give helpful insight for quality power transmission and management.展开更多
Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the...Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the model using the reference data set collected from the healthy structure and employing the trained model to identify outlier conditions representing the damaged state.In this paper,the coefficients and the residuals of the autoregressive model with exogenous input created using only the measured output signals are extracted as damage features.These features obtained at the baseline state for each sensor cluster are then utilized to train the one class support vector machine,an unsupervised classifier generating a decision function using only patterns belonging to this baseline state.Structural damage,once detected by the trained machine,a damage index based on comparison of the residuals between the trained class and the outlier state is implemented for localizing damage.The two-step damage assessment framework is first implemented on an eight degree-of-freedom numerical model with the effects of measurement noise integrated.Subsequently,vibration data collected from a one-story one-bay reinforced concrete frame inflicted with progressive levels of damage have been utilized to verify the accuracy and robustness of the proposed methodology.展开更多
基金Newcastle University and EPSRC(Grant No.2020/21 DTP:ref.EP/T517914/1).
文摘Accurate and robust detection of wax appearance(a medium-to high-molecular-weight component of crude oil)is crucial for the efficient operation of hydrocarbon transportation.The wax appearance temperature(WAT)is the lowest temperature at which the wax begins to form.When crude oil cools to its WAT,wax crystals precipitate,forming deposits on pipelines as the solubility limit is reached.Therefore,WAT is a crucial quality assurance parameter,especially when dealing with modern fuel oil blends.In this study,we use machine learning via MATLAB’s Bioinformatics Toolbox to predict the WAT of marine fuel samples by correlating near-infrared spectral data with laboratory-measured values.The dataset provided by Intertek PLC-a total quality assurance provider of inspection,testing,and certification services-includes industrial data that is imbalanced,with a higher proportion of high-WAT samples compared to low-WAT samples.The objective is to predict marine fuel oil blends with unusually high WAT values(>35℃)without relying on time-consuming and irregular laboratory-based measurements.The results demonstrate that the developed model,based on the one-class support vector machine(OCSVM)algorithm,achieved a Recall of 96,accurately predicting 96%of fuel samples with WAT>35℃.For standard binary classification,the Recall was 85.7.The trained OCSVM model is expected to facilitate rapid and well-informed decision-making for logistics and storage when choosing fuel oils.
基金supported by National Natural Science Foundation of China (Grant No. 50675219)Hu’nan Provincial Science Committee Excellent Youth Foundation of China (Grant No. 08JJ1008)
文摘Turbopump condition monitoring is a significant approach to ensure the safety of liquid rocket engine (LRE).Because of lack of fault samples,a monitoring system cannot be trained on all possible condition patterns.Thus it is important to differentiate abnormal or unknown patterns from normal pattern with novelty detection methods.One-class support vector machine (OCSVM) that has been commonly used for novelty detection cannot deal well with large scale samples.In order to model the normal pattern of the turbopump with OCSVM and so as to monitor the condition of the turbopump,a monitoring method that integrates OCSVM with incremental clustering is presented.In this method,the incremental clustering is used for sample reduction by extracting representative vectors from a large training set.The representative vectors are supposed to distribute uniformly in the object region and fulfill the region.And training OCSVM on these representative vectors yields a novelty detector.By applying this method to the analysis of the turbopump's historical test data,it shows that the incremental clustering algorithm can extract 91 representative points from more than 36 000 training vectors,and the OCSVM detector trained on these 91 representative points can recognize spikes in vibration signals caused by different abnormal events such as vane shedding,rub-impact and sensor faults.This monitoring method does not need fault samples during training as classical recognition methods.The method resolves the learning problem of large samples and is an alternative method for condition monitoring of the LRE turbopump.
文摘One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.
文摘为解决近年来用户行业变化特性加剧导致的难以准确辨识用户档案信息变动的问题,文中提出一种基于数据驱动的负荷特征异常辨识方法。首先,提出一种两阶段行业典型负荷形态构建方法,利用基于层次密度的含噪声应用空间聚类(hierarchical density-based spatial clustering of applications with noise,HDBSCAN)提取用户在不同场景下的典型日负荷曲线,并利用改进的K-means算法对提取出的典型日负荷曲线进行聚类分析,构建行业的典型负荷形态;其次,提出一种多维场景负荷特征异常智能研判方法,通过构造用户的负荷特征,使用熵权法评估行业典型场景的相对重要性,并采用单分类支持向量机(one-class support vector machine,OCSVM)算法量化每个场景下的用户负荷特征的异常程度,通过加权计算得到用户的综合嫌疑得分并排序,从而实现对负荷特征异常用户的准确辨识。最后,采用某地区实际用户数据进行算例验证。仿真结果表明,所提方法在行业典型负荷场景构建及负荷特征异常辨识方面表现出良好的可行性与实用价值。
基金Supported by the National Natural Science Foundation of China(No. 60872070)
文摘One-class support vector machines (one-class SVMs) are powerful tools that are widely used in many applications. This paper describes a semi-supervised one-class SVM that uses supervision in terms of relative comparisons. The analysis uses a hypersphere version of one-class SVMs with a penalty term appended to the objective function. The method simultaneously finds the minimum sphere in the feature space that encloses most of the target points and considers the relative comparisons. The result is a standard convex quadratic programming problem, which can be solved by adapting standard methods for SVM training, i.e., sequential minimal optimization. This one-class SVM can be applied to semi-supervised clustering and multi-classification problems. Tests show that this method achieves higher accuracy and better generalization performance than previous SVMs.
基金Supported by the National Natural Science Foundation of China(60603029)the Natural Science Foundation of Jiangsu Province(BK2007074)the Natural Science Foundation for Colleges and Universities in Jiangsu Province(06KJB520132)~~
文摘One-class support vector machine (OCSVM) and support vector data description (SVDD) are two main domain-based one-class (kernel) classifiers. To reveal their relationship with density estimation in the case of the Gaussian kernel, OCSVM and SVDD are firstly unified into the framework of kernel density estimation, and the essential relationship between them is explicitly revealed. Then the result proves that the density estimation induced by OCSVM or SVDD is in agreement with the true density. Meanwhile, it can also reduce the integrated squared error (ISE). Finally, experiments on several simulated datasets verify the revealed relationships.
基金supported in part through U.S.National Science Foundation(No.1553494).
文摘Power quality assessment is an important performance measurement in smart grids.Utility companies are interested in power quality monitoring even in the low level distribution side such as smart meters.Addressing this issue,in this study,we propose segregation of the power disturbance from regular values using one-class support vector machine(OCSVM).To precisely detect the power disturbances of a voltage wave,some practical wavelet filters are applied.Considering the unlimited types of waveform abnormalities,OCSVM is picked as a semisupervised machine learning algorithm which needs to be trained solely on a relatively large sample of normal data.This model is able to automatically detect the existence of any types of disturbances in real time,even unknown types which are not available in the training time.In the case of existence,the disturbances are further classified into different types such as sag,swell,transients and unbalanced.Being light weighted and fast,the proposed technique can be integrated into smart grid devices such as smart meter in order to perform a real-time disturbance monitoring.The continuous monitoring of power quality in smart meters will give helpful insight for quality power transmission and management.
基金funding provided by the Scientific and Technological Research Council of Türkiye(TÜBİTAK).
文摘Machine learning algorithms operating in an unsupervised fashion has emerged as promising tools for detecting structural damage in an automated fashion.Its essence relies on selecting appropriate features to train the model using the reference data set collected from the healthy structure and employing the trained model to identify outlier conditions representing the damaged state.In this paper,the coefficients and the residuals of the autoregressive model with exogenous input created using only the measured output signals are extracted as damage features.These features obtained at the baseline state for each sensor cluster are then utilized to train the one class support vector machine,an unsupervised classifier generating a decision function using only patterns belonging to this baseline state.Structural damage,once detected by the trained machine,a damage index based on comparison of the residuals between the trained class and the outlier state is implemented for localizing damage.The two-step damage assessment framework is first implemented on an eight degree-of-freedom numerical model with the effects of measurement noise integrated.Subsequently,vibration data collected from a one-story one-bay reinforced concrete frame inflicted with progressive levels of damage have been utilized to verify the accuracy and robustness of the proposed methodology.