In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the d...In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the data is mapped into a higher-dimensional space with kernel principal component analysis to make the data linearly separable. Then a two-layer KPCANet is built to obtain the principal components of the image. Finally, the principal components are classified with a linear classifier. Experimental results showthat the proposed KPCANet is effective in face recognition, object recognition and handwritten digit recognition. It also outperforms principal component analysis network( PCANet) generally. Besides, KPCANet is invariant to illumination and stable to occlusion and slight deformation.展开更多
In the industrial process situation, principal component analysis (PCA) is ageneral method in data reconciliation. However, PCA sometime is unfeasible to nonlinear featureanalysis and limited in application to nonline...In the industrial process situation, principal component analysis (PCA) is ageneral method in data reconciliation. However, PCA sometime is unfeasible to nonlinear featureanalysis and limited in application to nonlinear industrial process. Kernel PCA (KPCA) is extensionof PCA and can be used for nonlinear feature analysis. A nonlinear data reconciliation method basedon KPCA is proposed. The basic idea of this method is that firstly original data are mapped to highdimensional feature space by nonlinear function, and PCA is implemented in the feature space. Thennonlinear feature analysis is implemented and data are reconstructed by using the kernel. The datareconciliation method based on KPCA is applied to ternary distillation column. Simulation resultsshow that this method can filter the noise in measurements of nonlinear process and reconciliateddata can represent the true information of nonlinear process.展开更多
Principal Component Analysis(PCA)is one of the most important feature extraction methods,and Kernel Principal Component Analysis(KPCA)is a nonlinear extension of PCA based on kernel methods.In real world,each input da...Principal Component Analysis(PCA)is one of the most important feature extraction methods,and Kernel Principal Component Analysis(KPCA)is a nonlinear extension of PCA based on kernel methods.In real world,each input data may not be fully assigned to one class and it may partially belong to other classes.Based on the theory of fuzzy sets,this paper presents Fuzzy Principal Component Analysis(FPCA)and its nonlinear extension model,i.e.,Kernel-based Fuzzy Principal Component Analysis(KFPCA).The experimental results indicate that the proposed algorithms have good performances.展开更多
The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy...The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy c-means clustering. Firstly, we use kernel principal component analysis method to reduce the dimensionality of the original rating matrix, which can extract the effective features of users and items. Then, according to the dimension-reduced rating matrix and the high correlation characteristic between attack profiles, we use fuzzy c-means clustering method to cluster user profiles, which can realize the effective separation of genuine profiles and attack profiles. Finally, we construct an indicator function based on the attack detection results to decrease the influence of attack profiles on the recommendation, and incorporate it into the matrix factorization technology to design the corresponding robust recommendation algorithm. Experiment results indicate that the proposed algorithm is superior to the existing methods in both recommendation accuracy and robustness.展开更多
In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different ...In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different sources are collected at different sampling rates.To build a complete process monitoring strategy,all these multi-rate measurements should be considered for data-based modeling and monitoring.In this paper,a novel kernel multi-rate probabilistic principal component analysis(K-MPPCA)model is proposed to extract the nonlinear correlations among different sampling rates.In the proposed model,the model parameters are calibrated using the kernel trick and the expectation-maximum(EM)algorithm.Also,the corresponding fault detection methods based on the nonlinear features are developed.Finally,a simulated nonlinear case and an actual pre-decarburization unit in the ammonia synthesis process are tested to demonstrate the efficiency of the proposed method.展开更多
Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal ke...Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal kernel function parameter. We first comprehensively considered within-class scatter and between-class scatter of the sample features. Then, the fitness function of an optimized kernel function parameter is constructed, and the particle swarm optimization algorithm with adaptive acceleration (CPSO) is applied to optimizing it. It is used for gearbox condi- tion recognition, and the result is compared with the recognized results based on principal component analysis (PCA). The results show that KPCA optimized by CPSO can effectively recognize fault conditions of the gearbox by reducing bind set-up of the kernel function parameter, and its results of fault recognition outperform those of PCA. We draw the conclusion that KPCA based on CPSO has an advantage in nonlinear feature extraction of mechanical failure, and is helpful for fault condition recognition of complicated machines.展开更多
A model combining kernel principal component analysis(KPCA)and Xtreme Gradient Boosting(XGBoost)was introduced for forecasting the final oxygen content of electroslag remelting.KPCA was employed to reduce the dimensio...A model combining kernel principal component analysis(KPCA)and Xtreme Gradient Boosting(XGBoost)was introduced for forecasting the final oxygen content of electroslag remelting.KPCA was employed to reduce the dimensionality of the factors influencing the endpoint oxygen content and to eliminate any existing correlations among these factors.The resulting principal components were then utilized as input variables for the XGBoost prediction model.The KPCA-XGBoost model was trained and proven using data obtained from companies.The model structure was adapted,and hyperparameters were optimized using grid search cross-validation.The model performance of the KPCA-XGBoost model is compared with five machine learning models,including the support vector regression model.The findings demonstrated that the KPCA-XGBoost model exhibited the highest level of prediction accuracy,indicating that the incorporation of KPCA significantly enhanced the regression prediction performance of the model.The accuracy of the KPCA-XGBoost model was 82.4%,97.1%,and 100%at errors of±1.5×10^(-6),±2.0×10^(-6),and±3×10^(-6)for oxygen content,respectively.展开更多
Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC...Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.展开更多
Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Wa...Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.展开更多
The kernel principal component analysis (KPCA) method employs the first several kernel principal components (KPCs), which indicate the most variance information of normal observations for process monitoring, but m...The kernel principal component analysis (KPCA) method employs the first several kernel principal components (KPCs), which indicate the most variance information of normal observations for process monitoring, but may not reflect the fault information. In this study, sensitive kernel principal component analysis (SKPCA) is proposed to improve process monitoring performance, i.e., to deal with the discordance of T2 statistic and squared prediction error SVE statistic and reduce missed detection rates. T2 statistic can be used to measure the variation di rectly along each KPC and analyze the detection performance as well as capture the most useful information in a process. With the calculation of the change rate of T2 statistic along each KPC, SKPCA selects the sensitive kernel principal components for process monitoring. A simulated simple system and Tennessee Eastman process are employed to demonstrate the efficiency of SKPCA on online monitoring. The results indicate that the monitoring performance is improved significantly.展开更多
Kernel independent component analysis(KICA) is a newly emerging nonlinear process monitoring method,which can extract mutually independent latent variables called independent components(ICs) from process variables. Ho...Kernel independent component analysis(KICA) is a newly emerging nonlinear process monitoring method,which can extract mutually independent latent variables called independent components(ICs) from process variables. However, when more than one IC have Gaussian distribution, it cannot extract the IC feature effectively and thus its monitoring performance will be degraded drastically. To solve such a problem, a kernel time structure independent component analysis(KTSICA) method is proposed for monitoring nonlinear process in this paper. The original process data are mapped into a feature space nonlinearly and then the whitened data are calculated in the feature space by the kernel trick. Subsequently, a time structure independent component analysis algorithm, which has no requirement for the distribution of ICs, is proposed to extract the IC feature.Finally, two monitoring statistics are built to detect process faults. When some fault is detected, a nonlinear fault identification method is developed to identify fault variables based on sensitivity analysis. The proposed monitoring method is applied in the Tennessee Eastman benchmark process. Applications demonstrate the superiority of KTSICA over KICA.展开更多
A new algorithm for linear instantaneous independent component analysis is proposed based on maximizing the log-likelihood contrast function which can be changed into a gradient equation.An iterative method is introdu...A new algorithm for linear instantaneous independent component analysis is proposed based on maximizing the log-likelihood contrast function which can be changed into a gradient equation.An iterative method is introduced to solve this equation efficiently.The unknown probability density functions as well as their first and second derivatives in the gradient equation are estimated by kernel density method.Computer simulations on artificially generated signals and gray scale natural scene images confirm the efficiency and accuracy of the proposed algorithm.展开更多
The healthy condition of the milling tool has a very high impact on the machining quality of the titanium components.Therefore,it is important to recognize the healthy condition of the tool and replace the damaged cut...The healthy condition of the milling tool has a very high impact on the machining quality of the titanium components.Therefore,it is important to recognize the healthy condition of the tool and replace the damaged cutter at the right time.In order to recognize the health condition of the milling cutter,a method based on the long short term memory(LSTM)was proposed to recognize tool health state in this paper.The various signals collected in the tool wear experiments were analyzed by time-domain statistics,and then the extracted data were generated by principal component analysis(PCA)method.The preprocessed data extracted by PCA is transmitted to the LSTM model for recognition.Compared with back propagation neural network(BPNN)and support vector machine(SVM),the proposed method can effectively utilize the time-domain regulation in the data to achieve higher recognition speed and accuracy.展开更多
Dimensionality reduction techniques play an important role in data mining. Kernel entropy component analysis( KECA) is a newly developed method for data transformation and dimensionality reduction. This paper conducte...Dimensionality reduction techniques play an important role in data mining. Kernel entropy component analysis( KECA) is a newly developed method for data transformation and dimensionality reduction. This paper conducted a comparative study of KECA with other five dimensionality reduction methods,principal component analysis( PCA),kernel PCA( KPCA),locally linear embedding( LLE),laplacian eigenmaps( LAE) and diffusion maps( DM). Three quality assessment criteria, local continuity meta-criterion( LCMC),trustworthiness and continuity measure(T&C),and mean relative rank error( MRRE) are applied as direct performance indexes to assess those dimensionality reduction methods. Moreover,the clustering accuracy is used as an indirect performance index to evaluate the quality of the representative data gotten by those methods. The comparisons are performed on six datasets and the results are analyzed by Friedman test with the corresponding post-hoc tests. The results indicate that KECA shows an excellent performance in both quality assessment criteria and clustering accuracy assessing.展开更多
The precision of the kernel independent component analysis( KICA) algorithm depends on the type and parameter values of kernel function. Therefore,it's of great significance to study the choice method of KICA'...The precision of the kernel independent component analysis( KICA) algorithm depends on the type and parameter values of kernel function. Therefore,it's of great significance to study the choice method of KICA's kernel parameters for improving its feature dimension reduction result. In this paper, a fitness function was established by use of the ideal of Fisher discrimination function firstly. Then the global optimal solution of fitness function was searched by particle swarm optimization( PSO) algorithm and a multi-state information dimension reduction algorithm based on PSO-KICA was established. Finally,the validity of this algorithm to enhance the precision of feature dimension reduction has been proven.展开更多
A novel nonlinear combination process monitoring method was proposed based on techniques with memo- ry effect (multivariate exponentially weighted moving average (MEWMA)) and kernel independent component analysis ...A novel nonlinear combination process monitoring method was proposed based on techniques with memo- ry effect (multivariate exponentially weighted moving average (MEWMA)) and kernel independent component analysis (KICA). The method was developed for dealing with nonlinear issues and detecting small or moderate drifts in one or more process variables with autocorrelation. MEWMA charts use additional information from the past history of the process for keeping the memory effect of the process behavior trend. KICA is a recently devel- oped statistical technique for revealing hidden, nonlinear statistically independent factors that underlie sets of mea- surements and it is a two-phase algorithm., whitened kernel principal component analysis (KPCA) plus indepen- dent component analysis (ICA). The application to the fluid catalytic cracking unit (FCCU) simulated process in- dicates that the proposed combined method based on MEWMA and KICA can effectively capture the nonlinear rela- tionship and detect small drifts in process variables. Its performance significantly outperforms monitoring method based on ICA, MEWMA-ICA and KICA, especially for lonu-term performance deterioration.展开更多
Due to the complexity of underground engineering geology,the tunnel boring machine(TBM)usually shows poor adaptability to the surrounding rock mass,leading to machine jamming and geological hazards.For the TBM project...Due to the complexity of underground engineering geology,the tunnel boring machine(TBM)usually shows poor adaptability to the surrounding rock mass,leading to machine jamming and geological hazards.For the TBM project of Lanzhou Water Source Construction,this study proposed a neural network called PCA-GRU,which combines principal component analysis(PCA)with gated recurrent unit(GRU)to improve the accuracy of predicting rock mass classification in TBM tunneling.The input variables from the PCA dimension reduction of nine parameters in the sample data set were utilized for establishing the PCA-GRU model.Subsequently,in order to speed up the response time of surrounding rock mass classification predictions,the PCA-GRU model was optimized.Finally,the prediction results obtained by the PCA-GRU model were compared with those of four other models and further examined using random sampling analysis.As indicated by the results,the PCA-GRU model can predict the rock mass classification in TBM tunneling rapidly,requiring about 20 s to run.It performs better than the previous four models in predicting the rock mass classification,with accuracy A,macro precision MP,and macro recall MR being 0.9667,0.963,and 0.9763,respectively.In Class II,III,and IV rock mass prediction,the PCA-GRU model demonstrates better precision P and recall R owing to the dimension reduction technique.The random sampling analysis indicates that the PCA-GRU model shows stronger generalization,making it more appropriate in situations where the distribution of various rock mass classes and lithologies change in percentage.展开更多
Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of sampl...Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.展开更多
基金The National Natural Science Foundation of China(No.6120134461271312+7 种基金6140108511301074)the Research Fund for the Doctoral Program of Higher Education(No.20120092120036)the Program for Special Talents in Six Fields of Jiangsu Province(No.DZXX-031)Industry-University-Research Cooperation Project of Jiangsu Province(No.BY2014127-11)"333"Project(No.BRA2015288)High-End Foreign Experts Recruitment Program(No.GDT20153200043)Open Fund of Jiangsu Engineering Center of Network Monitoring(No.KJR1404)
文摘In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the data is mapped into a higher-dimensional space with kernel principal component analysis to make the data linearly separable. Then a two-layer KPCANet is built to obtain the principal components of the image. Finally, the principal components are classified with a linear classifier. Experimental results showthat the proposed KPCANet is effective in face recognition, object recognition and handwritten digit recognition. It also outperforms principal component analysis network( PCANet) generally. Besides, KPCANet is invariant to illumination and stable to occlusion and slight deformation.
基金This project is supported by Special Foundation for Major State Basic Research of China (Project 973, No.G1998030415)
文摘In the industrial process situation, principal component analysis (PCA) is ageneral method in data reconciliation. However, PCA sometime is unfeasible to nonlinear featureanalysis and limited in application to nonlinear industrial process. Kernel PCA (KPCA) is extensionof PCA and can be used for nonlinear feature analysis. A nonlinear data reconciliation method basedon KPCA is proposed. The basic idea of this method is that firstly original data are mapped to highdimensional feature space by nonlinear function, and PCA is implemented in the feature space. Thennonlinear feature analysis is implemented and data are reconstructed by using the kernel. The datareconciliation method based on KPCA is applied to ternary distillation column. Simulation resultsshow that this method can filter the noise in measurements of nonlinear process and reconciliateddata can represent the true information of nonlinear process.
文摘Principal Component Analysis(PCA)is one of the most important feature extraction methods,and Kernel Principal Component Analysis(KPCA)is a nonlinear extension of PCA based on kernel methods.In real world,each input data may not be fully assigned to one class and it may partially belong to other classes.Based on the theory of fuzzy sets,this paper presents Fuzzy Principal Component Analysis(FPCA)and its nonlinear extension model,i.e.,Kernel-based Fuzzy Principal Component Analysis(KFPCA).The experimental results indicate that the proposed algorithms have good performances.
基金Supported by the Scientific Research Foundation of Liaoning Provincial Education Department(L2015240)the National Natural Science Foundation of China(61379116,61503169)the Joint Fund of the Science and Technology Department of Liaoning Province(20170540448)
文摘The existing recommendation algorithms have lower robustness in facing of shilling attacks. Considering this problem, we present a robust recommendation algorithm based on kernel principal component analysis and fuzzy c-means clustering. Firstly, we use kernel principal component analysis method to reduce the dimensionality of the original rating matrix, which can extract the effective features of users and items. Then, according to the dimension-reduced rating matrix and the high correlation characteristic between attack profiles, we use fuzzy c-means clustering method to cluster user profiles, which can realize the effective separation of genuine profiles and attack profiles. Finally, we construct an indicator function based on the attack detection results to decrease the influence of attack profiles on the recommendation, and incorporate it into the matrix factorization technology to design the corresponding robust recommendation algorithm. Experiment results indicate that the proposed algorithm is superior to the existing methods in both recommendation accuracy and robustness.
基金supported by Zhejiang Provincial Natural Science Foundation of China(LY19F030003)Key Research and Development Project of Zhejiang Province(2021C04030)+1 种基金the National Natural Science Foundation of China(62003306)Educational Commission Research Program of Zhejiang Province(Y202044842)。
文摘In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different sources are collected at different sampling rates.To build a complete process monitoring strategy,all these multi-rate measurements should be considered for data-based modeling and monitoring.In this paper,a novel kernel multi-rate probabilistic principal component analysis(K-MPPCA)model is proposed to extract the nonlinear correlations among different sampling rates.In the proposed model,the model parameters are calibrated using the kernel trick and the expectation-maximum(EM)algorithm.Also,the corresponding fault detection methods based on the nonlinear features are developed.Finally,a simulated nonlinear case and an actual pre-decarburization unit in the ammonia synthesis process are tested to demonstrate the efficiency of the proposed method.
基金supported by National Natural Science Foundation under Grant No.50875247Shanxi Province Natural Science Foundation under Grant No.2009011026-1
文摘Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal kernel function parameter. We first comprehensively considered within-class scatter and between-class scatter of the sample features. Then, the fitness function of an optimized kernel function parameter is constructed, and the particle swarm optimization algorithm with adaptive acceleration (CPSO) is applied to optimizing it. It is used for gearbox condi- tion recognition, and the result is compared with the recognized results based on principal component analysis (PCA). The results show that KPCA optimized by CPSO can effectively recognize fault conditions of the gearbox by reducing bind set-up of the kernel function parameter, and its results of fault recognition outperform those of PCA. We draw the conclusion that KPCA based on CPSO has an advantage in nonlinear feature extraction of mechanical failure, and is helpful for fault condition recognition of complicated machines.
基金Authors acknowledge the financial support by National Natural Science Foundation of China(Grant Nos.52174303,and 51874084)Fundamental Research Funds for the Central Universities(Grant No.2125026)+1 种基金Program of Introducing Talents of Discipline to Universities(Grant No.B21001)the 111 Project(Grant No.B16009).
文摘A model combining kernel principal component analysis(KPCA)and Xtreme Gradient Boosting(XGBoost)was introduced for forecasting the final oxygen content of electroslag remelting.KPCA was employed to reduce the dimensionality of the factors influencing the endpoint oxygen content and to eliminate any existing correlations among these factors.The resulting principal components were then utilized as input variables for the XGBoost prediction model.The KPCA-XGBoost model was trained and proven using data obtained from companies.The model structure was adapted,and hyperparameters were optimized using grid search cross-validation.The model performance of the KPCA-XGBoost model is compared with five machine learning models,including the support vector regression model.The findings demonstrated that the KPCA-XGBoost model exhibited the highest level of prediction accuracy,indicating that the incorporation of KPCA significantly enhanced the regression prediction performance of the model.The accuracy of the KPCA-XGBoost model was 82.4%,97.1%,and 100%at errors of±1.5×10^(-6),±2.0×10^(-6),and±3×10^(-6)for oxygen content,respectively.
基金funding from the National Natural Science Foundation of China (Grant No.42277175)the pilot project of cooperation between the Ministry of Natural Resources and Hunan Province“Research and demonstration of key technologies for comprehensive remote sensing identification of geological hazards in typical regions of Hunan Province” (Grant No.2023ZRBSHZ056)the National Key Research and Development Program of China-2023 Key Special Project (Grant No.2023YFC2907400).
文摘Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.
文摘Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.
基金Supported by the 973 project of China (2013CB733600), the National Natural Science Foundation (21176073), the Doctoral Fund of Ministry of Education (20090074110005), the New Century Excellent Talents in University (NCET-09-0346), "Shu Guang" project (09SG29) and the Fundamental Research Funds for the Central Universities.
文摘The kernel principal component analysis (KPCA) method employs the first several kernel principal components (KPCs), which indicate the most variance information of normal observations for process monitoring, but may not reflect the fault information. In this study, sensitive kernel principal component analysis (SKPCA) is proposed to improve process monitoring performance, i.e., to deal with the discordance of T2 statistic and squared prediction error SVE statistic and reduce missed detection rates. T2 statistic can be used to measure the variation di rectly along each KPC and analyze the detection performance as well as capture the most useful information in a process. With the calculation of the change rate of T2 statistic along each KPC, SKPCA selects the sensitive kernel principal components for process monitoring. A simulated simple system and Tennessee Eastman process are employed to demonstrate the efficiency of SKPCA on online monitoring. The results indicate that the monitoring performance is improved significantly.
基金Supported by the National Natural Science Foundation of China(61273160)the Natural Science Foundation of Shandong Province of China(ZR2011FM014)+1 种基金the Doctoral Fund of Shandong Province(BS2012ZZ011)the Postgraduate Innovation Funds of China University of Petroleum(CX2013060)
文摘Kernel independent component analysis(KICA) is a newly emerging nonlinear process monitoring method,which can extract mutually independent latent variables called independent components(ICs) from process variables. However, when more than one IC have Gaussian distribution, it cannot extract the IC feature effectively and thus its monitoring performance will be degraded drastically. To solve such a problem, a kernel time structure independent component analysis(KTSICA) method is proposed for monitoring nonlinear process in this paper. The original process data are mapped into a feature space nonlinearly and then the whitened data are calculated in the feature space by the kernel trick. Subsequently, a time structure independent component analysis algorithm, which has no requirement for the distribution of ICs, is proposed to extract the IC feature.Finally, two monitoring statistics are built to detect process faults. When some fault is detected, a nonlinear fault identification method is developed to identify fault variables based on sensitivity analysis. The proposed monitoring method is applied in the Tennessee Eastman benchmark process. Applications demonstrate the superiority of KTSICA over KICA.
文摘A new algorithm for linear instantaneous independent component analysis is proposed based on maximizing the log-likelihood contrast function which can be changed into a gradient equation.An iterative method is introduced to solve this equation efficiently.The unknown probability density functions as well as their first and second derivatives in the gradient equation are estimated by kernel density method.Computer simulations on artificially generated signals and gray scale natural scene images confirm the efficiency and accuracy of the proposed algorithm.
基金National Natural Science Foundation of China(No.51805079)Shanghai Natural Science Foundation,China(No.17ZR1400600)Fundamental Research Funds for the Central Universities,China(No.16D110309)
文摘The healthy condition of the milling tool has a very high impact on the machining quality of the titanium components.Therefore,it is important to recognize the healthy condition of the tool and replace the damaged cutter at the right time.In order to recognize the health condition of the milling cutter,a method based on the long short term memory(LSTM)was proposed to recognize tool health state in this paper.The various signals collected in the tool wear experiments were analyzed by time-domain statistics,and then the extracted data were generated by principal component analysis(PCA)method.The preprocessed data extracted by PCA is transmitted to the LSTM model for recognition.Compared with back propagation neural network(BPNN)and support vector machine(SVM),the proposed method can effectively utilize the time-domain regulation in the data to achieve higher recognition speed and accuracy.
基金Climbing Peak Discipline Project of Shanghai Dianji University,China(No.15DFXK02)Hi-Tech Research and Development Programs of China(No.2007AA041600)
文摘Dimensionality reduction techniques play an important role in data mining. Kernel entropy component analysis( KECA) is a newly developed method for data transformation and dimensionality reduction. This paper conducted a comparative study of KECA with other five dimensionality reduction methods,principal component analysis( PCA),kernel PCA( KPCA),locally linear embedding( LLE),laplacian eigenmaps( LAE) and diffusion maps( DM). Three quality assessment criteria, local continuity meta-criterion( LCMC),trustworthiness and continuity measure(T&C),and mean relative rank error( MRRE) are applied as direct performance indexes to assess those dimensionality reduction methods. Moreover,the clustering accuracy is used as an indirect performance index to evaluate the quality of the representative data gotten by those methods. The comparisons are performed on six datasets and the results are analyzed by Friedman test with the corresponding post-hoc tests. The results indicate that KECA shows an excellent performance in both quality assessment criteria and clustering accuracy assessing.
文摘The precision of the kernel independent component analysis( KICA) algorithm depends on the type and parameter values of kernel function. Therefore,it's of great significance to study the choice method of KICA's kernel parameters for improving its feature dimension reduction result. In this paper, a fitness function was established by use of the ideal of Fisher discrimination function firstly. Then the global optimal solution of fitness function was searched by particle swarm optimization( PSO) algorithm and a multi-state information dimension reduction algorithm based on PSO-KICA was established. Finally,the validity of this algorithm to enhance the precision of feature dimension reduction has been proven.
基金The National Natural Science Foundation ofChina(No60504033)
文摘A novel nonlinear combination process monitoring method was proposed based on techniques with memo- ry effect (multivariate exponentially weighted moving average (MEWMA)) and kernel independent component analysis (KICA). The method was developed for dealing with nonlinear issues and detecting small or moderate drifts in one or more process variables with autocorrelation. MEWMA charts use additional information from the past history of the process for keeping the memory effect of the process behavior trend. KICA is a recently devel- oped statistical technique for revealing hidden, nonlinear statistically independent factors that underlie sets of mea- surements and it is a two-phase algorithm., whitened kernel principal component analysis (KPCA) plus indepen- dent component analysis (ICA). The application to the fluid catalytic cracking unit (FCCU) simulated process in- dicates that the proposed combined method based on MEWMA and KICA can effectively capture the nonlinear rela- tionship and detect small drifts in process variables. Its performance significantly outperforms monitoring method based on ICA, MEWMA-ICA and KICA, especially for lonu-term performance deterioration.
基金State Key Laboratory of Hydroscience and Hydraulic Engineering of Tsinghua University,Grant/Award Number:2019-KY-03Key Technology of Intelligent Construction of Urban Underground Space of North China University of Technology,Grant/Award Number:110051360022XN108-19+3 种基金Research Start-up Fund Project of North China University of Technology,Grant/Award Number:110051360002Yujie Project of North China University of Technology,Grant/Award Number:216051360020XN199/006National Natural Science Foundation of China,Grant/Award Numbers:51522903,51774184National Key R&D Program of China,Grant/Award Numbers:2018YFC1504801,2018YFC1504902。
文摘Due to the complexity of underground engineering geology,the tunnel boring machine(TBM)usually shows poor adaptability to the surrounding rock mass,leading to machine jamming and geological hazards.For the TBM project of Lanzhou Water Source Construction,this study proposed a neural network called PCA-GRU,which combines principal component analysis(PCA)with gated recurrent unit(GRU)to improve the accuracy of predicting rock mass classification in TBM tunneling.The input variables from the PCA dimension reduction of nine parameters in the sample data set were utilized for establishing the PCA-GRU model.Subsequently,in order to speed up the response time of surrounding rock mass classification predictions,the PCA-GRU model was optimized.Finally,the prediction results obtained by the PCA-GRU model were compared with those of four other models and further examined using random sampling analysis.As indicated by the results,the PCA-GRU model can predict the rock mass classification in TBM tunneling rapidly,requiring about 20 s to run.It performs better than the previous four models in predicting the rock mass classification,with accuracy A,macro precision MP,and macro recall MR being 0.9667,0.963,and 0.9763,respectively.In Class II,III,and IV rock mass prediction,the PCA-GRU model demonstrates better precision P and recall R owing to the dimension reduction technique.The random sampling analysis indicates that the PCA-GRU model shows stronger generalization,making it more appropriate in situations where the distribution of various rock mass classes and lithologies change in percentage.
文摘Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.