The Savitzky-Golay(SG)filter,which employs polynomial least-squares approximations to smooth data and estimate derivatives,is widely used for processing noisy data.However,noise suppression by the SG filter is recogni...The Savitzky-Golay(SG)filter,which employs polynomial least-squares approximations to smooth data and estimate derivatives,is widely used for processing noisy data.However,noise suppression by the SG filter is recognized to be limited at data boundaries and high frequencies,which can significantly reduce the signal-to-noise ratio(SNR).To solve this problem,a novel method synergistically integrating Principal Component Analysis(PCA)with SG filtering is proposed in this paper.This approach avoids the is-sue of excessive smoothing associated with larger window sizes.The proposed PCA-SG filtering algorithm was applied to a CO gas sensing system based on Cavity Ring-Down Spectroscopy(CRDS).The perform-ance of the PCA-SG filtering algorithm is demonstrated through comparison with Moving Average Filtering(MAF),Wavelet Transformation(WT),Kalman Filtering(KF),and the SG filter.The results demonstrate that the proposed algorithm exhibits superior noise reduction capabilities compared to the other algorithms evaluated.The SNR of the ring-down signal was improved from 11.8612 dB to 29.0913 dB,and the stand-ard deviation of the extracted ring-down time constant was reduced from 0.037μs to 0.018μs.These results confirm that the proposed PCA-SG filtering algorithm effectively improves the smoothness of the ring-down curve data,demonstrating its feasibility.展开更多
Crassostrea gigas has good taste and high nutritional value;however,there are few assessments of comprehensive and panoramic analyses of the nutritional quality of the northern oyster.To study the nutritional characte...Crassostrea gigas has good taste and high nutritional value;however,there are few assessments of comprehensive and panoramic analyses of the nutritional quality of the northern oyster.To study the nutritional characteristics of C.gigas from different sources(ploidy,region,size,and culture mode),C.gigas from various ploidy(diploid and triploid),regions(Rushan,Off-site fattening,and Rongcheng),sizes(small,medium,and large)and culture modes(nearshore and offshore)were selected for comparative analyses.The nutritional components(moisture,protein,fat,and mineral),flavor substances(taste amino acids,nucleotides,and succinic acid),and functional indices(eicosapentaenoic acid(EPA),docosahexaenoic acid(DHA),and taurine)of C.gigas were determined.Principal component analysis(PCA)was used to comprehensively evaluate the oysters and investigate the variations in nutritional quality.The PCA results indicate that protein,essential fatty acids,selenium,zinc,taste amino acids,taurine,EPA,and DHA were core components contributing to 82.25%of the cumulative variance,providing a more comprehensive reflection of the nutrient composition of C.gigas.The extensive quality rankings for the C.gigas were as follows:diploid>triploid,Rushan>fattening>Rongcheng,medium>large>small,and offshore>nearshore.The score rank revealed that diploid oysters of medium-size from Rushan demonstrated superior nutritional quality compared to other tested samples.This is the first comprehensive and systematic investigation of C.gigas in northern China to reveal the feature of nutrients,flavor,and functional components.The study provided data support for the culture,consumption,processing,research,and nutritional quality improvement of oyster industry.展开更多
Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC...Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.展开更多
The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of build...The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of buildings were processed by Principal Component Analysis.The key factor was extracted to support input of vector machine model and to build an evaluation model;the historical fitting result kept in line with the fact.In the real evaluation of two typhoons landed in Zhejiang Province in 2008 and 2009,the coincidence of evaluating result and actual value proved the feasibility of this model.展开更多
Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high com...Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high computational complexity and insufficient capture of high-frequency phase aberration components,so we proposed a Principal-Component-Analysis-based method for representing phase aberrations.This paper discusses the factors influencing the accuracy of restoration,mainly including the sample space size and the sampling interval of D/r_(0),on the basis of characterizing phase aberrations by Principal Components(PCs).The experimental results show that a larger D/r_(0)sampling interval can ensure the generalization ability and robustness of the principal components in the case of a limited amount of original data,which can help to achieve high-precision deployment of the model in practical applications quickly.In the environment with relatively strong turbulence in the test set of D/r_(0)=24,the use of 34 terms of PCs can improve the corrected Strehl ratio(SR)from 0.007 to 0.1585,while the Strehl ratio of the light spot after restoration using 34 terms of ZPs is only 0.0215,demonstrating almost no correction effect.The results indicate that PCs can serve as a better alternative in representing and restoring the characteristics of atmospheric turbulence induced phase aberrations.These findings pave the way to use PCs of phase aberrations with fewer terms than traditional ZPs to achieve data dimensionality reduction,and offer a reference to accelerate and stabilize the model and deep learning based adaptive optics correction.展开更多
Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Wa...Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.展开更多
The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This...The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This study proposes a novel strategy;the seasonal hydrological load signals are removed from the raw data,and the remaining signals use principal component analysis(PCA).Simulation results from Yunnan Province demonstrate that the spatial distribution of the root mean square error(RMSE)is improved by approximately 15% compared with traditional PCA extraction from raw data.From January 2013 to December 2022,TWS was inverted from 24 GNSS stations in Yunnan Province.The spatial distribution and time series of TWS inverted from GNSS align well with those TWS inferred from the Gravity Recovery and Climate Experiment(GRACE),GRACE Follow-On(GFO),and the Global Land Data Assimilation System(GLDAS)land surface model.However,the amplitude of the GNSS-inverted TWS is slightly higher.Since GNSS ground stations are more sensitive to hydrological load signals,they show correlations with precipitation data that are 8.6%and 6.0%higher than those of GRACE and GLDAS,respectively.In the power spectral density analysis of GRACE/GFO,GLDAS,and GNSS,the signal strength of GNSS is much higher than that of GRACE/GFO and GLDAS in the June and February cycles.These findings suggest that the new data extraction strategy can capture higher frequency hydrological signals in TWS,and GNSS observations can help address limitations in GRACE/GFO observations.This study demonstrates the potential of GNSS TWS in capturing higher-frequency hydrological signals and climate extremes application.展开更多
This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric...This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric warming(SSW)events.IHC in both hemispheres leads to a cold anomaly in the equatorial stratosphere,a warm anomaly in the equatorial mesosphere,and increased temperatures in the mesosphere and lower thermosphere(MLT)region of the summer hemisphere.However,the IHC features during boreal winter period are significantly weaker than during the austral winter period,primarily due to weaker stationary planetary wave activity in the Southern Hemisphere(SH).During the austral winter period,IHC results in a warm anomaly in the polar mesosphere of the SH,which does not occur in the NH during boreal winter period.This study also examines the possible influence of quasi-two-day waves(QTDWs)on IHC.We found that the largest temperature anomaly in the summer polar MLT region is associated with a large wind instability area,and a well-developed critical layer structure of QTDW in January.In contrast,during July,despite favorable conditions for QTDW propagation in the Northern Hemisphere,weaker IHC response is observed,suggesting that IHC features and the relationship with QTDWs during July would be more complex than during January.展开更多
Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamformin...Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamforming map.Current algorithms for separating different moving sound sources have limited effectiveness,leading to significant residual noise,especially when the rotating source is strong enough to mask stationary sources completely.To overcome these challenges,a novel solution utilizing a virtual rotating array in the modal domain combined with robust principal component analysis is proposed to separate sound sources with different rotational speeds.This approach,named Robust Principal Component Analysis in the Modal domain(RPCA-M),investigates the performance of convex nuclear norm and non-convex Schatten-p norm to distinguish stationary and rotating sources.By comparing the errors in Cross-Spectral Matrix(CSM)recovery and acoustic imaging across different algorithms,the effectiveness of RPCA-M in separating stationary and moving sound sources is demonstrated.Importantly,this method effectively separates sound sources,even when there are significant variations in their amplitudes at different rotation speeds.展开更多
We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of t...We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of the ground state wave functions with different disorder strengths can be distilled.As the disorder strength increases,the Moore–Read state will be destroyed.We explore the phase transition by analyzing the overlaps between the random sample wave functions and the topologically distilled state.The cross-point between the amplitudes of the principal component and its counterpart is the phase transition point.Additionally,the origin of the second component comes from the excited states,which is different from the Laughlin state.展开更多
With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistica...With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.展开更多
A combined model based on principal components analysis (PCA) and generalized regression neural network (GRNN) was adopted to forecast electricity price in day-ahead electricity market. PCA was applied to mine the mai...A combined model based on principal components analysis (PCA) and generalized regression neural network (GRNN) was adopted to forecast electricity price in day-ahead electricity market. PCA was applied to mine the main influence on day-ahead price, avoiding the strong correlation between the input factors that might influence electricity price, such as the load of the forecasting hour, other history loads and prices, weather and temperature; then GRNN was employed to forecast electricity price according to the main information extracted by PCA. To prove the efficiency of the combined model, a case from PJM (Pennsylvania-New Jersey-Maryland) day-ahead electricity market was evaluated. Compared to back-propagation (BP) neural network and standard GRNN, the combined method reduces the mean absolute percentage error about 3%.展开更多
Water quality monitoring has one of the highest priorities in surface water protection policy. Many variety approaches are being used to interpret and analyze the concealed variables that determine the variance of obs...Water quality monitoring has one of the highest priorities in surface water protection policy. Many variety approaches are being used to interpret and analyze the concealed variables that determine the variance of observed water quality of various source points. A considerable proportion of these approaches are mainly based on statistical methods, multivariate statistical techniques in particular. In the present study, the use of multivariate techniques is required to reduce the large variables number of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs) and determination of relationships among them for easy and robust evaluation. By means of multivariate statistics of principal components analysis (PCA), Fuzzy C-Means (FCM) and K-means algorithm for clustering analysis, this study attempted to determine the major dominant factors responsible for the variations of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs). Furthermore, cluster analysis classified 21 sampling stations into three clusters based on similarities of water quality features. The result of PCA shows that 6 principal components contain the key variables and account for 75.82% of total variance of the study area surface water quality and the dominant water quality parameters were: Conductivity, Iron, Biological Oxygen Demand (BOD), Total Coliform (TC), Ammonia (NH3), and pH. However, the results from both of FCM clustering and K-means algorithm, based on the dominant parameters concentrations, determined 3 cluster groups and produced cluster centers (prototypes). Based on clustering classification, a noted water quality deteriorating as the cluster number increased from 1 to 3. However the cluster grouping can be used to identify the physical, chemical and biological processes creating the variations in the water quality parameters. This study revealed that multivariate analysis techniques, as the extracted water quality dominant parameters and clustered information can be used in reducing the number of sampling parameters on the Nile River in a cost effective and efficient way instead of using a large set of parameters without missing much information. These techniques can be helpful for decision makers to obtain a global view on the water quality in any surface water or other water bodies when analyzing large data sets especially without a priori knowledge about relationships between them.展开更多
5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging.And these characteristics are correlated with each other.So the principal components analysis...5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging.And these characteristics are correlated with each other.So the principal components analysis(PCA)is used in the analysis of the sample data firstly.And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information.Furthermore,a software tool is developed for this kind of problems.And with sample data from a surface mounting device(SMD)process,it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA,but the problem is transformed from high-dimensional one to a lower dimensional one,i.e.,from 5 to 2 in this demonstration.展开更多
AIM: To investigate the metabolic profiles of xenograft pancreatic cancer before and after radiotherapy by high-resolution magic angle spinning proton magnetic resonance spectroscopy (HRMAS 1H NMR) combined with princ...AIM: To investigate the metabolic profiles of xenograft pancreatic cancer before and after radiotherapy by high-resolution magic angle spinning proton magnetic resonance spectroscopy (HRMAS 1H NMR) combined with principal components analysis (PCA) and evaluate the radiotherapeutic effect. METHODS: The nude mouse xenograft model of human pancreatic cancer was established by injecting human pancreatic cancer cell SW1990 subcutaneously into the nude mice. When the tumors volume reached 800 mm3 , the mice received various radiation doses. Two weeks later, tumor tissue sections were prepared for running the NMR measurements. 1H NMR and PCA were used to determine the changes in the metabolic profiles of tumor tissues after radiotherapy. Metabolic profiles of normal pancreas, pancreatic tumor tissues, and radiationtreated pancreatic tumor tissues were compared. RESULTS: Compared with 1H NMR spectra of the normal nude mouse pancreas, the levels of choline, taurine, alanine, isoleucine, leucine, valine, lactate, and glutamic acid of the pancreatic cancer group were increased, whereas an opposite trend for phosphocholine, glycerophosphocholine, and betaine was observed. The ratio of phosphocholine to creatine, and glycerophosphocholine to creatine showed noticeable decrease in the pancreatic cancer group. After further evaluation of the tissue metabolic profile after treatment with three different radiation doses, no significant change in metabolites was observed in the 1H NMR spectra, while the inhibition of tumor growth was in proportion to the radiation doses. However, PCA results showed that the levels of choline and betaine were decreased with the increased radiation dose, and conversely, the level of acetic acid was dramatically increased. CONCLUSION: The combined methods were demonstrated to have the potential for allowing early diagnosis and assessment of pancreatic cancer response to radiotherapy.展开更多
[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experim...[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experimental materials, correlation analysis and principal component anatysis were performed on 13 agronomic traits, i.e., plant height, ear position, ear weight, ear diameter, axis diameter, ear length, bald tip length, ear row number, number of grains per row, 100-kernel weight, fresh ear yield, tassel length, and tassel branch number. [Result] The principal component analysis performed to the 13 agronomic traits showed that the first three principal components, i.e., the fresh ear yield factors, the tassel factors and the bald top factors, had an accumulative contribution rate over 87.2767%, and could basically represent the genetic information represented by the 13 traits. The first principal component is the main index for the selection and evaluation of good corn varieties which should have large ear, large ear diameter but small axis diameter, i.e., longer grains, larger number of grains per ear, higher, 100-grain weight and higher plant height. As to the second principal component, the plants of fresh corn varieties are best to have longer tassel and not too many branches, and under the premise of ensuring enough pollen for the female spike, the varieties with fewer tassel branches shoud be selected as far as possible. From the point of the third principal component, bald tip length affects the marketing quality of fresh corn, and during fariety evaluation and breeding, the bald top length should be control at the Iowest standard. [Conclusion] The fresh ear yield of corn is in close positive correlation with ear weight, 100-grain weight, ear diameter, number of grains per row and ear length, and plant height also affects fresh ear yield.展开更多
[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal ...[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal component analysis on the anti-breaking index, leaf traits and cellulose contents. [Result] The results showed that the growth traits had certain relevance with the cellulose contents while the leaf weight assumed a significant negative correlation with the anti-breaking index, indicating that the heavier the leaf weight was, the weaker the anti-breaking capacity of flue-cured tobacco would be; the cross-sectional area of main veins and the cellulose contents had shown a positive correlation with the anti-breaking index, indicating that the thicker the main vein of flue-cured tobacco was, the higher the cellulose contents would be, and the stronger the anti-breaking capacity of flue-cured tobacco leaves would be. [Conclusion] This study provided theoretical basis and reference to improve tobacco production and enhance the quality of flue-cured tobacco.展开更多
In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algori...In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algorithm is proposed. The method is based on the idea of reducing the influence of the eigenvectors associated with the large eigenvalues by normalizing the feature vector element by its corresponding standard deviation. The Yale face database and Yale face database B are used to verify the method. The simulation results show that, for front face and even under the condition of limited variation in the facial poses, the proposed method results in better performance than the conventional PCA and linear discriminant analysis (LDA) approaches, and the computational cost remains the same as that of the PCA, and much less than that of the LDA.展开更多
In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonne...In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonnegative constrained principal component analysis method is proposed to construct a low-dimensional multi-spectral space and accomplish the conversion between the new constructed space and the multispectral space. First, the reason behind the negative data is analyzed and a nonnegative constraint is imposed on the classic PCA. Then a set of nonnegative linear independence weight vectors of principal components is obtained, by which a lowdimensional space is constructed. Finally, a nonlinear optimization technique is used to determine the projection vectors of the high-dimensional multi-spectral data in the constructed space. Experimental results show that the proposed method can keep the reconstructed spectral data in [ 0, 1 ]. The precision of the space created by the proposed method is equivalent to or even higher than that by the PCA.展开更多
Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of sampl...Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.展开更多
文摘The Savitzky-Golay(SG)filter,which employs polynomial least-squares approximations to smooth data and estimate derivatives,is widely used for processing noisy data.However,noise suppression by the SG filter is recognized to be limited at data boundaries and high frequencies,which can significantly reduce the signal-to-noise ratio(SNR).To solve this problem,a novel method synergistically integrating Principal Component Analysis(PCA)with SG filtering is proposed in this paper.This approach avoids the is-sue of excessive smoothing associated with larger window sizes.The proposed PCA-SG filtering algorithm was applied to a CO gas sensing system based on Cavity Ring-Down Spectroscopy(CRDS).The perform-ance of the PCA-SG filtering algorithm is demonstrated through comparison with Moving Average Filtering(MAF),Wavelet Transformation(WT),Kalman Filtering(KF),and the SG filter.The results demonstrate that the proposed algorithm exhibits superior noise reduction capabilities compared to the other algorithms evaluated.The SNR of the ring-down signal was improved from 11.8612 dB to 29.0913 dB,and the stand-ard deviation of the extracted ring-down time constant was reduced from 0.037μs to 0.018μs.These results confirm that the proposed PCA-SG filtering algorithm effectively improves the smoothness of the ring-down curve data,demonstrating its feasibility.
基金Supported by the Central Public-interest Scientific Institution Basal Research Fund,YSFRI,CAFS(No.20603022024016)the Central Public-interest Scientific Institution Basal Research Fund,CAFS(Nos.2023TD52,2023TD76)the earmarked fund for CARS(No.CARS-49)。
文摘Crassostrea gigas has good taste and high nutritional value;however,there are few assessments of comprehensive and panoramic analyses of the nutritional quality of the northern oyster.To study the nutritional characteristics of C.gigas from different sources(ploidy,region,size,and culture mode),C.gigas from various ploidy(diploid and triploid),regions(Rushan,Off-site fattening,and Rongcheng),sizes(small,medium,and large)and culture modes(nearshore and offshore)were selected for comparative analyses.The nutritional components(moisture,protein,fat,and mineral),flavor substances(taste amino acids,nucleotides,and succinic acid),and functional indices(eicosapentaenoic acid(EPA),docosahexaenoic acid(DHA),and taurine)of C.gigas were determined.Principal component analysis(PCA)was used to comprehensively evaluate the oysters and investigate the variations in nutritional quality.The PCA results indicate that protein,essential fatty acids,selenium,zinc,taste amino acids,taurine,EPA,and DHA were core components contributing to 82.25%of the cumulative variance,providing a more comprehensive reflection of the nutrient composition of C.gigas.The extensive quality rankings for the C.gigas were as follows:diploid>triploid,Rushan>fattening>Rongcheng,medium>large>small,and offshore>nearshore.The score rank revealed that diploid oysters of medium-size from Rushan demonstrated superior nutritional quality compared to other tested samples.This is the first comprehensive and systematic investigation of C.gigas in northern China to reveal the feature of nutrients,flavor,and functional components.The study provided data support for the culture,consumption,processing,research,and nutritional quality improvement of oyster industry.
基金funding from the National Natural Science Foundation of China (Grant No.42277175)the pilot project of cooperation between the Ministry of Natural Resources and Hunan Province“Research and demonstration of key technologies for comprehensive remote sensing identification of geological hazards in typical regions of Hunan Province” (Grant No.2023ZRBSHZ056)the National Key Research and Development Program of China-2023 Key Special Project (Grant No.2023YFC2907400).
文摘Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.
基金Supported by Scientific Research Project for Commonwealth (GYHY200806017)Innovation Project for Graduate of Jiangsu Province (CX09S-018Z)
文摘The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of buildings were processed by Principal Component Analysis.The key factor was extracted to support input of vector machine model and to build an evaluation model;the historical fitting result kept in line with the fact.In the real evaluation of two typhoons landed in Zhejiang Province in 2008 and 2009,the coincidence of evaluating result and actual value proved the feasibility of this model.
文摘Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high computational complexity and insufficient capture of high-frequency phase aberration components,so we proposed a Principal-Component-Analysis-based method for representing phase aberrations.This paper discusses the factors influencing the accuracy of restoration,mainly including the sample space size and the sampling interval of D/r_(0),on the basis of characterizing phase aberrations by Principal Components(PCs).The experimental results show that a larger D/r_(0)sampling interval can ensure the generalization ability and robustness of the principal components in the case of a limited amount of original data,which can help to achieve high-precision deployment of the model in practical applications quickly.In the environment with relatively strong turbulence in the test set of D/r_(0)=24,the use of 34 terms of PCs can improve the corrected Strehl ratio(SR)from 0.007 to 0.1585,while the Strehl ratio of the light spot after restoration using 34 terms of ZPs is only 0.0215,demonstrating almost no correction effect.The results indicate that PCs can serve as a better alternative in representing and restoring the characteristics of atmospheric turbulence induced phase aberrations.These findings pave the way to use PCs of phase aberrations with fewer terms than traditional ZPs to achieve data dimensionality reduction,and offer a reference to accelerate and stabilize the model and deep learning based adaptive optics correction.
文摘Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.
基金supported by the Natural Science Foundation of China(Grant Nos.42374032,42174103,42004073)Provincial Natural Science Foundation(2024JJ8348)the Key Laboratory of Natural Resources Monitoring and Supervision in Southern Hilly Region,Ministry of Natural Resources(NRMSSHR2023Y01)。
文摘The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This study proposes a novel strategy;the seasonal hydrological load signals are removed from the raw data,and the remaining signals use principal component analysis(PCA).Simulation results from Yunnan Province demonstrate that the spatial distribution of the root mean square error(RMSE)is improved by approximately 15% compared with traditional PCA extraction from raw data.From January 2013 to December 2022,TWS was inverted from 24 GNSS stations in Yunnan Province.The spatial distribution and time series of TWS inverted from GNSS align well with those TWS inferred from the Gravity Recovery and Climate Experiment(GRACE),GRACE Follow-On(GFO),and the Global Land Data Assimilation System(GLDAS)land surface model.However,the amplitude of the GNSS-inverted TWS is slightly higher.Since GNSS ground stations are more sensitive to hydrological load signals,they show correlations with precipitation data that are 8.6%and 6.0%higher than those of GRACE and GLDAS,respectively.In the power spectral density analysis of GRACE/GFO,GLDAS,and GNSS,the signal strength of GNSS is much higher than that of GRACE/GFO and GLDAS in the June and February cycles.These findings suggest that the new data extraction strategy can capture higher frequency hydrological signals in TWS,and GNSS observations can help address limitations in GRACE/GFO observations.This study demonstrates the potential of GNSS TWS in capturing higher-frequency hydrological signals and climate extremes application.
基金supported by the National Natural Science Foundation of China(Grant Numbers 42374195 and 42188101)the fellowship of China National Postdoctoral Program for Innovative Talents(Grant Number BX20230273)+1 种基金the Hubei Provincial Natural Science Foundation of China(Grant Number 2024AFB-097)the Postdoctor Project of Hubei Province(Grant Number 2024HBBHCXA054).
文摘This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric warming(SSW)events.IHC in both hemispheres leads to a cold anomaly in the equatorial stratosphere,a warm anomaly in the equatorial mesosphere,and increased temperatures in the mesosphere and lower thermosphere(MLT)region of the summer hemisphere.However,the IHC features during boreal winter period are significantly weaker than during the austral winter period,primarily due to weaker stationary planetary wave activity in the Southern Hemisphere(SH).During the austral winter period,IHC results in a warm anomaly in the polar mesosphere of the SH,which does not occur in the NH during boreal winter period.This study also examines the possible influence of quasi-two-day waves(QTDWs)on IHC.We found that the largest temperature anomaly in the summer polar MLT region is associated with a large wind instability area,and a well-developed critical layer structure of QTDW in January.In contrast,during July,despite favorable conditions for QTDW propagation in the Northern Hemisphere,weaker IHC response is observed,suggesting that IHC features and the relationship with QTDWs during July would be more complex than during January.
基金supported by the National Key Research and Development Plan of China(No.2023YFB3406500)the National Natural Science Foundation of China(No.52475132)+2 种基金the Aeronautical Science Foundation of China(No.20200015053001)the Shaanxi Key Research Program Project,China(No.2024GX-ZDCYL-01–16)the Xi’an Key Industrial Chain Technology Research Project,China(No.2023JH-RGZNGG-0033)。
文摘Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamforming map.Current algorithms for separating different moving sound sources have limited effectiveness,leading to significant residual noise,especially when the rotating source is strong enough to mask stationary sources completely.To overcome these challenges,a novel solution utilizing a virtual rotating array in the modal domain combined with robust principal component analysis is proposed to separate sound sources with different rotational speeds.This approach,named Robust Principal Component Analysis in the Modal domain(RPCA-M),investigates the performance of convex nuclear norm and non-convex Schatten-p norm to distinguish stationary and rotating sources.By comparing the errors in Cross-Spectral Matrix(CSM)recovery and acoustic imaging across different algorithms,the effectiveness of RPCA-M in separating stationary and moving sound sources is demonstrated.Importantly,this method effectively separates sound sources,even when there are significant variations in their amplitudes at different rotation speeds.
基金supported by the National Natural Science Foundation of China(Grant Nos.12104075 and 12347101).
文摘We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of the ground state wave functions with different disorder strengths can be distilled.As the disorder strength increases,the Moore–Read state will be destroyed.We explore the phase transition by analyzing the overlaps between the random sample wave functions and the topologically distilled state.The cross-point between the amplitudes of the principal component and its counterpart is the phase transition point.Additionally,the origin of the second component comes from the excited states,which is different from the Laughlin state.
基金founded by the National Natural Science Foundation of China(81202283,81473070,81373102 and81202267)Key Grant of Natural Science Foundation of the Jiangsu Higher Education Institutions of China(10KJA330034 and11KJA330001)+1 种基金the Research Fund for the Doctoral Program of Higher Education of China(20113234110002)the Priority Academic Program for the Development of Jiangsu Higher Education Institutions(Public Health and Preventive Medicine)
文摘With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.
基金Project(70671039) supported by the National Natural Science Foundation of China
文摘A combined model based on principal components analysis (PCA) and generalized regression neural network (GRNN) was adopted to forecast electricity price in day-ahead electricity market. PCA was applied to mine the main influence on day-ahead price, avoiding the strong correlation between the input factors that might influence electricity price, such as the load of the forecasting hour, other history loads and prices, weather and temperature; then GRNN was employed to forecast electricity price according to the main information extracted by PCA. To prove the efficiency of the combined model, a case from PJM (Pennsylvania-New Jersey-Maryland) day-ahead electricity market was evaluated. Compared to back-propagation (BP) neural network and standard GRNN, the combined method reduces the mean absolute percentage error about 3%.
文摘Water quality monitoring has one of the highest priorities in surface water protection policy. Many variety approaches are being used to interpret and analyze the concealed variables that determine the variance of observed water quality of various source points. A considerable proportion of these approaches are mainly based on statistical methods, multivariate statistical techniques in particular. In the present study, the use of multivariate techniques is required to reduce the large variables number of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs) and determination of relationships among them for easy and robust evaluation. By means of multivariate statistics of principal components analysis (PCA), Fuzzy C-Means (FCM) and K-means algorithm for clustering analysis, this study attempted to determine the major dominant factors responsible for the variations of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs). Furthermore, cluster analysis classified 21 sampling stations into three clusters based on similarities of water quality features. The result of PCA shows that 6 principal components contain the key variables and account for 75.82% of total variance of the study area surface water quality and the dominant water quality parameters were: Conductivity, Iron, Biological Oxygen Demand (BOD), Total Coliform (TC), Ammonia (NH3), and pH. However, the results from both of FCM clustering and K-means algorithm, based on the dominant parameters concentrations, determined 3 cluster groups and produced cluster centers (prototypes). Based on clustering classification, a noted water quality deteriorating as the cluster number increased from 1 to 3. However the cluster grouping can be used to identify the physical, chemical and biological processes creating the variations in the water quality parameters. This study revealed that multivariate analysis techniques, as the extracted water quality dominant parameters and clustered information can be used in reducing the number of sampling parameters on the Nile River in a cost effective and efficient way instead of using a large set of parameters without missing much information. These techniques can be helpful for decision makers to obtain a global view on the water quality in any surface water or other water bodies when analyzing large data sets especially without a priori knowledge about relationships between them.
基金This project is supported by National Natural Science Foundation of China(No.70372062)Hi-Tech Program of Tianjin city,China(No.04310881R).
文摘5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging.And these characteristics are correlated with each other.So the principal components analysis(PCA)is used in the analysis of the sample data firstly.And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information.Furthermore,a software tool is developed for this kind of problems.And with sample data from a surface mounting device(SMD)process,it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA,but the problem is transformed from high-dimensional one to a lower dimensional one,i.e.,from 5 to 2 in this demonstration.
基金Supported by The Medical Imageology Special Purpose Foundation of Cancer Hospital/Cancer Institute Fudan University, No.YX200802
文摘AIM: To investigate the metabolic profiles of xenograft pancreatic cancer before and after radiotherapy by high-resolution magic angle spinning proton magnetic resonance spectroscopy (HRMAS 1H NMR) combined with principal components analysis (PCA) and evaluate the radiotherapeutic effect. METHODS: The nude mouse xenograft model of human pancreatic cancer was established by injecting human pancreatic cancer cell SW1990 subcutaneously into the nude mice. When the tumors volume reached 800 mm3 , the mice received various radiation doses. Two weeks later, tumor tissue sections were prepared for running the NMR measurements. 1H NMR and PCA were used to determine the changes in the metabolic profiles of tumor tissues after radiotherapy. Metabolic profiles of normal pancreas, pancreatic tumor tissues, and radiationtreated pancreatic tumor tissues were compared. RESULTS: Compared with 1H NMR spectra of the normal nude mouse pancreas, the levels of choline, taurine, alanine, isoleucine, leucine, valine, lactate, and glutamic acid of the pancreatic cancer group were increased, whereas an opposite trend for phosphocholine, glycerophosphocholine, and betaine was observed. The ratio of phosphocholine to creatine, and glycerophosphocholine to creatine showed noticeable decrease in the pancreatic cancer group. After further evaluation of the tissue metabolic profile after treatment with three different radiation doses, no significant change in metabolites was observed in the 1H NMR spectra, while the inhibition of tumor growth was in proportion to the radiation doses. However, PCA results showed that the levels of choline and betaine were decreased with the increased radiation dose, and conversely, the level of acetic acid was dramatically increased. CONCLUSION: The combined methods were demonstrated to have the potential for allowing early diagnosis and assessment of pancreatic cancer response to radiotherapy.
文摘[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experimental materials, correlation analysis and principal component anatysis were performed on 13 agronomic traits, i.e., plant height, ear position, ear weight, ear diameter, axis diameter, ear length, bald tip length, ear row number, number of grains per row, 100-kernel weight, fresh ear yield, tassel length, and tassel branch number. [Result] The principal component analysis performed to the 13 agronomic traits showed that the first three principal components, i.e., the fresh ear yield factors, the tassel factors and the bald top factors, had an accumulative contribution rate over 87.2767%, and could basically represent the genetic information represented by the 13 traits. The first principal component is the main index for the selection and evaluation of good corn varieties which should have large ear, large ear diameter but small axis diameter, i.e., longer grains, larger number of grains per ear, higher, 100-grain weight and higher plant height. As to the second principal component, the plants of fresh corn varieties are best to have longer tassel and not too many branches, and under the premise of ensuring enough pollen for the female spike, the varieties with fewer tassel branches shoud be selected as far as possible. From the point of the third principal component, bald tip length affects the marketing quality of fresh corn, and during fariety evaluation and breeding, the bald top length should be control at the Iowest standard. [Conclusion] The fresh ear yield of corn is in close positive correlation with ear weight, 100-grain weight, ear diameter, number of grains per row and ear length, and plant height also affects fresh ear yield.
基金Supported by the Fund of Anhui Provincial Tobacco Monopoly Bureau(AHKJ2008-03)Anhui Provincial University Key Project of Natural Science(KJ2010A114)Undergraduate Student Science and Technology Innovation Fund of Anhui Agricultural University(2010233)~~
文摘[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal component analysis on the anti-breaking index, leaf traits and cellulose contents. [Result] The results showed that the growth traits had certain relevance with the cellulose contents while the leaf weight assumed a significant negative correlation with the anti-breaking index, indicating that the heavier the leaf weight was, the weaker the anti-breaking capacity of flue-cured tobacco would be; the cross-sectional area of main veins and the cellulose contents had shown a positive correlation with the anti-breaking index, indicating that the thicker the main vein of flue-cured tobacco was, the higher the cellulose contents would be, and the stronger the anti-breaking capacity of flue-cured tobacco leaves would be. [Conclusion] This study provided theoretical basis and reference to improve tobacco production and enhance the quality of flue-cured tobacco.
文摘In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algorithm is proposed. The method is based on the idea of reducing the influence of the eigenvectors associated with the large eigenvalues by normalizing the feature vector element by its corresponding standard deviation. The Yale face database and Yale face database B are used to verify the method. The simulation results show that, for front face and even under the condition of limited variation in the facial poses, the proposed method results in better performance than the conventional PCA and linear discriminant analysis (LDA) approaches, and the computational cost remains the same as that of the PCA, and much less than that of the LDA.
基金The Pre-Research Foundation of National Ministries andCommissions (No9140A16050109DZ01)the Scientific Research Program of the Education Department of Shanxi Province (No09JK701)
文摘In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonnegative constrained principal component analysis method is proposed to construct a low-dimensional multi-spectral space and accomplish the conversion between the new constructed space and the multispectral space. First, the reason behind the negative data is analyzed and a nonnegative constraint is imposed on the classic PCA. Then a set of nonnegative linear independence weight vectors of principal components is obtained, by which a lowdimensional space is constructed. Finally, a nonlinear optimization technique is used to determine the projection vectors of the high-dimensional multi-spectral data in the constructed space. Experimental results show that the proposed method can keep the reconstructed spectral data in [ 0, 1 ]. The precision of the space created by the proposed method is equivalent to or even higher than that by the PCA.
文摘Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.