Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC...Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.展开更多
The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of build...The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of buildings were processed by Principal Component Analysis.The key factor was extracted to support input of vector machine model and to build an evaluation model;the historical fitting result kept in line with the fact.In the real evaluation of two typhoons landed in Zhejiang Province in 2008 and 2009,the coincidence of evaluating result and actual value proved the feasibility of this model.展开更多
With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistica...With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.展开更多
5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analy...5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.展开更多
The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of t...The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of the multi-block statistical method known as common components and specific weights analysis.The three methods were applied to a set of data to show the differences and similarities of the results obtained,highlighting their complementarity.展开更多
Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high com...Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high computational complexity and insufficient capture of high-frequency phase aberration components,so we proposed a Principal-Component-Analysis-based method for representing phase aberrations.This paper discusses the factors influencing the accuracy of restoration,mainly including the sample space size and the sampling interval of D/r_(0),on the basis of characterizing phase aberrations by Principal Components(PCs).The experimental results show that a larger D/r_(0)sampling interval can ensure the generalization ability and robustness of the principal components in the case of a limited amount of original data,which can help to achieve high-precision deployment of the model in practical applications quickly.In the environment with relatively strong turbulence in the test set of D/r_(0)=24,the use of 34 terms of PCs can improve the corrected Strehl ratio(SR)from 0.007 to 0.1585,while the Strehl ratio of the light spot after restoration using 34 terms of ZPs is only 0.0215,demonstrating almost no correction effect.The results indicate that PCs can serve as a better alternative in representing and restoring the characteristics of atmospheric turbulence induced phase aberrations.These findings pave the way to use PCs of phase aberrations with fewer terms than traditional ZPs to achieve data dimensionality reduction,and offer a reference to accelerate and stabilize the model and deep learning based adaptive optics correction.展开更多
The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This...The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This study proposes a novel strategy;the seasonal hydrological load signals are removed from the raw data,and the remaining signals use principal component analysis(PCA).Simulation results from Yunnan Province demonstrate that the spatial distribution of the root mean square error(RMSE)is improved by approximately 15% compared with traditional PCA extraction from raw data.From January 2013 to December 2022,TWS was inverted from 24 GNSS stations in Yunnan Province.The spatial distribution and time series of TWS inverted from GNSS align well with those TWS inferred from the Gravity Recovery and Climate Experiment(GRACE),GRACE Follow-On(GFO),and the Global Land Data Assimilation System(GLDAS)land surface model.However,the amplitude of the GNSS-inverted TWS is slightly higher.Since GNSS ground stations are more sensitive to hydrological load signals,they show correlations with precipitation data that are 8.6%and 6.0%higher than those of GRACE and GLDAS,respectively.In the power spectral density analysis of GRACE/GFO,GLDAS,and GNSS,the signal strength of GNSS is much higher than that of GRACE/GFO and GLDAS in the June and February cycles.These findings suggest that the new data extraction strategy can capture higher frequency hydrological signals in TWS,and GNSS observations can help address limitations in GRACE/GFO observations.This study demonstrates the potential of GNSS TWS in capturing higher-frequency hydrological signals and climate extremes application.展开更多
It is of critical importance to determine the endpoint of stabilization for landfills that are stabilized by aeration acceleration.Current stabilization evaluation methods are inconsistent and mostly fail to account f...It is of critical importance to determine the endpoint of stabilization for landfills that are stabilized by aeration acceleration.Current stabilization evaluation methods are inconsistent and mostly fail to account for the effect of oxygen concentration.In this study,degradation experiments were conducted to quantitatively analyze the impact of oxygen on microbial communities and metabolic functions.High-throughput sequencing analysis demonstrated that an oxygen concentration exceeding 10%significantly enhances amino acid metabolism,secondary metabolite biosynthesis,and exogenous biodegradation.Three-dimensional fluorescence data were analyzed using the PARAFAC method,and a novel fluorescence-based stabilization indicator was proposed based on the ratio of fulvic-like to tyrosine-like substances.When the growth multiples of the fluorescence index exceed 10-fold,it can be inferred that degradation has been met the stabilization endpoint.Principal component analysis was employed to establish multiple regression equations between the physicochemical parameters of landfill waste and dissolved fluorescent substants,offering an innovative insight to evaluate the stabilization process of aerated landfills.展开更多
Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamformin...Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamforming map.Current algorithms for separating different moving sound sources have limited effectiveness,leading to significant residual noise,especially when the rotating source is strong enough to mask stationary sources completely.To overcome these challenges,a novel solution utilizing a virtual rotating array in the modal domain combined with robust principal component analysis is proposed to separate sound sources with different rotational speeds.This approach,named Robust Principal Component Analysis in the Modal domain(RPCA-M),investigates the performance of convex nuclear norm and non-convex Schatten-p norm to distinguish stationary and rotating sources.By comparing the errors in Cross-Spectral Matrix(CSM)recovery and acoustic imaging across different algorithms,the effectiveness of RPCA-M in separating stationary and moving sound sources is demonstrated.Importantly,this method effectively separates sound sources,even when there are significant variations in their amplitudes at different rotation speeds.展开更多
Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Wa...Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.展开更多
This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric...This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric warming(SSW)events.IHC in both hemispheres leads to a cold anomaly in the equatorial stratosphere,a warm anomaly in the equatorial mesosphere,and increased temperatures in the mesosphere and lower thermosphere(MLT)region of the summer hemisphere.However,the IHC features during boreal winter period are significantly weaker than during the austral winter period,primarily due to weaker stationary planetary wave activity in the Southern Hemisphere(SH).During the austral winter period,IHC results in a warm anomaly in the polar mesosphere of the SH,which does not occur in the NH during boreal winter period.This study also examines the possible influence of quasi-two-day waves(QTDWs)on IHC.We found that the largest temperature anomaly in the summer polar MLT region is associated with a large wind instability area,and a well-developed critical layer structure of QTDW in January.In contrast,during July,despite favorable conditions for QTDW propagation in the Northern Hemisphere,weaker IHC response is observed,suggesting that IHC features and the relationship with QTDWs during July would be more complex than during January.展开更多
We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of t...We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of the ground state wave functions with different disorder strengths can be distilled.As the disorder strength increases,the Moore–Read state will be destroyed.We explore the phase transition by analyzing the overlaps between the random sample wave functions and the topologically distilled state.The cross-point between the amplitudes of the principal component and its counterpart is the phase transition point.Additionally,the origin of the second component comes from the excited states,which is different from the Laughlin state.展开更多
[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experim...[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experimental materials, correlation analysis and principal component anatysis were performed on 13 agronomic traits, i.e., plant height, ear position, ear weight, ear diameter, axis diameter, ear length, bald tip length, ear row number, number of grains per row, 100-kernel weight, fresh ear yield, tassel length, and tassel branch number. [Result] The principal component analysis performed to the 13 agronomic traits showed that the first three principal components, i.e., the fresh ear yield factors, the tassel factors and the bald top factors, had an accumulative contribution rate over 87.2767%, and could basically represent the genetic information represented by the 13 traits. The first principal component is the main index for the selection and evaluation of good corn varieties which should have large ear, large ear diameter but small axis diameter, i.e., longer grains, larger number of grains per ear, higher, 100-grain weight and higher plant height. As to the second principal component, the plants of fresh corn varieties are best to have longer tassel and not too many branches, and under the premise of ensuring enough pollen for the female spike, the varieties with fewer tassel branches shoud be selected as far as possible. From the point of the third principal component, bald tip length affects the marketing quality of fresh corn, and during fariety evaluation and breeding, the bald top length should be control at the Iowest standard. [Conclusion] The fresh ear yield of corn is in close positive correlation with ear weight, 100-grain weight, ear diameter, number of grains per row and ear length, and plant height also affects fresh ear yield.展开更多
[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal ...[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal component analysis on the anti-breaking index, leaf traits and cellulose contents. [Result] The results showed that the growth traits had certain relevance with the cellulose contents while the leaf weight assumed a significant negative correlation with the anti-breaking index, indicating that the heavier the leaf weight was, the weaker the anti-breaking capacity of flue-cured tobacco would be; the cross-sectional area of main veins and the cellulose contents had shown a positive correlation with the anti-breaking index, indicating that the thicker the main vein of flue-cured tobacco was, the higher the cellulose contents would be, and the stronger the anti-breaking capacity of flue-cured tobacco leaves would be. [Conclusion] This study provided theoretical basis and reference to improve tobacco production and enhance the quality of flue-cured tobacco.展开更多
In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algori...In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algorithm is proposed. The method is based on the idea of reducing the influence of the eigenvectors associated with the large eigenvalues by normalizing the feature vector element by its corresponding standard deviation. The Yale face database and Yale face database B are used to verify the method. The simulation results show that, for front face and even under the condition of limited variation in the facial poses, the proposed method results in better performance than the conventional PCA and linear discriminant analysis (LDA) approaches, and the computational cost remains the same as that of the PCA, and much less than that of the LDA.展开更多
In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonne...In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonnegative constrained principal component analysis method is proposed to construct a low-dimensional multi-spectral space and accomplish the conversion between the new constructed space and the multispectral space. First, the reason behind the negative data is analyzed and a nonnegative constraint is imposed on the classic PCA. Then a set of nonnegative linear independence weight vectors of principal components is obtained, by which a lowdimensional space is constructed. Finally, a nonlinear optimization technique is used to determine the projection vectors of the high-dimensional multi-spectral data in the constructed space. Experimental results show that the proposed method can keep the reconstructed spectral data in [ 0, 1 ]. The precision of the space created by the proposed method is equivalent to or even higher than that by the PCA.展开更多
The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,...The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,rapid and sensitive HPLC-MS/MS method was developed for the identification and quantitation of the major bioactive components in C.chinensis fruits.Eighteen polyphenols were identified,which are first reported in C.chinensis fruits.Moreover,ten components were simultaneously quantified.The validated quantitative method was proved to be sensitive,reproducible and accurate.Then,it was applied to analyze batches of C.chinensis fruits from different phytomorph and areas.The principal components analysis(PCA)realized visualization and reduction of data set dimension while the hierarchical cluster analysis(HCA)indicated that the content of phenolic acids or all ten components might be used to differentiate C.chinensis fruits of different phytomorph.展开更多
Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of sampl...Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.展开更多
Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and ...Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and classification were done to the mine area farmland heavy metal pollution situation by synthetic principal components analysis (PCA).The results show that variable clustering analysis is efficient to identify the principal components of mine area farmland heavy metal pollution.Sort and clustering were done to the synthetic principal components scores of soil sample,which is given by synthetic principal components analysis.Data structure of soil heavy metal contaminations relationships and pollution level of different soil samples are discovered.The results of mine area farmland heavy metal pollution quality assessed and classified with synthetic component scores reflect the influence of both the major and compound heavy metal pol- lutants.Identification and assessment results of mine area farmland heavy metal pollution can provide reference and guide to propose control measures of mine area farmland heavy metal pollution and focus on the key treatment region.展开更多
Survey and analysis were conducted on water quality of offshore seas in eastern region of Shenzhen by principal component analysis with SPSS. Then, 8 pollutants indices were then reduced to 5. Based on weighted analys...Survey and analysis were conducted on water quality of offshore seas in eastern region of Shenzhen by principal component analysis with SPSS. Then, 8 pollutants indices were then reduced to 5. Based on weighted analysis of principal component weights, comprehensive scores of different monitored stations were com- puted and sequenced in order to make evaluation on sea quality of eastern region of Shenzhen.展开更多
基金funding from the National Natural Science Foundation of China (Grant No.42277175)the pilot project of cooperation between the Ministry of Natural Resources and Hunan Province“Research and demonstration of key technologies for comprehensive remote sensing identification of geological hazards in typical regions of Hunan Province” (Grant No.2023ZRBSHZ056)the National Key Research and Development Program of China-2023 Key Special Project (Grant No.2023YFC2907400).
文摘Joint roughness coefficient(JRC)is the most commonly used parameter for quantifying surface roughness of rock discontinuities in practice.The system composed of multiple roughness statistical parameters to measure JRC is a nonlinear system with a lot of overlapping information.In this paper,a dataset of eight roughness statistical parameters covering 112 digital joints is established.Then,the principal component analysis method is introduced to extract the significant information,which solves the information overlap problem of roughness characterization.Based on the two principal components of extracted features,the white shark optimizer algorithm was introduced to optimize the extreme gradient boosting model,and a new machine learning(ML)prediction model was established.The prediction accuracy of the new model and the other 17 models was measured using statistical metrics.The results show that the prediction result of the new model is more consistent with the real JRC value,with higher recognition accuracy and generalization ability.
基金Supported by Scientific Research Project for Commonwealth (GYHY200806017)Innovation Project for Graduate of Jiangsu Province (CX09S-018Z)
文摘The evaluation model was established to estimate the number of houses collapsed during typhoon disaster for Zhejiang Province.The factor leading to disaster,the environment fostering disaster and the exposure of buildings were processed by Principal Component Analysis.The key factor was extracted to support input of vector machine model and to build an evaluation model;the historical fitting result kept in line with the fact.In the real evaluation of two typhoons landed in Zhejiang Province in 2008 and 2009,the coincidence of evaluating result and actual value proved the feasibility of this model.
基金founded by the National Natural Science Foundation of China(81202283,81473070,81373102 and81202267)Key Grant of Natural Science Foundation of the Jiangsu Higher Education Institutions of China(10KJA330034 and11KJA330001)+1 种基金the Research Fund for the Doctoral Program of Higher Education of China(20113234110002)the Priority Academic Program for the Development of Jiangsu Higher Education Institutions(Public Health and Preventive Medicine)
文摘With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.
基金This project is supported by National Natural Science Foundation of China (No.70372062)Hi-Tech Program of Tianjin city,China (No.04310881R).
文摘5 critical quality characteristics must be controlled in the surface mount and wire-bond process in semiconductor packaging. And these characteristics are correlated with each other. So the principal components analysis(PCA) is used in the analysis of the sample data firstly. And then the process is controlled with hotelling T^2 control chart for the first several principal components which contain sufficient information. Furthermore, a software tool is developed for this kind of problems. And with sample data from a surface mounting device(SMD) process, it is demonstrated that the T^2 control chart with PCA gets the same conclusion as without PCA, but the problem is transformed from high-dimensional one to a lower dimensional one, i.e., from 5 to 2 in this demonstration.
文摘The aim of this work is to describe and compare three exploratory chemometrical tools,principal components analysis,independent components analysis and common components analysis,the last one being a modification of the multi-block statistical method known as common components and specific weights analysis.The three methods were applied to a set of data to show the differences and similarities of the results obtained,highlighting their complementarity.
文摘Restoration of phase aberrations is crucial for addressing atmospheric turbulence in light propagation.Traditional restoration algorithms based on Zernike polynomials(ZPs)often encounter challenges related to high computational complexity and insufficient capture of high-frequency phase aberration components,so we proposed a Principal-Component-Analysis-based method for representing phase aberrations.This paper discusses the factors influencing the accuracy of restoration,mainly including the sample space size and the sampling interval of D/r_(0),on the basis of characterizing phase aberrations by Principal Components(PCs).The experimental results show that a larger D/r_(0)sampling interval can ensure the generalization ability and robustness of the principal components in the case of a limited amount of original data,which can help to achieve high-precision deployment of the model in practical applications quickly.In the environment with relatively strong turbulence in the test set of D/r_(0)=24,the use of 34 terms of PCs can improve the corrected Strehl ratio(SR)from 0.007 to 0.1585,while the Strehl ratio of the light spot after restoration using 34 terms of ZPs is only 0.0215,demonstrating almost no correction effect.The results indicate that PCs can serve as a better alternative in representing and restoring the characteristics of atmospheric turbulence induced phase aberrations.These findings pave the way to use PCs of phase aberrations with fewer terms than traditional ZPs to achieve data dimensionality reduction,and offer a reference to accelerate and stabilize the model and deep learning based adaptive optics correction.
基金supported by the Natural Science Foundation of China(Grant Nos.42374032,42174103,42004073)Provincial Natural Science Foundation(2024JJ8348)the Key Laboratory of Natural Resources Monitoring and Supervision in Southern Hilly Region,Ministry of Natural Resources(NRMSSHR2023Y01)。
文摘The Global Navigation Satellite System(GNSS)is vital for monitoring terrestrial water storage(TWS).However,effectively extracting hydrological load deformation from GNSS observations poses a significant challenge.This study proposes a novel strategy;the seasonal hydrological load signals are removed from the raw data,and the remaining signals use principal component analysis(PCA).Simulation results from Yunnan Province demonstrate that the spatial distribution of the root mean square error(RMSE)is improved by approximately 15% compared with traditional PCA extraction from raw data.From January 2013 to December 2022,TWS was inverted from 24 GNSS stations in Yunnan Province.The spatial distribution and time series of TWS inverted from GNSS align well with those TWS inferred from the Gravity Recovery and Climate Experiment(GRACE),GRACE Follow-On(GFO),and the Global Land Data Assimilation System(GLDAS)land surface model.However,the amplitude of the GNSS-inverted TWS is slightly higher.Since GNSS ground stations are more sensitive to hydrological load signals,they show correlations with precipitation data that are 8.6%and 6.0%higher than those of GRACE and GLDAS,respectively.In the power spectral density analysis of GRACE/GFO,GLDAS,and GNSS,the signal strength of GNSS is much higher than that of GRACE/GFO and GLDAS in the June and February cycles.These findings suggest that the new data extraction strategy can capture higher frequency hydrological signals in TWS,and GNSS observations can help address limitations in GRACE/GFO observations.This study demonstrates the potential of GNSS TWS in capturing higher-frequency hydrological signals and climate extremes application.
基金supported by the National Natural Science Foundation of China(No.42177177).
文摘It is of critical importance to determine the endpoint of stabilization for landfills that are stabilized by aeration acceleration.Current stabilization evaluation methods are inconsistent and mostly fail to account for the effect of oxygen concentration.In this study,degradation experiments were conducted to quantitatively analyze the impact of oxygen on microbial communities and metabolic functions.High-throughput sequencing analysis demonstrated that an oxygen concentration exceeding 10%significantly enhances amino acid metabolism,secondary metabolite biosynthesis,and exogenous biodegradation.Three-dimensional fluorescence data were analyzed using the PARAFAC method,and a novel fluorescence-based stabilization indicator was proposed based on the ratio of fulvic-like to tyrosine-like substances.When the growth multiples of the fluorescence index exceed 10-fold,it can be inferred that degradation has been met the stabilization endpoint.Principal component analysis was employed to establish multiple regression equations between the physicochemical parameters of landfill waste and dissolved fluorescent substants,offering an innovative insight to evaluate the stabilization process of aerated landfills.
基金supported by the National Key Research and Development Plan of China(No.2023YFB3406500)the National Natural Science Foundation of China(No.52475132)+2 种基金the Aeronautical Science Foundation of China(No.20200015053001)the Shaanxi Key Research Program Project,China(No.2024GX-ZDCYL-01–16)the Xi’an Key Industrial Chain Technology Research Project,China(No.2023JH-RGZNGG-0033)。
文摘Traditional beamforming techniques may not accurately locate sources in scenarios with both stationary and rotating sound sources.The existence of rotating sound sources can cause blurring in the stationary beamforming map.Current algorithms for separating different moving sound sources have limited effectiveness,leading to significant residual noise,especially when the rotating source is strong enough to mask stationary sources completely.To overcome these challenges,a novel solution utilizing a virtual rotating array in the modal domain combined with robust principal component analysis is proposed to separate sound sources with different rotational speeds.This approach,named Robust Principal Component Analysis in the Modal domain(RPCA-M),investigates the performance of convex nuclear norm and non-convex Schatten-p norm to distinguish stationary and rotating sources.By comparing the errors in Cross-Spectral Matrix(CSM)recovery and acoustic imaging across different algorithms,the effectiveness of RPCA-M in separating stationary and moving sound sources is demonstrated.Importantly,this method effectively separates sound sources,even when there are significant variations in their amplitudes at different rotation speeds.
文摘Groundwater is a crucial water source for urban areas in Africa, particularly where surface water is insufficient to meet demand. This study analyses the water quality of five shallow wells (WW1-WW5) in Half-London Ward, Tunduma Town, Tanzania, using Principal Component Analysis (PCA) to identify the primary factors influencing groundwater contamination. Monthly samples were collected over 12 months and analysed for physical, chemical, and biological parameters. The PCA revealed between four and six principal components (PCs) for each well, explaining between 84.61% and 92.55% of the total variance in water quality data. In WW1, five PCs captured 87.53% of the variability, with PC1 (33.05%) dominated by pH, EC, TDS, and microbial contamination, suggesting significant influences from surface runoff and pit latrines. In WW2, six PCs explained 92.55% of the variance, with PC1 (36.17%) highlighting the effects of salinity, TDS, and agricultural runoff. WW3 had four PCs explaining 84.61% of the variance, with PC1 (39.63%) showing high contributions from pH, hardness, and salinity, indicating geological influences and contamination from human activities. Similarly, in WW4, six PCs explained 90.83% of the variance, where PC1 (43.53%) revealed contamination from pit latrines and fertilizers. WW5 also had six PCs, accounting for 92.51% of the variance, with PC1 (42.73%) indicating significant contamination from agricultural runoff and pit latrines. The study concludes that groundwater quality in Half-London Ward is primarily affected by a combination of surface runoff, pit latrine contamination, agricultural inputs, and geological factors. The presence of microbial contaminants and elevated nitrate and phosphate levels underscores the need for improved sanitation and sustainable agricultural practices. Recommendations include strengthening sanitation infrastructure, promoting responsible farming techniques, and implementing regular groundwater monitoring to safeguard water resources and public health in the region.
基金supported by the National Natural Science Foundation of China(Grant Numbers 42374195 and 42188101)the fellowship of China National Postdoctoral Program for Innovative Talents(Grant Number BX20230273)+1 种基金the Hubei Provincial Natural Science Foundation of China(Grant Number 2024AFB-097)the Postdoctor Project of Hubei Province(Grant Number 2024HBBHCXA054).
文摘This study employs Principal Component Analysis(PCA)and 13 years of SD-WACCM-X model data(2007-2019)to investigate the characteristics and mechanisms of Inter-hemispheric Coupling(IHC)triggered by sudden stratospheric warming(SSW)events.IHC in both hemispheres leads to a cold anomaly in the equatorial stratosphere,a warm anomaly in the equatorial mesosphere,and increased temperatures in the mesosphere and lower thermosphere(MLT)region of the summer hemisphere.However,the IHC features during boreal winter period are significantly weaker than during the austral winter period,primarily due to weaker stationary planetary wave activity in the Southern Hemisphere(SH).During the austral winter period,IHC results in a warm anomaly in the polar mesosphere of the SH,which does not occur in the NH during boreal winter period.This study also examines the possible influence of quasi-two-day waves(QTDWs)on IHC.We found that the largest temperature anomaly in the summer polar MLT region is associated with a large wind instability area,and a well-developed critical layer structure of QTDW in January.In contrast,during July,despite favorable conditions for QTDW propagation in the Northern Hemisphere,weaker IHC response is observed,suggesting that IHC features and the relationship with QTDWs during July would be more complex than during January.
基金supported by the National Natural Science Foundation of China(Grant Nos.12104075 and 12347101).
文摘We study the influence of disorder on the Moore–Read state by principal component analysis(PCA),which is one of the ground state candidates for the 5/2 fractional Hall state.By using PCA,the topological features of the ground state wave functions with different disorder strengths can be distilled.As the disorder strength increases,the Moore–Read state will be destroyed.We explore the phase transition by analyzing the overlaps between the random sample wave functions and the topologically distilled state.The cross-point between the amplitudes of the principal component and its counterpart is the phase transition point.Additionally,the origin of the second component comes from the excited states,which is different from the Laughlin state.
文摘[Objective] This study was conducted to provide certain theoretical reference for the comprehensive evaluation and breeding of new fresh waxy corn vari- eties. [Method] With 5 good fresh waxy corn varieties as experimental materials, correlation analysis and principal component anatysis were performed on 13 agronomic traits, i.e., plant height, ear position, ear weight, ear diameter, axis diameter, ear length, bald tip length, ear row number, number of grains per row, 100-kernel weight, fresh ear yield, tassel length, and tassel branch number. [Result] The principal component analysis performed to the 13 agronomic traits showed that the first three principal components, i.e., the fresh ear yield factors, the tassel factors and the bald top factors, had an accumulative contribution rate over 87.2767%, and could basically represent the genetic information represented by the 13 traits. The first principal component is the main index for the selection and evaluation of good corn varieties which should have large ear, large ear diameter but small axis diameter, i.e., longer grains, larger number of grains per ear, higher, 100-grain weight and higher plant height. As to the second principal component, the plants of fresh corn varieties are best to have longer tassel and not too many branches, and under the premise of ensuring enough pollen for the female spike, the varieties with fewer tassel branches shoud be selected as far as possible. From the point of the third principal component, bald tip length affects the marketing quality of fresh corn, and during fariety evaluation and breeding, the bald top length should be control at the Iowest standard. [Conclusion] The fresh ear yield of corn is in close positive correlation with ear weight, 100-grain weight, ear diameter, number of grains per row and ear length, and plant height also affects fresh ear yield.
基金Supported by the Fund of Anhui Provincial Tobacco Monopoly Bureau(AHKJ2008-03)Anhui Provincial University Key Project of Natural Science(KJ2010A114)Undergraduate Student Science and Technology Innovation Fund of Anhui Agricultural University(2010233)~~
文摘[Objective] This study aimed to explore the related mechanisms of the breaking of flue-cured tobacco leaves. [Method] Anti-breaking models of the main veins of flue-cured tobacco leaves were constructed for principal component analysis on the anti-breaking index, leaf traits and cellulose contents. [Result] The results showed that the growth traits had certain relevance with the cellulose contents while the leaf weight assumed a significant negative correlation with the anti-breaking index, indicating that the heavier the leaf weight was, the weaker the anti-breaking capacity of flue-cured tobacco would be; the cross-sectional area of main veins and the cellulose contents had shown a positive correlation with the anti-breaking index, indicating that the thicker the main vein of flue-cured tobacco was, the higher the cellulose contents would be, and the stronger the anti-breaking capacity of flue-cured tobacco leaves would be. [Conclusion] This study provided theoretical basis and reference to improve tobacco production and enhance the quality of flue-cured tobacco.
文摘In principal component analysis (PCA) algorithms for face recognition, to reduce the influence of the eigenvectors which relate to the changes of the illumination on abstract features, a modified PCA (MPCA) algorithm is proposed. The method is based on the idea of reducing the influence of the eigenvectors associated with the large eigenvalues by normalizing the feature vector element by its corresponding standard deviation. The Yale face database and Yale face database B are used to verify the method. The simulation results show that, for front face and even under the condition of limited variation in the facial poses, the proposed method results in better performance than the conventional PCA and linear discriminant analysis (LDA) approaches, and the computational cost remains the same as that of the PCA, and much less than that of the LDA.
基金The Pre-Research Foundation of National Ministries andCommissions (No9140A16050109DZ01)the Scientific Research Program of the Education Department of Shanxi Province (No09JK701)
文摘In order to overcome the shortcomings that the reconstructed spectral reflectance may be negative when using the classic principal component analysis (PCA)to reduce the dimensions of the multi-spectral data, a nonnegative constrained principal component analysis method is proposed to construct a low-dimensional multi-spectral space and accomplish the conversion between the new constructed space and the multispectral space. First, the reason behind the negative data is analyzed and a nonnegative constraint is imposed on the classic PCA. Then a set of nonnegative linear independence weight vectors of principal components is obtained, by which a lowdimensional space is constructed. Finally, a nonlinear optimization technique is used to determine the projection vectors of the high-dimensional multi-spectral data in the constructed space. Experimental results show that the proposed method can keep the reconstructed spectral data in [ 0, 1 ]. The precision of the space created by the proposed method is equivalent to or even higher than that by the PCA.
基金supported by the National Natural Science Foundation of China(Grant Nos.82073808,81872828,and 81573384)。
文摘The fruits of leguminous plants Cercis Chinensis Bunge are still overlooked although they have been reported to be antioxidative because of the limited information on the phytochemicals of C.chinensis fruits.A simple,rapid and sensitive HPLC-MS/MS method was developed for the identification and quantitation of the major bioactive components in C.chinensis fruits.Eighteen polyphenols were identified,which are first reported in C.chinensis fruits.Moreover,ten components were simultaneously quantified.The validated quantitative method was proved to be sensitive,reproducible and accurate.Then,it was applied to analyze batches of C.chinensis fruits from different phytomorph and areas.The principal components analysis(PCA)realized visualization and reduction of data set dimension while the hierarchical cluster analysis(HCA)indicated that the content of phenolic acids or all ten components might be used to differentiate C.chinensis fruits of different phytomorph.
文摘Matrix principal component analysis (MatPCA), as an effective feature extraction method, can deal with the matrix pattern and the vector pattern. However, like PCA, MatPCA does not use the class information of samples. As a result, the extracted features cannot provide enough useful information for distinguishing pat- tern from one another, and further resulting in degradation of classification performance. To fullly use class in- formation of samples, a novel method, called the fuzzy within-class MatPCA (F-WMatPCA)is proposed. F-WMatPCA utilizes the fuzzy K-nearest neighbor method(FKNN) to fuzzify the class membership degrees of a training sample and then performs fuzzy MatPCA within these patterns having the same class label. Due to more class information is used in feature extraction, F-WMatPCA can intuitively improve the classification perfor- mance. Experimental results in face databases and some benchmark datasets show that F-WMatPCA is effective and competitive than MatPCA. The experimental analysis on face image databases indicates that F-WMatPCA im- proves the recognition accuracy and is more stable and robust in performing classification than the existing method of fuzzy-based F-Fisherfaces.
文摘Referring to GB5618-1995 about heavy metal pollution,and using statistical analysis SPSS,the major pollutants of mine area farmland heavy metal pollution were identified by variable clustering analysis.Assessment and classification were done to the mine area farmland heavy metal pollution situation by synthetic principal components analysis (PCA).The results show that variable clustering analysis is efficient to identify the principal components of mine area farmland heavy metal pollution.Sort and clustering were done to the synthetic principal components scores of soil sample,which is given by synthetic principal components analysis.Data structure of soil heavy metal contaminations relationships and pollution level of different soil samples are discovered.The results of mine area farmland heavy metal pollution quality assessed and classified with synthetic component scores reflect the influence of both the major and compound heavy metal pol- lutants.Identification and assessment results of mine area farmland heavy metal pollution can provide reference and guide to propose control measures of mine area farmland heavy metal pollution and focus on the key treatment region.
文摘Survey and analysis were conducted on water quality of offshore seas in eastern region of Shenzhen by principal component analysis with SPSS. Then, 8 pollutants indices were then reduced to 5. Based on weighted analysis of principal component weights, comprehensive scores of different monitored stations were com- puted and sequenced in order to make evaluation on sea quality of eastern region of Shenzhen.