Many complex traits are highly correlated rather than independent. By taking the correlation structure of multiple traits into account, joint association analyses can achieve both higher statistical power and more acc...Many complex traits are highly correlated rather than independent. By taking the correlation structure of multiple traits into account, joint association analyses can achieve both higher statistical power and more accurate estimation. To develop a statistical approach to joint association analysis that includes allele detection and genetic effect estimation, we combined multivariate partial least squares regression with variable selection strategies and selected the optimal model using the Bayesian Information Criterion(BIC). We then performed extensive simulations under varying heritabilities and sample sizes to compare the performance achieved using our method with those obtained by single-trait multilocus methods. Joint association analysis has measurable advantages over single-trait methods, as it exhibits superior gene detection power, especially for pleiotropic genes. Sample size, heritability,polymorphic information content(PIC), and magnitude of gene effects influence the statistical power, accuracy and precision of effect estimation by the joint association analysis.展开更多
Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojia...Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojiao, a popular liquor with strong fragrant flavor, was used as the identified liquor brand (160 samples, negative, 52 vol alcoholicity). Liquors of 10 other brands with strong fragrant flavor were used as the interferential brands (200 samples, positive, 52 vol alcoholicity). The Kennard-Stone algorithm was used for the division of modeling samples to achieve uniformity and representativeness. Based on the MW-PLS-DA, a simplified optimal model set with 157 wavebands was further proposed. This set contained five types of wavebands corresponding to the NIR absorption bands of water, ethanol, and other micronutrients (i.e., acids, aldehydes, phenols, and aromatic compounds) in liquor for practical choice. Using five selected simple models with 4775 - 4239, 7804 - 6569, 6264 - 5844, 9435 - 7896, and 12066 - 10373 cm-1, the validation recognition rates were obtained as 99.3% or higher. Results show good prediction performance and low model complexity, and also provided a valuable reference for designing small dedicated instruments. The proposed method is a promising tool for large-scale inspection of liquor food safety.展开更多
This paper proposes an integrative framework for network-structured analytic network process (ANP) modeling. The underlying rationales include: 1) creating the measuring items for the complex decision problems;2) appl...This paper proposes an integrative framework for network-structured analytic network process (ANP) modeling. The underlying rationales include: 1) creating the measuring items for the complex decision problems;2) applying factor analysis to reduce the complex measuring items into fewer constructs;3) employing Bayesian network classifier technique to discover the causal directions among constructs;4) using partial least squares path modeling to test the causal relationships among the items-constructs. The proposed framework is implemented for knowledge discovery to a case of high-tech companies’ enterprise resource planning (ERP) benefits and satisfaction in Hsinchu Science Park,Taiwan. The results show that the proposed framework for ANP modeling can reach a satisfactory level of convergent reliability and validity. Based on the findings, pragmatic implications to the ERP venders are discussed. This study has shed new light on the long neglected, yet critical, issue on decision structures and knowledge discovery for ANP modeling.展开更多
The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain par...The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain parameters in terms of a set of spectrometric curves that are observed in a finite set of points (functional data). Although the predictor variable is clearly functional, this problem is usually solved by using multivariate calibration techniques that consider it as a finite set of variables associated with the observed points (wavelengths or times). But these explicative variables are highly correlated and it is therefore more informative to reconstruct first the true functional form of the predictor curves. Although it has been published in several articles related to the implementation of functional data analysis techniques in chemometric, their power to solve real problems is not yet well known. Because of this the extension of multivariate calibration techniques (linear regression, principal component regression and partial least squares) and classification methods (linear discriminant analysis and logistic regression) to the functional domain and some relevant chemometric applications are reviewed in this paper.展开更多
目的利用近红外光谱技术结合化学计量学方法对苦杏仁生品和不同火候炒苦杏仁进行定性和定量分析,以期建立不同火候炒苦杏仁的快速鉴定方法。方法利用HPLC方法测定样品苦杏仁苷含量,采集50批苦杏仁生品及其不同火候炒制品的近红外光谱信...目的利用近红外光谱技术结合化学计量学方法对苦杏仁生品和不同火候炒苦杏仁进行定性和定量分析,以期建立不同火候炒苦杏仁的快速鉴定方法。方法利用HPLC方法测定样品苦杏仁苷含量,采集50批苦杏仁生品及其不同火候炒制品的近红外光谱信息,经最优预处理算法预处理后利用偏最小二乘法判别分析(partial least square-discriminant analysis,PLS-DA)建立苦杏仁生品及其不同火候炒制品的定性及定量分析模型。结果标准正态变量变换(standard normal variate transformation,SNV)处理方法结合PLS-DA在模型建立中显示出良好的区分能力。在一维的九点平滑1D9S预处理方法的基础上,使用竞争性自适应重加权采样(competitive adaptive reweighted sampling,CARS)提取特征建立的偏最小二乘回归(partial least squares regressi,PLSR)模型效果最佳。结论基于近红外光谱技术结合化学计量学方法所建立的苦杏仁生品和不同炒制程度的炒苦杏仁定性判别模型和含量预测模型具有较高的准确性,可用于苦杏仁和不同火候炒苦杏仁的快速鉴别。展开更多
基金supported by grants from the National Program on the Development of Basic Research (2011CB100100)the Priority Academic Program Development of Jiangsu Higher Education Institutions, the National Natural Science Foundations (31391632, 31200943, 31171187, and 91535103)+3 种基金the National High-tech R&D Program (863 Program) (2014AA10A601-5)the Natural Science Foundations of Jiangsu Province (BK20150010)the Natural Science Foundation of the Jiangsu Higher Education Institutions (14KJA210005)the Innovative Research Team of Universities in Jiangsu Province (KYLX_1352)
文摘Many complex traits are highly correlated rather than independent. By taking the correlation structure of multiple traits into account, joint association analyses can achieve both higher statistical power and more accurate estimation. To develop a statistical approach to joint association analysis that includes allele detection and genetic effect estimation, we combined multivariate partial least squares regression with variable selection strategies and selected the optimal model using the Bayesian Information Criterion(BIC). We then performed extensive simulations under varying heritabilities and sample sizes to compare the performance achieved using our method with those obtained by single-trait multilocus methods. Joint association analysis has measurable advantages over single-trait methods, as it exhibits superior gene detection power, especially for pleiotropic genes. Sample size, heritability,polymorphic information content(PIC), and magnitude of gene effects influence the statistical power, accuracy and precision of effect estimation by the joint association analysis.
文摘Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojiao, a popular liquor with strong fragrant flavor, was used as the identified liquor brand (160 samples, negative, 52 vol alcoholicity). Liquors of 10 other brands with strong fragrant flavor were used as the interferential brands (200 samples, positive, 52 vol alcoholicity). The Kennard-Stone algorithm was used for the division of modeling samples to achieve uniformity and representativeness. Based on the MW-PLS-DA, a simplified optimal model set with 157 wavebands was further proposed. This set contained five types of wavebands corresponding to the NIR absorption bands of water, ethanol, and other micronutrients (i.e., acids, aldehydes, phenols, and aromatic compounds) in liquor for practical choice. Using five selected simple models with 4775 - 4239, 7804 - 6569, 6264 - 5844, 9435 - 7896, and 12066 - 10373 cm-1, the validation recognition rates were obtained as 99.3% or higher. Results show good prediction performance and low model complexity, and also provided a valuable reference for designing small dedicated instruments. The proposed method is a promising tool for large-scale inspection of liquor food safety.
文摘This paper proposes an integrative framework for network-structured analytic network process (ANP) modeling. The underlying rationales include: 1) creating the measuring items for the complex decision problems;2) applying factor analysis to reduce the complex measuring items into fewer constructs;3) employing Bayesian network classifier technique to discover the causal directions among constructs;4) using partial least squares path modeling to test the causal relationships among the items-constructs. The proposed framework is implemented for knowledge discovery to a case of high-tech companies’ enterprise resource planning (ERP) benefits and satisfaction in Hsinchu Science Park,Taiwan. The results show that the proposed framework for ANP modeling can reach a satisfactory level of convergent reliability and validity. Based on the findings, pragmatic implications to the ERP venders are discussed. This study has shed new light on the long neglected, yet critical, issue on decision structures and knowledge discovery for ANP modeling.
文摘The objective of this paper is to present a review of different calibration and classification methods for functional data in the context of chemometric applications. In chemometric, it is usual to measure certain parameters in terms of a set of spectrometric curves that are observed in a finite set of points (functional data). Although the predictor variable is clearly functional, this problem is usually solved by using multivariate calibration techniques that consider it as a finite set of variables associated with the observed points (wavelengths or times). But these explicative variables are highly correlated and it is therefore more informative to reconstruct first the true functional form of the predictor curves. Although it has been published in several articles related to the implementation of functional data analysis techniques in chemometric, their power to solve real problems is not yet well known. Because of this the extension of multivariate calibration techniques (linear regression, principal component regression and partial least squares) and classification methods (linear discriminant analysis and logistic regression) to the functional domain and some relevant chemometric applications are reviewed in this paper.
文摘目的利用近红外光谱技术结合化学计量学方法对苦杏仁生品和不同火候炒苦杏仁进行定性和定量分析,以期建立不同火候炒苦杏仁的快速鉴定方法。方法利用HPLC方法测定样品苦杏仁苷含量,采集50批苦杏仁生品及其不同火候炒制品的近红外光谱信息,经最优预处理算法预处理后利用偏最小二乘法判别分析(partial least square-discriminant analysis,PLS-DA)建立苦杏仁生品及其不同火候炒制品的定性及定量分析模型。结果标准正态变量变换(standard normal variate transformation,SNV)处理方法结合PLS-DA在模型建立中显示出良好的区分能力。在一维的九点平滑1D9S预处理方法的基础上,使用竞争性自适应重加权采样(competitive adaptive reweighted sampling,CARS)提取特征建立的偏最小二乘回归(partial least squares regressi,PLSR)模型效果最佳。结论基于近红外光谱技术结合化学计量学方法所建立的苦杏仁生品和不同炒制程度的炒苦杏仁定性判别模型和含量预测模型具有较高的准确性,可用于苦杏仁和不同火候炒苦杏仁的快速鉴别。