Background: Different production systems and climates could lead to genotype-by-environment(G × E) interactions between populations, and the inclusion of G × E interactions is becoming essential in breeding ...Background: Different production systems and climates could lead to genotype-by-environment(G × E) interactions between populations, and the inclusion of G × E interactions is becoming essential in breeding decisions. The objective of this study was to investigate the performance of multi-trait models in genomic prediction in a limited number of environments with G × E interactions.Results: In total, 2,688 and 1,384 individuals with growth and reproduction phenotypes, respectively, from two Yorkshire pig populations with similar genetic backgrounds were genotyped with the PorcineSNP80 panel.Single-and multi-trait models with genomic best linear unbiased prediction(GBLUP) and BayesC π were implemented to investigate their genomic prediction abilities with 20 replicates of five-fold cross-validation.Our results regarding between-environment genetic correlations of growth and reproductive traits(ranging from 0.618 to 0.723) indicated the existence of G × E interactions between these two Yorkshire pig populations. For single-trait models, genomic prediction with GBLUP was only 1.1% more accurate on average in the combined population than in single populations, and no significant improvements were obtained by BayesC π for most traits. In addition, single-trait models with either GBLUP or BayesC π produced greater bias for the combined population than for single populations. However, multi-trait models with GBLUP and BayesC π better accommodated G × E interactions,yielding 2.2% – 3.8% and 1.0% – 2.5% higher prediction accuracies for growth and reproductive traits, respectively,compared to those for single-trait models of single populations and the combined population. The multi-trait models also yielded lower bias and larger gains in the case of a small reference population. The smaller improvement in prediction accuracy and larger bias obtained by the single-trait models in the combined population was mainly due to the low consistency of linkage disequilibrium between the two populations, which also caused the BayesC π method to always produce the largest standard error in marker effect estimation for the combined population.Conclusions: In conclusion, our findings confirmed that directly combining populations to enlarge the reference population is not efficient in improving the accuracy of genomic prediction in the presence of G × E interactions, while multi-trait models perform better in a limited number of environments with G × E interactions.展开更多
Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an...Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an effective model for a target task by leveraging knowledge from a different,but related,source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data.However,it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework.We therefore developed TrG2P,a transfer-learning-based framework.TrG2P first employs convolutional neural networks(CNN)to train models using non-yield-trait phenotypic and genotypic data,thus obtaining pre-trained models.Subsequently,the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task,and the fully connected layers are retrained,thus obtaining fine-tuned models.Finally,the convolutional layer and the first fully connected layer of the fine-tuned models are fused,and the last fully connected layer is trained to enhance prediction performance.We applied TrG2P to five sets of genotypic and phenotypic data from maize(Zea mays),rice(Oryza sativa),and wheat(Triticum aestivum)and compared its model precision to that of seven other popular GS tools:ridge regression best linear unbiased prediction(rrBLUP),random forest,support vector regression,light gradient boosting machine(LightGBM),CNN,DeepGS,and deep neural network for genomic prediction(DNNGP).TrG2P improved the accuracy of yield prediction by 39.9%,6.8%,and 1.8%in rice,maize,and wheat,respectively,compared with predictions generated by the best-performing comparison model.Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data.We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation.The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P.The web-based tool is available at http://trg2p.ebreed.cn:81.展开更多
Two clonal trial stands of Chinese Fir (Cunninghamia lanceolata) were used in this study, one was 19-year-old stand which included 38 clones, and the other was 17-year-old stand including 102 clones.The statistical ...Two clonal trial stands of Chinese Fir (Cunninghamia lanceolata) were used in this study, one was 19-year-old stand which included 38 clones, and the other was 17-year-old stand including 102 clones.The statistical analyses showed that there were very significant genetic variations in height, DBH,volume and ratio of heartwood(R<sub>hw</sub>),wood basic density(ρ<sub>b</sub> ) of the clones in the two stands. The repeatability of clones was in median to high level,and the genetic CV was different over the all five traits.There were very significant phenotypic and genetic correlations among height,DBH and volume,and negative correlations among growth, R<sub>hw</sub> andρ<sub>b</sub>.The selection method experiment indicated that index selection could improve volume, R<sub>hw</sub> andρ<sub>b</sub>,showing synthetically superior selection effects compared to any individual trait selection methods.展开更多
京津冀地区作为我国重要的早熟夏玉米生产区,对该区域早熟夏玉米品种开展多性状综合评价与筛选,对促进夏玉米品种资源的科学利用具有重要意义。本研究在品种-产量×性状组合(GYT,genotype by yield×trait)双标图模型基础上,将...京津冀地区作为我国重要的早熟夏玉米生产区,对该区域早熟夏玉米品种开展多性状综合评价与筛选,对促进夏玉米品种资源的科学利用具有重要意义。本研究在品种-产量×性状组合(GYT,genotype by yield×trait)双标图模型基础上,将“产量”拓展为“主要目标性状”,创新性提出品种-主性状×多性状(GMT,genotype by major trait×multi-trait)双标图方法。以2017-2024年完成京津冀地区京科联合体早熟夏玉米品种试验程序的72个参试品种为材料,分别基于主性状(产量、蛋白质含量、脂肪含量和赖氨酸含量)与产量、生育期、株高、百粒重、出籽率、籽粒含水量、容重、淀粉含量、蛋白质含量、脂肪含量、赖氨酸含量、抗病指数共12个目标性状的组合水平,采用新提出的GMT双标图方法对参试品种进行综合评价与选择。结果表明:(1)以产量为主要目标性状的品种-产量×性状组合双标图筛选出京农科458、京科628、MC921、鑫玉农812、京科383、京科938、京科597和京农科809共8个产量理想指数表现优秀的品种;(2)以蛋白质含量为主要目标性状的品种-蛋白质×性状组合(GPT,genotype by protein×trait)双标图筛选出综合表现突出的品种京农科836和MC921;(3)以脂肪含量为主要目标性状的品种-脂肪×性状组合(GFT,genotype by fat×trait)双标图筛选出表现优异的品种京农科458、京科383和MC616;(4)以赖氨酸含量为主要目标性状的品种-赖氨酸×性状组合(GLT,genotype by lysine×trait)双标图筛选出表现较好的品种MC921、MC167、京农科836和京农科801;(5)蛋白质理想指数和赖氨酸理想指数呈极显著正相关,以蛋白质含量和赖氨酸含量为共同目标性状时,筛选出“蛋白质-赖氨酸特专型”优秀品种MC921、京农科836和京农科458;产量理想指数和脂肪理想指数也呈极显著正相关,以产量和脂肪含量为共同目标性状时,筛选出京农科458和京科383“产量-脂肪特专型”优秀品种;基于上述4个主性状的理想指数同步筛选,选出“全能型”核心品种农科458和MC921。本研究提出的GMT双标图方法为多目标性状协同评价提供了新工具,筛选出的特专型品种和全能型品种可为京津冀地区玉米品种高效利用和高品质育种提供参考。展开更多
目的:检验中文版植入心律转复除颤器患者自我效能期望及结局期望量表(Self-Efficacy Expectations and Outcome Expectations after ICD Implantation Scales,SEOE-ICD)在心脏电子设备植入人群中应用的信度和效度,为该人群期望的评估提...目的:检验中文版植入心律转复除颤器患者自我效能期望及结局期望量表(Self-Efficacy Expectations and Outcome Expectations after ICD Implantation Scales,SEOE-ICD)在心脏电子设备植入人群中应用的信度和效度,为该人群期望的评估提供参考。方法:遵循量表引进原则,对量表进行汉化;于2024年3月—12月采用便利抽样法纳入205例北京市某三级甲等医院心律失常病房行心脏电子设备植入术的患者为调查对象,使用中文版SEOE-ICD、一般自我效能量表、治疗期望问卷、Florida患者接受度调查表进行测试;采用验证性因子分析、多质多法、内部一致性分析对量表进行信度和效度评价。结果:中文版SEOE-ICD共包括自我效能期望与结局期望两个子量表,13个条目;验证性因子模型适配度指标表现良好(χ^(2)/df=2.929,TLI=0.951,CFI=0.961,SRMR=0.042,RMSEA=0.097);多质多法分析显示量表有较好的聚合效度及区分效度;两个子量表的Cronbach’sα系数分别为0.949、0.878。结论:中文版SEOE-ICD在心脏电子设备植入人群中应用的信度、效度良好,可作为该人群期望的评估工具,为健康策略制定提供有效依据。展开更多
The objectives of this study were to estimate genetic parameters of lactation average somatic cell scores (LSCS) and examine genetic associations between LSCS and production traits in the first three lactations of C...The objectives of this study were to estimate genetic parameters of lactation average somatic cell scores (LSCS) and examine genetic associations between LSCS and production traits in the first three lactations of Chinese Holstein cows using single-parity multi-trait animal model and multi-trait repeatability animal model. There were totally 273605 lactation records of Chinese Holstein cows with first calving from 2001 to 2012. Heritability estimates for LSCS ranged from 0.144 to 0.187. Genetic correlations between LSCS and 305 days milk, protein percentage and fat percentage were -0.079, -0.082 and -0.135, respectively. Phenotypic correlation between LSCS and 305 days milk yield was negative (-0.103 to -0.190). Genetic correlation between 305 days milk and fat percentage or protein percentage was highly negative. Genetic correlation between milk fat percentage and milk protein percentage was highly favorable. Heritabilities of production traits decreased with increase of parity, whereas heritability of LSCS increased with increase of parity.展开更多
基金supported by grants from the earmarked fund for China Agriculture Research System (CARS-35)Modern Agriculture Science and Technology Key Project of Hebei Province (19226376D)+2 种基金the National Key Research and Development Project (SQ2019YFE00771)the National Natural Science Foundation of China (31671327)Major Project of Selection for New Livestock and Poultry Breeds of Zhejiang Province (2016C02054–5)。
文摘Background: Different production systems and climates could lead to genotype-by-environment(G × E) interactions between populations, and the inclusion of G × E interactions is becoming essential in breeding decisions. The objective of this study was to investigate the performance of multi-trait models in genomic prediction in a limited number of environments with G × E interactions.Results: In total, 2,688 and 1,384 individuals with growth and reproduction phenotypes, respectively, from two Yorkshire pig populations with similar genetic backgrounds were genotyped with the PorcineSNP80 panel.Single-and multi-trait models with genomic best linear unbiased prediction(GBLUP) and BayesC π were implemented to investigate their genomic prediction abilities with 20 replicates of five-fold cross-validation.Our results regarding between-environment genetic correlations of growth and reproductive traits(ranging from 0.618 to 0.723) indicated the existence of G × E interactions between these two Yorkshire pig populations. For single-trait models, genomic prediction with GBLUP was only 1.1% more accurate on average in the combined population than in single populations, and no significant improvements were obtained by BayesC π for most traits. In addition, single-trait models with either GBLUP or BayesC π produced greater bias for the combined population than for single populations. However, multi-trait models with GBLUP and BayesC π better accommodated G × E interactions,yielding 2.2% – 3.8% and 1.0% – 2.5% higher prediction accuracies for growth and reproductive traits, respectively,compared to those for single-trait models of single populations and the combined population. The multi-trait models also yielded lower bias and larger gains in the case of a small reference population. The smaller improvement in prediction accuracy and larger bias obtained by the single-trait models in the combined population was mainly due to the low consistency of linkage disequilibrium between the two populations, which also caused the BayesC π method to always produce the largest standard error in marker effect estimation for the combined population.Conclusions: In conclusion, our findings confirmed that directly combining populations to enlarge the reference population is not efficient in improving the accuracy of genomic prediction in the presence of G × E interactions, while multi-trait models perform better in a limited number of environments with G × E interactions.
基金This research was funded by the STI2030-Major Projects(no.2023ZD0406104)the Beijing Postdoctoral Research Foundation(no.2023-ZZ-116).
文摘Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an effective model for a target task by leveraging knowledge from a different,but related,source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data.However,it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework.We therefore developed TrG2P,a transfer-learning-based framework.TrG2P first employs convolutional neural networks(CNN)to train models using non-yield-trait phenotypic and genotypic data,thus obtaining pre-trained models.Subsequently,the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task,and the fully connected layers are retrained,thus obtaining fine-tuned models.Finally,the convolutional layer and the first fully connected layer of the fine-tuned models are fused,and the last fully connected layer is trained to enhance prediction performance.We applied TrG2P to five sets of genotypic and phenotypic data from maize(Zea mays),rice(Oryza sativa),and wheat(Triticum aestivum)and compared its model precision to that of seven other popular GS tools:ridge regression best linear unbiased prediction(rrBLUP),random forest,support vector regression,light gradient boosting machine(LightGBM),CNN,DeepGS,and deep neural network for genomic prediction(DNNGP).TrG2P improved the accuracy of yield prediction by 39.9%,6.8%,and 1.8%in rice,maize,and wheat,respectively,compared with predictions generated by the best-performing comparison model.Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data.We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation.The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P.The web-based tool is available at http://trg2p.ebreed.cn:81.
文摘Two clonal trial stands of Chinese Fir (Cunninghamia lanceolata) were used in this study, one was 19-year-old stand which included 38 clones, and the other was 17-year-old stand including 102 clones.The statistical analyses showed that there were very significant genetic variations in height, DBH,volume and ratio of heartwood(R<sub>hw</sub>),wood basic density(ρ<sub>b</sub> ) of the clones in the two stands. The repeatability of clones was in median to high level,and the genetic CV was different over the all five traits.There were very significant phenotypic and genetic correlations among height,DBH and volume,and negative correlations among growth, R<sub>hw</sub> andρ<sub>b</sub>.The selection method experiment indicated that index selection could improve volume, R<sub>hw</sub> andρ<sub>b</sub>,showing synthetically superior selection effects compared to any individual trait selection methods.
文摘京津冀地区作为我国重要的早熟夏玉米生产区,对该区域早熟夏玉米品种开展多性状综合评价与筛选,对促进夏玉米品种资源的科学利用具有重要意义。本研究在品种-产量×性状组合(GYT,genotype by yield×trait)双标图模型基础上,将“产量”拓展为“主要目标性状”,创新性提出品种-主性状×多性状(GMT,genotype by major trait×multi-trait)双标图方法。以2017-2024年完成京津冀地区京科联合体早熟夏玉米品种试验程序的72个参试品种为材料,分别基于主性状(产量、蛋白质含量、脂肪含量和赖氨酸含量)与产量、生育期、株高、百粒重、出籽率、籽粒含水量、容重、淀粉含量、蛋白质含量、脂肪含量、赖氨酸含量、抗病指数共12个目标性状的组合水平,采用新提出的GMT双标图方法对参试品种进行综合评价与选择。结果表明:(1)以产量为主要目标性状的品种-产量×性状组合双标图筛选出京农科458、京科628、MC921、鑫玉农812、京科383、京科938、京科597和京农科809共8个产量理想指数表现优秀的品种;(2)以蛋白质含量为主要目标性状的品种-蛋白质×性状组合(GPT,genotype by protein×trait)双标图筛选出综合表现突出的品种京农科836和MC921;(3)以脂肪含量为主要目标性状的品种-脂肪×性状组合(GFT,genotype by fat×trait)双标图筛选出表现优异的品种京农科458、京科383和MC616;(4)以赖氨酸含量为主要目标性状的品种-赖氨酸×性状组合(GLT,genotype by lysine×trait)双标图筛选出表现较好的品种MC921、MC167、京农科836和京农科801;(5)蛋白质理想指数和赖氨酸理想指数呈极显著正相关,以蛋白质含量和赖氨酸含量为共同目标性状时,筛选出“蛋白质-赖氨酸特专型”优秀品种MC921、京农科836和京农科458;产量理想指数和脂肪理想指数也呈极显著正相关,以产量和脂肪含量为共同目标性状时,筛选出京农科458和京科383“产量-脂肪特专型”优秀品种;基于上述4个主性状的理想指数同步筛选,选出“全能型”核心品种农科458和MC921。本研究提出的GMT双标图方法为多目标性状协同评价提供了新工具,筛选出的特专型品种和全能型品种可为京津冀地区玉米品种高效利用和高品质育种提供参考。
文摘目的:检验中文版植入心律转复除颤器患者自我效能期望及结局期望量表(Self-Efficacy Expectations and Outcome Expectations after ICD Implantation Scales,SEOE-ICD)在心脏电子设备植入人群中应用的信度和效度,为该人群期望的评估提供参考。方法:遵循量表引进原则,对量表进行汉化;于2024年3月—12月采用便利抽样法纳入205例北京市某三级甲等医院心律失常病房行心脏电子设备植入术的患者为调查对象,使用中文版SEOE-ICD、一般自我效能量表、治疗期望问卷、Florida患者接受度调查表进行测试;采用验证性因子分析、多质多法、内部一致性分析对量表进行信度和效度评价。结果:中文版SEOE-ICD共包括自我效能期望与结局期望两个子量表,13个条目;验证性因子模型适配度指标表现良好(χ^(2)/df=2.929,TLI=0.951,CFI=0.961,SRMR=0.042,RMSEA=0.097);多质多法分析显示量表有较好的聚合效度及区分效度;两个子量表的Cronbach’sα系数分别为0.949、0.878。结论:中文版SEOE-ICD在心脏电子设备植入人群中应用的信度、效度良好,可作为该人群期望的评估工具,为健康策略制定提供有效依据。
基金fundings from the National Natural Science Foundation of China (31200927)the National Modern Agricultural Industry Technology Fund for Scientists in Sheep Industry System, China (CARS-39-04B)+1 种基金the National Key Technology Research and Development Program of the Ministry of Science and Technology of China (2011BAD28B02, 2012BAD12B06)the Chinese Academy of Agricultural Sciences Foundation (2012cj-2)
文摘The objectives of this study were to estimate genetic parameters of lactation average somatic cell scores (LSCS) and examine genetic associations between LSCS and production traits in the first three lactations of Chinese Holstein cows using single-parity multi-trait animal model and multi-trait repeatability animal model. There were totally 273605 lactation records of Chinese Holstein cows with first calving from 2001 to 2012. Heritability estimates for LSCS ranged from 0.144 to 0.187. Genetic correlations between LSCS and 305 days milk, protein percentage and fat percentage were -0.079, -0.082 and -0.135, respectively. Phenotypic correlation between LSCS and 305 days milk yield was negative (-0.103 to -0.190). Genetic correlation between 305 days milk and fat percentage or protein percentage was highly negative. Genetic correlation between milk fat percentage and milk protein percentage was highly favorable. Heritabilities of production traits decreased with increase of parity, whereas heritability of LSCS increased with increase of parity.