Oil and protein content and fatty acid composition are quality traits in peanut.Elucidating the genetic mechanisms underlying these traits may help researchers to obtain improved cultivars by molecular breeding.Whole-...Oil and protein content and fatty acid composition are quality traits in peanut.Elucidating the genetic mechanisms underlying these traits may help researchers to obtain improved cultivars by molecular breeding.Whole-genome resequencing of a recombinant inbred population of 318 lines was performed to construct a high-density linkage map and identify QTL for peanut quality.The map,containing 4561 bin markers,covered 2032 c M with a mean marker density of 0.45 c M.A total of 110 QTL for oil and protein content,and fatty acid composition were mapped on the 18 peanut chromosomes.The QTL q A05.1 was detected in four environments and showed a major phenotypic effect on the contents of oil,protein,and six fatty acids.The genomic region spanned by q A05.1,corresponding to a physical interval of approximately 1.5 Mb,contains two SNPs polymorphic between the parents that could cause missense mutations.The two SNP sites were employed as KASP markers and validated using lines with extremely high and low oil contents.These sites may be useful in the marker-assisted breeding of peanut cultivars with high oil contents.展开更多
Largemouth bass(Micropterus salmoides) is an economically important fish species in North America, Europe, and China. Various genetic improvement programs and domestication processes have modified its genome sequence ...Largemouth bass(Micropterus salmoides) is an economically important fish species in North America, Europe, and China. Various genetic improvement programs and domestication processes have modified its genome sequence through selective pressure, leaving nucleotide signals that can be detected at the genomic level. In this study,we sequenced 149 largemouth bass fish, including protospecies(imported from the US) and improved breeds(four domestic breeding populations from China). We detected genomic regions harboring certain genes associated with improved traits, which may be useful molecular markers for practical domestication, breeding, and selection. Subsequent analyses of genetic diversity and population structure revealed that the improved breeds have undergone more rigorous genetic changes. Through selective signal analysis, we identified hundreds of putative selective sweep regions in each largemouth bass line. Interestingly, we predicted 103 putative candidate genes potentially subjected to selection,including several associated with growth(psst1 and grb10), early development(klf9, sp4, and sp8), and immune traits(pkn2, sept2, bcl6, and ripk2). These candidate genes represent potential genomic landmarks that could be used to improve important traits of biological and commercial interest. In summary, this study provides a genome-wide map of genetic variations and selection footprints in largemouth bass, which may benefit genetic studies and accelerate genetic improvement of this economically important fish.展开更多
Grape berry shape is an important agricultural trait.Clarifying its genetic basis is significant for cultivating grape varieties that meet market demands.However,the current study by forward genetics has not achieved ...Grape berry shape is an important agricultural trait.Clarifying its genetic basis is significant for cultivating grape varieties that meet market demands.However,the current study by forward genetics has not achieved in-depth results.Here,a high-density map was constructed to identify quantitative trait loci(QTLs)for berry shape.A total of 358709 polymorphic SNPs were obtained using whole-genome resequencing(WGS)based on 208 F2 individuals derived from round grape‘E42-6’and oblong grape‘Rizamat’.The 1635.65 cM high-density map was divided into 19 linkage groups with an average distance of 0.37 cM.Using this map,three significant QTLs for fruit shape index(ShI:ratio of berry length to berry width)identified over three years were mapped onto LG4 and LG5,including one stable QTL on Chr5 with the genomic region of 0.47–1.94 Mb.Combining with gene annotation and expression patterns based on RNA-seq data from two contrasting F2 individuals with round and oblong berry(their average ShI was 1.89 and 1.10,respectively)at four developmental stages,four candidate genes were selected from the above QTLs.They were mainly involved in DNA replication,cell wall modification,and phytohormone biosynthesis.Further analysis of RNA-seq data revealed that several important phytohormone synthesis and metabolic pathways were enriched based on differentially expressed genes(DEGs),which was consistent with the results of QTL mapping for genes related to plant hormone biosynthesis in the F2 population.Furthermore,a comparison of plant hormone content showed that there were significant differences in IAA and tZ content between the two contrasting F2 individuals at different developmental stages.Our findings provide molecular insights into the genetic variation in grape berry shape.Stable QTLs and their tightly linked markers offer the possibility of marker-assisted selection to accelerate berry shape breeding.展开更多
The abundance of domesticated sheep varieties and phenotypes is largely the result of long-term natural and artificial selection. However, there is limited information regarding the genetic mechanisms underlying pheno...The abundance of domesticated sheep varieties and phenotypes is largely the result of long-term natural and artificial selection. However, there is limited information regarding the genetic mechanisms underlying phenotypic variation induced by the domestication and improvement of sheep. In this study, to explore genomic diversity and selective regions at the genome level, we sequenced the genomes of 100 sheep across 10 breeds and combined these results with publicly available genomic data from 225 individuals, including improved breeds, Chinese indigenous breeds,African indigenous breeds, and their Asian mouflon ancestor. Based on population structure, the domesticated sheep formed a monophyletic group,while the Chinese indigenous sheep showed a clear geographical distribution trend. Comparative genomic analysis of domestication identified several selective signatures, including IFI44 and IFI44L genes and PANK2 and RNF24 genes, associated with immune response and visual function.Population genomic analysis of improvement demonstrated that candidate genes of selected regions were mainly associated with pigmentation,energy metabolism, and growth development.Furthermore, the IFI44 and IFI44L genes showed a common selection signature in the genomes of 30domesticated sheep breeds. The IFI44 c. 54413058C>G mutation was selected for genotyping and population genetic validation. Results showed that the IFI44 polymorphism was significantly associated with partial immune traits. Our findings identified the population genetic basis of domesticated sheep at the whole-genome level, providing theoretical insights into the molecular mechanism underlying breed characteristics and phenotypic changes during sheep domestication and improvement.展开更多
The phenotypic diversity resulting from artificial or natural selection of sheep has made a significant contribution to human civilization.Hu sheep are a local sheep breed unique to China with high reproductive rates ...The phenotypic diversity resulting from artificial or natural selection of sheep has made a significant contribution to human civilization.Hu sheep are a local sheep breed unique to China with high reproductive rates and rapid growth.Genomic selection signatures have been widely used to investigate the genetic mechanisms underlying phenotypic variation in livestock.Here,we conduct whole-genome sequencing of 207 Hu sheep and compare them with the wild ancestors of domestic sheep(Asiatic mouflon)to investigate the genetic characteristics and selection signatures of Hu sheep.Based on six signatures of selection approaches,we detect genomic regions containing genes related to reproduction(BMPR1B,BMP2,PGFS,CYP19,CAMK4,GGT5,and GNAQ),vision(ALDH1A2,SAG,and PDE6B),nervous system(NAV1),and immune response(GPR35,SH2B2,PIK3R3,and HRAS).Association analysis with a population of 1299 Hu sheep reveals that those missense mutations in the GPR35(GPR35 g.952651 A>G;GPR35 g.952496 C>T)and NAV1(NAV1 g.84216190 C>T;NAV1 g.84227412 G>A)genes are significantly associated(P<0.05)with immune and growth traits in Hu sheep,respectively.This research offers unique insights into the selection characteristics of Hu sheep and facilitates further genetic improvement and molecular investigations.展开更多
Rapeseed(Brassica napus)supplies about half of the vegetable oil in China.Increasing oil production and searching for genes that control oil content in the crop are research goals.In our previous studies,four major QT...Rapeseed(Brassica napus)supplies about half of the vegetable oil in China.Increasing oil production and searching for genes that control oil content in the crop are research goals.In our previous studies,four major QTL for oil content located on A08,A09,C03 and C06 in the Ken C-8×N53-2(KN DH)mapping population were detected.The parental lines were resequenced to identify structural variations and candidate genes affecting oil content in these four major QTL regions.Insertion-deletion(In Del)markers were developed and used to narrow the regions.Differentially expressed genes located in the regions were investigated.GO and KEGG analysis showed that several genes were associated with lipid metabolism.Several transcription factors with higher expression in N53-2 than in Ken C-8 were identified.These results shed light on the genetic control of oil content and may be helpful for the development of highoil-content cultivars.展开更多
Clubroot caused by Plasmodiophora brassicae is a devastating disease of Cruciferous crops.Developing cultivars with clubroot resistance(CR)is the most effective control measure.For the two major Brassica vegetable spe...Clubroot caused by Plasmodiophora brassicae is a devastating disease of Cruciferous crops.Developing cultivars with clubroot resistance(CR)is the most effective control measure.For the two major Brassica vegetable species B.rapa and B.oleracea,several commercial cultivars with unclear CR pedigrees have been intensively used as CR donors in breeding.However,the continuous occurrence of CR-breaking makes the CR pedigree underlying these cultivars one of the breeders'most urgent concerns.The complex intraspecific diversity of these two major Brassica vegetables has also limited the applicability of CR markers in different breeding programs.Here we first traced the pedigree underlying two kinds of CR that have been widely applied in breeding by linkage and introgression analyses based on public resequencing data.In B.rapa,a major locus CRzi8 underlying the CR of the commercial CR donor‘DegaoCR117’was identified.CRzi8 was further shown to have been introgressed from turnip(B.rapa ssp.rapifera)and that it carried a potential functional allele of Crr1a.The turnip introgression carried CRb^(c),sharing the same coding sequence with the CRb that was also identified from chromosome C07 of B.oleracea CR cultivars with different morphotypes.Within natural populations,variation analysis of linkage intervals of CRzi8,PbBa8.1,CRb,and CRb^(c)yielded easily resolved InDel markers(>20 bp)for these fundamental CR genes.The specificity of these markers was tested in diverse cultivars panels,and each exhibited high reliability in breeding.Our research demonstrates the value of the practice of applying resequencing big data to solve urgent concerns in breeding programs.展开更多
The genetic adaptations of various organisms to heterogeneous environments in the northwestern Pacific remain poorly understood.Heterogeneous genomic divergence among populations may reflect environmentalselection.Adv...The genetic adaptations of various organisms to heterogeneous environments in the northwestern Pacific remain poorly understood.Heterogeneous genomic divergence among populations may reflect environmentalselection.Advancingour understanding of the mechanisms by which organisms adapt to different temperatures in response to climate change and predicting the adaptive potential and ecological consequences of anthropogenic global warming are critical.We sequenced the whole genomes of Japanese whiting(Sillago japonica)specimens collected from different latitudinal locations along the coastal waters of China and Japan to detect possible thermal adaptations.Using population genomics,a total of 5.48 million single nucleotide polymorphisms(SNPs)from five populations revealed a complete genetic break between the Chinese and Japanese groups,which was attributed to both geographic distance and local adaptation.The shared natural selection genes between two isolated populations(i.e.,Zhoushan and Ise Bay/Tokyo Bay)indicated possible parallel evolution at the genetic level induced by temperature.These genes also indicated that the process of temperature selection on isolated populations is repeatable.Moreover,we observed natural candidate genes related to membrane fluidity,possibly underlying adaptation to cold environmental stress.These findings advance our understanding of the genetic mechanisms underlying the rapid adaptations of fish species.Species distribution projection models suggested that the Chinese and Japanese groups may have different responses to future climate change,with the former expanding and the latter contracting.The findings of this study enhance our understanding of genetic differentiation and adaptation to changing environments.展开更多
Trichomes are specialized structures developed from epidermal cells and can protect plants against biotic and abiotic stresses.Trichomes cover carrots during the generative phase.However,the morphology of the carrot t...Trichomes are specialized structures developed from epidermal cells and can protect plants against biotic and abiotic stresses.Trichomes cover carrots during the generative phase.However,the morphology of the carrot trichomes and candidate genes controlling the formation of trichomes are still unclear.This study found that carrot trichomes were nonglandular and unbranched hairs distributed on the stem,leaf,petiole,pedicel,and seed of carrot.Resequencing analysis of a trichome mutant with sparse and short trichomes(sst)and a wild type(wt)with long and dense trichomes on carrot stems was conducted.A total of 15396 genes containing nonsynonymous mutations in sst were obtained,including 42 trichomerelated genes.We also analyzed the transcriptome of the trichomes on secondary branches when these secondary branches were 10 cm long between wt and sst and obtained 6576 differentially expressed genes(DEGs),including 24 trichome-related genes.qRT-PCR validation exhibited three significantly up-regulated DEGs,20 significantly downregulated,and one with no difference.We considered both the resequencing and transcriptome sequencing analyses and found that 12 trichome-related genes that were grouped into five transcription factor families containing nonsynonymous mutations and significantly down-regulated in sst.Therefore,these genes are potentially promising candidate genes whose nonsynonymous mutations and down-regulation may result in scarce and short trichomes mutation on carrot stems in sst.展开更多
Breeding hybrids with nuclear malesterile lines is an important method for the cross-breeding of sweet peppers. To date, few reports have been published on the nuclear malesterility gene of sweet pepper. Yet, there ar...Breeding hybrids with nuclear malesterile lines is an important method for the cross-breeding of sweet peppers. To date, few reports have been published on the nuclear malesterility gene of sweet pepper. Yet, there are approximately 20 pepper nuclear malesterility lines in the world. Using the self-developed testing material, sweet pepper nuclear malesterile dual-purpose line AB91, the genome-wide resequencing technique was applied to find that the mutation site causing the abortion of sweet pepper nuclear malesterility AB91 is on chromosome #5. The mutation gene Capana05g000747 was filtered out and validated by the flight mass spectrometry genotyping and quantitative realtime PCR method and determined to be the gene causing the abortion of sweet pepper nuclear male sterility AB91. The gene Capana05g000747 mutation site is a non-synonymous mutation site located at the 6th exon, the base C mutated into A, and the amino acid changed from alanine to serine. The three-dimensional protein structure of fertile and sterile plant Capana05g000747 was predicted. The results showed that the three-dimensional structure of the two proteins differed significantly. Sequence alignment analysis showed that the gene Capana05g000747 has a similar function to gene At2g02148. The gene At2g02148 contains a pentatricopeptide repeat protein which has important physiological functions in the gene expression process of organelles and is closely related to the performance of malesterility genes. Therefore, Capana05g000747 was selected as an important candidate gene for sweet pepper nuclear male sterile testing material AB91.展开更多
The leopard coral grouper(Plectropomus leopardus)is a species of significant economic importance.Although artificial cultivation of P.leopardus has thrived in recent decades,the advancement of selective breeding has b...The leopard coral grouper(Plectropomus leopardus)is a species of significant economic importance.Although artificial cultivation of P.leopardus has thrived in recent decades,the advancement of selective breeding has been hindered by the lack of comprehensive population genomic data.In this study,we identified over 8.73 million single nucleotide polymorphisms(SNPs)through whole-genome resequencing of 326 individuals spanning six distinct groups.Furthermore,we categorized 226 individuals with high-coverage sequencing depth(≥14×)into eight clusters based on their genetic profiles and phylogenetic relationships.Notably,four of these clusters exhibited pronounced genetic differentiation compared with the other populations.To identify potentially advantageous loci for P.leopardus,we examined genomic regions exhibiting selective sweeps by analyzing the nucleotide diversity(θπ)and fixation index(FST)in these four clusters.Using these high-coverage resequencing data,we successfully constructed the first haplotype reference panel specific to P.leopardus.This achievement holds promise for enabling high-quality,cost-effectiveimputationmethods.Additionally,we combined low-coverage sequencing data with imputation techniques for a genome-wide association study,aiming to identify candidate SNP loci and genes associated with growth traits.A significant concentration of these genes was observed on chromosome 17,which is primarily involved in skeletal muscle and embryonic development and cell proliferation.Notably,our detailed investigation of growth-related SNPs across the eight clusters revealed that cluster 5 harbored the most promising candidate SNPs,showing potential for genetic selective breeding efforts.These findings provide a robust toolkit and valuable insights into the management of germplasm resources and genome-driven breeding initiatives targeting P.leopardus.展开更多
In order to save manpower and time costs,and to achieve simultaneous detection of multiple animal-derived components in meat and meat products,this study used multiple nucleotide polymorphism(MNP)marker technology bas...In order to save manpower and time costs,and to achieve simultaneous detection of multiple animal-derived components in meat and meat products,this study used multiple nucleotide polymorphism(MNP)marker technology based on the principle of high-throughput sequencing,and established a multi-locus 10 animalderived components identification method of cattle,goat,sheep,donkey,horse,chicken,duck,goose,pigeon,quail in meat and meat products.The specific loci of each species could be detected and the species could be accurately identified,including 5 loci for cattle and duck,3 loci for sheep,9 loci for chicken and horse,10 loci for goose and pigeon,6 loci for quail and 1 locus for donkey and goat,and an adulteration model was established to simulate commercially available samples.The results showed that the method established in this study had high throughput,good repeatability and accuracy,and was able to identify 10 animalderived components simultaneously with 100%repeatability accuracy.The detection limit was 0.1%(m/m)in simulated samples of chicken,duck and horse.Using the method established in this study to test commercially available samples,4 samples from 14 commercially available samples were detected to be inconsistent with the labels,of which 2 did not contain the target ingredient and 2 were adulterated with small amounts of other ingredients.展开更多
The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF str...The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.展开更多
The real-time screening of biomolecules and single cells in biochips is extremely important for disease prediction and diagnosis,cellular analysis,and life science research.Barcode biochip technology,which is integrat...The real-time screening of biomolecules and single cells in biochips is extremely important for disease prediction and diagnosis,cellular analysis,and life science research.Barcode biochip technology,which is integrated with microfluidics,typically comprises barcode array,sample loading,and reaction unit array chips.Here,we present a review of microfluidics barcode biochip analytical approaches for the high-throughput screening of biomolecules and single cells,including protein biomarkers,microRNA(miRNA),circulating tumor DNA(ctDNA),single-cell secreted proteins,single-cell exosomes,and cell interactions.We begin with an overview of current high-throughput detection and analysis approaches.Following this,we outline recent improvements in microfluidic devices for biomolecule and single-cell detection,highlighting the benefits and limitations of these devices.This paper focuses on the research and development of microfluidic barcode biochips,covering their self-assembly substrate materials and their specific applications with biomolecules and single cells.Looking forward,we explore the prospects and challenges of this technology,with the aim of contributing toward the use of microfluidic barcode detection biochips in medical diagnostics and therapies,and their large-scale commercialization.展开更多
In recent years,intensive human activities have increased the intensity of desertification,driving continual desertification process of peripheral meadows.To investigate the effects of restoration on soil microbial co...In recent years,intensive human activities have increased the intensity of desertification,driving continual desertification process of peripheral meadows.To investigate the effects of restoration on soil microbial communities,we analyzed vegetation-soil relationships in the Hulun Buir Sandy Land,northern China.Through the use of high-throughput sequencing,we examined the structure and diversity in the bacterial and fungal communities within the 0-20 cm soil layer after 9-15 a of restoration.Different slope positions were analyzed and spatial heterogeneity was assessed.The results showed progressive improvements in soil properties and vegetation with the increase of restoration duration,and the following order was as follows:bottom slope>middle slope>crest slope.During the restoration in the Hulun Buir Sandy Land,the bacterial communities were dominated by Proteobacteria,Actinobacteria,and Acidobacteria,whereas the fungal communities were dominated by Ascomycota and Basidiomycota.Eutrophic bacterial abundance increased with the restoration duration,whereas oligotrophic bacterial and fungal abundance levels decreased.The soil bacterial abundance significantly increased with the increasing restoration duration,whereas the fungal diversity decreased after 11 a of restoration,except that at the crest slope.Redundancy analysis showed that pH,soil moisture content,total nitrogen,and vegetation-related factors affected the bacterial community structure(45.43%of the total variance explained).Canonical correspondence analysis indicated that pH,total phosphorus,and vegetation-related factors shaped the bacterial community structure(31.82%of the total variance explained).Structural equation modeling highlighted greater bacterial responses(R^(2)=0.49-0.79)to changes in environmental factors than those of fungi(R^(2)=0.20-0.48).The soil bacterial community was driven mainly by pH,soil moisture content,electrical conductivity,plant coverage,and litter dry weight.The abundance and diversity of the soil fungal community were mainly driven by plant coverage,litter dry weight,and herbaceous aboveground biomass,while there was no significant correlation between the soil fungal community structure and environmental factors.These findings highlighted divergent microbial succession patterns and environmental sensitivities during sandy grassland restoration.展开更多
Designing high-performance high-entropy alloys(HEAs)with transformation-induced plasticity(TRIP)or twinning-induced plasticity(TWIP)effects requires precise control over stacking fault energy(SFE)and phase stability.H...Designing high-performance high-entropy alloys(HEAs)with transformation-induced plasticity(TRIP)or twinning-induced plasticity(TWIP)effects requires precise control over stacking fault energy(SFE)and phase stability.However,the vast complexity of multicomponent systems poses a major challenge for identifying promising candidates through conventional experimental or computational methods.A high-throughput CALPHAD framework is developed to identify compositions with potential TWIP/TRIP behaviors in the Cr-Co-Ni and Cr-Co-Ni-Fe systems through systematic screening of stacking fault energy(SFE),FCC phase stability,and FCC-to-HCP transition temperatures(T0).The approach combines TC-Python automation with parallel Gibbs energy calculations across hundreds of thousands of compositions,enabling efficient extraction of metastable FCC-dominant alloys.The high-throughput results find 214 compositions with desired properties from 160,000 candidates.Detailed analysis of the Gibbs energy distributions,phase fraction trends,and temperature-dependent SFE evolution reveals critical insights into the thermodynamic landscape governing plasticity mechanisms in HEAs.The results show that only a narrow region of the compositional space satisfies all screening criteria,emphasizing the necessity of an integrated approach.The screened compositions and trends provide a foundation for targeted experimental validation.Furthermore,this work demonstrates a scalable,composition-resolved strategy for predicting deformation mechanisms in multicomponent alloys and offers a blueprint for integrating thermodynamic screening with mechanistic understanding in HEA design.展开更多
The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectrosc...The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectroscopy and first-principles calculations are time-and power-consuming,not to mention capturing bandgap change mechanisms for hybrid perovskite materials across a wide range of unknown space.In the present work,an artificial intelligence ensemble comprising two classifiers(with F1 scores of 0.9125 and 0.925)and a regressor(with mean squared error of 0.0014 eV)is constructed to achieve high-precision prediction of the bandgap.The bandgap perovskite dataset is established through highthroughput prediction of bandgaps by the ensemble.Based on the self-built dataset,partial dependence analysis(PDA)is developed to interpret the bandgap influential mechanism.Meanwhile,an interpretable mathematical model with an R^(2)of 0.8417 is generated using the genetic programming symbolic regression(GPSR)technique.The constructed PDA maps agree well with the Shapley Additive exPlanations,the GPSR model,and experiment verification.Through PDA,we reveal the boundary effect,the bowing effect,and their evolution trends with key descriptors.展开更多
The integration of artificial intelligence (AI) with high-throughput experimentation (HTE) techniques is revolutionizing catalyst design, addressing challenges in efficiency, cost, and scalability. This review explore...The integration of artificial intelligence (AI) with high-throughput experimentation (HTE) techniques is revolutionizing catalyst design, addressing challenges in efficiency, cost, and scalability. This review explores the synergistic application of AI and HTE, highlighting their role in accelerating catalyst discovery, optimizing reaction parameters, and understanding structure-performance relationships. HTE facilitates the rapid preparation, characterization, and evaluation of diverse catalyst formulations, generating large datasets essential for AI model training. Machine learning algorithms, including regression models, neural networks, and active learning frameworks, analyze these datasets to uncover the underlying relationships between the data, predict performance, and optimize experimental workflows in real-time. Case studies across heterogeneous, homogeneous, and electrocatalysis demonstrate significant advancements, including improved reaction selectivity, enhanced material stability, and shorten discovery cycles. The integration of AI with HTE has significantly accelerated discovery cycles, enabling the optimization of catalyst formulations and reaction conditions. Despite these achievements, challenges remain, including reliance on researcher expertise, real-time adaptability, and the complexity of large-scale data analysis. Addressing these limitations through refined experimental protocols, standardized datasets, and interpretable AI models will unlock the full potential of AI-HTE integration.展开更多
This study leverages machine learning to perform high-throughput computational screening of n-hexane cracking initiators.Artificial neural networks are applied to predict the chemical performance of initiators,using s...This study leverages machine learning to perform high-throughput computational screening of n-hexane cracking initiators.Artificial neural networks are applied to predict the chemical performance of initiators,using simulated pyrolysis data as the training dataset.Various feature extraction methods are utilized,and five neural network architectures are developed to predict the co-cracking product distribution based on molecular structures.High-throughput screening of 12946 molecules outside the training dataset identifies the top 10 initiators for each target product—ethylene,propylene,and butadiene.The relative error between predicted and simulated values is less than 7%.Additionally,reaction pathway analysis elucidates the mechanisms by which initiators influence the distribution of cracking products.The proposed framework provides a practical and efficient approach for the rapid identification and evaluation of high-performance cracking initiators.展开更多
The labels of VU1 and VU2 in Fig.1(b)of the paper[Chin.Phys.B 34046801(2025)]were not correctly placed.The correct figure is provided.This modification does not affect the result presented in the paper.
基金supported by the National Basic Research Program of ChinaSpecial Project for National Supercomputing Zhengzhou Center Innovation Ecosystem Construction(201400210600)+4 种基金Outstanding Young Scientists of Henan Academy of Agricultural Sciences(2020YQ08)Fund for Distinguished Young Scholars from Henan Academy of Agricultural Sciences(2019JQ02)China Agriculture Research System(CARS-13)Henan Provincial Agriculture Research System,China(S2012-5)Henan Provincial Young Talents Supporting Project(2020HYTP044)。
文摘Oil and protein content and fatty acid composition are quality traits in peanut.Elucidating the genetic mechanisms underlying these traits may help researchers to obtain improved cultivars by molecular breeding.Whole-genome resequencing of a recombinant inbred population of 318 lines was performed to construct a high-density linkage map and identify QTL for peanut quality.The map,containing 4561 bin markers,covered 2032 c M with a mean marker density of 0.45 c M.A total of 110 QTL for oil and protein content,and fatty acid composition were mapped on the 18 peanut chromosomes.The QTL q A05.1 was detected in four environments and showed a major phenotypic effect on the contents of oil,protein,and six fatty acids.The genomic region spanned by q A05.1,corresponding to a physical interval of approximately 1.5 Mb,contains two SNPs polymorphic between the parents that could cause missense mutations.The two SNP sites were employed as KASP markers and validated using lines with extremely high and low oil contents.These sites may be useful in the marker-assisted breeding of peanut cultivars with high oil contents.
基金supported by the Key-Area Research and Development Program of Guangdong Province(2021B0202020001)China Agriculture Research System of MOF and MARA(CARS-46)+2 种基金Central Public-interest Scientific Institution Basal Research Fund of CAFS(2020TD23,2020ZJTD-02)Project of Construction of Guangdong Aquatic Seed Industry Demonstration Base 2021Special Funds for Science Technology Innovation and Industrial Development of Shenzhen Dapeng New District(KJYF202101-02)。
文摘Largemouth bass(Micropterus salmoides) is an economically important fish species in North America, Europe, and China. Various genetic improvement programs and domestication processes have modified its genome sequence through selective pressure, leaving nucleotide signals that can be detected at the genomic level. In this study,we sequenced 149 largemouth bass fish, including protospecies(imported from the US) and improved breeds(four domestic breeding populations from China). We detected genomic regions harboring certain genes associated with improved traits, which may be useful molecular markers for practical domestication, breeding, and selection. Subsequent analyses of genetic diversity and population structure revealed that the improved breeds have undergone more rigorous genetic changes. Through selective signal analysis, we identified hundreds of putative selective sweep regions in each largemouth bass line. Interestingly, we predicted 103 putative candidate genes potentially subjected to selection,including several associated with growth(psst1 and grb10), early development(klf9, sp4, and sp8), and immune traits(pkn2, sept2, bcl6, and ripk2). These candidate genes represent potential genomic landmarks that could be used to improve important traits of biological and commercial interest. In summary, this study provides a genome-wide map of genetic variations and selection footprints in largemouth bass, which may benefit genetic studies and accelerate genetic improvement of this economically important fish.
基金financially supported by National Key R&D Program of China(Grant No.2019YFD1001401)Project of Construction of Grape Germplasm Resources Sharing Platform(Grant No.PT2029)+2 种基金Zhengzhou Major Scientific and Technological Innovation Projects(Grant No.2020CXZX0082)National Modern Agricultural Industry Technology System Construction Special Project(Grant No.CARS-29-yc-1)Special Project of Science,Technology Innovation Project of Chinese Academy of Agricultural Sciences(Grant No.CAAS-ASTIP-2019-ZFRI).
文摘Grape berry shape is an important agricultural trait.Clarifying its genetic basis is significant for cultivating grape varieties that meet market demands.However,the current study by forward genetics has not achieved in-depth results.Here,a high-density map was constructed to identify quantitative trait loci(QTLs)for berry shape.A total of 358709 polymorphic SNPs were obtained using whole-genome resequencing(WGS)based on 208 F2 individuals derived from round grape‘E42-6’and oblong grape‘Rizamat’.The 1635.65 cM high-density map was divided into 19 linkage groups with an average distance of 0.37 cM.Using this map,three significant QTLs for fruit shape index(ShI:ratio of berry length to berry width)identified over three years were mapped onto LG4 and LG5,including one stable QTL on Chr5 with the genomic region of 0.47–1.94 Mb.Combining with gene annotation and expression patterns based on RNA-seq data from two contrasting F2 individuals with round and oblong berry(their average ShI was 1.89 and 1.10,respectively)at four developmental stages,four candidate genes were selected from the above QTLs.They were mainly involved in DNA replication,cell wall modification,and phytohormone biosynthesis.Further analysis of RNA-seq data revealed that several important phytohormone synthesis and metabolic pathways were enriched based on differentially expressed genes(DEGs),which was consistent with the results of QTL mapping for genes related to plant hormone biosynthesis in the F2 population.Furthermore,a comparison of plant hormone content showed that there were significant differences in IAA and tZ content between the two contrasting F2 individuals at different developmental stages.Our findings provide molecular insights into the genetic variation in grape berry shape.Stable QTLs and their tightly linked markers offer the possibility of marker-assisted selection to accelerate berry shape breeding.
基金supported by the National Key R&D Program of China (2021YFD1300901)National Natural Science Foundation of China (31960653)+1 种基金West Light Foundation of the Chinese Academy of SciencesNational Joint Research on Improved Breeds of Livestock and Poultry (19210365)。
文摘The abundance of domesticated sheep varieties and phenotypes is largely the result of long-term natural and artificial selection. However, there is limited information regarding the genetic mechanisms underlying phenotypic variation induced by the domestication and improvement of sheep. In this study, to explore genomic diversity and selective regions at the genome level, we sequenced the genomes of 100 sheep across 10 breeds and combined these results with publicly available genomic data from 225 individuals, including improved breeds, Chinese indigenous breeds,African indigenous breeds, and their Asian mouflon ancestor. Based on population structure, the domesticated sheep formed a monophyletic group,while the Chinese indigenous sheep showed a clear geographical distribution trend. Comparative genomic analysis of domestication identified several selective signatures, including IFI44 and IFI44L genes and PANK2 and RNF24 genes, associated with immune response and visual function.Population genomic analysis of improvement demonstrated that candidate genes of selected regions were mainly associated with pigmentation,energy metabolism, and growth development.Furthermore, the IFI44 and IFI44L genes showed a common selection signature in the genomes of 30domesticated sheep breeds. The IFI44 c. 54413058C>G mutation was selected for genotyping and population genetic validation. Results showed that the IFI44 polymorphism was significantly associated with partial immune traits. Our findings identified the population genetic basis of domesticated sheep at the whole-genome level, providing theoretical insights into the molecular mechanism underlying breed characteristics and phenotypic changes during sheep domestication and improvement.
基金supported by the National Key Research and Development Program of China(2021YFD1300901,2022YFD1302000)National Natural Science Foundation of China(32260818,31960653)。
文摘The phenotypic diversity resulting from artificial or natural selection of sheep has made a significant contribution to human civilization.Hu sheep are a local sheep breed unique to China with high reproductive rates and rapid growth.Genomic selection signatures have been widely used to investigate the genetic mechanisms underlying phenotypic variation in livestock.Here,we conduct whole-genome sequencing of 207 Hu sheep and compare them with the wild ancestors of domestic sheep(Asiatic mouflon)to investigate the genetic characteristics and selection signatures of Hu sheep.Based on six signatures of selection approaches,we detect genomic regions containing genes related to reproduction(BMPR1B,BMP2,PGFS,CYP19,CAMK4,GGT5,and GNAQ),vision(ALDH1A2,SAG,and PDE6B),nervous system(NAV1),and immune response(GPR35,SH2B2,PIK3R3,and HRAS).Association analysis with a population of 1299 Hu sheep reveals that those missense mutations in the GPR35(GPR35 g.952651 A>G;GPR35 g.952496 C>T)and NAV1(NAV1 g.84216190 C>T;NAV1 g.84227412 G>A)genes are significantly associated(P<0.05)with immune and growth traits in Hu sheep,respectively.This research offers unique insights into the selection characteristics of Hu sheep and facilitates further genetic improvement and molecular investigations.
基金the National Natural Science Foundation of China(31871656 and 32072098)。
文摘Rapeseed(Brassica napus)supplies about half of the vegetable oil in China.Increasing oil production and searching for genes that control oil content in the crop are research goals.In our previous studies,four major QTL for oil content located on A08,A09,C03 and C06 in the Ken C-8×N53-2(KN DH)mapping population were detected.The parental lines were resequenced to identify structural variations and candidate genes affecting oil content in these four major QTL regions.Insertion-deletion(In Del)markers were developed and used to narrow the regions.Differentially expressed genes located in the regions were investigated.GO and KEGG analysis showed that several genes were associated with lipid metabolism.Several transcription factors with higher expression in N53-2 than in Ken C-8 were identified.These results shed light on the genetic control of oil content and may be helpful for the development of highoil-content cultivars.
基金supported by the China Agriculture Research System(Grant No.CARS-23-A13)Hubei Agrotechnical Major Project(Grant No.2021-620-000-001-01)+1 种基金Wuhan Major Project of Key Technologies in Biological Breeding and New Variety Cultivation(Grant No.2022021302024852)HZAU-AGIS Cooperation Fund(Grant No.SZYJY2023022).
文摘Clubroot caused by Plasmodiophora brassicae is a devastating disease of Cruciferous crops.Developing cultivars with clubroot resistance(CR)is the most effective control measure.For the two major Brassica vegetable species B.rapa and B.oleracea,several commercial cultivars with unclear CR pedigrees have been intensively used as CR donors in breeding.However,the continuous occurrence of CR-breaking makes the CR pedigree underlying these cultivars one of the breeders'most urgent concerns.The complex intraspecific diversity of these two major Brassica vegetables has also limited the applicability of CR markers in different breeding programs.Here we first traced the pedigree underlying two kinds of CR that have been widely applied in breeding by linkage and introgression analyses based on public resequencing data.In B.rapa,a major locus CRzi8 underlying the CR of the commercial CR donor‘DegaoCR117’was identified.CRzi8 was further shown to have been introgressed from turnip(B.rapa ssp.rapifera)and that it carried a potential functional allele of Crr1a.The turnip introgression carried CRb^(c),sharing the same coding sequence with the CRb that was also identified from chromosome C07 of B.oleracea CR cultivars with different morphotypes.Within natural populations,variation analysis of linkage intervals of CRzi8,PbBa8.1,CRb,and CRb^(c)yielded easily resolved InDel markers(>20 bp)for these fundamental CR genes.The specificity of these markers was tested in diverse cultivars panels,and each exhibited high reliability in breeding.Our research demonstrates the value of the practice of applying resequencing big data to solve urgent concerns in breeding programs.
基金supported by the National Natural Science Foundation of China(41976083,41776171 and 32072980)。
文摘The genetic adaptations of various organisms to heterogeneous environments in the northwestern Pacific remain poorly understood.Heterogeneous genomic divergence among populations may reflect environmentalselection.Advancingour understanding of the mechanisms by which organisms adapt to different temperatures in response to climate change and predicting the adaptive potential and ecological consequences of anthropogenic global warming are critical.We sequenced the whole genomes of Japanese whiting(Sillago japonica)specimens collected from different latitudinal locations along the coastal waters of China and Japan to detect possible thermal adaptations.Using population genomics,a total of 5.48 million single nucleotide polymorphisms(SNPs)from five populations revealed a complete genetic break between the Chinese and Japanese groups,which was attributed to both geographic distance and local adaptation.The shared natural selection genes between two isolated populations(i.e.,Zhoushan and Ise Bay/Tokyo Bay)indicated possible parallel evolution at the genetic level induced by temperature.These genes also indicated that the process of temperature selection on isolated populations is repeatable.Moreover,we observed natural candidate genes related to membrane fluidity,possibly underlying adaptation to cold environmental stress.These findings advance our understanding of the genetic mechanisms underlying the rapid adaptations of fish species.Species distribution projection models suggested that the Chinese and Japanese groups may have different responses to future climate change,with the former expanding and the latter contracting.The findings of this study enhance our understanding of genetic differentiation and adaptation to changing environments.
基金the Research Project Supported by Shanxi Scholarship Council of China(2021-066)the National Natural Science Foundation of China(31601751)+2 种基金the Key Research and Development Plan of Shanxi Province,China(201903D221063)the Fundamental Research Program of Shanxi Province,China(20210302123412)the Science and Technology Innovation Project of Shanxi Agricultural University,China(2016ZZ02).
文摘Trichomes are specialized structures developed from epidermal cells and can protect plants against biotic and abiotic stresses.Trichomes cover carrots during the generative phase.However,the morphology of the carrot trichomes and candidate genes controlling the formation of trichomes are still unclear.This study found that carrot trichomes were nonglandular and unbranched hairs distributed on the stem,leaf,petiole,pedicel,and seed of carrot.Resequencing analysis of a trichome mutant with sparse and short trichomes(sst)and a wild type(wt)with long and dense trichomes on carrot stems was conducted.A total of 15396 genes containing nonsynonymous mutations in sst were obtained,including 42 trichomerelated genes.We also analyzed the transcriptome of the trichomes on secondary branches when these secondary branches were 10 cm long between wt and sst and obtained 6576 differentially expressed genes(DEGs),including 24 trichome-related genes.qRT-PCR validation exhibited three significantly up-regulated DEGs,20 significantly downregulated,and one with no difference.We considered both the resequencing and transcriptome sequencing analyses and found that 12 trichome-related genes that were grouped into five transcription factor families containing nonsynonymous mutations and significantly down-regulated in sst.Therefore,these genes are potentially promising candidate genes whose nonsynonymous mutations and down-regulation may result in scarce and short trichomes mutation on carrot stems in sst.
文摘Breeding hybrids with nuclear malesterile lines is an important method for the cross-breeding of sweet peppers. To date, few reports have been published on the nuclear malesterility gene of sweet pepper. Yet, there are approximately 20 pepper nuclear malesterility lines in the world. Using the self-developed testing material, sweet pepper nuclear malesterile dual-purpose line AB91, the genome-wide resequencing technique was applied to find that the mutation site causing the abortion of sweet pepper nuclear malesterility AB91 is on chromosome #5. The mutation gene Capana05g000747 was filtered out and validated by the flight mass spectrometry genotyping and quantitative realtime PCR method and determined to be the gene causing the abortion of sweet pepper nuclear male sterility AB91. The gene Capana05g000747 mutation site is a non-synonymous mutation site located at the 6th exon, the base C mutated into A, and the amino acid changed from alanine to serine. The three-dimensional protein structure of fertile and sterile plant Capana05g000747 was predicted. The results showed that the three-dimensional structure of the two proteins differed significantly. Sequence alignment analysis showed that the gene Capana05g000747 has a similar function to gene At2g02148. The gene At2g02148 contains a pentatricopeptide repeat protein which has important physiological functions in the gene expression process of organelles and is closely related to the performance of malesterility genes. Therefore, Capana05g000747 was selected as an important candidate gene for sweet pepper nuclear male sterile testing material AB91.
基金supported by the National Key Research and Development Program of China (2022YFD2400501)Key R&D Project of Hainan Province (ZDYF2021XDNY133)+2 种基金Project of Sanya Yazhouwan Science and Technology City Management Foundation (SKJC-2020-02-009)PhD Scientific Research and Innovation Foundation of Sanya Yazhou Bay Science and Technology City (HSPHDSRF-2022-02-007)Young Elite Scientists Sponsorship Program by CAST (2023QNRC001)。
文摘The leopard coral grouper(Plectropomus leopardus)is a species of significant economic importance.Although artificial cultivation of P.leopardus has thrived in recent decades,the advancement of selective breeding has been hindered by the lack of comprehensive population genomic data.In this study,we identified over 8.73 million single nucleotide polymorphisms(SNPs)through whole-genome resequencing of 326 individuals spanning six distinct groups.Furthermore,we categorized 226 individuals with high-coverage sequencing depth(≥14×)into eight clusters based on their genetic profiles and phylogenetic relationships.Notably,four of these clusters exhibited pronounced genetic differentiation compared with the other populations.To identify potentially advantageous loci for P.leopardus,we examined genomic regions exhibiting selective sweeps by analyzing the nucleotide diversity(θπ)and fixation index(FST)in these four clusters.Using these high-coverage resequencing data,we successfully constructed the first haplotype reference panel specific to P.leopardus.This achievement holds promise for enabling high-quality,cost-effectiveimputationmethods.Additionally,we combined low-coverage sequencing data with imputation techniques for a genome-wide association study,aiming to identify candidate SNP loci and genes associated with growth traits.A significant concentration of these genes was observed on chromosome 17,which is primarily involved in skeletal muscle and embryonic development and cell proliferation.Notably,our detailed investigation of growth-related SNPs across the eight clusters revealed that cluster 5 harbored the most promising candidate SNPs,showing potential for genetic selective breeding efforts.These findings provide a robust toolkit and valuable insights into the management of germplasm resources and genome-driven breeding initiatives targeting P.leopardus.
基金financially supported by National Key R&D Program(2021YFF0701905)。
文摘In order to save manpower and time costs,and to achieve simultaneous detection of multiple animal-derived components in meat and meat products,this study used multiple nucleotide polymorphism(MNP)marker technology based on the principle of high-throughput sequencing,and established a multi-locus 10 animalderived components identification method of cattle,goat,sheep,donkey,horse,chicken,duck,goose,pigeon,quail in meat and meat products.The specific loci of each species could be detected and the species could be accurately identified,including 5 loci for cattle and duck,3 loci for sheep,9 loci for chicken and horse,10 loci for goose and pigeon,6 loci for quail and 1 locus for donkey and goat,and an adulteration model was established to simulate commercially available samples.The results showed that the method established in this study had high throughput,good repeatability and accuracy,and was able to identify 10 animalderived components simultaneously with 100%repeatability accuracy.The detection limit was 0.1%(m/m)in simulated samples of chicken,duck and horse.Using the method established in this study to test commercially available samples,4 samples from 14 commercially available samples were detected to be inconsistent with the labels,of which 2 did not contain the target ingredient and 2 were adulterated with small amounts of other ingredients.
基金financial support from the National Key Research and Development Program of China(2021YFB 3501501)the National Natural Science Foundation of China(No.22225803,22038001,22108007 and 22278011)+1 种基金Beijing Natural Science Foundation(No.Z230023)Beijing Science and Technology Commission(No.Z211100004321001).
文摘The high porosity and tunable chemical functionality of metal-organic frameworks(MOFs)make it a promising catalyst design platform.High-throughput screening of catalytic performance is feasible since the large MOF structure database is available.In this study,we report a machine learning model for high-throughput screening of MOF catalysts for the CO_(2) cycloaddition reaction.The descriptors for model training were judiciously chosen according to the reaction mechanism,which leads to high accuracy up to 97%for the 75%quantile of the training set as the classification criterion.The feature contribution was further evaluated with SHAP and PDP analysis to provide a certain physical understanding.12,415 hypothetical MOF structures and 100 reported MOFs were evaluated under 100℃ and 1 bar within one day using the model,and 239 potentially efficient catalysts were discovered.Among them,MOF-76(Y)achieved the top performance experimentally among reported MOFs,in good agreement with the prediction.
基金supported by the National Key Research and Development Plan of China(2023YFB3210400)the Natural Science Innovation Group Foundation of China(T2321004)+3 种基金the National Natural Science Foundation of China(62174101)Shandong University Integrated Research and Cultivation Project(2022JC001)Key Research and Development Plan of Shandong Province(Major Science and Technology Innovation Project2022CXGC020501).
文摘The real-time screening of biomolecules and single cells in biochips is extremely important for disease prediction and diagnosis,cellular analysis,and life science research.Barcode biochip technology,which is integrated with microfluidics,typically comprises barcode array,sample loading,and reaction unit array chips.Here,we present a review of microfluidics barcode biochip analytical approaches for the high-throughput screening of biomolecules and single cells,including protein biomarkers,microRNA(miRNA),circulating tumor DNA(ctDNA),single-cell secreted proteins,single-cell exosomes,and cell interactions.We begin with an overview of current high-throughput detection and analysis approaches.Following this,we outline recent improvements in microfluidic devices for biomolecule and single-cell detection,highlighting the benefits and limitations of these devices.This paper focuses on the research and development of microfluidic barcode biochips,covering their self-assembly substrate materials and their specific applications with biomolecules and single cells.Looking forward,we explore the prospects and challenges of this technology,with the aim of contributing toward the use of microfluidic barcode detection biochips in medical diagnostics and therapies,and their large-scale commercialization.
基金supported by the National Ecological Environment Survey and Assessment(2024-vertical-0107)the Fundamental Research Funds for the Central Public-interest Scientific Institution(2023YSKY-26)the Hulun Buir Grassland Ecological Restoration Comprehensive Survey Project(DD20230474).
文摘In recent years,intensive human activities have increased the intensity of desertification,driving continual desertification process of peripheral meadows.To investigate the effects of restoration on soil microbial communities,we analyzed vegetation-soil relationships in the Hulun Buir Sandy Land,northern China.Through the use of high-throughput sequencing,we examined the structure and diversity in the bacterial and fungal communities within the 0-20 cm soil layer after 9-15 a of restoration.Different slope positions were analyzed and spatial heterogeneity was assessed.The results showed progressive improvements in soil properties and vegetation with the increase of restoration duration,and the following order was as follows:bottom slope>middle slope>crest slope.During the restoration in the Hulun Buir Sandy Land,the bacterial communities were dominated by Proteobacteria,Actinobacteria,and Acidobacteria,whereas the fungal communities were dominated by Ascomycota and Basidiomycota.Eutrophic bacterial abundance increased with the restoration duration,whereas oligotrophic bacterial and fungal abundance levels decreased.The soil bacterial abundance significantly increased with the increasing restoration duration,whereas the fungal diversity decreased after 11 a of restoration,except that at the crest slope.Redundancy analysis showed that pH,soil moisture content,total nitrogen,and vegetation-related factors affected the bacterial community structure(45.43%of the total variance explained).Canonical correspondence analysis indicated that pH,total phosphorus,and vegetation-related factors shaped the bacterial community structure(31.82%of the total variance explained).Structural equation modeling highlighted greater bacterial responses(R^(2)=0.49-0.79)to changes in environmental factors than those of fungi(R^(2)=0.20-0.48).The soil bacterial community was driven mainly by pH,soil moisture content,electrical conductivity,plant coverage,and litter dry weight.The abundance and diversity of the soil fungal community were mainly driven by plant coverage,litter dry weight,and herbaceous aboveground biomass,while there was no significant correlation between the soil fungal community structure and environmental factors.These findings highlighted divergent microbial succession patterns and environmental sensitivities during sandy grassland restoration.
基金supported by the U.S.Army Research Laboratory through their award#W911NF-22-2-0040the Ministry of Education,Youth and Sports of the Czech Republic through the e-INFRA CZ(ID:90254).
文摘Designing high-performance high-entropy alloys(HEAs)with transformation-induced plasticity(TRIP)or twinning-induced plasticity(TWIP)effects requires precise control over stacking fault energy(SFE)and phase stability.However,the vast complexity of multicomponent systems poses a major challenge for identifying promising candidates through conventional experimental or computational methods.A high-throughput CALPHAD framework is developed to identify compositions with potential TWIP/TRIP behaviors in the Cr-Co-Ni and Cr-Co-Ni-Fe systems through systematic screening of stacking fault energy(SFE),FCC phase stability,and FCC-to-HCP transition temperatures(T0).The approach combines TC-Python automation with parallel Gibbs energy calculations across hundreds of thousands of compositions,enabling efficient extraction of metastable FCC-dominant alloys.The high-throughput results find 214 compositions with desired properties from 160,000 candidates.Detailed analysis of the Gibbs energy distributions,phase fraction trends,and temperature-dependent SFE evolution reveals critical insights into the thermodynamic landscape governing plasticity mechanisms in HEAs.The results show that only a narrow region of the compositional space satisfies all screening criteria,emphasizing the necessity of an integrated approach.The screened compositions and trends provide a foundation for targeted experimental validation.Furthermore,this work demonstrates a scalable,composition-resolved strategy for predicting deformation mechanisms in multicomponent alloys and offers a blueprint for integrating thermodynamic screening with mechanistic understanding in HEA design.
基金supported by the National Research Foundation of Korea(NRF)funded by the Korean government(MSIT)(Grant number:RS-2025-02316700,and RS-2025-00522430)the China Scholarship Council Program。
文摘The bandgap is a key parameter for understanding and designing hybrid perovskite material properties,as well as developing photovoltaic devices.Traditional bandgap calculation methods like ultravioletvisible spectroscopy and first-principles calculations are time-and power-consuming,not to mention capturing bandgap change mechanisms for hybrid perovskite materials across a wide range of unknown space.In the present work,an artificial intelligence ensemble comprising two classifiers(with F1 scores of 0.9125 and 0.925)and a regressor(with mean squared error of 0.0014 eV)is constructed to achieve high-precision prediction of the bandgap.The bandgap perovskite dataset is established through highthroughput prediction of bandgaps by the ensemble.Based on the self-built dataset,partial dependence analysis(PDA)is developed to interpret the bandgap influential mechanism.Meanwhile,an interpretable mathematical model with an R^(2)of 0.8417 is generated using the genetic programming symbolic regression(GPSR)technique.The constructed PDA maps agree well with the Shapley Additive exPlanations,the GPSR model,and experiment verification.Through PDA,we reveal the boundary effect,the bowing effect,and their evolution trends with key descriptors.
基金supported by the Special Project of National Natural Science Foundation(42341204)the the National Natural Science Foundation of China(W2411009).
文摘The integration of artificial intelligence (AI) with high-throughput experimentation (HTE) techniques is revolutionizing catalyst design, addressing challenges in efficiency, cost, and scalability. This review explores the synergistic application of AI and HTE, highlighting their role in accelerating catalyst discovery, optimizing reaction parameters, and understanding structure-performance relationships. HTE facilitates the rapid preparation, characterization, and evaluation of diverse catalyst formulations, generating large datasets essential for AI model training. Machine learning algorithms, including regression models, neural networks, and active learning frameworks, analyze these datasets to uncover the underlying relationships between the data, predict performance, and optimize experimental workflows in real-time. Case studies across heterogeneous, homogeneous, and electrocatalysis demonstrate significant advancements, including improved reaction selectivity, enhanced material stability, and shorten discovery cycles. The integration of AI with HTE has significantly accelerated discovery cycles, enabling the optimization of catalyst formulations and reaction conditions. Despite these achievements, challenges remain, including reliance on researcher expertise, real-time adaptability, and the complexity of large-scale data analysis. Addressing these limitations through refined experimental protocols, standardized datasets, and interpretable AI models will unlock the full potential of AI-HTE integration.
基金The financial support provided by the Project of the National Natural Science Foundation of China (22308314,U22A20415)the Natural Science Foundation of Zhejiang Province (LQ24B060001)+1 种基金the "Pioneer" and "Leading Goose" Research & Development Program of Zhejiang (2022C01SA442617)the SINOPEC Technology Development Project (224244)
文摘This study leverages machine learning to perform high-throughput computational screening of n-hexane cracking initiators.Artificial neural networks are applied to predict the chemical performance of initiators,using simulated pyrolysis data as the training dataset.Various feature extraction methods are utilized,and five neural network architectures are developed to predict the co-cracking product distribution based on molecular structures.High-throughput screening of 12946 molecules outside the training dataset identifies the top 10 initiators for each target product—ethylene,propylene,and butadiene.The relative error between predicted and simulated values is less than 7%.Additionally,reaction pathway analysis elucidates the mechanisms by which initiators influence the distribution of cracking products.The proposed framework provides a practical and efficient approach for the rapid identification and evaluation of high-performance cracking initiators.
文摘The labels of VU1 and VU2 in Fig.1(b)of the paper[Chin.Phys.B 34046801(2025)]were not correctly placed.The correct figure is provided.This modification does not affect the result presented in the paper.