期刊文献+
共找到15篇文章
< 1 >
每页显示 20 50 100
Survey of Distributed Computing Frameworks for Supporting Big Data Analysis 被引量:5
1
作者 Xudong Sun Yulin He +1 位作者 Dingming Wu Joshua Zhexue Huang 《Big Data Mining and Analytics》 EI CSCD 2023年第2期154-169,共16页
Distributed computing frameworks are the fundamental component of distributed computing systems.They provide an essential way to support the efficient processing of big data on clusters or cloud.The size of big data i... Distributed computing frameworks are the fundamental component of distributed computing systems.They provide an essential way to support the efficient processing of big data on clusters or cloud.The size of big data increases at a pace that is faster than the increase in the big data processing capacity of clusters.Thus,distributed computing frameworks based on the MapReduce computing model are not adequate to support big data analysis tasks which often require running complex analytical algorithms on extremely big data sets in terabytes.In performing such tasks,these frameworks face three challenges:computational inefficiency due to high I/O and communication costs,non-scalability to big data due to memory limit,and limited analytical algorithms because many serial algorithms cannot be implemented in the MapReduce programming model.New distributed computing frameworks need to be developed to conquer these challenges.In this paper,we review MapReduce-type distributed computing frameworks that are currently used in handling big data and discuss their problems when conducting big data analysis.In addition,we present a non-MapReduce distributed computing framework that has the potential to overcome big data analysis challenges. 展开更多
关键词 distributed computing frameworks big data analysis approximate computing MapReduce computing model
原文传递
A computational framework for improving genetic variants identification from 5,061 sheep sequencing data
2
作者 Shangqian Xie Karissa Isaacs +1 位作者 Gabrielle Becker Brenda M.Murdoch 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第6期2332-2344,共13页
Background Pan-genomics is a recently emerging strategy that can be utilized to provide a more comprehensive characterization of genetic variation.Joint calling is routinely used to combine identified variants across ... Background Pan-genomics is a recently emerging strategy that can be utilized to provide a more comprehensive characterization of genetic variation.Joint calling is routinely used to combine identified variants across multiple related samples.However,the improvement of variants identification using the mutual support information from mul-tiple samples remains quite limited for population-scale genotyping.Results In this study,we developed a computational framework for joint calling genetic variants from 5,061 sheep by incorporating the sequencing error and optimizing mutual support information from multiple samples’data.The variants were accurately identified from multiple samples by using four steps:(1)Probabilities of variants from two widely used algorithms,GATK and Freebayes,were calculated by Poisson model incorporating base sequencing error potential;(2)The variants with high mapping quality or consistently identified from at least two samples by GATK and Freebayes were used to construct the raw high-confidence identification(rHID)variants database;(3)The high confidence variants identified in single sample were ordered by probability value and controlled by false discovery rate(FDR)using rHID database;(4)To avoid the elimination of potentially true variants from rHID database,the vari-ants that failed FDR were reexamined to rescued potential true variants and ensured high accurate identification variants.The results indicated that the percent of concordant SNPs and Indels from Freebayes and GATK after our new method were significantly improved 12%-32%compared with raw variants and advantageously found low frequency variants of individual sheep involved several traits including nipples number(GPC5),scrapie pathology(PAPSS2),sea-sonal reproduction and litter size(GRM1),coat color(RAB27A),and lentivirus susceptibility(TMEM154).Conclusion The new method used the computational strategy to reduce the number of false positives,and simulta-neously improve the identification of genetic variants.This strategy did not incur any extra cost by using any addi-tional samples or sequencing data information and advantageously identified rare variants which can be important for practical applications of animal breeding. 展开更多
关键词 Computational framework Genetic variants Multiple samples SHEEP
在线阅读 下载PDF
PAMPHLET:PAM Prediction HomoLogous-Enhancement Toolkit for precise PAM prediction in CRISPR-Cas systems
3
作者 Chen Qi Xuechun Shen +6 位作者 Baitao Li Chuan Liu Lei Huang Hongxia Lan Donglong Chen Yuan Jiang Dan Wang 《Journal of Genetics and Genomics》 2025年第2期258-268,共11页
CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms,evolving from a singular Cas9 model to a diverse CRISPR toolbox.A critical bottleneck in developing new Cas proteins is identif... CRISPR-Cas technology has revolutionized our ability to understand and engineer organisms,evolving from a singular Cas9 model to a diverse CRISPR toolbox.A critical bottleneck in developing new Cas proteins is identifying protospacer adjacent motif(PAM)sequences.Due to the limitations of experimental methods,bioinformatics approaches have become essential.However,existing PAM prediction programs are limited by the small number of spacers in CRISPR-Cas systems,resulting in low accuracy.To address this,we develop PAMPHLET,a pipeline that uses homology searches to identify additional spacers,significantly increasing the number of spacers up to 18-fold.PAMPHLET is validated on 20 CRISPR-Cas systems and successfully predicts PAM sequences for 18 protospacers.These predictions are further validated using the DocMF platform,which characterizes protein-DNA recognition patterns via next-generation sequencing.The high consistency between PAMPHLET predictions and DocMF results for Cas proteins demonstrates the potential of PAMPHLET to enhance PAM sequence prediction accuracy,expedite the discovery process,and accelerate the development of CRISPR tools. 展开更多
关键词 CRISPR-Cas Protospacer adjacentmotif Genome editing PAM prediction Computational framework
原文传递
Hypergraph Computation
4
作者 Yue Gao Shuyi Ji +1 位作者 Xiangmin Han Qionghai Dai 《Engineering》 SCIE EI CAS CSCD 2024年第9期188-201,共14页
Practical real-world scenarios such as the Internet,social networks,and biological networks present the challenges of data scarcity and complex correlations,which limit the applications of artificial intelligence.The ... Practical real-world scenarios such as the Internet,social networks,and biological networks present the challenges of data scarcity and complex correlations,which limit the applications of artificial intelligence.The graph structure is a typical tool used to formulate such correlations,it is incapable of modeling highorder correlations among different objects in systems;thus,the graph structure cannot fully convey the intricate correlations among objects.Confronted with the aforementioned two challenges,hypergraph computation models high-order correlations among data,knowledge,and rules through hyperedges and leverages these high-order correlations to enhance the data.Additionally,hypergraph computation achieves collaborative computation using data and high-order correlations,thereby offering greater modeling flexibility.In particular,we introduce three types of hypergraph computation methods:①hypergraph structure modeling,②hypergraph semantic computing,and③efficient hypergraph computing.We then specify how to adopt hypergraph computation in practice by focusing on specific tasks such as three-dimensional(3D)object recognition,revealing that hypergraph computation can reduce the data requirement by 80%while achieving comparable performance or improve the performance by 52%given the same data,compared with a traditional data-based method.A comprehensive overview of the applications of hypergraph computation in diverse domains,such as intelligent medicine and computer vision,is also provided.Finally,we introduce an open-source deep learning library,DeepHypergraph(DHG),which can serve as a tool for the practical usage of hypergraph computation. 展开更多
关键词 High-order correlation Hypergraph structure modeling Hypergraph semantic computing Efficient hypergraph computing Hypergraph computation framework
在线阅读 下载PDF
A machine-learning framework for accelerating spin-lattice relaxation simulations
5
作者 Valerio Briganti Alessandro Lunghi 《npj Computational Materials》 2025年第1期621-629,共9页
Molecular and lattice vibrations are able to couple to the spin of electrons and lead to their relaxation and decoherence.Ab initio simulations have played a fundamental role in shaping our understanding of this proce... Molecular and lattice vibrations are able to couple to the spin of electrons and lead to their relaxation and decoherence.Ab initio simulations have played a fundamental role in shaping our understanding of this process but further progress is hindered by their high computational cost.Here we present an accelerated computational framework based on machine-learning models for the prediction of molecular vibrations and spin-phonon coupling coefficients.We apply this method to three open-shell coordination compounds exhibiting long relaxation times and show that this approach achieves semito-full quantitative agreement with ab initio methods reducing the computational cost by about 80%.Moreover,we show that this framework naturally extends to molecular dynamics simulations,paving the way to the study of spin relaxation in condensed matter beyond simple equilibrium harmonic thermal baths. 展开更多
关键词 molecular vibrations machine learning molecular lattice vibrations couple spin accelerated computational framework spin lattice relaxation relaxation decoherenceab initio simulations spin phonon coupling
原文传递
High-throughput computational framework for high-order anharmonic thermal transport in cubic and tetragonal crystals
6
作者 Zhi Li Huiju Lee +1 位作者 Chris Wolverton Yi Xia 《npj Computational Materials》 2025年第1期4656-4671,共16页
Accurate first-principles prediction of lattice thermal conductivity(κ_(L))remains challenging in identifying materials with extreme thermal behavior.While the harmonic approximation with threephonon scattering(HA+3p... Accurate first-principles prediction of lattice thermal conductivity(κ_(L))remains challenging in identifying materials with extreme thermal behavior.While the harmonic approximation with threephonon scattering(HA+3ph)is now routine,reliableκ_(L)prediction often requires higher-order anharmonic effects,including self-consistent phonon renormalization,three-and four-phonon scattering,and off-diagonal heat flux(SCPH+3,4ph+OD).We present a state-of-the-art highthroughput workflow that unifies these effects and apply it to 773 cubic and tetragonal crystals spanning diverse chemistries and structures.From 562 dynamically stable compounds,weassess the hierarchical impacts of higher-order anharmonicity.For around 60%of materials,HA+3ph predictions closely match those from SCPH+3,4ph+OD.SCPH generally increasesκ_(L),by over 8 times in extreme cases,whereas four-phonon scattering universally suppressesκ_(L),sometimes to 15%of the HA+3ph value.Off-diagonal contributions are negligible in high-κ_(L)systems but can rival diagonal terms in highly anharmonic low-κ_(L)compounds.We highlight four case studies,Rb_(2)TlAlH_(6),Cu_(3)VSe_(4),CuBr,and KTlCl_(4),that exhibit distinct extreme behaviors.This work delivers not only a robust workflow for high-fidelityκ_(L)dataset but also a quantitative framework to determine when higher-order effects are essential.The hierarchy ofκ_(L)results,from the HA+3ph to SCPH+3,4ph+OD level,offers a scalable,interpretable route to discovering next-generation extreme thermal materials. 展开更多
关键词 thermal transport high throughput computational framework identifying materials extreme thermal behaviorwhile high order anharmonicity harmonic approximation threephonon scattering ha ph lattice thermal conductivity l remains first principles prediction
原文传递
A location-based fog computing optimization of energy management in smart buildings:DEVS modeling and design of connected objects
7
作者 Abdelfettah MAATOUG Ghalem BELALEM Saïd MAHMOUDI 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第2期179-195,共17页
Nowadays,smart buildings rely on Internet of things(loT)technology derived from the cloud and fog computing paradigms to coordinate and collaborate between connected objects.Fog is characterized by low latency with a ... Nowadays,smart buildings rely on Internet of things(loT)technology derived from the cloud and fog computing paradigms to coordinate and collaborate between connected objects.Fog is characterized by low latency with a wider spread and geographically distributed nodes to support mobility,real-time interaction,and location-based services.To provide optimum quality of user life in moderm buildings,we rely on a holistic Framework,designed in a way that decreases latency and improves energy saving and services efficiency with different capabilities.Discrete EVent system Specification(DEVS)is a formalism used to describe simulation models in a modular way.In this work,the sub-models of connected objects in the building are accurately and independently designed,and after installing them together,we easily get an integrated model which is subject to the fog computing Framework.Simulation results show that this new approach significantly,improves energy efficiency of buildings and reduces latency.Additionally,with DEVS,we can easily add or remove sub-models to or from the overall model,allowing us to continually improve our designs. 展开更多
关键词 smart building energy consumption IOT fog computing framework DEVS simulation models
原文传递
Network Diffusion Framework to Simulate Spreading Processes in Complex Networks
8
作者 MichałCzuba Mateusz Nurek +5 位作者 Damian Serwata Yu-Xuan Qiu Mingshan Jia Katarzyna Musial Radosław Michalski Piotr Bródka 《Big Data Mining and Analytics》 EI CSCD 2024年第3期637-654,共18页
With the advancement of computational network science,its research scope has significantly expanded beyond static graphs to encompass more complex structures.The introduction of streaming,temporal,multilayer,and hyper... With the advancement of computational network science,its research scope has significantly expanded beyond static graphs to encompass more complex structures.The introduction of streaming,temporal,multilayer,and hypernetwork approaches has brought new possibilities and imposed additional requirements.For instance,by utilising these advancements,one can model structures such as social networks in a much more refined manner,which is particularly relevant in simulations of the spreading processes.Unfortunately,the pace of advancement is often too rapid for existing computational packages to keep up with the functionality updates.This results in a significant proliferation of tools used by researchers and,consequently,a lack of a universally accepted technological stack that would standardise experimental methods(as seen,e.g.,in machine learning).This article addresses that issue by presenting an extended version of the Network Diffusion library.First,a survey of the existing approaches and toolkits for simulating spreading phenomena is shown,and then,an overview of the framework functionalities.Finally,we report four case studies conducted with the package to demonstrate its usefulness:the impact of sanitary measures on the spread of COVID-19,the comparison of information diffusion on two temporal network models,and the effectiveness of seed selection methods in the task of influence maximisation in multilayer networks.We conclude the paper with a critical assessment of the library and the outline of still awaiting challenges to standardise research environments in computational network science. 展开更多
关键词 computational framework seed selection influence maximisation spreading models temporal networks multilayer networks network science network control
原文传递
Comparative study of heat transfer in MHD multilayer flows
9
作者 Mahesha Rudrappa Nalinakshi Narasappa Sravan Kumar Thavadaa 《International Journal of Fluid Engineering》 2025年第3期71-82,共12页
This paper investigates mixed convection heat transfer in vertical multilayer flow in a system consisting of a viscous fluid flanked by nanofluids in a porous medium,taking account of magnetohydrodynamic(MHD)and radia... This paper investigates mixed convection heat transfer in vertical multilayer flow in a system consisting of a viscous fluid flanked by nanofluids in a porous medium,taking account of magnetohydrodynamic(MHD)and radiation effects and internal heat generation.The thermal conductivity of the nanofluids is analyzed using the Maxwell-Garnett and Patel models.A computational framework for solving the governing nonlinear differential equations using an analytical and perturbative approach is established,to provide accurate predictions of heat transfer characteristics.The interplay between the viscous fluid and the nanofluids in the presence of MHD effects introduces complex thermal and fluid dynamic interactions,highlighting the need for innovative modeling approaches.The results obtained provides enhanced understanding of multiphase flow behavior in the presence of internal heat generation and external magnetic fields.They will contribute to the development of methods for optimizing heat transfer in advanced thermal management applications such as nuclear reactor cooling,medical management of hyperthermia,and industrial energy systems. 展开更多
关键词 computational framework heat transfer analytical perturbative approach viscous fluid mixed convection mixed convection heat transfer porous mediumtaking thermal conductivity
在线阅读 下载PDF
Machine learning accelerated descriptor design for catalyst discovery in CO_(2)to methanol conversion
10
作者 Prajwal Pisal Ondřej Krejčí Patrick Rinke 《npj Computational Materials》 2025年第1期2260-2268,共9页
Transforming CO_(2)into methanol represents a crucial step towards closing the carbon cycle,with thermoreduction technology nearing industrial application.However,obtaining high methanol yields and ensuring the stabil... Transforming CO_(2)into methanol represents a crucial step towards closing the carbon cycle,with thermoreduction technology nearing industrial application.However,obtaining high methanol yields and ensuring the stability of heterocatalysts remain significant challenges.Herein,we present a sophisticated computational framework to accelerate the discovery of thermal heterogeneous catalysts,using machine-learned force fields.We propose a new catalytic descriptor,termed adsorption energy distribution,that aggregates the binding energies for different catalyst facets,binding sites,and adsorbates.The descriptor is versatile and can be adjusted to a specific reaction through careful choice of the key-step reactants and reaction intermediates.By applying unsupervised machine learning and statistical analysis to a dataset comprising nearly 160 metallic alloys,we offer a powerful tool for catalyst discovery.We propose new promising candidates such as ZnRh and ZnPt_(3),which to our knowledge,have not yet been tested,and discuss their possible advantage in terms of stability. 展开更多
关键词 computational framework catalytic descriptortermed adsorption energy distributionthat CO methanol conversion catalyst discovery thermoreduction technology heterogeneous catalystsusing closing carbon cyclewith
原文传递
Generative active learning across polymer architectures and solvophobicities for targeted rheological behavior
11
作者 Shengli Jiang Michael A.Webb 《npj Computational Materials》 2025年第1期4457-4470,共14页
Modifying solution viscosity is a key functional application of polymers,yet the interplay of molecular chemistry,polymer architecture,and intermolecular interactions makes tailoring precise rheological responses chal... Modifying solution viscosity is a key functional application of polymers,yet the interplay of molecular chemistry,polymer architecture,and intermolecular interactions makes tailoring precise rheological responses challenging.We introduce a computational framework coupling topology-aware generative machine learning,Gaussian process modeling,and multiparticle collision dynamics to design polymers yielding prescribed shear-rate-dependent viscosity profiles.Targeting thirty rheological profiles of varying difficulty,Bayesian optimization identifies polymers that satisfy all lowand most medium-difficulty targets by modifying topology and solvophobicity,with other variables fixed.In these regimes,wefind and explain design degeneracy,where distinct polymers produce nearidentical rheological profiles.However,satisfying high-difficulty targets requires extrapolation beyond the initial constrained design space;this is rationally guided by physical scaling theories.This integrated framework establishes a data-driven yet mechanistic route to rational polymer design. 展开更多
关键词 multiparticle collision dynamics thirty rheological profiles computational framework process modelingand tailoring precise rheological responses intermolecular interactions modifying solution viscosity molecular chemistrypolymer
原文传递
EMFF-2025:a general neural network potential for energeticmaterials with C,H,N,and O elements
12
作者 Mingjie Wen Jiahe Han +3 位作者 Wenjuan Li Xiaoya Chang Qingzhao Chu Dongping Chen 《npj Computational Materials》 2025年第1期3619-3634,共16页
The discovery and optimization of high-energy materials(HEMs)face challenges due to the computational expense and slow iteration of traditional methods.Neural network potentials(NNPs)have emerged as an efficient alter... The discovery and optimization of high-energy materials(HEMs)face challenges due to the computational expense and slow iteration of traditional methods.Neural network potentials(NNPs)have emerged as an efficient alternative to first-principles simulations.This study presents EMFF-2025,a general NNP model for C,H,N,and O-based HEMs,leveraging transfer learning with minimal data from DFT calculations.The model achieves DFT-level accuracy,predicting the structure,mechanical properties,and decomposition characteristics of 20 HEMs.Integrating EMFF-2025 with PCA and correlation heatmaps,we map the chemical space and structural evolution of these HEMs across temperatures.Surprisingly,EMFF-2025 uncovers that most HEMs follow similar hightemperature decomposition mechanisms,challenging the conventional view of material-specific behavior.EMFF-2025 offers a versatile computational framework for accelerating HEM design and optimization. 展开更多
关键词 transfer learning traditional methodsneural network potentials nnps computational framework energetic materials decomposition mechanisms neural network potential high energy materials
原文传递
Generalized first-principles prediction of hydrogen para-equilibrium thermodynamics in metal hydrides
13
作者 Peter Hannappel Matthew T.Curnan +4 位作者 Geun Ho Gu Mauro Palumbo Mateusz Balcerzak Thomas Weißgärber Felix Heubner 《npj Computational Materials》 2025年第1期4353-4362,共10页
Accurate first-principles-based prediction of the pressure-composition-temperature(PCT)relationships of metal hydrides can enable predictive optimization of hydrogen capacities and pressures.In this work,we introduce ... Accurate first-principles-based prediction of the pressure-composition-temperature(PCT)relationships of metal hydrides can enable predictive optimization of hydrogen capacities and pressures.In this work,we introduce a novel computational framework that integrates density functional theory(DFT)with a Python-based PCT Simulation Toolkit to predict PCT diagrams with high accuracy.By using only structural input data from the metallic phase,this toolkit automates the detection of interstitial voids,generates input files for DFT calculations,and constructs thermodynamic models based on para-equilibrium principles.We validate this approach across five major metal-hydride classes–BCC and FCC alloys,AB_(5),AB_(2),and AB compounds-and demonstrate that even with minimal computational effort,key hydrogen sorption characteristics can be reliably determined.Using the PBE functional without vibrational contribution,our results show that hydrogen capacity predictions achieve a mean accuracy of 95%,while sorption pressures are modeled within one order of magnitude of experimental values.Furthermore,our method can implicitly account for the phase transition in metal hydrides and can reliably model multicomponent alloys with representative alloys of lesser chemical complexity.This framework enables rapid and accurate exploration of metal hydrides to design alloys for new applications. 展开更多
关键词 density functional theory dft computational framework metal hydrides detection interstitial voidsgenerates hydrogen thermodynamics pressure composition temperature relationships density functional theory
原文传递
Multiscale modeling of vacancy-cluster interactions and solute clustering kinetics in multicomponent alloys
14
作者 Zhucong Xi Louis G.Hector Jr +1 位作者 Amit Misra Liang Qi 《npj Computational Materials》 2025年第1期4036-4052,共17页
Prediction of solute clustering kinetics in aged multicomponent alloys requires a quantitative understanding of complex vacancy-cluster interactions across multiple scales.Here,we develop an integrated computational f... Prediction of solute clustering kinetics in aged multicomponent alloys requires a quantitative understanding of complex vacancy-cluster interactions across multiple scales.Here,we develop an integrated computational framework combining on-lattice kinetic Monte Carlo(KMC)simulations,absorbing Markov chain models,and mesoscale cluster dynamics(CD)to investigate these interactions in Al-Mg-Zn alloys.The Markov chain model yields vacancy escape times from solute clusters and identifies a two-stage behavior of the vacancy-cluster binding energy.These binding energies are used to estimate residual vacancy concentrations in the Al matrix after quenching,which serve as critical inputs to CD simulations to predict long-term cluster evolution kinetics during natural aging.Our results quantitatively demonstrate the significant impact of quench rate on natural aging kinetics.Results provide insights to guide alloy chemistry,quench rates,and aging time at finite temperatures to control the evolution of solute clusters and eventual precipitates in aged multicomponent alloys. 展开更多
关键词 vacancy escape times solute clusters mesoscale cluster dynamics cd prediction solute clustering kinetics multiscale modeling markov chain model solute clustering kinetics integrated computational framework vacancy cluster interactions
原文传递
Design and implementation of a software architecture for 3D-DDA 被引量:1
15
作者 CHENG XiaoLong XIAO Jun +1 位作者 MIAO QingHai WANG Ying 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2015年第9期1604-1608,共5页
The three-dimensional discontinuous deformation analysis(3D-DDA) is a promising numerical method for both static and dynamic analyses of rock systems. Lacking mature software, its popularity is far behind its ability.... The three-dimensional discontinuous deformation analysis(3D-DDA) is a promising numerical method for both static and dynamic analyses of rock systems. Lacking mature software, its popularity is far behind its ability. To address this problem, this paper presents a new software architecture from a software engineering viewpoint. Based on 3D-DDA characteristics, the implementation of the proposed architecture has the following merits. Firstly, the software architecture separates data, computing, visualization, and signal control into individual modules. Secondly, data storage and parallel access are fully considered for different conditions. Thirdly, an open computing framework is provided which supports most numerical computing methods; common tools for equation solving and parallel computing are provided for further development. Fourthly, efficient visualization functions are provided by integrating a variety of visualization algorithms. A user-friendly graphical user interface is designed to improve the user experience. Finally, through a set of examples, the software is verified against both analytical solutions and the original code by Dr. Shi Gen Hua. 展开更多
关键词 3D-DDA software architecture 3D-DDA data structure open computing framework efficient visualization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部