Despite recent encouragement to follow the FAIR principles,the day-to-day research practices have not changed substantially.Due to new developments and the increasing pressure to apply best practices,initiatives to im...Despite recent encouragement to follow the FAIR principles,the day-to-day research practices have not changed substantially.Due to new developments and the increasing pressure to apply best practices,initiatives to improve the efficiency and reproducibility of scientific workflows are becoming more prevalent.In this article,we discuss the importance of well-annotated tools and the specific requirements to ensure reproducible research with FAIR outputs.We detail how Galaxy,an open-source workflow management system with a web-based interface,has implemented the concepts that are put forward by the Canonical Workflow Framework for Research(CWFR),whilst minimising changes to the practices of scientific communities.Although we showcase concrete applications from two different domains,this approach is generalisable to any domain and particularly useful in interdisciplinary research and science-based applications.展开更多
Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric ...Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric Ozone Assessment Report(TOAR)created,contains one of the world's largest collections of near-surface air quality measurements and considers FAIR data principles as an integral part.A special feature of our data service is the on-demand processing and product generation of several air quality metrics directly from the underlying database.In this paper,we show that the necessary data harmonization for establishing such online analysis services goes much deeper than the obvious issues of common data formats,variable names,and measurement units,and we explore how the generation of FAIR Digital Objects(FDO)in combination with automatically generateddocumentation may support Canonical Analysis Workflows for airquality and related data.展开更多
We introduce the concept of Canonical Workflow Building Blocks(CWBB),a methodology of describing and wrapping computational tools,in order for them to be utilised in a reproducible manner from multiple workflow langua...We introduce the concept of Canonical Workflow Building Blocks(CWBB),a methodology of describing and wrapping computational tools,in order for them to be utilised in a reproducible manner from multiple workflow languages and execution platforms.The concept is implemented and demonstrated with the BioExcel Building Blocks library(BioBB),a collection of tool wrappers in the field of computational biomolecular simulation.Interoperability across different workflow languages is showcased through a protein Molecular Dynamics setup transversal workflow,built using this library and run with 5 different Workflow Manager Systems(WfMS).We argue such practice is a necessary requirement for FAIR Computational Workflows and an element of Canonical Workflow Frameworks for Research(CWFR)in order to improve widespread adoption and reuse of computational methods across workflow language barriers.展开更多
基金supported by the European Union's Horizon 2020 programme(No.857652,EOSCNordic,and No.101017501,RELIANCE)supported by BMBF grants 031 A538A/A538C(de.NBI-RBC),NFDI7/1-42077441(DataPLANT)and 031L0101C(de.NBI-epi)+2 种基金supported by Research Foundation-Flanders(FWO)for ELIXIR Belgium(I002819N)the European Union's Horizon 2020 programme(No.824087,EOSC-Life)partially supported by the Research Council of Norway's grant 270068 for ELIXIR Norway.
文摘Despite recent encouragement to follow the FAIR principles,the day-to-day research practices have not changed substantially.Due to new developments and the increasing pressure to apply best practices,initiatives to improve the efficiency and reproducibility of scientific workflows are becoming more prevalent.In this article,we discuss the importance of well-annotated tools and the specific requirements to ensure reproducible research with FAIR outputs.We detail how Galaxy,an open-source workflow management system with a web-based interface,has implemented the concepts that are put forward by the Canonical Workflow Framework for Research(CWFR),whilst minimising changes to the practices of scientific communities.Although we showcase concrete applications from two different domains,this approach is generalisable to any domain and particularly useful in interdisciplinary research and science-based applications.
文摘Data harmonization and documentation of the data processing are essential prerequisites for enabling Canonical Analysis Workflows.The recently revised Terabyte-scale air quality database system,which the Tropospheric Ozone Assessment Report(TOAR)created,contains one of the world's largest collections of near-surface air quality measurements and considers FAIR data principles as an integral part.A special feature of our data service is the on-demand processing and product generation of several air quality metrics directly from the underlying database.In this paper,we show that the necessary data harmonization for establishing such online analysis services goes much deeper than the obvious issues of common data formats,variable names,and measurement units,and we explore how the generation of FAIR Digital Objects(FDO)in combination with automatically generateddocumentation may support Canonical Analysis Workflows for airquality and related data.
基金a project funded by the European Union contracts H2020-INFRAEDI-02-2018823830,and H2020-EINFRA-2015-1675728funded through EOSC-Life(https://www.eosc-life.eu)contract H2020-INFRAEOSC-2018-2824087ELIXIR-CONVERGE(https://elixir-europe.org)contract H2020-INFRADEV-2019-2871075.
文摘We introduce the concept of Canonical Workflow Building Blocks(CWBB),a methodology of describing and wrapping computational tools,in order for them to be utilised in a reproducible manner from multiple workflow languages and execution platforms.The concept is implemented and demonstrated with the BioExcel Building Blocks library(BioBB),a collection of tool wrappers in the field of computational biomolecular simulation.Interoperability across different workflow languages is showcased through a protein Molecular Dynamics setup transversal workflow,built using this library and run with 5 different Workflow Manager Systems(WfMS).We argue such practice is a necessary requirement for FAIR Computational Workflows and an element of Canonical Workflow Frameworks for Research(CWFR)in order to improve widespread adoption and reuse of computational methods across workflow language barriers.