摘要
随着大数据技术和人工智能技术的高速发展,科学数据已成为推动科研范式变革的核心科技资源。生物多样性科学数据与生物资源本身一样,已成为国家的重要战略资源,是国际科技与产业的竞争热点和战略制高点。针对当前国内植物多样性科学数据在开放共享方面与国际主流数据库之间存在差距的问题,本研究提出并实施建设植物多样性科学数据应用程序编程接口(Application Programming Interfaces,APIs)。基于中国科学院昆明植物研究所科学数据中心已收集并整理的植物名称、物种信息、图片元数据、种质元数据、DNA条形码、标本元数据、图书和文献元数据等植物科学数据,通过对各类植物科学数据进行编目,设计了APIs的业务流程和标准规范,采用了模块化的开发方法研发建成了植物多样性科学数据APIs,实现了对各类植物科学数据的任意属性字段组合查询、数据高效检索、数据安全以及接口权限控制等功能,从而助力推进国内植物多样性科学数据接口标准的建立与完善,提升我国植物多样性科学数据的共享水平,促进植物多样性的研究,为生物多样性保护提供数据和技术支撑,同时也可以为其他学科领域的科学数据APIs的建设提供参考。
With the rapid development of big data technology and artificial intelligence technology,scientific data have become one of the core scientific and technological resources driving the transformation of scientific research paradigm.Biodiversity scientific data,like biological resources themselves,have become an important national strategic and a focal point of international competition in science,technology,and industry.To address the gap between domestic plant diversity scientific data and international mainstream databases in terms of openness and sharing,Application Programming Interfaces(APIs)for plant diversity scientific data are proposed and implemented in this study.Based on plant scientific data,including plant name,species information,picture metadata,germplasm metadata,DNA barcode,specimen metadata,as well as book and literature metadata collected and organized by the Scientific Data Center,Kunming Institute of Botany,Chinese Academy of Sciences,the APIs business process and standard specifications were designed through cataloging of diverse plant scientific data.Using a modular development method,the APIs for plant diversity scientific data were developed and built,enabling functions such as flexible combination queries of arbitrary attribute fields,efficient data retrieval,data security assurance,and interface access control.These APIs contribute to the establishment and improvement of domestic standards for plant diversity scientific data interfaces,enhance the level of data sharing level of plant diversity scientific data in China,promote research on plant diversity,and provide data and technical support for biodiversity conservation.In addition,the proposed framework can serve as a reference for the construction of scientific data APIs in other disciplinary domines.
作者
邱金水
张建文
金涛
王朋
杜宁
黄蓉
庄会富
QIU Jinshui;ZHANG Jianwen;JIN Tao;WANG Peng;DU Ning;HUANG Rong;ZHUANG Huifu(Scientific Data Center,Kunming Institute of Botany,Chinese Academy of Sciences,Kunming 650201,P.R.China;State Key Laboratory of Phytochemistry and Natural Medicines,Kunming 650201,P.R.China;National Wild Plant Germplasm Resource Center,Kunming 650201,P.R.China)
基金
云南省技术创新人才(202405AD350053)
中国科学院技术支撑人才(KIB202303)
中国科学院网络安全和信息化专项(CAS-WX2022SDC-SJ01)
中国科学院青年创新促进会会员支持项目(2022397)资助
云南省科技人才与平台计划(202305AH340005)
云南省重大科技专项(202402AE090039)。