Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global researc...Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global research data infrastructure,drawing insights from national roadmaps and strategic documents related to research data infrastructure.It emphasizes the pivotal role of research data infrastructures by delineating four new missions aimed at positioning them at the core of the current scientific research and communication ecosystem.The four new missions of research data infrastructures are:(1)as a pioneer,to transcend the disciplinary border and address complex,cutting-edge scientific and social challenges with problem-and data-oriented insights;(2)as an architect,to establish a digital,intelligent,flexible research and knowledge services environment;(3)as a platform,to foster the high-end academic communication;(4)as a coordinator,to balance scientific openness with ethics needs.展开更多
Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a ...Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.展开更多
The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture ...The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture by embedding structured metadata markup in dataset web landing pages using vocabularies from Schema.org and extensions.This paper aims to examine metadata interoperability for supporting global data discovery.Specifically,the paper reports a survey on which metadata schema has been adopted by participating data repositories,and presents an analysis of crosswalks from fourteen research data schemas to Schema.org.The analysis indicates most descriptive metadata are interoperable among the schemas,the most inconsistent mapping is the rights metadata,and a large gap exists in the structural metadata and controlled vocabularies to specify various property values.The analysis and collated crosswalks can serve as a reference for data repositories when they develop crosswalks from their own schemas to Schema.org,and provide the research data community a benchmark of structured metadata implementation.展开更多
The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing ...The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing and analysis needs of the UKCH members and mapping them to potential solutions.This paper presents a proposal for a demonstrator to analyse the use of scientific workflows for large scale data processing.The demonstrator provides a concrete target to promote further discussion of the processing and analysis needs of the UKCH community.In this paper,we will discuss the main requirements for data processing elicited and the proposed adaptations that will be incorporated in the design of the CRW and how to integrate the proposed solutions with existing practices of the UKCH.The demonstrator has been used in discussion with researchers and in presentations to the UKCH community,generating increased interest and motivating furtherdevelopment.展开更多
Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(O...Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(OS),we have built cross-disciplinary federated infrastructures for different types of(open)digital resources:Open Data(OD),Open Educational Resources(OER),and open access documents.In each case,our approach targeted only the resource“metadata”.Based on this experience,we identified some challenges that we had to overcome again and again:lack of(i)harvesters,(ii)common metadata models and(iii)metadata mapping tools.In this paper,we report on the challenges we faced in the federated infrastructure projects we were involved with.We structure the report based on the three challenges listed above.展开更多
Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the...Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.展开更多
The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough c...The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough context. Aiming at this issue, we systematically examined the research outcomes related to real estate big data. It gives a comment to current research status and proposes the future directions in this area from the four aspects, i.e. the hierarchical structuring of real estate "Big Data", integrated implementation, exchange and pricing systems, and the market operation system, in order to assist the researchers for their future works.展开更多
This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enth...This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enthnographic research, the Participant Observation approach based on inductive logic, that is, grounded theory.展开更多
Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of product...Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of products and brands, and the platforms where they search for the product. In this research, I study the relationship between product sales and consumer characteristics, the relationship between product sales and product qualities, demand curve analysis, and the search friction effect for different platforms. I utilized data from a randomized field experiment involving more than 400 thousand customers and 30 thousand products on JD.com, one of the world’s largest online retailing platforms. There are two focuses of the research: 1) how different consumer characteristics affect sales;2) how to set price and possible search friction for different channels. I find that JD plus membership, education level and age have no significant relationship with product sales, and higher user level leads to higher sales. Sales are highly skewed, with very high numbers of products sold making up only a small percentage of the total. Consumers living in more industrialized cities have more purchasing power. Women and singles lead to higher spending. Also, the better the product performs, the more it sells. Moderate pricing can increase product sales. Based on the research results of search volume in different channels, it is suggested that it is better to focus on app sales. By knowing the results, producers can adjust target consumers for different products and do target advertisements in order to maximize the sales. Also, an appropriate price for a product is also crucial to a seller. By the way, knowing the search friction of different channels can help producers to rearrange platform layout so that search friction can be reduced and more potential deals may be made.展开更多
背景:接受全膝关节置换患者人数在全球范围内逐年增加,全膝关节置换后的疼痛管理是一个重要的方面,因为有效的疼痛控制可以促进患者早期活动,减少并发症,提高患者满意度,并加快康复进程。目的:构建全膝关节置换后疼痛的可视化图谱,了解...背景:接受全膝关节置换患者人数在全球范围内逐年增加,全膝关节置换后的疼痛管理是一个重要的方面,因为有效的疼痛控制可以促进患者早期活动,减少并发症,提高患者满意度,并加快康复进程。目的:构建全膝关节置换后疼痛的可视化图谱,了解该领域的国际研究现状及趋势,为日后研究提供参考依据。方法:在中国知网、万方数据库、Web of Science核心数据库检索从2000年1月至2023年12月有关全膝关节置换患者术后疼痛的研究文献。使用CiteSpace(6.2.3版本)可视化软件分析文献年发文量、作者、机构、国家、关键词、参考文献等内容。并使用R语言(4.4.1版本)软件建立数据库进行折线图、条形图的绘制。结果与结论:①共纳入3796篇文献,包括中文文献3509篇、英文文献287篇。②在英文数据库中,美国为发文量最高的国家,哈佛大学则是发文量最多的机构;中文数据库中,广州中医药大学发文量居首。③通过关键词聚类分析,中文文献近5年突现关键词包括“生活质量”“恐动症”和“针刺”;英文文献的突现关键词则为“满意度”和“心理因素”。共现图和聚类图分析显示,机构-作者-文献呈现内部集体联系紧密,外部合作松散。④由于可视化分析方法排除了部分影响力较低的文献,研究结果可能存在一定的偏倚。⑤研究热点方面,国内外存在差异:国内研究侧重于镇痛手段的有效性论证和方式探索,而国外研究则更关注疼痛机制的细分及镇痛药物的改良与替代。未来研究的发展趋势预计将集中在手术后疼痛的中医药治疗、多模式镇痛联合应用以及疼痛分型机制的探究和预防。展开更多
基金the National Social Science Fund of China(Grant No.22CTQ031)Special Project on Library Capacity Building of the Chinese Academy of Sciences(Grant No.E2290431).
文摘Research data infrastructures form the cornerstone in both cyber and physical spaces,driving the progression of the data-intensive scientific research paradigm.This opinion paper presents an overview of global research data infrastructure,drawing insights from national roadmaps and strategic documents related to research data infrastructure.It emphasizes the pivotal role of research data infrastructures by delineating four new missions aimed at positioning them at the core of the current scientific research and communication ecosystem.The four new missions of research data infrastructures are:(1)as a pioneer,to transcend the disciplinary border and address complex,cutting-edge scientific and social challenges with problem-and data-oriented insights;(2)as an architect,to establish a digital,intelligent,flexible research and knowledge services environment;(3)as a platform,to foster the high-end academic communication;(4)as a coordinator,to balance scientific openness with ethics needs.
文摘Research Data Management(RDM)has become increasingly important for more and more academic institutions.Using the Peking University Open Research Data Repository(PKU-ORDR)project as an example,this paper will review a library-based university-wide open research data repository project and related RDM services implementation process including project kickoff,needs assessment,partnerships establishment,software investigation and selection,software customization,as well as data curation services and training.Through the review,some issues revealed during the stages of the implementation process are also discussed and addressed in the paper such as awareness of research data,demands from data providers and users,data policies and requirements from home institution,requirements from funding agencies and publishers,the collaboration between administrative units and libraries,and concerns from data providers and users.The significance of the study is that the paper shows an example of creating an Open Data repository and RDM services for other Chinese academic libraries planning to implement their RDM services for their home institutions.The authors of the paper have also observed since the PKU-ORDR and RDM services implemented in 2015,the Peking University Library(PKUL)has helped numerous researchers to support the entire research life cycle and enhanced Open Science(OS)practices on campus,as well as impacted the national OS movement in China through various national events and activities hosted by the PKUL.
文摘The increased number of data repositories has greatly increased the availability of open data.To enable broad discovery and access to research dataset,some data repositories have begun leveraging the web architecture by embedding structured metadata markup in dataset web landing pages using vocabularies from Schema.org and extensions.This paper aims to examine metadata interoperability for supporting global data discovery.Specifically,the paper reports a survey on which metadata schema has been adopted by participating data repositories,and presents an analysis of crosswalks from fourteen research data schemas to Schema.org.The analysis indicates most descriptive metadata are interoperable among the schemas,the most inconsistent mapping is the rights metadata,and a large gap exists in the structural metadata and controlled vocabularies to specify various property values.The analysis and collated crosswalks can serve as a reference for data repositories when they develop crosswalks from their own schemas to Schema.org,and provide the research data community a benchmark of structured metadata implementation.
基金funded by EPSRC grant:EP/R026939/1,EP/R026815/1,EP/R026645/1,EP/R027129/1 or EP/M013219/1(biocatalysis)part-funded by the European Regional Development Fund(ERDF)via Welsh Government.
文摘The UK Catalysis Hub(UKCH)is designing a virtual research environment to support data processing and analysis,the Catalysis Research Workbench(CRW).The development of this platform requires identifying the processing and analysis needs of the UKCH members and mapping them to potential solutions.This paper presents a proposal for a demonstrator to analyse the use of scientific workflows for large scale data processing.The demonstrator provides a concrete target to promote further discussion of the processing and analysis needs of the UKCH community.In this paper,we will discuss the main requirements for data processing elicited and the proposed adaptations that will be incorporated in the design of the CRW and how to integrate the proposed solutions with existing practices of the UKCH.The demonstrator has been used in discussion with researchers and in presentations to the UKCH community,generating increased interest and motivating furtherdevelopment.
文摘Federated Research Data Infrastructures aim to provide seamless access to research data along with services to facilitate the researchers in performing their data management tasks.During our research on Open Science(OS),we have built cross-disciplinary federated infrastructures for different types of(open)digital resources:Open Data(OD),Open Educational Resources(OER),and open access documents.In each case,our approach targeted only the resource“metadata”.Based on this experience,we identified some challenges that we had to overcome again and again:lack of(i)harvesters,(ii)common metadata models and(iii)metadata mapping tools.In this paper,we report on the challenges we faced in the federated infrastructure projects we were involved with.We structure the report based on the three challenges listed above.
基金supported by Fundamental Research Funds of State Key Laboratory of Ophthalmology (Grant No.2015QN01)Young Teacher Top-Support project of Sun Yat-sen University(Grant No.2015ykzd11)+4 种基金the Cultivation Projects for Young Teaching Staff of Sun Yat-sen University(Grant No.12ykpy61) from the Fundamental Research Funds for the Central Universitiesthe Pearl River Science and Technology New Star(Grant No.2014J2200060)Project of Guangzhou City,the Guangdong Provincial Natural Science Foundation for Distinguished Young Scholars of China(Grant No. 2014A030306030)Youth Science and Technology Innovation Talents Funds in Special Support Plan for High Level Talents in Guangdong Province(Grant No. 2014TQ01R573)Key Research Plan for National Natural Science Foundation of China in Cultivation Project (No.91546101)
文摘Widely used in clinical research, the database is a new type of data management automation technology and the most efficient tool for data management. In this article, we first explain some basic concepts, such as the definition, classification, and establishment of databases. Afterward, the workflow for establishing databases, inputting data, verifying data, and managing databases is presented. Meanwhile, by discussing the application of databases in clinical research, we illuminate the important role of databases in clinical research practice. Lastly, we introduce the reanalysis of randomized controlled trials(RCTs) and cloud computing techniques, showing the most recent advancements of databases in clinical research.
基金Funded partly by the Post-graduate Students’ ducation and Teaching Reform Program of Chongqing Education Committee(No.Yjg123089)
文摘The research value and market potentials of "big data" in real estate industry are well acknowledged with the development of technologies. But research in this area is far away from systematic and thorough context. Aiming at this issue, we systematically examined the research outcomes related to real estate big data. It gives a comment to current research status and proposes the future directions in this area from the four aspects, i.e. the hierarchical structuring of real estate "Big Data", integrated implementation, exchange and pricing systems, and the market operation system, in order to assist the researchers for their future works.
文摘This article has explored the relationships between data and theory in qualitative research from an enthnographic perspective. It has also explicated the discovery of theory from data systematically obtained from enthnographic research, the Participant Observation approach based on inductive logic, that is, grounded theory.
文摘Unlike consumers in the mall or supermarkets, online consumers are “intangible” and their purchasing behaviors are affected by multiple factors, including product pricing, promotion and discounts, quality of products and brands, and the platforms where they search for the product. In this research, I study the relationship between product sales and consumer characteristics, the relationship between product sales and product qualities, demand curve analysis, and the search friction effect for different platforms. I utilized data from a randomized field experiment involving more than 400 thousand customers and 30 thousand products on JD.com, one of the world’s largest online retailing platforms. There are two focuses of the research: 1) how different consumer characteristics affect sales;2) how to set price and possible search friction for different channels. I find that JD plus membership, education level and age have no significant relationship with product sales, and higher user level leads to higher sales. Sales are highly skewed, with very high numbers of products sold making up only a small percentage of the total. Consumers living in more industrialized cities have more purchasing power. Women and singles lead to higher spending. Also, the better the product performs, the more it sells. Moderate pricing can increase product sales. Based on the research results of search volume in different channels, it is suggested that it is better to focus on app sales. By knowing the results, producers can adjust target consumers for different products and do target advertisements in order to maximize the sales. Also, an appropriate price for a product is also crucial to a seller. By the way, knowing the search friction of different channels can help producers to rearrange platform layout so that search friction can be reduced and more potential deals may be made.
文摘背景:接受全膝关节置换患者人数在全球范围内逐年增加,全膝关节置换后的疼痛管理是一个重要的方面,因为有效的疼痛控制可以促进患者早期活动,减少并发症,提高患者满意度,并加快康复进程。目的:构建全膝关节置换后疼痛的可视化图谱,了解该领域的国际研究现状及趋势,为日后研究提供参考依据。方法:在中国知网、万方数据库、Web of Science核心数据库检索从2000年1月至2023年12月有关全膝关节置换患者术后疼痛的研究文献。使用CiteSpace(6.2.3版本)可视化软件分析文献年发文量、作者、机构、国家、关键词、参考文献等内容。并使用R语言(4.4.1版本)软件建立数据库进行折线图、条形图的绘制。结果与结论:①共纳入3796篇文献,包括中文文献3509篇、英文文献287篇。②在英文数据库中,美国为发文量最高的国家,哈佛大学则是发文量最多的机构;中文数据库中,广州中医药大学发文量居首。③通过关键词聚类分析,中文文献近5年突现关键词包括“生活质量”“恐动症”和“针刺”;英文文献的突现关键词则为“满意度”和“心理因素”。共现图和聚类图分析显示,机构-作者-文献呈现内部集体联系紧密,外部合作松散。④由于可视化分析方法排除了部分影响力较低的文献,研究结果可能存在一定的偏倚。⑤研究热点方面,国内外存在差异:国内研究侧重于镇痛手段的有效性论证和方式探索,而国外研究则更关注疼痛机制的细分及镇痛药物的改良与替代。未来研究的发展趋势预计将集中在手术后疼痛的中医药治疗、多模式镇痛联合应用以及疼痛分型机制的探究和预防。