Purpose: This paper aims to assess if the extent of openness and the coverage of data sets released by European governments have a significant impact on citizen trust in public institutions.Design/methodology/approach...Purpose: This paper aims to assess if the extent of openness and the coverage of data sets released by European governments have a significant impact on citizen trust in public institutions.Design/methodology/approach: Data for openness and coverage have been collected from the Open Data Inventory 2018(ODIN), by Open Data Watch;institutional trust is built up as a formative construct based on the European Social Survey(ESS), Round 9. The relations between the open government data features and trust have been tested on the basis of structural equation modelling(SEM).Findings: The paper reveals that as European governments improve data openness, disaggregation, and time coverage, people tend to trust them more. However, the size of the effect is still small and, comparatively, data coverage effect on citizens' confidence is more than twice than the impact of openness.Research limitations: This paper analyzes the causal effect of Open Government Data(OGD) features captured in a certain moment of time. In upcoming years, as OGD is implemented and a more consistent effect on people is expected, time series analysis will provide with a deeper insight.Practical implications: Public officers should continue working in the development of a technological framework that contributes to make OGD truly open. They should improve the added value of the increasing amount of open data currently available in order to boost internal and external innovations valuable both for public agencies and citizens.Originality/value: In a field of knowledge with little quantitative empirical evidence, this paper provides updated support for the positive effect of OGD strategies and it also points out areas of improvement in terms of the value that citizens can get from OGD coverage and openness.展开更多
In the process of implementing data openness between banks and fin-tech companies,as the breadth and depth of cooperation between banks and enterprises continue to increase,there is a risk of“too much correlation to ...In the process of implementing data openness between banks and fin-tech companies,as the breadth and depth of cooperation between banks and enterprises continue to increase,there is a risk of“too much correlation to fail”and“too many links to fail”.There are problems with the implementation of financial data openness by regulatory agencies for banks and fin-tech enterprises,such as the ambiguity of regulatory responsibilities,the emphasis on financial regulatory goals,and the lag in regulatory methods.To address these issues,it is necessary to clarify the responsibilities of financial regulatory agencies,establish a collaborative mechanism for financial regulation,coordinate the types of risks in bank enterprise cooperation,achieve the technical implementation of financial regulatory measures and the design of regulatory systems,obtain regulatory data in real time,establish a hierarchical regulatory system for bank enterprise cooperation to improve the regulatory path,and ensure the rational and legal use of financial data in bank enterprise cooperation.展开更多
With advancements in information technology and the increasing demand for data-driven governance,the openness of public data has become essential for global governance and social innovation.However,legal risks related...With advancements in information technology and the increasing demand for data-driven governance,the openness of public data has become essential for global governance and social innovation.However,legal risks related to privacy protection,data security,intellectual property,liability allocation,and legal adaptability pose significant challenges to data governance in China.This paper analyzes these risks and proposes three strategies:enhancing the legal framework through clear data classification and accountability mechanisms,establishing regulatory bodies to monitor data usage,and promoting public education on data privacy.These strategies aim to address gaps in legal discourse and guide effective data governance,contributing to the secure development of open data initiatives in China and beyond.展开更多
Open data strategies are being adopted in disaster-related data particularly because of the need to provide information on global targets and indicators for implementation of the Sendai Framework for Disaster Risk Red...Open data strategies are being adopted in disaster-related data particularly because of the need to provide information on global targets and indicators for implementation of the Sendai Framework for Disaster Risk Reduction 2015–2030.In all phases of disaster risk management including forecasting,emergency response and post-disaster reconstruction,the need for interconnected multidisciplinary open data for collaborative reporting as well as study and analysis are apparent,in order to determine disaster impact data in timely and reportable manner.The extraordinary progress in computing and information technology in the past decade,such as broad local and wide-area network connectivity(e.g.Internet),highperformance computing,service and cloud computing,big data methods and mobile devices,provides the technical foundation for connecting open data to support disaster risk research.A new generation of disaster data infrastructure based on interconnected open data is evolving rapidly.There are two levels in the conceptual model of Linked Open Data for Global Disaster Risk Research(LODGD)Working Group of the Committee on Data for Science and Technology(CODATA),which is the Committee on Data of the International Council for Science(ICSU):data characterization and data connection.In data characterization,the knowledge about disaster taxonomy and data dependency on disaster events requires specific scientific study as it aims to understand and present the correlation between specific disaster events and scientific data through the integration of literature analysis and semantic knowledge discovery.Data connection concepts deal with technical methods to connect distributed data resources identified by data characterization of disaster type.In the science community,interconnected open data for disaster risk impact assessment are beginning to influence how disaster data are shared,and this will need to extend data coverage and provide better ways of utilizing data across domains where innovation and integration are now necessarily needed.展开更多
Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the f...Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.展开更多
Purpose: Our work seeks to overcome data quality issues related to incomplete author affiliation data in bibliographic records in order to support accurate and reliable measurement of international research collaborat...Purpose: Our work seeks to overcome data quality issues related to incomplete author affiliation data in bibliographic records in order to support accurate and reliable measurement of international research collaboration(IRC).Design/methodology/approch: We propose, implement, and evaluate a method that leverages the Web-based knowledge graph Wikidata to resolve publication affiliation data to particular countries. The method is tested with general and domain-specific data sets.Findings: Our evaluation covers the magnitude of improvement, accuracy, and consistency. Results suggest the method is beneficial, reliable, and consistent, and thus a viable and improved approach to measuring IRC.Research limitations: Though our evaluation suggests the method works with both general and domain-specific bibliographic data sets, it may perform differently with data sets not tested here. Further limitations stem from the use of the R programming language and R libraries for country identification as well as imbalanced data coverage and quality in Wikidata that may also change over time.Practical implications: The new method helps to increase the accuracy in IRC studies and provides a basis for further development into a general tool that enriches bibliographic data using the Wikidata knowledge graph.Originality: This is the first attempt to enrich bibliographic data using a peer-produced, Webbased knowledge graph like Wikidata.展开更多
Purpose: The purpose of this exploratory study is to provide modern local governments with potential use cases for their open data, in order to help inform related future policies and decision-making. The concrete con...Purpose: The purpose of this exploratory study is to provide modern local governments with potential use cases for their open data, in order to help inform related future policies and decision-making. The concrete context was that of the Vaxjo municipality located in southeastern Sweden.Design/methodology/approach: The methodology was two-fold: 1) a survey of potential end users(n=151) from a local university;and, 2) analysis of survey results using a theoretical model regarding local strategies for implementing open government data.Findings: Most datasets predicted to be useful were on: sustainability and environment;preschool and school;municipality and politics. The use context given is primarily research and development, informing policies and decision making;but also education, informing personal choices, informing citizens and creating services based on open data. Not the least, the need for educating target user groups on data literacy emerged. A tentative pattern comprising a technical perspective on open data and a social perspective on open government was identified. Research limitations: In line with available funding, the nature of the study was exploratory and implemented as an anonymous web-based survey of employees and students at the local university. Further research involving(qualitative) surveys with all stakeholders would allow for creating a more complete picture of the matter. Practical implications: The study determines potential use cases and use contexts for open government data, in order to help inform related future policies and decision-making.Originality/value: Modern local governments, and especially in Sweden, are faced with a challenge of how to make their data open, how to learn about which types of data will be most relevant for their end users and what will be different societal purposes. The paper contributes to knowledge that modern local governments can resort to when it comes to attitudes of local citizens to open government data in the context of an open government data perspective.展开更多
Systematically analyze the composition of post-marketing adverse drug reaction data and the open mode in the EU, and summarize its characteristics. EU post-marketing ADR data is open to six categories of stakeholders:...Systematically analyze the composition of post-marketing adverse drug reaction data and the open mode in the EU, and summarize its characteristics. EU post-marketing ADR data is open to six categories of stakeholders: EMA, EC, medicines regulatory authorities in EEA member states, healthcare professionals and the public, Marketing Authorization Holders, academia, WHO and medicines regulatory authorities in third countries. The EU has implemented hierarchical opening for ADRs, with different levels containing different data and facing different stakeholders. Openness is divided into active and passive openness. In opening up data, the EU complies with relevant personal data protection laws to protect the privacy of individuals. The EU’s post-marketing adverse drug reaction data openness is characterized by a combination of data openness and privacy protection, active and passive openness, and a hierarchy of data openness. It is hoped that this can provide a reference for the opening up of post-marketing adverse drug reaction data in China.展开更多
Lane change prediction is critical for crash avoidance but challenging as it requires the understanding of the instantaneous driving environment.With cutting-edge artificial intelligence and sensing technologies,auton...Lane change prediction is critical for crash avoidance but challenging as it requires the understanding of the instantaneous driving environment.With cutting-edge artificial intelligence and sensing technologies,autonomous vehicles(AVs)are expected to have exceptional perception systems to capture instantaneously their driving environments for predicting lane changes.By exploring the Waymo open motion dataset,this study proposes a framework to explore autonomous driving data and investigate lane change behaviors.In the framework,this study develops a Long Short-Term Memory(LSTM)model to predict lane changing behaviors.The concept of Vehicle Operating Space(VOS)is introduced to quantify a vehicle's instantaneous driving environment as an important indicator used to predict vehicle lane changes.To examine the robustness of the model,a series of sensitivity analysis are conducted by varying the feature selection,prediction horizon,and training data balancing ratios.The test results show that including VOS into modeling can speed up the loss decay in the training process and lead to higher accuracy and recall for predicting lane-change behaviors.This study offers an example along with a methodological framework for transportation researchers to use emerging autonomous driving data to investigate driving behaviors and traffic environments.展开更多
This research describes a quantitative,rapid,and low-cost methodology for debris flow susceptibility evaluation at the basin scale using open-access data and geodatabases.The proposed approach can aid decision makers ...This research describes a quantitative,rapid,and low-cost methodology for debris flow susceptibility evaluation at the basin scale using open-access data and geodatabases.The proposed approach can aid decision makers in land management and territorial planning,by first screening for areas with a higher debris flow susceptibility.Five environmental predisposing factors,namely,bedrock lithology,fracture network,quaternary deposits,slope inclination,and hydrographic network,were selected as independent parameters and their mutual interactions were described and quantified using the Rock Engineering System(RES)methodology.For each parameter,specific indexes were proposed,aiming to provide a final synthetic and representative index of debris flow susceptibility at the basin scale.The methodology was tested in four basins located in the Upper Susa Valley(NW Italian Alps)where debris flow events are the predominant natural hazard.The proposed matrix can represent a useful standardized tool,universally applicable,since it is independent of type and characteristic of the basin.展开更多
In recent years, transparency and accountability seem to find new impulse, with the development of ICT (information and communication technology) and the prospective of open data that invest the public system at a n...In recent years, transparency and accountability seem to find new impulse, with the development of ICT (information and communication technology) and the prospective of open data that invest the public system at a national and supranational level. Public institutions tend to make available to the public, more data and information concerning the administration, the manner of use of public goods and resources. At the same time, each institution is called upon to deal with the demand of transparency and participation by citizens who increasingly use Internet 2.0 and social media. After a reflection on how public administrations acted in the phase of Web 1.0 to practice transparency and accountability in terms of communication, this paper considers the elements of continuity and the new opportunities linked to the advent of Web 2.0 and open data. At the end of this analysis, the focus is on the strengths and weaknesses of this process, with a particular attention to the role of the public communication.展开更多
This paper aims to present the experience gathered in the Italian alpine city of Bolzano within the project“Bolzano Traffic”whose goal is the introduction of an experimental open ITS platform for local service provi...This paper aims to present the experience gathered in the Italian alpine city of Bolzano within the project“Bolzano Traffic”whose goal is the introduction of an experimental open ITS platform for local service providers,fostering the diffusion of advanced traveller information services and the future deployment of cooperative mobility systems in the region.Several end-users applications targeted to the needs of different user groups have been developed in collaboration with local companies and research centers;a partnership with the EU Co-Cities project has been activated as well.The implemented services rely on real-time travel and traffic information collected by urban traffic monitoring systems or published by local stakeholders(e.g.public transportation operators).An active involvement of end-users,who have recently started testing these demo applications for free,is actually on-going.展开更多
Minerals,like many other natural materials of geological origin(i.e.,geomaterials),face the challenge of name variations.This in turn hinders the data-intensive geoscience research,which often needs to integrate data ...Minerals,like many other natural materials of geological origin(i.e.,geomaterials),face the challenge of name variations.This in turn hinders the data-intensive geoscience research,which often needs to integrate data from multiple sources.It is clear that mineral name is not an appropriate identifier to connect records within and amongst data sources.The Mindat database,as one of the biggest resources for open data in mineralogy,has received significant volume of feedback on the heterogeneity of mineral and rock names.To address that issue,we established a persistent identifier service on Mindat to provide persistent and meaningful access to the records of geomaterials(mineral/rock/variety),localities,mineral occurrences,references,photos,and specimens.A key development was the long-form identifier,which adds contextual information such as identifier authorities and data types into the identifier structure.Moreover,a UUID service was built along with the long-form identifier to further increase the interoperability.The identifier service has been successfully implemented to mint millions of identifiers to different types of data objects on Mindat.Several use case scenarios were developed to illustrate the utility of the identifiers in the real world.We believe the persistent identifier will help address the challenges caused by name variations,and we welcome Mindat users to test the identifiers and send feedback to us for future extensions.展开更多
随着地球观测进入大数据时代,传统的数据管理技术已经难以适应大数据需求,Open Data Cube(ODC)作为新型的开源的地球观测数据管理与分析平台,适合进行时间序列数据的高性能计算和探索性数据分析,是亚大区域综合地球观测系统AOGEOSS的重...随着地球观测进入大数据时代,传统的数据管理技术已经难以适应大数据需求,Open Data Cube(ODC)作为新型的开源的地球观测数据管理与分析平台,适合进行时间序列数据的高性能计算和探索性数据分析,是亚大区域综合地球观测系统AOGEOSS的重要技术支撑平台。但当前ODC对国产卫星影像支持不友好,缺乏自动化管理和数据组织工具,使用ODC进行国产卫星影像数据管理的技术不成熟。因此,本文以高分一号卫星影像为例,通过开发ODC_GFTool中间件和自定义高分卫星影像产品格式探索ODC框架下国产影像数据自动化管理流程,最后以鄱阳湖为试验区,进行了基于ODC框架的水体提取应用实验,论证了ODC框架下国产卫星数据存取的可行性,研究结果表明相较于传统的数据处理工具ODC具有明显的效率优势,能够为AOGEOSS基础设施建设和国产卫星影像数据管理提供一定的参考。展开更多
Microblog is a social platform with huge user community and mass data. We propose a semantic recommendation mechanism based on sentiment analysis for microblog. Firstly, the keywords and sensibility words in this mech...Microblog is a social platform with huge user community and mass data. We propose a semantic recommendation mechanism based on sentiment analysis for microblog. Firstly, the keywords and sensibility words in this mechanism are extracted by natural language processing including segmentation, lexical analysis and strategy selection. Then, we query the background knowledge base based on linked open data (LOD) with the basic information of users. The experiment result shows that the accuracy of recommendation is within the range of 70% -89% with sentiment analysis and semantic query. Compared with traditional recommendation method, this method can satisfy users' requirement greatly.展开更多
After a systematic review of 38 current intelligent city evaluation systems (ICESs) from around the world, this research analyzes the secondary and tertiary indicators of these 38 ICESs from the perspec- tives of sc...After a systematic review of 38 current intelligent city evaluation systems (ICESs) from around the world, this research analyzes the secondary and tertiary indicators of these 38 ICESs from the perspec- tives of scale structuring, approaches and indicator selection, and determines their common base. From this base, the fundamentals of the City Intelligence Quotient (City IOD Evaluation System are developed and five dimensions are selected after a clustering analysis. The basic version, City IQ Evaluation System 1.0, involves 275 experts from 14 high-end research institutions, which include the Chinese Academy of Engineering, the National Academy of Science and Engineering (Germany), the Royal Swedish Academy of Engineering Sciences, the Planning Management Center of the Ministry of Housing and Urban-Rural Development of China, and the Development Research Center of the State Council of China. City IQ Evaluation System 2.0 is further developed, with improvements in its universality, openness, and dy- namic adjustment capability. After employing deviation evaluation methods in the IQ assessment, City IQ Evaluation System 3.0 was conceived. The research team has conducted a repeated assessment of 41 intelligent cities around the world using City IQ Evaluation System 3.0. The results have proved that the City IQ Evaluation System, developed on the basis of intelligent life, features more rational indicators selected from data sources that can offer better universality, openness, and dynamics, and is more sen- sitive and precise.展开更多
The rapid increase in the publication of knowledge bases as linked open data (LOD) warrants serious consideration from all concerned, as this phenomenon will potentially scale exponentially. This paper will briefly ...The rapid increase in the publication of knowledge bases as linked open data (LOD) warrants serious consideration from all concerned, as this phenomenon will potentially scale exponentially. This paper will briefly describe the evolution of the LOD, the emerging world-wide semantic web (WWSW), and explore the scalability and performance features Of the service oriented architecture that forms the foundation of the semantic technology platform developed at MIMOS Bhd., for addressing the challenges posed by the intelligent future internet. This paper" concludes with a review of the current status of the agriculture linked open data.展开更多
The rising awareness of environmental issues and the increase of renewable energy sources(RESs)has led to a shift in energy production toward RES,such as photovoltaic(PV)systems,and toward a distributed generation(DG)...The rising awareness of environmental issues and the increase of renewable energy sources(RESs)has led to a shift in energy production toward RES,such as photovoltaic(PV)systems,and toward a distributed generation(DG)model of energy production that requires systems in which energy is generated,stored,and consumed locally.In this work,we present a methodology that integrates geographic information system(GIS)-based PV potential assessment procedures with models for the estimation of both energy generation and consumption profiles.In particular,we have created an innovative infrastructure that co-simulates PV integration on building rooftops together with an analysis of households’electricity demand.Our model relies on high spatiotemporal resolution and considers both shadowing effects and real-sky conditions for solar radiation estimation.It integrates methodologies to estimate energy demand with a high temporal resolution,accounting for realistic populations with realistic consumption profiles.Such a solution enables concrete recommendations to be drawn in order to promote an understanding of urban energy systems and the integration of RES in the context of future smart cities.The proposed methodology is tested and validated within the municipality of Turin,Italy.For the whole municipality,we estimate both the electricity absorbed from the residential sector(simulating a realistic population)and the electrical energy that could be produced by installing PV systems on buildings’rooftops(considering two different scenarios,with the former using only the rooftops of residential buildings and the latter using all available rooftops).The capabilities of the platform are explored through an in-depth analysis of the obtained results.Generated power and energy profiles are presented,emphasizing the flexibility of the resolution of the spatial and temporal results.Additional energy indicators are presented for the self-consumption of produced energy and the avoidance of CO_(2) emissions.展开更多
The Semantic Web seems finally close to maintaining its promise about a real world-wide graph of interconnected resources. The SPARQL query language and protocols and the Linked Open Data initiative have laid the way ...The Semantic Web seems finally close to maintaining its promise about a real world-wide graph of interconnected resources. The SPARQL query language and protocols and the Linked Open Data initiative have laid the way for endless data endpoints sparse around the globe. However, for the Semantic Web to really happen, it does not suffice to get billions of triples out there: these must be shareable, interlinked and conform to widely accepted vocabularies. While more and more data are converted from already available large knowledge repositories of companies and organizations, the question whether these should be carefully converted to semantically consistent ontology vocabularies or find other shallow representations for their content naturally arises. The danger is to come up with massive amounts of useless data, a boomerang which could result to be contradictory for the success of the web of data. In this paper, I provide some insights on common problems which may arise when porting huge amount of existing data or conceptual schemes (very common in the agriculture domain) to resource description framwork (RDF), and will address different modeling choices, by discussing in particular the relationship between the two main modeling vocabularies offered by W3C: OWL and SKOS.展开更多
Urban barriers are often a consequence of the unplanned expansion of transport infrastructures,housing growth nearby existing transport infrastructures,and the vehicular traffic itself.This process,also known as“comm...Urban barriers are often a consequence of the unplanned expansion of transport infrastructures,housing growth nearby existing transport infrastructures,and the vehicular traffic itself.This process,also known as“community severance”,is a problem that affects the travel patterns of the population and causes negative impacts to adjacent communities,but it is not easy to assess.As an alternative to the costly technologies and restricted-access data sometimes used to analyze urban barriers,we propose an analytical approach that assesses community severance levels by classifying the quality of Pedestrian Crossings on Urban Streets(the PeCUS index).Additionally,the approach is used to identify possible inequities nearby the different classification groups regarding demographic data.We used the chi-square(χ^(2))test of independence and the standardized Pearson residuals to indicate deficits and excesses of residents regarding the demographic characteristics studied surrounding the urban streets.We found the following evidences of associations:residents with low-income or those aged up to 19 tend to live close to the worst crossings,whereas residents with permanent mobility constraints or the elderly tend to live near the crossings with the best classifications.Therefore,the study shows that the assessment of community severance by the analysis of the PeCUS index is feasible.Moreover,it reveals that the distribution of residents surrounding the distinct classes of quality of crossings is equitable for vulnerable social groups,except for low-income or young residents.展开更多
文摘Purpose: This paper aims to assess if the extent of openness and the coverage of data sets released by European governments have a significant impact on citizen trust in public institutions.Design/methodology/approach: Data for openness and coverage have been collected from the Open Data Inventory 2018(ODIN), by Open Data Watch;institutional trust is built up as a formative construct based on the European Social Survey(ESS), Round 9. The relations between the open government data features and trust have been tested on the basis of structural equation modelling(SEM).Findings: The paper reveals that as European governments improve data openness, disaggregation, and time coverage, people tend to trust them more. However, the size of the effect is still small and, comparatively, data coverage effect on citizens' confidence is more than twice than the impact of openness.Research limitations: This paper analyzes the causal effect of Open Government Data(OGD) features captured in a certain moment of time. In upcoming years, as OGD is implemented and a more consistent effect on people is expected, time series analysis will provide with a deeper insight.Practical implications: Public officers should continue working in the development of a technological framework that contributes to make OGD truly open. They should improve the added value of the increasing amount of open data currently available in order to boost internal and external innovations valuable both for public agencies and citizens.Originality/value: In a field of knowledge with little quantitative empirical evidence, this paper provides updated support for the positive effect of OGD strategies and it also points out areas of improvement in terms of the value that citizens can get from OGD coverage and openness.
文摘In the process of implementing data openness between banks and fin-tech companies,as the breadth and depth of cooperation between banks and enterprises continue to increase,there is a risk of“too much correlation to fail”and“too many links to fail”.There are problems with the implementation of financial data openness by regulatory agencies for banks and fin-tech enterprises,such as the ambiguity of regulatory responsibilities,the emphasis on financial regulatory goals,and the lag in regulatory methods.To address these issues,it is necessary to clarify the responsibilities of financial regulatory agencies,establish a collaborative mechanism for financial regulation,coordinate the types of risks in bank enterprise cooperation,achieve the technical implementation of financial regulatory measures and the design of regulatory systems,obtain regulatory data in real time,establish a hierarchical regulatory system for bank enterprise cooperation to improve the regulatory path,and ensure the rational and legal use of financial data in bank enterprise cooperation.
基金Tianjin Education Commission Research Program in Humanities and Social Sciences(Project No.2022SK064)。
文摘With advancements in information technology and the increasing demand for data-driven governance,the openness of public data has become essential for global governance and social innovation.However,legal risks related to privacy protection,data security,intellectual property,liability allocation,and legal adaptability pose significant challenges to data governance in China.This paper analyzes these risks and proposes three strategies:enhancing the legal framework through clear data classification and accountability mechanisms,establishing regulatory bodies to monitor data usage,and promoting public education on data privacy.These strategies aim to address gaps in legal discourse and guide effective data governance,contributing to the secure development of open data initiatives in China and beyond.
基金This work was supported by the Strategic Priority Research Program of Chinese Academy of Sciences[grant number XDA19020201].
文摘Open data strategies are being adopted in disaster-related data particularly because of the need to provide information on global targets and indicators for implementation of the Sendai Framework for Disaster Risk Reduction 2015–2030.In all phases of disaster risk management including forecasting,emergency response and post-disaster reconstruction,the need for interconnected multidisciplinary open data for collaborative reporting as well as study and analysis are apparent,in order to determine disaster impact data in timely and reportable manner.The extraordinary progress in computing and information technology in the past decade,such as broad local and wide-area network connectivity(e.g.Internet),highperformance computing,service and cloud computing,big data methods and mobile devices,provides the technical foundation for connecting open data to support disaster risk research.A new generation of disaster data infrastructure based on interconnected open data is evolving rapidly.There are two levels in the conceptual model of Linked Open Data for Global Disaster Risk Research(LODGD)Working Group of the Committee on Data for Science and Technology(CODATA),which is the Committee on Data of the International Council for Science(ICSU):data characterization and data connection.In data characterization,the knowledge about disaster taxonomy and data dependency on disaster events requires specific scientific study as it aims to understand and present the correlation between specific disaster events and scientific data through the integration of literature analysis and semantic knowledge discovery.Data connection concepts deal with technical methods to connect distributed data resources identified by data characterization of disaster type.In the science community,interconnected open data for disaster risk impact assessment are beginning to influence how disaster data are shared,and this will need to extend data coverage and provide better ways of utilizing data across domains where innovation and integration are now necessarily needed.
基金College of Communication and Information(CCI)Research and Creative Activity Fund,Kent State University
文摘Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.
文摘Purpose: Our work seeks to overcome data quality issues related to incomplete author affiliation data in bibliographic records in order to support accurate and reliable measurement of international research collaboration(IRC).Design/methodology/approch: We propose, implement, and evaluate a method that leverages the Web-based knowledge graph Wikidata to resolve publication affiliation data to particular countries. The method is tested with general and domain-specific data sets.Findings: Our evaluation covers the magnitude of improvement, accuracy, and consistency. Results suggest the method is beneficial, reliable, and consistent, and thus a viable and improved approach to measuring IRC.Research limitations: Though our evaluation suggests the method works with both general and domain-specific bibliographic data sets, it may perform differently with data sets not tested here. Further limitations stem from the use of the R programming language and R libraries for country identification as well as imbalanced data coverage and quality in Wikidata that may also change over time.Practical implications: The new method helps to increase the accuracy in IRC studies and provides a basis for further development into a general tool that enriches bibliographic data using the Wikidata knowledge graph.Originality: This is the first attempt to enrich bibliographic data using a peer-produced, Webbased knowledge graph like Wikidata.
文摘Purpose: The purpose of this exploratory study is to provide modern local governments with potential use cases for their open data, in order to help inform related future policies and decision-making. The concrete context was that of the Vaxjo municipality located in southeastern Sweden.Design/methodology/approach: The methodology was two-fold: 1) a survey of potential end users(n=151) from a local university;and, 2) analysis of survey results using a theoretical model regarding local strategies for implementing open government data.Findings: Most datasets predicted to be useful were on: sustainability and environment;preschool and school;municipality and politics. The use context given is primarily research and development, informing policies and decision making;but also education, informing personal choices, informing citizens and creating services based on open data. Not the least, the need for educating target user groups on data literacy emerged. A tentative pattern comprising a technical perspective on open data and a social perspective on open government was identified. Research limitations: In line with available funding, the nature of the study was exploratory and implemented as an anonymous web-based survey of employees and students at the local university. Further research involving(qualitative) surveys with all stakeholders would allow for creating a more complete picture of the matter. Practical implications: The study determines potential use cases and use contexts for open government data, in order to help inform related future policies and decision-making.Originality/value: Modern local governments, and especially in Sweden, are faced with a challenge of how to make their data open, how to learn about which types of data will be most relevant for their end users and what will be different societal purposes. The paper contributes to knowledge that modern local governments can resort to when it comes to attitudes of local citizens to open government data in the context of an open government data perspective.
文摘Systematically analyze the composition of post-marketing adverse drug reaction data and the open mode in the EU, and summarize its characteristics. EU post-marketing ADR data is open to six categories of stakeholders: EMA, EC, medicines regulatory authorities in EEA member states, healthcare professionals and the public, Marketing Authorization Holders, academia, WHO and medicines regulatory authorities in third countries. The EU has implemented hierarchical opening for ADRs, with different levels containing different data and facing different stakeholders. Openness is divided into active and passive openness. In opening up data, the EU complies with relevant personal data protection laws to protect the privacy of individuals. The EU’s post-marketing adverse drug reaction data openness is characterized by a combination of data openness and privacy protection, active and passive openness, and a hierarchy of data openness. It is hoped that this can provide a reference for the opening up of post-marketing adverse drug reaction data in China.
文摘Lane change prediction is critical for crash avoidance but challenging as it requires the understanding of the instantaneous driving environment.With cutting-edge artificial intelligence and sensing technologies,autonomous vehicles(AVs)are expected to have exceptional perception systems to capture instantaneously their driving environments for predicting lane changes.By exploring the Waymo open motion dataset,this study proposes a framework to explore autonomous driving data and investigate lane change behaviors.In the framework,this study develops a Long Short-Term Memory(LSTM)model to predict lane changing behaviors.The concept of Vehicle Operating Space(VOS)is introduced to quantify a vehicle's instantaneous driving environment as an important indicator used to predict vehicle lane changes.To examine the robustness of the model,a series of sensitivity analysis are conducted by varying the feature selection,prediction horizon,and training data balancing ratios.The test results show that including VOS into modeling can speed up the loss decay in the training process and lead to higher accuracy and recall for predicting lane-change behaviors.This study offers an example along with a methodological framework for transportation researchers to use emerging autonomous driving data to investigate driving behaviors and traffic environments.
文摘This research describes a quantitative,rapid,and low-cost methodology for debris flow susceptibility evaluation at the basin scale using open-access data and geodatabases.The proposed approach can aid decision makers in land management and territorial planning,by first screening for areas with a higher debris flow susceptibility.Five environmental predisposing factors,namely,bedrock lithology,fracture network,quaternary deposits,slope inclination,and hydrographic network,were selected as independent parameters and their mutual interactions were described and quantified using the Rock Engineering System(RES)methodology.For each parameter,specific indexes were proposed,aiming to provide a final synthetic and representative index of debris flow susceptibility at the basin scale.The methodology was tested in four basins located in the Upper Susa Valley(NW Italian Alps)where debris flow events are the predominant natural hazard.The proposed matrix can represent a useful standardized tool,universally applicable,since it is independent of type and characteristic of the basin.
文摘In recent years, transparency and accountability seem to find new impulse, with the development of ICT (information and communication technology) and the prospective of open data that invest the public system at a national and supranational level. Public institutions tend to make available to the public, more data and information concerning the administration, the manner of use of public goods and resources. At the same time, each institution is called upon to deal with the demand of transparency and participation by citizens who increasingly use Internet 2.0 and social media. After a reflection on how public administrations acted in the phase of Web 1.0 to practice transparency and accountability in terms of communication, this paper considers the elements of continuity and the new opportunities linked to the advent of Web 2.0 and open data. At the end of this analysis, the focus is on the strengths and weaknesses of this process, with a particular attention to the role of the public communication.
文摘This paper aims to present the experience gathered in the Italian alpine city of Bolzano within the project“Bolzano Traffic”whose goal is the introduction of an experimental open ITS platform for local service providers,fostering the diffusion of advanced traveller information services and the future deployment of cooperative mobility systems in the region.Several end-users applications targeted to the needs of different user groups have been developed in collaboration with local companies and research centers;a partnership with the EU Co-Cities project has been activated as well.The implemented services rely on real-time travel and traffic information collected by urban traffic monitoring systems or published by local stakeholders(e.g.public transportation operators).An active involvement of end-users,who have recently started testing these demo applications for free,is actually on-going.
基金funded by the U.S.National Science Foundation(Grant No.2126315).
文摘Minerals,like many other natural materials of geological origin(i.e.,geomaterials),face the challenge of name variations.This in turn hinders the data-intensive geoscience research,which often needs to integrate data from multiple sources.It is clear that mineral name is not an appropriate identifier to connect records within and amongst data sources.The Mindat database,as one of the biggest resources for open data in mineralogy,has received significant volume of feedback on the heterogeneity of mineral and rock names.To address that issue,we established a persistent identifier service on Mindat to provide persistent and meaningful access to the records of geomaterials(mineral/rock/variety),localities,mineral occurrences,references,photos,and specimens.A key development was the long-form identifier,which adds contextual information such as identifier authorities and data types into the identifier structure.Moreover,a UUID service was built along with the long-form identifier to further increase the interoperability.The identifier service has been successfully implemented to mint millions of identifiers to different types of data objects on Mindat.Several use case scenarios were developed to illustrate the utility of the identifiers in the real world.We believe the persistent identifier will help address the challenges caused by name variations,and we welcome Mindat users to test the identifiers and send feedback to us for future extensions.
文摘随着地球观测进入大数据时代,传统的数据管理技术已经难以适应大数据需求,Open Data Cube(ODC)作为新型的开源的地球观测数据管理与分析平台,适合进行时间序列数据的高性能计算和探索性数据分析,是亚大区域综合地球观测系统AOGEOSS的重要技术支撑平台。但当前ODC对国产卫星影像支持不友好,缺乏自动化管理和数据组织工具,使用ODC进行国产卫星影像数据管理的技术不成熟。因此,本文以高分一号卫星影像为例,通过开发ODC_GFTool中间件和自定义高分卫星影像产品格式探索ODC框架下国产影像数据自动化管理流程,最后以鄱阳湖为试验区,进行了基于ODC框架的水体提取应用实验,论证了ODC框架下国产卫星数据存取的可行性,研究结果表明相较于传统的数据处理工具ODC具有明显的效率优势,能够为AOGEOSS基础设施建设和国产卫星影像数据管理提供一定的参考。
基金Supported by the National Natural Science Foundation of China(60803160 and 61272110)the Key Projects of National Social Science Foundation of China(11&ZD189)+4 种基金the Natural Science Foundation of Hubei Province(2013CFB334)the Natural Science Foundation of Educational Agency of Hubei Province(Q20101110)the State Key Lab of Software Engineering Open Foundation of Wuhan University(SKLSE2012-09-07)the Teaching Research Project of Hubei Province(2011s005)the Wuhan Key Technology Support Program(2013010602010216)
文摘Microblog is a social platform with huge user community and mass data. We propose a semantic recommendation mechanism based on sentiment analysis for microblog. Firstly, the keywords and sensibility words in this mechanism are extracted by natural language processing including segmentation, lexical analysis and strategy selection. Then, we query the background knowledge base based on linked open data (LOD) with the basic information of users. The experiment result shows that the accuracy of recommendation is within the range of 70% -89% with sentiment analysis and semantic query. Compared with traditional recommendation method, this method can satisfy users' requirement greatly.
文摘After a systematic review of 38 current intelligent city evaluation systems (ICESs) from around the world, this research analyzes the secondary and tertiary indicators of these 38 ICESs from the perspec- tives of scale structuring, approaches and indicator selection, and determines their common base. From this base, the fundamentals of the City Intelligence Quotient (City IOD Evaluation System are developed and five dimensions are selected after a clustering analysis. The basic version, City IQ Evaluation System 1.0, involves 275 experts from 14 high-end research institutions, which include the Chinese Academy of Engineering, the National Academy of Science and Engineering (Germany), the Royal Swedish Academy of Engineering Sciences, the Planning Management Center of the Ministry of Housing and Urban-Rural Development of China, and the Development Research Center of the State Council of China. City IQ Evaluation System 2.0 is further developed, with improvements in its universality, openness, and dy- namic adjustment capability. After employing deviation evaluation methods in the IQ assessment, City IQ Evaluation System 3.0 was conceived. The research team has conducted a repeated assessment of 41 intelligent cities around the world using City IQ Evaluation System 3.0. The results have proved that the City IQ Evaluation System, developed on the basis of intelligent life, features more rational indicators selected from data sources that can offer better universality, openness, and dynamics, and is more sen- sitive and precise.
文摘The rapid increase in the publication of knowledge bases as linked open data (LOD) warrants serious consideration from all concerned, as this phenomenon will potentially scale exponentially. This paper will briefly describe the evolution of the LOD, the emerging world-wide semantic web (WWSW), and explore the scalability and performance features Of the service oriented architecture that forms the foundation of the semantic technology platform developed at MIMOS Bhd., for addressing the challenges posed by the intelligent future internet. This paper" concludes with a review of the current status of the agriculture linked open data.
文摘The rising awareness of environmental issues and the increase of renewable energy sources(RESs)has led to a shift in energy production toward RES,such as photovoltaic(PV)systems,and toward a distributed generation(DG)model of energy production that requires systems in which energy is generated,stored,and consumed locally.In this work,we present a methodology that integrates geographic information system(GIS)-based PV potential assessment procedures with models for the estimation of both energy generation and consumption profiles.In particular,we have created an innovative infrastructure that co-simulates PV integration on building rooftops together with an analysis of households’electricity demand.Our model relies on high spatiotemporal resolution and considers both shadowing effects and real-sky conditions for solar radiation estimation.It integrates methodologies to estimate energy demand with a high temporal resolution,accounting for realistic populations with realistic consumption profiles.Such a solution enables concrete recommendations to be drawn in order to promote an understanding of urban energy systems and the integration of RES in the context of future smart cities.The proposed methodology is tested and validated within the municipality of Turin,Italy.For the whole municipality,we estimate both the electricity absorbed from the residential sector(simulating a realistic population)and the electrical energy that could be produced by installing PV systems on buildings’rooftops(considering two different scenarios,with the former using only the rooftops of residential buildings and the latter using all available rooftops).The capabilities of the platform are explored through an in-depth analysis of the obtained results.Generated power and energy profiles are presented,emphasizing the flexibility of the resolution of the spatial and temporal results.Additional energy indicators are presented for the self-consumption of produced energy and the avoidance of CO_(2) emissions.
文摘The Semantic Web seems finally close to maintaining its promise about a real world-wide graph of interconnected resources. The SPARQL query language and protocols and the Linked Open Data initiative have laid the way for endless data endpoints sparse around the globe. However, for the Semantic Web to really happen, it does not suffice to get billions of triples out there: these must be shareable, interlinked and conform to widely accepted vocabularies. While more and more data are converted from already available large knowledge repositories of companies and organizations, the question whether these should be carefully converted to semantically consistent ontology vocabularies or find other shallow representations for their content naturally arises. The danger is to come up with massive amounts of useless data, a boomerang which could result to be contradictory for the success of the web of data. In this paper, I provide some insights on common problems which may arise when porting huge amount of existing data or conceptual schemes (very common in the agriculture domain) to resource description framwork (RDF), and will address different modeling choices, by discussing in particular the relationship between the two main modeling vocabularies offered by W3C: OWL and SKOS.
基金This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil(CAPES)-Finance Code[001].
文摘Urban barriers are often a consequence of the unplanned expansion of transport infrastructures,housing growth nearby existing transport infrastructures,and the vehicular traffic itself.This process,also known as“community severance”,is a problem that affects the travel patterns of the population and causes negative impacts to adjacent communities,but it is not easy to assess.As an alternative to the costly technologies and restricted-access data sometimes used to analyze urban barriers,we propose an analytical approach that assesses community severance levels by classifying the quality of Pedestrian Crossings on Urban Streets(the PeCUS index).Additionally,the approach is used to identify possible inequities nearby the different classification groups regarding demographic data.We used the chi-square(χ^(2))test of independence and the standardized Pearson residuals to indicate deficits and excesses of residents regarding the demographic characteristics studied surrounding the urban streets.We found the following evidences of associations:residents with low-income or those aged up to 19 tend to live close to the worst crossings,whereas residents with permanent mobility constraints or the elderly tend to live near the crossings with the best classifications.Therefore,the study shows that the assessment of community severance by the analysis of the PeCUS index is feasible.Moreover,it reveals that the distribution of residents surrounding the distinct classes of quality of crossings is equitable for vulnerable social groups,except for low-income or young residents.