The objective,connotations and research issues of big geodata mining were discussed to address its significance to geographical research in this paper.Big geodata may be categorized into two domains:big earth observat...The objective,connotations and research issues of big geodata mining were discussed to address its significance to geographical research in this paper.Big geodata may be categorized into two domains:big earth observation data and big human behavior data.A description of big geodata includes,in addition to the“5Vs”(volume,velocity,value,variety and veracity),a further five features,that is,granularity,scope,density,skewness and precision.Based on this approach,the essence of mining big geodata includes four aspects.First,flow space,where flow replaces points in traditional space,will become the new presentation form for big human behavior data.Second,the objectives for mining big geodata are the spatial patterns and the spatial relationships.Third,the spatiotemporal distributions of big geodata can be viewed as overlays of multiple geographic patterns and the characteristics of the data,namely heterogeneity and homogeneity,may change with scale.Fourth,data mining can be seen as a tool for discovery of geographic patterns and the patterns revealed may be attributed to human-land relationships.The big geodata mining methods may be categorized into two types in view of the mining objective,i.e.,classification mining and relationship mining.Future research will be faced by a number of issues,including the aggregation and connection of big geodata,the effective evaluation of the mining results and the challenge for mining to reveal“non-trivial”knowledge.展开更多
The emergence of "Big Data" has been a dramatic development in recent years. Alongside it, a lesser-known but equally important set of concepts and practices has also come into being--"Smart Data." This paper shar...The emergence of "Big Data" has been a dramatic development in recent years. Alongside it, a lesser-known but equally important set of concepts and practices has also come into being--"Smart Data." This paper shares the author's understanding of what, why, how, who, where, and which data in relation to Smart Data and digital humanities. It concludes that, challenges and opportunities co-exist, but it is certain that Smart Data, the ability to achieve big insights from trusted, contextualized, relevant, cognitive, predictive, and consumable data at any scale, will continue to have extraordinary value in digital humanities.展开更多
A Bayesian method for estimating human error probability(HEP) is presented.The main idea of the method is incorporating human performance data into the HEP estimation process.By integrating human performance data an...A Bayesian method for estimating human error probability(HEP) is presented.The main idea of the method is incorporating human performance data into the HEP estimation process.By integrating human performance data and prior information about human performance together,a more accurate and specific HEP estimation can be achieved.For the time-unrelated task without rigorous time restriction,the HEP estimated by the common-used human reliability analysis(HRA) methods or expert judgments is collected as the source of prior information.And for the time-related task with rigorous time restriction,the human error is expressed as non-response making.Therefore,HEP is the time curve of non-response probability(NRP).The prior information is collected from system safety and reliability specifications or by expert judgments.The(joint) posterior distribution of HEP or NRP-related parameter(s) is constructed after prior information has been collected.Based on the posterior distribution,the point or interval estimation of HEP/NRP is obtained.Two illustrative examples are introduced to demonstrate the practicality of the aforementioned approach.展开更多
The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, ...The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of humanrelated biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.展开更多
A high capacity data hiding technique was developed for compressed digital audio. As perceptual audio coding has become the accepted technology for storage and transmission of audio signals, compressed audio informati...A high capacity data hiding technique was developed for compressed digital audio. As perceptual audio coding has become the accepted technology for storage and transmission of audio signals, compressed audio information hiding enables robust, imperceptible transmission of data within audio signals, thus allowing valuable information to be attached to the content, such as the song title, lyrics, composer's name, and artist or property rights related data. This paper describes simultaneous low bitrate encoding and information hiding for highly compressed audio signals. The information hiding is implemented in the quan- tization process of the audio content which improves robustness, signal quality, and security. The impercep- tibility of the embedded data is ensured based on the masking property of the human auditory system (HAS) The robustness and security are evaluated by various attacking algorithms. Tests with an extended MPEG4 advanced audio coding (AAC) encoder confirm that the method is robust to the regular and singular groups method (RS) and sample pair analysis (SPA) attacks as well as other statistical steganalysis method attacks.展开更多
The compressed sensing (CS) of acceleration data has been drawing increasing attention in gait telemonitoring application. In such application, there still exist some challenging issues including high energy consumpti...The compressed sensing (CS) of acceleration data has been drawing increasing attention in gait telemonitoring application. In such application, there still exist some challenging issues including high energy consumption of body-worn device for acceleration data acquisition and the poor reconstruction performance due to nonsparsity of acceleration data. Thus, the novel scheme of compressive sensing of acceleration data is needed urgently for solutions that are found to these issues.展开更多
基金National Natural Science Foundation of China,No.41525004,No.41421001。
文摘The objective,connotations and research issues of big geodata mining were discussed to address its significance to geographical research in this paper.Big geodata may be categorized into two domains:big earth observation data and big human behavior data.A description of big geodata includes,in addition to the“5Vs”(volume,velocity,value,variety and veracity),a further five features,that is,granularity,scope,density,skewness and precision.Based on this approach,the essence of mining big geodata includes four aspects.First,flow space,where flow replaces points in traditional space,will become the new presentation form for big human behavior data.Second,the objectives for mining big geodata are the spatial patterns and the spatial relationships.Third,the spatiotemporal distributions of big geodata can be viewed as overlays of multiple geographic patterns and the characteristics of the data,namely heterogeneity and homogeneity,may change with scale.Fourth,data mining can be seen as a tool for discovery of geographic patterns and the patterns revealed may be attributed to human-land relationships.The big geodata mining methods may be categorized into two types in view of the mining objective,i.e.,classification mining and relationship mining.Future research will be faced by a number of issues,including the aggregation and connection of big geodata,the effective evaluation of the mining results and the challenge for mining to reveal“non-trivial”knowledge.
文摘The emergence of "Big Data" has been a dramatic development in recent years. Alongside it, a lesser-known but equally important set of concepts and practices has also come into being--"Smart Data." This paper shares the author's understanding of what, why, how, who, where, and which data in relation to Smart Data and digital humanities. It concludes that, challenges and opportunities co-exist, but it is certain that Smart Data, the ability to achieve big insights from trusted, contextualized, relevant, cognitive, predictive, and consumable data at any scale, will continue to have extraordinary value in digital humanities.
基金supported by the Specialized Research Fund for the Doctoral Program of Higher Education(20114307120032)the National Natural Science Foundation of China(71201167)
文摘A Bayesian method for estimating human error probability(HEP) is presented.The main idea of the method is incorporating human performance data into the HEP estimation process.By integrating human performance data and prior information about human performance together,a more accurate and specific HEP estimation can be achieved.For the time-unrelated task without rigorous time restriction,the HEP estimated by the common-used human reliability analysis(HRA) methods or expert judgments is collected as the source of prior information.And for the time-related task with rigorous time restriction,the human error is expressed as non-response making.Therefore,HEP is the time curve of non-response probability(NRP).The prior information is collected from system safety and reliability specifications or by expert judgments.The(joint) posterior distribution of HEP or NRP-related parameter(s) is constructed after prior information has been collected.Based on the posterior distribution,the point or interval estimation of HEP/NRP is obtained.Two illustrative examples are introduced to demonstrate the practicality of the aforementioned approach.
基金supported by the‘‘100-Talent Program’’of Chinese Academy of Sciencesthe Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB13040500)+1 种基金the National High-tech R&D Program(863 ProgramGrant No.2012AA020409)by the Ministry of Science and Technology of China awarded to ZZ
文摘The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of humanrelated biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.
基金Supported by the Chuanxin Foundation (No. 110109001)the Basic Research Foundation of Tsinghua National Laboratory for Information Science and Technology (TNList)
文摘A high capacity data hiding technique was developed for compressed digital audio. As perceptual audio coding has become the accepted technology for storage and transmission of audio signals, compressed audio information hiding enables robust, imperceptible transmission of data within audio signals, thus allowing valuable information to be attached to the content, such as the song title, lyrics, composer's name, and artist or property rights related data. This paper describes simultaneous low bitrate encoding and information hiding for highly compressed audio signals. The information hiding is implemented in the quan- tization process of the audio content which improves robustness, signal quality, and security. The impercep- tibility of the embedded data is ensured based on the masking property of the human auditory system (HAS) The robustness and security are evaluated by various attacking algorithms. Tests with an extended MPEG4 advanced audio coding (AAC) encoder confirm that the method is robust to the regular and singular groups method (RS) and sample pair analysis (SPA) attacks as well as other statistical steganalysis method attacks.
文摘The compressed sensing (CS) of acceleration data has been drawing increasing attention in gait telemonitoring application. In such application, there still exist some challenging issues including high energy consumption of body-worn device for acceleration data acquisition and the poor reconstruction performance due to nonsparsity of acceleration data. Thus, the novel scheme of compressive sensing of acceleration data is needed urgently for solutions that are found to these issues.