In an asynchronous cooperative editing workflow of a structured document, each of the co-authors receives in the different phases of the editing process, a copy of the document to insert its contribution. For confiden...In an asynchronous cooperative editing workflow of a structured document, each of the co-authors receives in the different phases of the editing process, a copy of the document to insert its contribution. For confidentiality reasons, this copy may be only a partial replica containing only parts of the (global) document which are of demonstrated interest for the considered co-author. Note that some parts may be a demonstrated interest over a co-author;they will therefore be accessible concurrently. When it’s synchronization time (e.g. at the end of an asynchronous editing phase of the process), we want to merge all contributions of all authors in a single document. Due to the asynchronism of edition and to the potential existence of the document parts offering concurrent access, conflicts may arise and make partial replicas unmergeable in their entirety: they are inconsistent, meaning that they contain conflictual parts. The purpose of this paper is to propose a merging approach said by consensus of such partial replicas using tree automata. Specifically, from the partial replicas updates, we build a tree automaton that accepts exactly the consensus documents. These documents are the maximum prefixes containing no conflict of partial replicas merged.展开更多
Based upon the deep seismic sounding profile conducted in the Tengchong volcanic-geothermal area, a two-dimensional crustal P velocity structure is obtained by use of the finite-difference inversion and the forward tr...Based upon the deep seismic sounding profile conducted in the Tengchong volcanic-geothermal area, a two-dimensional crustal P velocity structure is obtained by use of the finite-difference inversion and the forward travel-time fitting method. The crustal model shows that there is a low velocity zone in upper crust in the Tengchong area, which may be related to the volcanic-geothermal activities, and two intracrustal faults (the LonglingRuili fault and Tengchong fault) exist on the profile, where the Tengchong fault may extend to the Moho discontinuity. Meanwhile, based on teleseismic data recorded by a temporary seismic network, we obtained the S-wave velocity structures beneath the RehaiRetian region in the Tengchong area, which show the low S-wave velocity anomaly in upper crust. The authors discuss the causes of Tengchong volcanic eruption based on the deep crustal structure. The crustal structure in the Tengchong volcanic-geothermal area is characterized by low P-wave and S-wave velocity, low resistivity, high heat-flow value and low Q value. The P-wave velocity in the upper mantle is also low. For this information, it can be induced that the magma in the crust is derived from the upper mantle, and the low velocity anomaly in upper crust in the Tengchong area may be related to the differentiation of magma. The Tengchong volcanoes are close to an active plate boundary and belong to plate boundary volcanoes.展开更多
A new organic semiconductor tartaric acid doped salt of emeraldine polyaniline (PANI-C 4H 6O 6) has been obtained by the method of oxidative polymerization of monomeric aniline with ammonium persulfate in acidic solut...A new organic semiconductor tartaric acid doped salt of emeraldine polyaniline (PANI-C 4H 6O 6) has been obtained by the method of oxidative polymerization of monomeric aniline with ammonium persulfate in acidic solution. The structure was characterized by Fourier Transform Infrared technique (FTIR) and X-ray diffraction (XRD). The temperature dependence dc conductivity δ dc ( T ) shows a semiconductor behavior and follows the quasi one dimensional variable range hopping (Q1D-VRH) model. Data on δ dc ( T ) are also discussed.展开更多
Based on analyses of the share of documents of structural geology and tectonics in the GeoRef system over 100 years in the last century, and the historical change of international (31 years) and domestic (16 years...Based on analyses of the share of documents of structural geology and tectonics in the GeoRef system over 100 years in the last century, and the historical change of international (31 years) and domestic (16 years) document counts of various topics in structural geology and tectonics, the position of structural geology and tectonics in the geosciences is evaluated and the major advaces in fields of plate tectonics, continental dynamics and global dynamics are reviewed. Our attention mainly focuses on the advances in studies of structural analysis, deformation mechanisms and rheology of rocks, contractional tectonics and late- and post-orogenic extensional collapse in orogens, large-scale strikeslip faults and indentation-extrusion tectonics, active tectonics and natural hazards. The relationships of structural geology and tectonics with petrology and geochronology are also discussed in terms of intersection of scientific disciplines. Finally, some suggestions are proposed for the further development of structural geology and tectonics in China.展开更多
Complex structured documents can be intentionally represented as a tree structure decorated with attributes. Ignoring attributes (these are related to semantic aspects that can be treated separately from purely struct...Complex structured documents can be intentionally represented as a tree structure decorated with attributes. Ignoring attributes (these are related to semantic aspects that can be treated separately from purely structural aspects which interest us here), in the context of a cooperative edition, legal structures are characterized by a document model (an abstract grammar) and each intentional representation can be manipulated independently and eventually asynchronously by several co-authors through various editing tools that operate on its “partial replicas”. For unsynchronized edition of a partial replica, considered co-author must have a syntactic document local model that constraints him to ensure minimum consistency of local representation that handles with respect to the global model. This consistency is synonymous with the existence of one or more (global) intentional representations towards the global model, assuming the current local representation as her/their partial replica. The purpose of this paper is to present the grammatical structures which are grammars that permit not only to specify a (global) model for documents published in a cooperative manner, but also to derive automatically via a so call projection operation, consistent (local) models for each co-authors involved in the cooperative edition. We also show some properties that meet these grammatical structures.展开更多
This paper focuses on document clustering by clustering algorithm based on a DEnsityTree (CABDET) to improve the accuracy of clustering. The CABDET method constructs a density-based treestructure for every potential c...This paper focuses on document clustering by clustering algorithm based on a DEnsityTree (CABDET) to improve the accuracy of clustering. The CABDET method constructs a density-based treestructure for every potential cluster by dynamically adjusting the radius of neighborhood according to local density. It avoids density-based spatial clustering of applications with noise (DBSCAN) ′s global density parameters and reduces input parameters to one. The results of experiment on real document show that CABDET achieves better accuracy of clustering than DBSCAN method. The CABDET algorithm obtains the max F-measure value 0.347 with the root node's radius of neighborhood 0.80, which is higher than 0.332 of DBSCAN with the radius of neighborhood 0.65 and the minimum number of objects 6.展开更多
In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly comple...In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment.展开更多
自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作...自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作机制,并通过出台一系列的指南、指导性文件和推荐性技术文件,规范救援能力和队伍建设,强化国际救援协调和现场救援的效率。该文系统介绍了INSARAG标准和技术文件组成体系架构,并阐述了各标准及技术文件的出台背景、主要内容及对中国的搜救队伍建设的推动作用,并讨论其对我国灾害救援工作的启示与借鉴意义。展开更多
This article proposes a document-level prompt learning approach using LLMs to extract the timeline-based storyline. Through verification tests on datasets such as ESCv1.2 and Timeline17, the results show that the prom...This article proposes a document-level prompt learning approach using LLMs to extract the timeline-based storyline. Through verification tests on datasets such as ESCv1.2 and Timeline17, the results show that the prompt + one-shot learning proposed in this article works well. Meanwhile, our research findings indicate that although timeline-based storyline extraction has shown promising prospects in the practical applications of LLMs, it is still a complex natural language processing task that requires further research.展开更多
With the increasing popularity of mobile devices and the wide adoption of mobile Apps,an increasing concern of privacy issues is raised.Privacy policy is identified as a proper medium to indicate the legal terms,such ...With the increasing popularity of mobile devices and the wide adoption of mobile Apps,an increasing concern of privacy issues is raised.Privacy policy is identified as a proper medium to indicate the legal terms,such as the general data protection regulation(GDPR),and to bind legal agreement between service providers and users.However,privacy policies are usually long and vague for end users to read and understand.It is thus important to be able to automatically analyze the document structures of privacy policies to assist user understanding.In this work we create a manually labelled corpus containing 231 privacy policies(of more than 566,000 words and 7,748 annotated paragraphs).We benchmark our data corpus with 3 document classification models and achieve more than 82%on F1-score.展开更多
Survey generation aims to generate a summary from a scientific topic based on related papers.The structure of papers deeply influences the generative process of survey,especially the relationships between sentence and...Survey generation aims to generate a summary from a scientific topic based on related papers.The structure of papers deeply influences the generative process of survey,especially the relationships between sentence and sentence,paragraph and paragraph.In principle,the structure of paper can influence the quality of the summary.Therefore,we employ the structure of paper to leverage contextual information among sentences in paragraphs to generate a survey for documents.In particular,we present a neural document structure model for survey generation.We take paragraphs as units,and model sentences in paragraphs,we then employ a hierarchical model to learn structure among sentences,which can be used to select important and informative sentences to generate survey.We evaluate our model on scientific document data set.The experimental results show that our model is effective,and the generated survey is informative and readable.展开更多
文摘In an asynchronous cooperative editing workflow of a structured document, each of the co-authors receives in the different phases of the editing process, a copy of the document to insert its contribution. For confidentiality reasons, this copy may be only a partial replica containing only parts of the (global) document which are of demonstrated interest for the considered co-author. Note that some parts may be a demonstrated interest over a co-author;they will therefore be accessible concurrently. When it’s synchronization time (e.g. at the end of an asynchronous editing phase of the process), we want to merge all contributions of all authors in a single document. Due to the asynchronism of edition and to the potential existence of the document parts offering concurrent access, conflicts may arise and make partial replicas unmergeable in their entirety: they are inconsistent, meaning that they contain conflictual parts. The purpose of this paper is to propose a merging approach said by consensus of such partial replicas using tree automata. Specifically, from the partial replicas updates, we build a tree automaton that accepts exactly the consensus documents. These documents are the maximum prefixes containing no conflict of partial replicas merged.
基金The Climb Program (95-S-05-01) of National Scientific and Technological Ministry of China, the Key Project (95-11-02) from China Seismological Bureau, the State Natural Sciences Foundation of China (49974020) and the Joint Earthquake S
文摘Based upon the deep seismic sounding profile conducted in the Tengchong volcanic-geothermal area, a two-dimensional crustal P velocity structure is obtained by use of the finite-difference inversion and the forward travel-time fitting method. The crustal model shows that there is a low velocity zone in upper crust in the Tengchong area, which may be related to the volcanic-geothermal activities, and two intracrustal faults (the LonglingRuili fault and Tengchong fault) exist on the profile, where the Tengchong fault may extend to the Moho discontinuity. Meanwhile, based on teleseismic data recorded by a temporary seismic network, we obtained the S-wave velocity structures beneath the RehaiRetian region in the Tengchong area, which show the low S-wave velocity anomaly in upper crust. The authors discuss the causes of Tengchong volcanic eruption based on the deep crustal structure. The crustal structure in the Tengchong volcanic-geothermal area is characterized by low P-wave and S-wave velocity, low resistivity, high heat-flow value and low Q value. The P-wave velocity in the upper mantle is also low. For this information, it can be induced that the magma in the crust is derived from the upper mantle, and the low velocity anomaly in upper crust in the Tengchong area may be related to the differentiation of magma. The Tengchong volcanoes are close to an active plate boundary and belong to plate boundary volcanoes.
文摘A new organic semiconductor tartaric acid doped salt of emeraldine polyaniline (PANI-C 4H 6O 6) has been obtained by the method of oxidative polymerization of monomeric aniline with ammonium persulfate in acidic solution. The structure was characterized by Fourier Transform Infrared technique (FTIR) and X-ray diffraction (XRD). The temperature dependence dc conductivity δ dc ( T ) shows a semiconductor behavior and follows the quasi one dimensional variable range hopping (Q1D-VRH) model. Data on δ dc ( T ) are also discussed.
文摘Based on analyses of the share of documents of structural geology and tectonics in the GeoRef system over 100 years in the last century, and the historical change of international (31 years) and domestic (16 years) document counts of various topics in structural geology and tectonics, the position of structural geology and tectonics in the geosciences is evaluated and the major advaces in fields of plate tectonics, continental dynamics and global dynamics are reviewed. Our attention mainly focuses on the advances in studies of structural analysis, deformation mechanisms and rheology of rocks, contractional tectonics and late- and post-orogenic extensional collapse in orogens, large-scale strikeslip faults and indentation-extrusion tectonics, active tectonics and natural hazards. The relationships of structural geology and tectonics with petrology and geochronology are also discussed in terms of intersection of scientific disciplines. Finally, some suggestions are proposed for the further development of structural geology and tectonics in China.
文摘Complex structured documents can be intentionally represented as a tree structure decorated with attributes. Ignoring attributes (these are related to semantic aspects that can be treated separately from purely structural aspects which interest us here), in the context of a cooperative edition, legal structures are characterized by a document model (an abstract grammar) and each intentional representation can be manipulated independently and eventually asynchronously by several co-authors through various editing tools that operate on its “partial replicas”. For unsynchronized edition of a partial replica, considered co-author must have a syntactic document local model that constraints him to ensure minimum consistency of local representation that handles with respect to the global model. This consistency is synonymous with the existence of one or more (global) intentional representations towards the global model, assuming the current local representation as her/their partial replica. The purpose of this paper is to present the grammatical structures which are grammars that permit not only to specify a (global) model for documents published in a cooperative manner, but also to derive automatically via a so call projection operation, consistent (local) models for each co-authors involved in the cooperative edition. We also show some properties that meet these grammatical structures.
基金Science and Technology Development Project of Tianjin(No. 06FZRJGX02400)National Natural Science Foundation of China (No.60603027)
文摘This paper focuses on document clustering by clustering algorithm based on a DEnsityTree (CABDET) to improve the accuracy of clustering. The CABDET method constructs a density-based treestructure for every potential cluster by dynamically adjusting the radius of neighborhood according to local density. It avoids density-based spatial clustering of applications with noise (DBSCAN) ′s global density parameters and reduces input parameters to one. The results of experiment on real document show that CABDET achieves better accuracy of clustering than DBSCAN method. The CABDET algorithm obtains the max F-measure value 0.347 with the root node's radius of neighborhood 0.80, which is higher than 0.332 of DBSCAN with the radius of neighborhood 0.65 and the minimum number of objects 6.
文摘In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment.
文摘自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作机制,并通过出台一系列的指南、指导性文件和推荐性技术文件,规范救援能力和队伍建设,强化国际救援协调和现场救援的效率。该文系统介绍了INSARAG标准和技术文件组成体系架构,并阐述了各标准及技术文件的出台背景、主要内容及对中国的搜救队伍建设的推动作用,并讨论其对我国灾害救援工作的启示与借鉴意义。
文摘This article proposes a document-level prompt learning approach using LLMs to extract the timeline-based storyline. Through verification tests on datasets such as ESCv1.2 and Timeline17, the results show that the prompt + one-shot learning proposed in this article works well. Meanwhile, our research findings indicate that although timeline-based storyline extraction has shown promising prospects in the practical applications of LLMs, it is still a complex natural language processing task that requires further research.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.61802275 and U1836214)the Innovation fund of Tianjin University(2020XRG-0022).
文摘With the increasing popularity of mobile devices and the wide adoption of mobile Apps,an increasing concern of privacy issues is raised.Privacy policy is identified as a proper medium to indicate the legal terms,such as the general data protection regulation(GDPR),and to bind legal agreement between service providers and users.However,privacy policies are usually long and vague for end users to read and understand.It is thus important to be able to automatically analyze the document structures of privacy policies to assist user understanding.In this work we create a manually labelled corpus containing 231 privacy policies(of more than 566,000 words and 7,748 annotated paragraphs).We benchmark our data corpus with 3 document classification models and achieve more than 82%on F1-score.
基金This work was supported by the Fundamental Research Funds for the Central Universities(2018B678X14 and 2016B44414)Postgraduate Research Practice Innovation Program of Jiangsu Province of China(KYCX18_0553 and KYLX16_0722)+1 种基金the National Natural Science Foundation of China(Grant Nos.61806137 and 61976146)Project of Natural Science Research of the Universities of Jiangsu Province(18KJB520043).
文摘Survey generation aims to generate a summary from a scientific topic based on related papers.The structure of papers deeply influences the generative process of survey,especially the relationships between sentence and sentence,paragraph and paragraph.In principle,the structure of paper can influence the quality of the summary.Therefore,we employ the structure of paper to leverage contextual information among sentences in paragraphs to generate a survey for documents.In particular,we present a neural document structure model for survey generation.We take paragraphs as units,and model sentences in paragraphs,we then employ a hierarchical model to learn structure among sentences,which can be used to select important and informative sentences to generate survey.We evaluate our model on scientific document data set.The experimental results show that our model is effective,and the generated survey is informative and readable.