期刊文献+
共找到419篇文章
< 1 2 21 >
每页显示 20 50 100
Tree Automata for Extracting Consensus from Partial Replicas of a Structured Document
1
作者 Maurice Tchoupé Tchendji Milliam M. Zekeng Ndadji 《Journal of Software Engineering and Applications》 2017年第5期432-456,共25页
In an asynchronous cooperative editing workflow of a structured document, each of the co-authors receives in the different phases of the editing process, a copy of the document to insert its contribution. For confiden... In an asynchronous cooperative editing workflow of a structured document, each of the co-authors receives in the different phases of the editing process, a copy of the document to insert its contribution. For confidentiality reasons, this copy may be only a partial replica containing only parts of the (global) document which are of demonstrated interest for the considered co-author. Note that some parts may be a demonstrated interest over a co-author;they will therefore be accessible concurrently. When it’s synchronization time (e.g. at the end of an asynchronous editing phase of the process), we want to merge all contributions of all authors in a single document. Due to the asynchronism of edition and to the potential existence of the document parts offering concurrent access, conflicts may arise and make partial replicas unmergeable in their entirety: they are inconsistent, meaning that they contain conflictual parts. The purpose of this paper is to propose a merging approach said by consensus of such partial replicas using tree automata. Specifically, from the partial replicas updates, we build a tree automaton that accepts exactly the consensus documents. These documents are the maximum prefixes containing no conflict of partial replicas merged. 展开更多
关键词 structured documents WORKFLOW of Cooperative Edition MERGING PARTIAL Replicas CONFLICT CONSENSUS Tree AUTOMATA AUTOMATA Product Lazy Evaluation
暂未订购
Seismological study on the crustal structure of Tengchong volcanic-geothermal area 被引量:4
2
作者 王椿镛 楼海 +3 位作者 吴建平 白志明 皇甫岗 秦嘉政 《Acta Seismologica Sinica(English Edition)》 CSCD 2002年第3期247-259,共13页
Based upon the deep seismic sounding profile conducted in the Tengchong volcanic-geothermal area, a two-dimensional crustal P velocity structure is obtained by use of the finite-difference inversion and the forward tr... Based upon the deep seismic sounding profile conducted in the Tengchong volcanic-geothermal area, a two-dimensional crustal P velocity structure is obtained by use of the finite-difference inversion and the forward travel-time fitting method. The crustal model shows that there is a low velocity zone in upper crust in the Tengchong area, which may be related to the volcanic-geothermal activities, and two intracrustal faults (the LonglingRuili fault and Tengchong fault) exist on the profile, where the Tengchong fault may extend to the Moho discontinuity. Meanwhile, based on teleseismic data recorded by a temporary seismic network, we obtained the S-wave velocity structures beneath the RehaiRetian region in the Tengchong area, which show the low S-wave velocity anomaly in upper crust. The authors discuss the causes of Tengchong volcanic eruption based on the deep crustal structure. The crustal structure in the Tengchong volcanic-geothermal area is characterized by low P-wave and S-wave velocity, low resistivity, high heat-flow value and low Q value. The P-wave velocity in the upper mantle is also low. For this information, it can be induced that the magma in the crust is derived from the upper mantle, and the low velocity anomaly in upper crust in the Tengchong area may be related to the differentiation of magma. The Tengchong volcanoes are close to an active plate boundary and belong to plate boundary volcanoes. 展开更多
关键词 Tengchong volcanic area crustal structure deep seismic sounding travel-time fitting teleseismic waveform CLC number: P315.63 document code: A
在线阅读 下载PDF
Preparation , Structure and DC Conductivity of Organic Semiconductor PA NI-C_4H_6O_6
3
作者 AdulkaderM.Elamin YAOKai-lun 《Semiconductor Photonics and Technology》 CAS 1999年第3期166-170,共5页
A new organic semiconductor tartaric acid doped salt of emeraldine polyaniline (PANI-C 4H 6O 6) has been obtained by the method of oxidative polymerization of monomeric aniline with ammonium persulfate in acidic solut... A new organic semiconductor tartaric acid doped salt of emeraldine polyaniline (PANI-C 4H 6O 6) has been obtained by the method of oxidative polymerization of monomeric aniline with ammonium persulfate in acidic solution. The structure was characterized by Fourier Transform Infrared technique (FTIR) and X-ray diffraction (XRD). The temperature dependence dc conductivity δ dc ( T ) shows a semiconductor behavior and follows the quasi one dimensional variable range hopping (Q1D-VRH) model. Data on δ dc ( T ) are also discussed. 展开更多
关键词 DC Conductivity Organic Semiconductor Polyaniline structure Tartaric Acid CLC number:O631.23 O632.7 TN304.52 document code:A
在线阅读 下载PDF
面向工艺规范的文档树结构检索增强生成方法
4
作者 姜禹辰 王裴岩 +1 位作者 余卓 李中武 《计算机集成制造系统》 北大核心 2026年第1期131-144,共14页
现有基于大语言模型的检索增强生成忽视了工艺规范这类技术文档所呈现的复杂段落结构与隐含知识关联,致使效果不佳难以满足应用需求。针对该问题设计了面向工艺规范的文档树结构检索方法(DTSR),利用段落间章节关系将工艺规范文档转化为... 现有基于大语言模型的检索增强生成忽视了工艺规范这类技术文档所呈现的复杂段落结构与隐含知识关联,致使效果不佳难以满足应用需求。针对该问题设计了面向工艺规范的文档树结构检索方法(DTSR),利用段落间章节关系将工艺规范文档转化为树形结构对象,设计了树形结构对象遍历算法在检索增强过程中动态获取相关段落,使得大语言模型获得更多对于问题回答有益的相关知识。在3345条工艺规范问答数据集上进行实验,结果表明,与朴素检索增强生成相比,DTSR在概念准确性上平均提升13.01%,在ROUGE-L和BLEU-4指标上分别提升4.78%和2.91%。为提高大语言模型在工艺规范等工程技术性文档中的应用效果提供了新的思路。 展开更多
关键词 检索增强生成 大语言模型 工艺规范 文档树结构
在线阅读 下载PDF
Advances in Structural Geology and Tectonics in the Late 20th Century: A Review 被引量:3
5
作者 DONG Shuwen ZHENG Yadong +1 位作者 CHEN Xuanhua SHI Jing 《Acta Geologica Sinica(English Edition)》 SCIE CAS CSCD 2006年第3期349-375,共27页
Based on analyses of the share of documents of structural geology and tectonics in the GeoRef system over 100 years in the last century, and the historical change of international (31 years) and domestic (16 years... Based on analyses of the share of documents of structural geology and tectonics in the GeoRef system over 100 years in the last century, and the historical change of international (31 years) and domestic (16 years) document counts of various topics in structural geology and tectonics, the position of structural geology and tectonics in the geosciences is evaluated and the major advaces in fields of plate tectonics, continental dynamics and global dynamics are reviewed. Our attention mainly focuses on the advances in studies of structural analysis, deformation mechanisms and rheology of rocks, contractional tectonics and late- and post-orogenic extensional collapse in orogens, large-scale strikeslip faults and indentation-extrusion tectonics, active tectonics and natural hazards. The relationships of structural geology and tectonics with petrology and geochronology are also discussed in terms of intersection of scientific disciplines. Finally, some suggestions are proposed for the further development of structural geology and tectonics in China. 展开更多
关键词 plate tectonics RHEOLOGY structural geology continental dynamics document statistics
在线阅读 下载PDF
A Stable and Consistent Document Model Suitable for Asynchronous Cooperative Edition
6
作者 Maurice Tchoupé Tchendji Rodrigue D. Djeumen Marcellin T. Atemkeng 《Journal of Computer and Communications》 2017年第8期69-82,共14页
Complex structured documents can be intentionally represented as a tree structure decorated with attributes. Ignoring attributes (these are related to semantic aspects that can be treated separately from purely struct... Complex structured documents can be intentionally represented as a tree structure decorated with attributes. Ignoring attributes (these are related to semantic aspects that can be treated separately from purely structural aspects which interest us here), in the context of a cooperative edition, legal structures are characterized by a document model (an abstract grammar) and each intentional representation can be manipulated independently and eventually asynchronously by several co-authors through various editing tools that operate on its “partial replicas”. For unsynchronized edition of a partial replica, considered co-author must have a syntactic document local model that constraints him to ensure minimum consistency of local representation that handles with respect to the global model. This consistency is synonymous with the existence of one or more (global) intentional representations towards the global model, assuming the current local representation as her/their partial replica. The purpose of this paper is to present the grammatical structures which are grammars that permit not only to specify a (global) model for documents published in a cooperative manner, but also to derive automatically via a so call projection operation, consistent (local) models for each co-authors involved in the cooperative edition. We also show some properties that meet these grammatical structures. 展开更多
关键词 structureD documentS documentS Models GRAMMARS Cooperative EDITION structureD EDITION Projections VIEWS Partial Replicas
在线阅读 下载PDF
Document Clustering Based on Constructing Density Tree
7
作者 戴维迪 王文俊 +2 位作者 侯越先 王英 张璐 《Transactions of Tianjin University》 EI CAS 2008年第1期21-26,共6页
This paper focuses on document clustering by clustering algorithm based on a DEnsityTree (CABDET) to improve the accuracy of clustering. The CABDET method constructs a density-based treestructure for every potential c... This paper focuses on document clustering by clustering algorithm based on a DEnsityTree (CABDET) to improve the accuracy of clustering. The CABDET method constructs a density-based treestructure for every potential cluster by dynamically adjusting the radius of neighborhood according to local density. It avoids density-based spatial clustering of applications with noise (DBSCAN) ′s global density parameters and reduces input parameters to one. The results of experiment on real document show that CABDET achieves better accuracy of clustering than DBSCAN method. The CABDET algorithm obtains the max F-measure value 0.347 with the root node's radius of neighborhood 0.80, which is higher than 0.332 of DBSCAN with the radius of neighborhood 0.65 and the minimum number of objects 6. 展开更多
关键词 document handling clustering tree structure vector space model
在线阅读 下载PDF
A Tree Pattern Matching Algorithm for XML Queries with Structural Preferences
8
作者 Maurice Tchoupé Tchendji Lionel Tadonfouet Thomas Tébougang Tchendji 《Journal of Computer and Communications》 2019年第1期61-83,共23页
In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly comple... In the XML community, exact queries allow users to specify exactly what they want to check and/or retrieve in an XML document. When they are applied to a semi-structured document or to a document with an overly complex model, the lack or the ignorance of the explicit document model (DTD—Document Type Definition, Schema, etc.) increases the risk of obtaining an empty result set when the query is too specific, or, too large result set when it is too vague (e.g. it contains wildcards such as “*”). The reason is that in both cases, users write queries according to the document model they have in mind;this can be very far from the one that can actually be extracted from the document. Opposed to exact queries, preference queries are more flexible and can be relaxed to expand the search space during their evaluations. Indeed, during their evaluation, certain constraints (the preferences they contain) can be relaxed if necessary to avoid precisely empty results;moreover, the returned answers can be filtered to retain only the best ones. This paper presents an algorithm for evaluating such queries inspired by the TreeMatch algorithm proposed by Yao et al. for exact queries. In the proposed algorithm, the best answers are obtained by using an adaptation of the Skyline operator (defined in relational databases) in the context of documents (trees) to incrementally filter into the partial solutions set, those which satisfy the maximum of preferential constraints. The only restriction imposed on documents is No-Self-Containment. 展开更多
关键词 SEMI-structureD documents Preference QUERIES TREE Pattern Matching TreeMatch Algorithm XML The SKYLINE Operator
在线阅读 下载PDF
联合国国际城市搜索与救援标准及指导性文件综述 被引量:1
9
作者 曲旻皓 李立 +4 位作者 王建平 程永 高杨 王学瀚 黄新杰 《灾害学》 北大核心 2025年第2期153-156,204,共5页
自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作... 自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作机制,并通过出台一系列的指南、指导性文件和推荐性技术文件,规范救援能力和队伍建设,强化国际救援协调和现场救援的效率。该文系统介绍了INSARAG标准和技术文件组成体系架构,并阐述了各标准及技术文件的出台背景、主要内容及对中国的搜救队伍建设的推动作用,并讨论其对我国灾害救援工作的启示与借鉴意义。 展开更多
关键词 国际城市搜救咨询团INSARAG INSARAG标准架构 INSARAG指南 INSARAG指导性文件 INSARAG推荐性技术文件
在线阅读 下载PDF
Storyline Extraction of Document-Level Events Using Large Language Models
10
作者 Ziyang Hu Yaxiong Li 《Journal of Computer and Communications》 2024年第11期162-172,共11页
This article proposes a document-level prompt learning approach using LLMs to extract the timeline-based storyline. Through verification tests on datasets such as ESCv1.2 and Timeline17, the results show that the prom... This article proposes a document-level prompt learning approach using LLMs to extract the timeline-based storyline. Through verification tests on datasets such as ESCv1.2 and Timeline17, the results show that the prompt + one-shot learning proposed in this article works well. Meanwhile, our research findings indicate that although timeline-based storyline extraction has shown promising prospects in the practical applications of LLMs, it is still a complex natural language processing task that requires further research. 展开更多
关键词 document-Level Storyline Extraction TIMELINE Large Language Models Topological structure of Storyline Prompt Learning
在线阅读 下载PDF
我国文献修复职业稳定性因素分析及发展策略研究
11
作者 张美芳 臧丹阳 +1 位作者 李萌 宋欣 《北京档案》 北大核心 2025年第5期22-28,共7页
文献修复职业的职能及职责有别于其他行业或档案馆与图书馆其他岗位,对专业及技术要求高。论文利用31个省(区、市)的调查数据,分析了当前我国文献修复人员从业现状,并基于Logit模型对影响修复职业稳定性的主要因素进行了分析,同时运用SP... 文献修复职业的职能及职责有别于其他行业或档案馆与图书馆其他岗位,对专业及技术要求高。论文利用31个省(区、市)的调查数据,分析了当前我国文献修复人员从业现状,并基于Logit模型对影响修复职业稳定性的主要因素进行了分析,同时运用SPSS 26.0对调研结果进行回归分析,结果得出,修复人员学历、职业兴趣、成就感、社会评价、升迁机会等与选择并坚守修复职业直接相关。论文揭示了在社会及大众对文献修复职业认可度不断提高的背景下自身职业发展变化的根源,以便调整修复人才培养方式,促进文献保护与修复队伍不断壮大。 展开更多
关键词 文献修复 人才结构 LOGIT模型 人才培养
在线阅读 下载PDF
基于树形结构的JSON文档相似性度量模型
12
作者 王六平 陆玟冰 +2 位作者 胡子达 孙程 张锦 《计算机与数字工程》 2025年第8期2133-2139,共7页
JSON文档的相似性度量是JSON文档数据挖掘、文本聚类、信息检索的关键,现有的研究方法用于JSON文档的结构提取存在缺陷,导致相似度计算精确度不高,效果不理想。论文在现有的树编辑距离算法的基础上,提出了一种适用于JSON文档的相似性度... JSON文档的相似性度量是JSON文档数据挖掘、文本聚类、信息检索的关键,现有的研究方法用于JSON文档的结构提取存在缺陷,导致相似度计算精确度不高,效果不理想。论文在现有的树编辑距离算法的基础上,提出了一种适用于JSON文档的相似性度量模型,并针对无序树的编辑距离计算进行优化。实验结果表明,该模型在一定程度上提高了JSON文档相似度计算的效率。 展开更多
关键词 编辑距离 JSON文档 结构相似性 信息抽取 半结构化数据
在线阅读 下载PDF
篇章模式视角下公文正文的宏观结构探究
13
作者 杨霞 《秘书》 2025年第4期81-94,共14页
在长期的公务管理话语实践中,公文正文逐渐形成了独特且稳定的篇章模式。然而,目前学界提出的公文篇章模式大多是对公文表层结构的概括性描述,缺乏理论深度。本文运用篇章语言学和篇章分析等方法,深入剖析公文的深层结构,并创新性地构... 在长期的公务管理话语实践中,公文正文逐渐形成了独特且稳定的篇章模式。然而,目前学界提出的公文篇章模式大多是对公文表层结构的概括性描述,缺乏理论深度。本文运用篇章语言学和篇章分析等方法,深入剖析公文的深层结构,并创新性地构建公文四维逻辑链模式,即“背景缘由-事项行为-条件保障-控制约束”模式。该模式与社会组织的公务管理思维高度契合,能够充分反映公务场域的管理情境及其语境特征,不仅为公文构思提供宏观的思维框架,也为公文篇章的建构和话语信息内容的组织提供宏观的功能结构和语义结构,从而为公文写作与阅读奠定坚实的理论基础。 展开更多
关键词 公文篇章模式 深层结构分析 公文四维逻辑链模式 宏观结构框架 公务管理思维
原文传递
基于模态注意力与知识库比对的招标文件编制方法 被引量:1
14
作者 侯继辉 吴小忠 +3 位作者 郑浩 夏卓群 唐海东 王欣 《计算机技术与发展》 2025年第5期36-44,共9页
多模态自适应权重注意力命名实体识别与知识库比对的招标编制方法随着人工智能的发展,招投标业务中引入了大量的智能化手段。招标文件编制作为招投标业务中的重要流程,编制结果的准确性和编制速率的高效性需要更好的保障。传统招标文件... 多模态自适应权重注意力命名实体识别与知识库比对的招标编制方法随着人工智能的发展,招投标业务中引入了大量的智能化手段。招标文件编制作为招投标业务中的重要流程,编制结果的准确性和编制速率的高效性需要更好的保障。传统招标文件编制中存在的文件编制过程复杂、耗时较长和人为编制的偶然错误问题,降低了文件编制的效率甚至影响了招标业务的流畅性。为此提出了基于多模态自适应权重注意力机制的命名实体识别(MMAWA-NER)提取与知识库比对的招标文件编制方法。首先以人工分析的形式将历史招标文件按照文章条目分为各个模块,每个模块使用包含了固定的条款信息并存入结构化标准库中;在文件编制阶段,根据标准库当中的模块内容生成标准化文件,编制人员随后填写关键信息;在编制完成后,将需校验的招标文件通过MMAWA-NER抽取关键信息与领域知识库进行对比,并返回评估结果给编制人员;最后使用了历史招标文件处理后的知识库数据生成新的招标文件数据集进行评估,并对比了其他模型的方法,实验证明该模型实现了80%的编制效率和20%的准确率提升。 展开更多
关键词 招标文件 智能编制 知识库 命名实体识别 结构化
在线阅读 下载PDF
结合大语言模型与动态提示的裁判文书摘要方法
15
作者 张滨滨 秦永彬 +1 位作者 黄瑞章 陈艳平 《计算机应用》 北大核心 2025年第9期2783-2789,共7页
针对裁判文书案件结构复杂、涉案事实冗余且案情分布广泛的问题,现有的大语言模型(LLM)难以有效关注结构信息并可能会产生事实错误关联,从而导致结构信息缺失和事实不一致。因此,提出一种结合LLM与动态提示的裁判文书摘要方法DPCM(Dynam... 针对裁判文书案件结构复杂、涉案事实冗余且案情分布广泛的问题,现有的大语言模型(LLM)难以有效关注结构信息并可能会产生事实错误关联,从而导致结构信息缺失和事实不一致。因此,提出一种结合LLM与动态提示的裁判文书摘要方法DPCM(Dynamic Prompt Correction Method)。首先,利用LLM进行单样本学习,以生成裁判文书摘要。其次,计算原文与摘要之间的高维相似性,以检测摘要中可能存在的结构缺失或事实不一致的问题:如果发现问题,将错误摘要与原文拼接,并加入提示词,随后再次进行单样本学习,以修正并生成新的摘要,且再次进行相似性检测,如果问题仍然存在,则重复此生成与检测过程。最后,通过这种反复迭代的方式动态调整提示词,以逐步优化生成的摘要。在CAIL2020公共司法摘要数据集上的实验结果表明,相较于Least-To-Most-Prompting、Zero-Shot-Reasoners和Self_Consistency_Cot等方法,所提方法在Rouge-1、Rouge-2、Rouge-L、BERTscore、FactCC(Factual Consistency)指标上均有所提高。 展开更多
关键词 大语言模型 动态提示 裁判文书摘要 结构缺失 事实不一致
在线阅读 下载PDF
面向复杂刑事案件的涉案金额识别方法
16
作者 田如君 林川 +3 位作者 黄瑞章 陈艳平 杨志 秦永彬 《计算机工程与设计》 北大核心 2025年第6期1556-1563,共8页
针对现有涉案金额识别方法在复杂案件(一案多人)上面临金额的所属关系易混淆及意图多样性问题,提出一种面向复杂刑事案件的涉案金额识别推理方法。通过分析裁判文书的逻辑结构,抽取文书中的金额相关要素并结合文书的特征构建金额共现图... 针对现有涉案金额识别方法在复杂案件(一案多人)上面临金额的所属关系易混淆及意图多样性问题,提出一种面向复杂刑事案件的涉案金额识别推理方法。通过分析裁判文书的逻辑结构,抽取文书中的金额相关要素并结合文书的特征构建金额共现图,用图的形式对金额的归属关系进行表示,使用图神经网络(graph neural network, GNN)在金额共现图中学习要素节点之间的语义依赖信息和结构信息,获取其深层的节点特征,实现对涉案金额的识别和推理。在公共比赛数据集LAIC2021(Legal AI Challenge 2021)上的准确率(Accuracy, Acc)值达到94.75%,比当前最优模型提升了3.7%,在某省人民法院裁判文书复杂案件数据集上的Acc值达到74.16%。 展开更多
关键词 刑事案件 涉案金额识别 裁判文书逻辑结构 金额共现图 图神经网络 司法智能 特征融合
在线阅读 下载PDF
APPCorp:a corpus for Android privacy policy document structure analysis 被引量:2
17
作者 Shuang LIU Fan ZHANG +3 位作者 Baiyang ZHAO Renjie GUO Tao CHEN Meishan ZHANG 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第3期1-10,共10页
With the increasing popularity of mobile devices and the wide adoption of mobile Apps,an increasing concern of privacy issues is raised.Privacy policy is identified as a proper medium to indicate the legal terms,such ... With the increasing popularity of mobile devices and the wide adoption of mobile Apps,an increasing concern of privacy issues is raised.Privacy policy is identified as a proper medium to indicate the legal terms,such as the general data protection regulation(GDPR),and to bind legal agreement between service providers and users.However,privacy policies are usually long and vague for end users to read and understand.It is thus important to be able to automatically analyze the document structures of privacy policies to assist user understanding.In this work we create a manually labelled corpus containing 231 privacy policies(of more than 566,000 words and 7,748 annotated paragraphs).We benchmark our data corpus with 3 document classification models and achieve more than 82%on F1-score. 展开更多
关键词 privacy policy GDPR document structure analysis representation learning graph neural network
原文传递
基于修辞结构的篇章级神经机器翻译
18
作者 姜云卓 贡正仙 《计算机工程与科学》 北大核心 2025年第1期180-190,共11页
虽然篇章级神经机器翻译发展多年,并取得了长足的进步,但是其大部分工作都是从模型的角度出发,利用上下文字词信息来构建有效的网络结构,忽视了使用跨句子的篇章结构和修辞信息对模型进行指导。针对这一问题,在修辞结构理论的指导下,提... 虽然篇章级神经机器翻译发展多年,并取得了长足的进步,但是其大部分工作都是从模型的角度出发,利用上下文字词信息来构建有效的网络结构,忽视了使用跨句子的篇章结构和修辞信息对模型进行指导。针对这一问题,在修辞结构理论的指导下,提出了对篇章单元和修辞结构树特征分别进行编码的方法。实验结果表明,所提方法加强了编码器对篇章结构和修辞上的表征能力,使用该方法对模型进行改进后,其翻译结果在多个数据集上都获得了明显提升,性能超过了多个优质的基线模型,并且在提出的定量评估方法和人工分析中译文质量上也表现出了明显改善。 展开更多
关键词 神经机器翻译 篇章分析 篇章翻译 修辞结构理论
在线阅读 下载PDF
基于实体类别信息的数据分析与关系抽取模型构建
19
作者 杨航 张啸成 张永刚 《吉林大学学报(理学版)》 北大核心 2025年第2期428-436,共9页
针对文档级关系抽取任务中的实体多提及问题和实体对噪音问题,使用实体的类别信息,提出一个基于实体类别信息的关系抽取模型(EUT模型),该模型通过实体类别判断和类别对产生的关系类别先验两个子任务提高关系抽取结果.实体类别判断任务... 针对文档级关系抽取任务中的实体多提及问题和实体对噪音问题,使用实体的类别信息,提出一个基于实体类别信息的关系抽取模型(EUT模型),该模型通过实体类别判断和类别对产生的关系类别先验两个子任务提高关系抽取结果.实体类别判断任务对实体进行类型标记后,再对实体所有提及进行类型分类,使实体的多个提及产生更丰富且相近的特征表示.关系类别先验任务使模型获得实体对的头尾类型所产生的关系分布先验,通过实体对的类别降低错误实体对噪音.为验证EUT模型的效果,在两个文档级数据集DocRED和Re-DocRED上进行实验,实验结果表明,该模型有效利用了实体的类型信息,与基础模型相比取得了更好的关系抽取效果,表明实体的类别信息对文档级关系抽取有重要影响. 展开更多
关键词 文档级关系抽取 知识图谱 结构化先验 自然语言处理
在线阅读 下载PDF
Document structure model for survey generation using neural network
20
作者 Huiyan XU Zhongqing WANG +3 位作者 Yifei ZHANG Xiaolan WENG ZhijianWANG Guodong ZHOU 《Frontiers of Computer Science》 SCIE EI CSCD 2021年第4期73-82,共10页
Survey generation aims to generate a summary from a scientific topic based on related papers.The structure of papers deeply influences the generative process of survey,especially the relationships between sentence and... Survey generation aims to generate a summary from a scientific topic based on related papers.The structure of papers deeply influences the generative process of survey,especially the relationships between sentence and sentence,paragraph and paragraph.In principle,the structure of paper can influence the quality of the summary.Therefore,we employ the structure of paper to leverage contextual information among sentences in paragraphs to generate a survey for documents.In particular,we present a neural document structure model for survey generation.We take paragraphs as units,and model sentences in paragraphs,we then employ a hierarchical model to learn structure among sentences,which can be used to select important and informative sentences to generate survey.We evaluate our model on scientific document data set.The experimental results show that our model is effective,and the generated survey is informative and readable. 展开更多
关键词 survey generation contextual information document structure
原文传递
上一页 1 2 21 下一页 到第
使用帮助 返回顶部