期刊文献+
共找到34篇文章
< 1 2 >
每页显示 20 50 100
Using AdaBoost Meta-Learning Algorithm for Medical News Multi-Document Summarization 被引量:1
1
作者 Mahdi Gholami Mehr 《Intelligent Information Management》 2013年第6期182-190,共9页
Automatic text summarization involves reducing a text document or a larger corpus of multiple documents to a short set of sentences or paragraphs that convey the main meaning of the text. In this paper, we discuss abo... Automatic text summarization involves reducing a text document or a larger corpus of multiple documents to a short set of sentences or paragraphs that convey the main meaning of the text. In this paper, we discuss about multi-document summarization that differs from the single one in which the issues of compression, speed, redundancy and passage selection are critical in the formation of useful summaries. Since the number and variety of online medical news make them difficult for experts in the medical field to read all of the medical news, an automatic multi-document summarization can be useful for easy study of information on the web. Hence we propose a new approach based on machine learning meta-learner algorithm called AdaBoost that is used for summarization. We treat a document as a set of sentences, and the learning algorithm must learn to classify as positive or negative examples of sentences based on the score of the sentences. For this learning task, we apply AdaBoost meta-learning algorithm where a C4.5 decision tree has been chosen as the base learner. In our experiment, we use 450 pieces of news that are downloaded from different medical websites. Then we compare our results with some existing approaches. 展开更多
关键词 multi-document summarization Machine Learning Decision Trees ADABOOST C4.5 MEDICAL Document summarization
在线阅读 下载PDF
Density peaks clustering based integrate framework for multi-document summarization 被引量:3
2
作者 BaoyanWang Jian Zhang +1 位作者 Yi Liu Yuexian Zou 《CAAI Transactions on Intelligence Technology》 2017年第1期26-30,共5页
We present a novel unsupervised integrated score framework to generate generic extractive multi- document summaries by ranking sentences based on dynamic programming (DP) strategy. Considering that cluster-based met... We present a novel unsupervised integrated score framework to generate generic extractive multi- document summaries by ranking sentences based on dynamic programming (DP) strategy. Considering that cluster-based methods proposed by other researchers tend to ignore informativeness of words when they generate summaries, our proposed framework takes relevance, diversity, informativeness and length constraint of sentences into consideration comprehensively. We apply Density Peaks Clustering (DPC) to get relevance scores and diversity scores of sentences simultaneously. Our framework produces the best performance on DUC2004, 0.396 of ROUGE-1 score, 0.094 of ROUGE-2 score and 0.143 of ROUGE-SU4 which outperforms a series of popular baselines, such as DUC Best, FGB [7], and BSTM [10]. 展开更多
关键词 multi-document summarization Integrated score framework Density peaks clustering Sentences rank
在线阅读 下载PDF
Constructing a taxonomy to support multi-document summarization of dissertation abstracts
3
作者 KHOO Christopher S.G. GOH Dion H. 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2005年第11期1258-1267,共10页
This paper reports part of a study to develop a method for automatic multi-document summarization. The current focus is on dissertation abstracts in the field of sociology. The summarization method uses macro-level an... This paper reports part of a study to develop a method for automatic multi-document summarization. The current focus is on dissertation abstracts in the field of sociology. The summarization method uses macro-level and micro-level discourse structure to identify important information that can be extracted from dissertation abstracts, and then uses a variable-based framework to integrate and organize extracted information across dissertation abstracts. This framework focuses more on research concepts and their research relationships found in sociology dissertation abstracts and has a hierarchical structure. A taxonomy is constructed to support the summarization process in two ways: (1) helping to identify important concepts and relations expressed in the text, and (2) providing a structure for linking similar concepts in different abstracts. This paper describes the variable-based framework and the summarization process, and then reports the construction of the taxonomy for supporting the summarization process. An example is provided to show how to use the constructed taxonomy to identify important concepts and integrate the concepts extracted from different abstracts. 展开更多
关键词 Text summarization Automatic multi-document summarization Variable-based framework Digital library
在线阅读 下载PDF
Unsupervised Graph-Based Tibetan Multi-Document Summarization
4
作者 Xiaodong Yan Yiqin Wang +3 位作者 Wei Song Xiaobing Zhao A.Run Yang Yanxing 《Computers, Materials & Continua》 SCIE EI 2022年第10期1769-1781,共13页
Text summarization creates subset that represents the most important or relevant information in the original content,which effectively reduce information redundancy.Recently neural network method has achieved good res... Text summarization creates subset that represents the most important or relevant information in the original content,which effectively reduce information redundancy.Recently neural network method has achieved good results in the task of text summarization both in Chinese and English,but the research of text summarization in low-resource languages is still in the exploratory stage,especially in Tibetan.What’s more,there is no large-scale annotated corpus for text summarization.The lack of dataset severely limits the development of low-resource text summarization.In this case,unsupervised learning approaches are more appealing in low-resource languages as they do not require labeled data.In this paper,we propose an unsupervised graph-based Tibetan multi-document summarization method,which divides a large number of Tibetan news documents into topics and extracts the summarization of each topic.Summarization obtained by using traditional graph-based methods have high redundancy and the division of documents topics are not detailed enough.In terms of topic division,we adopt two level clustering methods converting original document into document-level and sentence-level graph,next we take both linguistic and deep representation into account and integrate external corpus into graph to obtain the sentence semantic clustering.Improve the shortcomings of the traditional K-Means clustering method and perform more detailed clustering of documents.Then model sentence clusters into graphs,finally remeasure sentence nodes based on the topic semantic information and the impact of topic features on sentences,higher topic relevance summary is extracted.In order to promote the development of Tibetan text summarization,and to meet the needs of relevant researchers for high-quality Tibetan text summarization datasets,this paper manually constructs a Tibetan summarization dataset and carries out relevant experiments.The experiment results show that our method can effectively improve the quality of summarization and our method is competitive to previous unsupervised methods. 展开更多
关键词 multi-document summarization text clustering topic feature fusion graphic model
在线阅读 下载PDF
Research on multi-document summarization based on latent semantic indexing
5
作者 秦兵 刘挺 +1 位作者 张宇 李生 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2005年第1期91-94,共4页
A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decompos... A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decomposition (SVD) to reduce the dimension of the matrix and extract features, and then the sentence similarity is computed. The sentences are clustered according to similarity of sentences. The centroid sentences are selected from each class. Finally, the selected sentences are ordered to generate the summarization. The evaluation and results are presented, which prove that the proposed methods are efficient. 展开更多
关键词 multi-document summarization LSI (latent semantic indexing) CLUSTERING
在线阅读 下载PDF
TWO-STAGE SENTENCE SELECTION APPROACH FOR MULTI-DOCUMENT SUMMARIZATION
6
作者 Zhang Shu Zhao Tiejun Zheng Dequan Zhao Hua 《Journal of Electronics(China)》 2008年第4期562-567,共6页
Compared with the traditional method of adding sentences to get summary in multi-document summarization,a two-stage sentence selection approach based on deleting sentences in acandidate sentence set to generate summar... Compared with the traditional method of adding sentences to get summary in multi-document summarization,a two-stage sentence selection approach based on deleting sentences in acandidate sentence set to generate summary is proposed,which has two stages,the acquisition of acandidate sentence set and the optimum selection of sentence.At the first stage,the candidate sentenceset is obtained by redundancy-based sentence selection approach.At the second stage,optimum se-lection of sentences is proposed to delete sentences in the candidate sentence set according to itscontribution to the whole set until getting the appointed summary length.With a test corpus,theROUGE value of summaries gotten by the proposed approach proves its validity,compared with thetraditional method of sentence selection.The influence of the token chosen in the two-stage sentenceselection approach on the quality of the generated summaries is analyzed. 展开更多
关键词 TWO-STAGE Sentence selection approach multi-document summarization
在线阅读 下载PDF
Multi-Document Summarization Model Based on Integer Linear Programming
7
作者 Rasim Alguliev Ramiz Aliguliyev Makrufa Hajirahimova 《Intelligent Control and Automation》 2010年第2期105-111,共7页
This paper proposes an extractive generic text summarization model that generates summaries by selecting sentences according to their scores. Sentence scores are calculated using their extensive coverage of the main c... This paper proposes an extractive generic text summarization model that generates summaries by selecting sentences according to their scores. Sentence scores are calculated using their extensive coverage of the main content of the text, and summaries are created by extracting the highest scored sentences from the original document. The model formalized as a multiobjective integer programming problem. An advantage of this model is that it can cover the main content of source (s) and provide less redundancy in the generated sum- maries. To extract sentences which form a summary with an extensive coverage of the main content of the text and less redundancy, have been used the similarity of sentences to the original document and the similarity between sentences. Performance evaluation is conducted by comparing summarization outputs with manual summaries of DUC2004 dataset. Experiments showed that the proposed approach outperforms the related methods. 展开更多
关键词 multi-document summarization Content COVERAGE LESS REDUNDANCY INTEGER Linear Programming
在线阅读 下载PDF
基于多图神经网络和图对比学习的科学文献摘要模型
8
作者 赵红燕 郭力华 +1 位作者 刘春霞 王日云 《计算机应用》 北大核心 2025年第12期3820-3828,共9页
生成面临句间关系的捕捉、长距离依赖及文档信息的高效编码与提取等难题,一直是自然语言处理领域的一个难点任务。同时,科学文献通常包含多个章节和段落,具有复杂的层次结构,使科学文献的摘要生成任务更具挑战性。针对以上问题,提出一... 生成面临句间关系的捕捉、长距离依赖及文档信息的高效编码与提取等难题,一直是自然语言处理领域的一个难点任务。同时,科学文献通常包含多个章节和段落,具有复杂的层次结构,使科学文献的摘要生成任务更具挑战性。针对以上问题,提出一种基于多图神经网络(GNN)和图对比学习(GCL)的科学文献摘要模型(MGCSum)。首先,对于输入的文档,通过同构GNN和异构GNN分别建模句内与句间关系,以生成初始句子表示;其次,将这些句子表示馈送到一个多头超图注意网络(HGAT),并在其中利用自注意机制充分捕捉节点和边之间的关系,从而进一步更新和学习句间的表示;再次,引入GCL模块增强全局主题感知,从而提升句子表示的语义一致性和区分度;最后,采用多层感知器(MLP)和归一化层计算一个得分,用于判断句子是否应被选为摘要。在PubMed和ArXiv数据集上的实验结果表明,MGCSum模型的表现优于多数基线模型。具体地,在PubMed数据集上,MGCSum模型的ROUGE-1、ROUGE-2和ROUGE-L分别达到了48.97%、23.15%和44.09%,相比现有的先进模型HAESum(Hierarchical Attention graph for Extractive document Summarization)分别提高了0.20、0.71和0.26个百分点。可见,通过结合多GNN和GCL,MGCSum模型能够更有效地捕捉文献的层次结构信息和句间关系,提升了摘要生成的准确性和语义一致性,展现了它在科学文献摘要生成任务中的优势。 展开更多
关键词 科学文献摘要 抽取式摘要 图神经网络 超图注意网络 图对比学习
在线阅读 下载PDF
中国共产党总结和运用改革开放经验的历史发展、内涵本质及其当代意义
9
作者 张艳国 《岭南学刊》 2025年第1期41-55,共15页
在每个重要的历史阶段和关键的时间节点上,中国共产党都高度重视总结和运用改革开放的宝贵历史经验。党坚持把改革开放的宏大社会实践与经验总结的深刻认识有机结合起来、科学整合起来,形成为一个完整的实践体系和认识过程。依循中国改... 在每个重要的历史阶段和关键的时间节点上,中国共产党都高度重视总结和运用改革开放的宝贵历史经验。党坚持把改革开放的宏大社会实践与经验总结的深刻认识有机结合起来、科学整合起来,形成为一个完整的实践体系和认识过程。依循中国改革开放的历史逻辑、实践逻辑和认识逻辑的依次展开、有序推进和阶段性特点,可以将改革开放经验总结和运用的历史发展,概括为“一个过程、三个阶段”。它是一个完整的、有机联系的认识整体和思想整体,具有鲜明的时代特征、实践特性和认识特点。党对改革开放宝贵经验的总结和运用,是一个完整的思想体系和独立的话语表达体系,其内涵具有多方面的本质特征。科学认识和准确掌握党对改革开放宝贵经验的总结运用,对于推进新时代全面深化改革,具有十分重要的当代意义和价值。 展开更多
关键词 中国共产党 改革开放 宝贵经验总结和运用 科学内涵 当代意义
在线阅读 下载PDF
面向可溯源文本生成的科技文献伪反馈训练数据合成研究
10
作者 马永强 刘家伟 高影繁 《情报学报》 北大核心 2025年第7期830-845,共16页
在学术文本中插入恰当的引文标识是学术写作的基本规范,可以帮助读者验证文本内容的真实性。引文标识符可以用于实现内容溯源、保证内容可验证性。在学术场景中,现有大语言模型普遍缺乏内置的内容溯源机制,导致所生成学术文本的可验证... 在学术文本中插入恰当的引文标识是学术写作的基本规范,可以帮助读者验证文本内容的真实性。引文标识符可以用于实现内容溯源、保证内容可验证性。在学术场景中,现有大语言模型普遍缺乏内置的内容溯源机制,导致所生成学术文本的可验证性不足。当前,借助领域数据集来优化大模型是主流的研究思路。然而,在优化模型可溯源性方面,基于人类撰写的学术文本所构建的训练集存在内在一致性不足、引文标注行为差异性大等问题,基于大模型的数据合成方法在数据多样性方面也存在局限性。为此,本文提出了一种面向可溯源学术文本的引文标识符体系与评测方法,用于分析大模型所生成学术文本的可溯源性。然后,从训练数据的角度,针对可溯源的学术文本生成,本文提出了一种两阶段伪反馈训练数据合成方法,兼顾大模型标注文本和人类标注文本的特性,构建高质量、多样化的训练数据。研究结果表明,采用本文构建的合成数据训练的小模型,能够生成更具可溯源性的学术文本;通过第二阶段的伪反馈进一步优化数据分布和任务多样性,有助于增强模型的泛化能力。 展开更多
关键词 大语言模型 数据合成 学术多文档摘要 文本可溯源性
在线阅读 下载PDF
Reflective thinking meets artificial intelligence:Synthesizing sustainability transition knowledge in left-behind mountain regions
11
作者 Andrej Ficko Simo Sarkki +2 位作者 Yasar Selman Gultekin Antonia Egli Juha Hiedanpää 《Geography and Sustainability》 2025年第1期159-169,共11页
We demonstrate a multi-method approach towards discovering and structuring sustainability transition knowl edge in marginalized mountain regions.By employing reflective thinking,artificial intelligence(AI)-powered tex... We demonstrate a multi-method approach towards discovering and structuring sustainability transition knowl edge in marginalized mountain regions.By employing reflective thinking,artificial intelligence(AI)-powered text summarization and text mining,we synthesize experts’narratives on sustainable development challenges and solutions in Kardüz Upland,Türkiye.We then analyze their alignment with the UN Sustainable Development Goals(SDGs)using document embedding.Investment in infrastructure,education,and resilient socio-ecological systems emerged as priority sectors to combat poor infrastructure,geographic isolation,climate change,poverty,depopulation,unemployment,low education levels,and inadequate social services.The narratives were closest in substance to SDG 1,3,and 11.Social dimensions of sustainability were more pronounced than environmental dimensions.The presented approach supports policymakers in organizing loosely structured sustainability tran sition knowledge and fragmented data corpora,while also advancing AI applications for designing and planning sustainable development policies at the regional level. 展开更多
关键词 Artificial intelligence INNOVATION Reflective thinking scientific imagination Text mining Text summarization
在线阅读 下载PDF
关键词和被引次数对科技论文自动摘要效果影响研究 被引量:6
12
作者 刘家益 李鲡瑶 +1 位作者 张智雄 邹益民 《情报学报》 CSSCI CSCD 北大核心 2017年第11期1165-1174,共10页
科技论文的关键词和被引次数与论文主题具有高相关性,是发现论文重要内容的有力线索。但这两个特征尚未应用于当前面向科技论文的多文档自动摘要方法中,其对科技论文自动摘要效果的影响还有待探索。本文通过设计对比算法和实验,定量分... 科技论文的关键词和被引次数与论文主题具有高相关性,是发现论文重要内容的有力线索。但这两个特征尚未应用于当前面向科技论文的多文档自动摘要方法中,其对科技论文自动摘要效果的影响还有待探索。本文通过设计对比算法和实验,定量分析研究了科技论文的关键词、被引次数两个特征对科技论文自动摘要效果的影响。结果表明:引入关键词因子和被引次数因子能显著提高摘要的效果。其中,同时使用两个因子,对摘要效果的积极影响最为显著;单独使用被引次数因子对摘要效果的积极影响也较为显著,但弱于同时使用两个因子;单独使用关键词因子对摘要效果影响不显著,甚至差于基准组;此外两个因子对摘要规模的变化也较为敏感。 展开更多
关键词 关键词 被引次数 科技论文 多文档自动摘要
在线阅读 下载PDF
以科研为关键加快向研究型大学转变的步伐——庆祝南京航空航天大学建校50周年 被引量:2
13
作者 梁德旺 《南京航空航天大学学报》 EI CAS CSCD 北大核心 2002年第5期403-408,共6页
首先回顾了建校 5 0年来的科研工作 ,着重介绍了“七五”~“九五”期间我校承担国防型号任务 ,加强基础研究 ,跟踪高新技术等方面所取得的成绩 ;总结了取得科研业绩的经验和体会 ;队伍是根本 ,基地是关键 ,团队是基础 ;并提出为实现向... 首先回顾了建校 5 0年来的科研工作 ,着重介绍了“七五”~“九五”期间我校承担国防型号任务 ,加强基础研究 ,跟踪高新技术等方面所取得的成绩 ;总结了取得科研业绩的经验和体会 ;队伍是根本 ,基地是关键 ,团队是基础 ;并提出为实现向研究型大学转变的战略目标 ,必须进一步巩固完善科研管理体制和机制 。 展开更多
关键词 研究型大学 南京航空航天大学 科研工作 研究成果
在线阅读 下载PDF
音乐科研能力评价体系研究现状综论述 被引量:1
14
作者 李虻 陈达波 《黄钟(武汉音乐学院学报)》 CSSCI 北大核心 2010年第3期24-26,共3页
文章从显性科研能力和隐性科研能力的界定入手,分析了音乐科研能力形成与评价机制的现状,在国家文科类评价标准的前提下,提出了兼顾音乐特色的科研能力评价研究新思路,并在"研究思路探讨"、"研究难点与创新的讨论"... 文章从显性科研能力和隐性科研能力的界定入手,分析了音乐科研能力形成与评价机制的现状,在国家文科类评价标准的前提下,提出了兼顾音乐特色的科研能力评价研究新思路,并在"研究思路探讨"、"研究难点与创新的讨论"等小标题下,提出了该领域一些亟待解决的问题。 展开更多
关键词 音乐科研 科研能力 评价 综述
在线阅读 下载PDF
论影响科技成果转化的非技术因素 被引量:2
15
作者 沈林 何婷婷 《合肥工业大学学报(社会科学版)》 2007年第5期94-98,共5页
科技已成为推进社会、经济发展的关键因素,因此,科技成果转化的意义十分突出,技术本身是科技成果转化中最为根本的影响因素。但随着经济、社会的发展,其他要素如风险投资、管理、知识产权、资金等对科技成果转化都会产生不同程度的影响... 科技已成为推进社会、经济发展的关键因素,因此,科技成果转化的意义十分突出,技术本身是科技成果转化中最为根本的影响因素。但随着经济、社会的发展,其他要素如风险投资、管理、知识产权、资金等对科技成果转化都会产生不同程度的影响。文章主要分析影响科技成果转化的非技术因素。 展开更多
关键词 科技成果转化 非技术因素 综述
在线阅读 下载PDF
媒介素养科学内涵研究述评 被引量:8
16
作者 米丽娟 《重庆文理学院学报(社会科学版)》 2010年第2期138-142,共5页
20世纪30年代以来,国内外传播学界对媒介素养的研究日趋重视,结出了累累硕果,其中关于媒介素养的定义就有数十个。文章以文献综述的方式,对众多的媒介素养定义进行分类、比较和分析,并从中总结出适应当今我国特殊社会状况和传播形势的... 20世纪30年代以来,国内外传播学界对媒介素养的研究日趋重视,结出了累累硕果,其中关于媒介素养的定义就有数十个。文章以文献综述的方式,对众多的媒介素养定义进行分类、比较和分析,并从中总结出适应当今我国特殊社会状况和传播形势的媒介素养定义,力图在众说纷纭中滤清媒介素养的科学内涵,夯实媒介素养的研究基础,以利于相关研究的拓展和深化。 展开更多
关键词 媒介素养研究 定义 内涵 综述
在线阅读 下载PDF
基于深度学习文本摘要的科技名词释义生成方法 被引量:5
17
作者 杜振雷 陈若愚 姜雨杉 《中国科技术语》 2024年第2期29-36,共8页
科技名词是科学技术形成、积累、交流和传播的前提和基础。为科技名词生成百科性释义,对于普通民众和中文学习者掌握科技名词内涵、正确使用术语具有很强的现实意义。文章提出了一种基于深度学习的科技名词百科释义生成方法。通过网络... 科技名词是科学技术形成、积累、交流和传播的前提和基础。为科技名词生成百科性释义,对于普通民众和中文学习者掌握科技名词内涵、正确使用术语具有很强的现实意义。文章提出了一种基于深度学习的科技名词百科释义生成方法。通过网络采集百科文本和专家撰写的术语释义文章,构建了科技名词百科释义数据集。基于T5 PEGASUS预训练模型并对模型进行微调,构建了生成式文本摘要模型和建立了科技名词释义生成系统。实验结果显示,本项研究所提出的模型在生成质量、语义连贯性和通用性等方面具有较高的性能。 展开更多
关键词 深度学习 文本摘要 科技名词 术语释义 释义生成 数据集
在线阅读 下载PDF
煤及有机岩研究的新进展——第44届国际煤及有机岩会议 被引量:1
18
作者 王洁 《煤田地质与勘探》 CAS CSCD 北大核心 1993年第2期20-25,共6页
通过对44届国际煤及有机岩会议及会后访问所讨论和研究的主要问题的分析,介绍了煤的分类、新的煤显微组分、煤岩学与有机岩石学、镜质组反射率抑制、成熟度、盆地分析与评价、以及工艺转化中的活性惰性组组分等方面研究的新进展。
关键词 有机岩石 岩石学 科学研究 含量
在线阅读 下载PDF
农业科研档案规范管理探讨 被引量:3
19
作者 马鑫 《农业科技与装备》 2019年第4期76-77,共2页
介绍农业科研档案的特点,总结在农业科研档案管理中存在的主要问题,根据农业科研档案的主要作用,提出规范农业科研档案管理的具体措施,为充分利用农业科研档案的信息资源提供借鉴。
关键词 科研档案 农业 综述 管理制度
在线阅读 下载PDF
“三个代表”重要思想:中华民族百年复兴史的科学总结
20
作者 杨乃良 丁祥艳 《南都学坛(南阳师范学院人文社会科学学报)》 2005年第2期88-91,共4页
"三个代表"重要思想是在对中华民族复兴历史进程的总体把握中对我们党的理论和实践的科学总结。"三个代表"重要思想不仅是我们党的执政之基、立党之本、力量之源,也是中华民族的复兴之源、复兴之魂和复兴之本。
关键词 科学总结 “三个代表”重要思想 执政之基 中华民族复兴 复兴 党的理论 力量之源 百年 总体 历史进程
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部