期刊文献+
共找到1,770篇文章
< 1 2 89 >
每页显示 20 50 100
Learning Multi Labels from Single Label——An Extreme Weak Label Learning Algorithm 被引量:1
1
作者 DUAN Junhong LI Xiaoyu MU Dejun 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2019年第2期161-168,共8页
This paper presents a novel algorithm for an extreme form of weak label learning, in which only one of all relevant labels is given for each training sample. Using genetic algorithm, all of the labels in the training ... This paper presents a novel algorithm for an extreme form of weak label learning, in which only one of all relevant labels is given for each training sample. Using genetic algorithm, all of the labels in the training set are optimally divided into several non-overlapping groups to maximize the label distinguishability in every group. Multiple classifiers are trained separately and ensembled for label predictions. Experimental results show significant improvement over previous weak label learning algorithms. 展开更多
关键词 weak-supervised LEARNING genetic algorithm multi-label classification
原文传递
ML-CLUBAS: A Multi Label Bug Classification Algorithm
2
作者 Naresh Kumar Nagwani Shrish Verma 《Journal of Software Engineering and Applications》 2012年第12期983-990,共8页
In this paper, a multi label variant of CLUBAS [1] algorithm, ML-CLUBAS (Multi Label-Classification of software Bugs Using Bug Attribute Similarity) is presented. CLUBAS is a hybrid algorithm, and is designed by using... In this paper, a multi label variant of CLUBAS [1] algorithm, ML-CLUBAS (Multi Label-Classification of software Bugs Using Bug Attribute Similarity) is presented. CLUBAS is a hybrid algorithm, and is designed by using text clustering, frequent term calculations and taxonomic terms mapping techniques, and is an example of classification using clustering technique. CLUBAS is a single label algorithm, where one bug cluster is exactly mapped to a single bug category. However a bug cluster can be mapped into the more than one bug category in case of cluster label matches with the more than one category term, for this purpose ML-CLUBAS a multi label variant of CLUBAS is presented in this work. The designed algorithm is evaluated using the performance parameters F-measures and accuracy, number of clusters and purity. These parameters are compared with the CLUBAS and other multi label text clustering algorithms. 展开更多
关键词 SOFTWARE BUG Mining SOFTWARE BUG CLASSIFICATION BUG CLUSTERING CLASSIFICATION Using CLUSTERING BUG Attribute Similarity multi label CLASSIFICATION
在线阅读 下载PDF
Multi-label dimensionality reduction and classification with extreme learning machines 被引量:9
3
作者 Lin Feng Jing Wang +1 位作者 Shenglan Liu Yao Xiao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2014年第3期502-513,共12页
In the need of some real applications, such as text categorization and image classification, the multi-label learning gradually becomes a hot research point in recent years. Much attention has been paid to the researc... In the need of some real applications, such as text categorization and image classification, the multi-label learning gradually becomes a hot research point in recent years. Much attention has been paid to the research of multi-label classification algorithms. Considering the fact that the high dimensionality of the multi-label datasets may cause the curse of dimensionality and wil hamper the classification process, a dimensionality reduction algorithm, named multi-label kernel discriminant analysis (MLKDA), is proposed to reduce the dimensionality of multi-label datasets. MLKDA, with the kernel trick, processes the multi-label integrally and realizes the nonlinear dimensionality reduction with the idea similar with linear discriminant analysis (LDA). In the classification process of multi-label data, the extreme learning machine (ELM) is an efficient algorithm in the premise of good accuracy. MLKDA, combined with ELM, shows a good performance in multi-label learning experiments with several datasets. The experiments on both static data and data stream show that MLKDA outperforms multi-label dimensionality reduction via dependence maximization (MDDM) and multi-label linear discriminant analysis (MLDA) in cases of balanced datasets and stronger correlation between tags, and ELM is also a good choice for multi-label classification. 展开更多
关键词 multi-label dimensionality reduction kernel trick classification.
在线阅读 下载PDF
Feature Selection for Multi-label Classification Using Neighborhood Preservation 被引量:12
4
作者 Zhiling Cai William Zhu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期320-330,共11页
Multi-label learning deals with data associated with a set of labels simultaneously. Dimensionality reduction is an important but challenging task in multi-label learning. Feature selection is an efficient technique f... Multi-label learning deals with data associated with a set of labels simultaneously. Dimensionality reduction is an important but challenging task in multi-label learning. Feature selection is an efficient technique for dimensionality reduction to search an optimal feature subset preserving the most relevant information. In this paper, we propose an effective feature evaluation criterion for multi-label feature selection, called neighborhood relationship preserving score. This criterion is inspired by similarity preservation, which is widely used in single-label feature selection. It evaluates each feature subset by measuring its capability in preserving neighborhood relationship among samples. Unlike similarity preservation, we address the order of sample similarities which can well express the neighborhood relationship among samples, not just the pairwise sample similarity. With this criterion, we also design one ranking algorithm and one greedy algorithm for feature selection problem. The proposed algorithms are validated in six publicly available data sets from machine learning repository. Experimental results demonstrate their superiorities over the compared state-of-the-art methods. 展开更多
关键词 Feature selection multi-label learning neighborhood relationship preserving sample similarity
在线阅读 下载PDF
A Multi-Label Classification Algorithm Based on Label-Specific Features 被引量:2
5
作者 QU Huaqiao ZHANG Shichao +1 位作者 LIU Huawen ZHAO Jianmin 《Wuhan University Journal of Natural Sciences》 CAS 2011年第6期520-524,共5页
Aiming at the problem of multi-label classification, a multi-label classification algorithm based on label-specific features is proposed in this paper. In this algorithm, we compute feature density on the positive and... Aiming at the problem of multi-label classification, a multi-label classification algorithm based on label-specific features is proposed in this paper. In this algorithm, we compute feature density on the positive and negative instances set of each class firstly and then select mk features of high density from the positive and negative instances set of each class, respectively; the intersec- tion is taken as the label-specific features of the corresponding class. Finally, multi-label data are classified on the basis of la- bel-specific features. The algorithm can show the label-specific features of each class. Experiments show that our proposed method, the MLSF algorithm, performs significantly better than the other state-of-the-art multi-label learning approaches. 展开更多
关键词 multi-label classification label-specific features feature's value DENSITY
原文传递
Novel Apriori-Based Multi-Label Learning Algorithm by Exploiting Coupled Label Relationship 被引量:1
6
作者 Zhenwu Wang Longbing Cao 《Journal of Beijing Institute of Technology》 EI CAS 2017年第2期206-214,共9页
It is a key challenge to exploit the label coupling relationship in multi-label classification(MLC)problems.Most previous work focused on label pairwise relations,in which generally only global statistical informati... It is a key challenge to exploit the label coupling relationship in multi-label classification(MLC)problems.Most previous work focused on label pairwise relations,in which generally only global statistical information is used to analyze the coupled label relationship.In this work,firstly Bayesian and hypothesis testing methods are applied to predict the label set size of testing samples within their k nearest neighbor samples,which combines global and local statistical information,and then apriori algorithm is used to mine the label coupling relationship among multiple labels rather than pairwise labels,which can exploit the label coupling relations more accurately and comprehensively.The experimental results on text,biology and audio datasets shown that,compared with the state-of-the-art algorithm,the proposed algorithm can obtain better performance on 5 common criteria. 展开更多
关键词 multi-label classification hypothesis testing k nearest neighbor apriori algorithm label coupling
在线阅读 下载PDF
iATC_Deep-mISF: A Multi-Label Classifier for Predicting the Classes of Anatomical Therapeutic Chemicals by Deep Learning 被引量:1
7
作者 Zhe Lu Kuo-Chen Chou 《Advances in Bioscience and Biotechnology》 2020年第5期153-159,共7页
The recent worldwide spreading of pneumonia-causing virus, such as Coronavirus, COVID-19, and H1N1, has been endangering the life of human beings all around the world. To provide useful clues for developing antiviral ... The recent worldwide spreading of pneumonia-causing virus, such as Coronavirus, COVID-19, and H1N1, has been endangering the life of human beings all around the world. To provide useful clues for developing antiviral drugs, information of anatomical therapeutic chemicals is vitally important. In view of this, a CNN based predictor called “iATC_Deep-mISF” has been developed. The predictor is particularly useful in dealing with the multi-label systems in which some chemicals may occur in two or more different classes. To maximize the convenience for most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/iATC_Deep-mISF/, which will become a very powerful tool for developing effective drugs to fight pandemic coronavirus and save the mankind of this planet. 展开更多
关键词 PANDEMIC CORONAVIRUS multi-label System ANATOMICAL THERAPEUTIC CHEMICALS Learning at Deeper Level Five-Steps Rule
暂未订购
A SIMULTANEOUS MULTI-PROBE DETECTION LABEL-FREE OPTICAL-RESOLUTION PHOTOACOUSTIC MICROSCOPY TECHNIQUE BASED ON MICROCAVITY TRANSDUCER 被引量:1
8
作者 YONGBO WU ZHILIE TANG +1 位作者 YAN CHI LIRU WU 《Journal of Innovative Optical Health Sciences》 SCIE EI CAS 2013年第3期108-113,共6页
We demonstrate the feasibility of simultancous multi-probe detection for an optcal-resolution photoacoustic microscopy(OR-PAM)system.OR-P AM has elicited the attention of biomedical imaging researchers because of its ... We demonstrate the feasibility of simultancous multi-probe detection for an optcal-resolution photoacoustic microscopy(OR-PAM)system.OR-P AM has elicited the attention of biomedical imaging researchers because of its optical absorption contrast and high spatial resolution with great imaging depth.OR-PAM allows label-free and noninvasive imaging by maximizing the optical absorption of endogenous biomolecules.However,given the inadequate absoption of some biomolcules,detection sensitivity at the same incident intensity requires improvement.In this study,a modulated continuous wave with power density less than 3mW/cm^(2)(1/4 of the ANSI safety limit)excited the weak photoacoustic(PA)signals of biological cells.A microcavity traneducer is developed based on the bulk modulus of gas five orders of magnitude lower than that of solid;air pressure variation is inversely proportional to cavity volume at the same temperature increase.Considering that a PA wave expands in various directions,detecting PA signals from different positions and adding them together can increase detection sensitivity and signal-to-noise ratio.Therefore,we employ four detectors to acquire tiny PA signals simul-taneously.Experimental results show that the developed OR-PAM system allows the label-free imaging of cells with weak optical absorption. 展开更多
关键词 multi-probe label free optical-resolution photoacoustic microscopy
原文传递
Coupled Attribute Similarity Learning on Categorical Data for Multi-Label Classification
9
作者 Zhenwu Wang Longbing Cao 《Journal of Beijing Institute of Technology》 EI CAS 2017年第3期404-410,共7页
In this paper a novel coupled attribute similarity learning method is proposed with the basis on the multi-label categorical data(CASonMLCD).The CASonMLCD method not only computes the correlations between different ... In this paper a novel coupled attribute similarity learning method is proposed with the basis on the multi-label categorical data(CASonMLCD).The CASonMLCD method not only computes the correlations between different attributes and multi-label sets using information gain,which can be regarded as the important degree of each attribute in the attribute learning method,but also further analyzes the intra-coupled and inter-coupled interactions between an attribute value pair for different attributes and multiple labels.The paper compared the CASonMLCD method with the OF distance and Jaccard similarity,which is based on the MLKNN algorithm according to 5common evaluation criteria.The experiment results demonstrated that the CASonMLCD method can mine the similarity relationship more accurately and comprehensively,it can obtain better performance than compared methods. 展开更多
关键词 COUPLED SIMILARITY multi-label categorical data CORRELATIONS
在线阅读 下载PDF
Multi-label local discriminative embedding
10
作者 Jujie Zhang Min Fang Huimin Chai 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2017年第5期1009-1018,共10页
Multi-label classification problems arise frequently in text categorization, and many other related applications. Like conventional categorization problems, multi-label categorization tasks suffer from the curse of hi... Multi-label classification problems arise frequently in text categorization, and many other related applications. Like conventional categorization problems, multi-label categorization tasks suffer from the curse of high dimensionality. Existing multi-label dimensionality reduction methods mainly suffer from two limitations. First, latent nonlinear structures are not utilized in the input space. Second, the label information is not fully exploited. This paper proposes a new method, multi-label local discriminative embedding (MLDE), which exploits latent structures to minimize intraclass distances and maximize interclass distances on the basis of label correlations. The latent structures are extracted by constructing two sets of adjacency graphs to make use of nonlinear information. Non-symmetric label correlations, which are the case in real applications, are adopted. The problem is formulated into a global objective function and a linear mapping is achieved to solve out-of-sample problems. Empirical studies across 11 Yahoo sub-tasks, Enron and Bibtex are conducted to validate the superiority of MLDE to state-of-art multi-label dimensionality reduction methods. 展开更多
关键词 multi-label classification dimensionality reduction latent structure label correlation
在线阅读 下载PDF
Multi-label learning of face demographic classification for correlation analysis
11
作者 方昱春 程功 罗婕 《Journal of Shanghai University(English Edition)》 CAS 2011年第5期352-356,共5页
In this paper, we utilize the framework of multi-label learning for face demographic classification. We also attempt t;o explore the suitable classifiers and features for face demographic classification. Three most po... In this paper, we utilize the framework of multi-label learning for face demographic classification. We also attempt t;o explore the suitable classifiers and features for face demographic classification. Three most popular demographic information, gender, ethnicity and age are considered in experiments. Based on the results from demographic classification, we utilize statistic analysis to explore the correlation among various face demographic information. Through the analysis, we draw several conclusions on the correlation and interaction among these high-level face semantic, and the obtained results can be helpful in automatic face semantic annotation and other face analysis tasks. 展开更多
关键词 denlographic classification multi-label learning face analysis
在线阅读 下载PDF
Pedestrian attribute classification with multi-scale and multi-label convolutional neural networks
12
作者 朱建清 Zeng Huanqiang +2 位作者 Zhang Yuzhao Zheng Lixin Cai Canhui 《High Technology Letters》 EI CAS 2018年第1期53-61,共9页
Pedestrian attribute classification from a pedestrian image captured in surveillance scenarios is challenging due to diverse clothing appearances,varied poses and different camera views. A multiscale and multi-label c... Pedestrian attribute classification from a pedestrian image captured in surveillance scenarios is challenging due to diverse clothing appearances,varied poses and different camera views. A multiscale and multi-label convolutional neural network( MSMLCNN) is proposed to predict multiple pedestrian attributes simultaneously. The pedestrian attribute classification problem is firstly transformed into a multi-label problem including multiple binary attributes needed to be classified. Then,the multi-label problem is solved by fully connecting all binary attributes to multi-scale features with logistic regression functions. Moreover,the multi-scale features are obtained by concatenating those featured maps produced from multiple pooling layers of the MSMLCNN at different scales. Extensive experiment results show that the proposed MSMLCNN outperforms state-of-the-art pedestrian attribute classification methods with a large margin. 展开更多
关键词 PEDESTRIAN ATTRIBUTE CLASSIFICATION multi-SCALE features multi-label CLASSIFICATION convolutional NEURAL network (CNN)
在线阅读 下载PDF
Genetic algorithm for multi-protocol label switching
13
作者 孟德宇 梁栋 凌永发 《Journal of Pharmaceutical Analysis》 SCIE CAS 2007年第2期121-123,共3页
A new method for multi-protocol label switching is presented in this study, whose core idea is to construct model for simulating process of accommodating network online loads and then adopt genetic algorithm to optimi... A new method for multi-protocol label switching is presented in this study, whose core idea is to construct model for simulating process of accommodating network online loads and then adopt genetic algorithm to optimize the model. Due to the heuristic property of evolutional method, the new method is efficient and effective, which is verified by the experiments. 展开更多
关键词 multi-protocol label switching network load genetic algorithm
在线阅读 下载PDF
Optimization Model and Algorithm for Multi-Label Learning
14
作者 Zhengyang Li 《Journal of Applied Mathematics and Physics》 2021年第5期969-975,共7页
<div style="text-align:justify;"> This paper studies a kind of urban security risk assessment model based on multi-label learning, which is transformed into the solution of linear equations through a s... <div style="text-align:justify;"> This paper studies a kind of urban security risk assessment model based on multi-label learning, which is transformed into the solution of linear equations through a series of transformations, and then the solution of linear equations is transformed into an optimization problem. Finally, this paper uses some classical optimization algorithms to solve these optimization problems, the convergence of the algorithm is proved, and the advantages and disadvantages of several optimization methods are compared. </div> 展开更多
关键词 Operations Research multi-label Learning Linear Equations Solving Optimization Algorithm
在线阅读 下载PDF
Variational Bayesian labeled multi-Bernoulli filter with unknown sensor noise statistics 被引量:5
15
作者 Qiu Hao Huang Gaoming Gao Jun 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2016年第5期1378-1384,共7页
It is difficult to build accurate model for measurement noise covariance in complex backgrounds. For the scenarios of unknown sensor noise variances, an adaptive multi-target tracking algorithm based on labeled random... It is difficult to build accurate model for measurement noise covariance in complex backgrounds. For the scenarios of unknown sensor noise variances, an adaptive multi-target tracking algorithm based on labeled random finite set and variational Bayesian (VB) approximation is proposed. The variational approximation technique is introduced to the labeled multi-Bernoulli (LMB) filter to jointly estimate the states of targets and sensor noise variances. Simulation results show that the proposed method can give unbiased estimation of cardinality and has better performance than the VB probability hypothesis density (VB-PHD) filter and the VB cardinality balanced multi-target multi-Bernoulli (VB-CBMeMBer) filter in harsh situations. The simulations also confirm the robustness of the proposed method against the time-varying noise variances. The computational complexity of proposed method is higher than the VB-PHD and VB-CBMeMBer in extreme cases, while the mean execution times of the three methods are close when targets are well separated. 展开更多
关键词 labeled random finite set multi-Bernoulli filter multi-target tracking Parameter estimation Variational Bayesian approximation
原文传递
Semantic Similarity over Gene Ontology for Multi-Label Protein Subcellular Localization
16
作者 Shibiao Wan Man-Wai Mak Sun-Yuan Kung 《Engineering(科研)》 2013年第10期68-72,共5页
As one of the essential topics in proteomics and molecular biology, protein subcellular localization has been extensively studied in previous decades. However, most of the methods are limited to the prediction of sing... As one of the essential topics in proteomics and molecular biology, protein subcellular localization has been extensively studied in previous decades. However, most of the methods are limited to the prediction of single-location proteins. In many studies, multi-location proteins are either not considered or assumed not existing. This paper proposes a novel multi-label subcellular-localization predictor based on the semantic similarity between Gene Ontology (GO) terms. Given a protein, the accession numbers of its homologs are obtained via BLAST search. Then, the homologous accession numbers of the protein are used as keys to search against the gene ontology annotation database to obtain a set of GO terms. The semantic similarity between GO terms is used to formulate semantic similarity vectors for classification. A support vector machine (SVM) classifier with a new decision scheme is proposed to classify the multi-label GO semantic similarity vectors. Experimental results show that the proposed multi-label predictor significantly outperforms the state-of-the-art predictors such as iLoc-Plant and Plant-mPLoc. 展开更多
关键词 Protein SUBCELLULAR Localization SEMANTIC SIMILARITY GO TERMS multi-label Classification
暂未订购
基于共享背景主题的Labeled LDA模型 被引量:17
17
作者 江雨燕 李平 王清 《电子学报》 EI CAS CSCD 北大核心 2013年第9期1794-1799,共6页
隐藏狄利克雷分配(Latent Dirichlet Allocation,LDA)模型被广泛应用于文本分析、图像识别等领域.但由于LDA及其扩展模型多为无监督学习模型,无法将其应用于分类任务中.本文通过研究文档标记与LDA模型中主题的映射关系,提出一种新的Labe... 隐藏狄利克雷分配(Latent Dirichlet Allocation,LDA)模型被广泛应用于文本分析、图像识别等领域.但由于LDA及其扩展模型多为无监督学习模型,无法将其应用于分类任务中.本文通过研究文档标记与LDA模型中主题的映射关系,提出一种新的Labeled LDA模型(Shared Background Topics Labeled LDA,SBTL-LDA).在SBTL-LDA模型中每个标记除了存在若干个独享的局部主题外,还存在若干个共享的背景(Background)主题,这样可以有效分析不同标记所含主题之间的依赖关系,而文档标记被映射为局部主题和共享主题的组合,因此SBTL-LDA模型可以有效提升文档标记判别的准确性.同时SBTL-LDA模型还可以看成是一种半监督聚类模型,在对文档进行聚类分析的过程中模型可以有效的利用文档的标记信息提升文档聚类效果.实验证明SBTL-LDA模型能够有效解决PLDA模型中主题之间的相似性和依赖关系,具有良好的多标记判别能力,并且具有优于LDA、PLDA模型的文档聚类效果. 展开更多
关键词 隐藏狄利克雷分配 文本分析 多标记学习 半监督聚类
在线阅读 下载PDF
用于多标签分类的改进Labeled LDA模型 被引量:12
18
作者 江雨燕 李平 王清 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2013年第4期425-432,共8页
概率主题模型由于其优良的文档分析能力,被广泛应用于各种文本分析任务中.然而,网络中的文档数据除了含有基本的内容信息外,同时还可能存在文档类别、作者等信息.如何通过主题模型对这些信息进行有效的分析,已经成为机器学习、自然语言... 概率主题模型由于其优良的文档分析能力,被广泛应用于各种文本分析任务中.然而,网络中的文档数据除了含有基本的内容信息外,同时还可能存在文档类别、作者等信息.如何通过主题模型对这些信息进行有效的分析,已经成为机器学习、自然语言处理等领域的重点研究方向.本文通过对隐含狄利克雷分配(Latent Dirichlet Allocation,LDA)及其扩展模型的研究,提出一种适用于文档多标签判定的改进Labeled LDA模型.模型中的标记被映射为多个主题的组合,其中包含若干个独享的主题和共享主题.在文档类别判定过程中通过联合独享主题和共享主题来对类别进行预测.为了验证算法的有效性本文将提出的模型分别与PLDA模型及其他非主题模型进行了对比.实验结果表明,改进LabeledLDA模型能够有效解决PLDA模型无法有效分析类别标记之间共享主题的问题,具有明显优于PLDA和其他非主题模型的多标签判定能力. 展开更多
关键词 主题模型 隐含狄利克雷分配 多标签分类 共享主题
在线阅读 下载PDF
多示例多标记学习综述
19
作者 王梅霞 王慧颖 +1 位作者 黄苗苗 赵宇海 《计算机研究与发展》 北大核心 2026年第1期66-89,共24页
多示例多标记学习(multi-instance multi-label learning,MIML)是一种重要的机器学习框架,用于处理每个对象被表示为多个示例构成的多示例包,并且是一种能够同时与多个标记相关联的复杂学习任务,被广泛应用于药物活性预测、医学图像分... 多示例多标记学习(multi-instance multi-label learning,MIML)是一种重要的机器学习框架,用于处理每个对象被表示为多个示例构成的多示例包,并且是一种能够同时与多个标记相关联的复杂学习任务,被广泛应用于药物活性预测、医学图像分析等领域。与传统的多示例学习和多标记学习不同,MIML同时考虑了输入空间(多示例)和输出空间(多标记)的多样性,使得模型能更全面地描述和理解现实世界的复杂对象,但同样也面临着计算复杂性高、模型难以优化和泛化能力受限等挑战。因此,MIML问题受到了研究者们的广泛关注。然而,目前尚缺乏对MIML研究的完整综述。首先给出与MIML相关的问题和符号定义;其次从数据复杂性的角度将MIML划分为标准MIML、多元MIML和非精确标记的MIML三类,并且分别从问题求解策略、示例来源和标记完整性的角度,将3类MIML细化为9个小类进行介绍和分析;然后给出了MIML方法的常用数据集和实验对比结果;最后介绍了5种常见的MIML实际应用场景,展望了MIML领域的4个未来研究方向并对全文进行总结。 展开更多
关键词 多示例多标记学习 多视图 非精确标记 退化方法 直接求解方法 主动学习
在线阅读 下载PDF
一种改进的实体关系抽取算法——OptMultiR 被引量:2
20
作者 延浩然 靳小龙 +1 位作者 贾岩涛 程学旗 《中文信息学报》 CSCD 北大核心 2018年第9期66-74,共9页
作为知识图谱构建过程中的关键步骤,关系抽取这一从海量自然语言文本中抽取实体间关系的任务近年来得到了越来越广泛的关注。如今,远程监督(distant supervision)方法通过与已有知识库(knowledge base)中的实体和关系进行对齐,可以直接... 作为知识图谱构建过程中的关键步骤,关系抽取这一从海量自然语言文本中抽取实体间关系的任务近年来得到了越来越广泛的关注。如今,远程监督(distant supervision)方法通过与已有知识库(knowledge base)中的实体和关系进行对齐,可以直接使用源文本进行训练从而省去了人工标记数据的过程。其中,使用了多实例多标签(multi-instance multi-label)模型的MultiR算法取得了很好的抽取效果。但该算法存在两个问题:抽取过程中未考虑实体对之间可能已存在的关系,以及概率图匹配计算中使用的贪心算法无法获得最优解。该文针对上述问题进行了改进。首先,在关系抽取的打分过程中,考虑到同一实体对可能存在的多个关系之间具有一定关联性,该文引入了关系权重矩阵,使其在抽取过程中将实体对已知的关系转换为权重向量对打分进行干预,以此减少个别文本特征的干扰,提高抽取准确率。其次,在概率图匹配过程中,为了将图的匹配从局部最优值提升为全局最优值,该文将原有的贪心算法替换为基于状态压缩的动态规划算法。实验结果证明,优化后的MultiR模型,称之为OptMultiR,其关系抽取性能得到了显著提升。 展开更多
关键词 关系抽取 远程监督 多实例多标签学习
在线阅读 下载PDF
上一页 1 2 89 下一页 到第
使用帮助 返回顶部