期刊文献+

基于BLSTM的命名实体识别方法 被引量:53

Named Entity Recognition Method Based on BLSTM
在线阅读 下载PDF
导出
摘要 传统的命名实体识别方法直接依靠大量的人工特征和专门的领域知识,解决了监督学习语料不足的问题,但设计人工特征和获取领域知识的代价昂贵。针对该问题,提出一种基于BLSTM(Bidirectional Long Short-Term Memory)的神经网络结构的命名实体识别方法。该方法不再直接依赖于人工特征和领域知识,而是利用基于上下文的词向量和基于字的词向量,前者表达命名实体的上下文信息,后者表达构成命名实体的前缀、后缀和领域信息;同时,利用标注序列中标签之间的相关性对BLSTM的代价函数进行约束,并将领域知识嵌入模型的代价函数中,进一步增强模型的识别能力。实验表明,所提方法的识别效果优于传统方法。 Traditional named entity recognition methods directly rely on plenty of hand-crafted features and special domain knowledge,and have resolved the problem that there are few supervised learning corpora which are available.But the costs of developing hand-crafted features and obtaining domain knowledge are expensive.To solve this problem,a neural network model based on BLSTM(Bidirectional Long Short-Term Memory)was proposed.This method does not directly use hand-crafted features and domain knowledge any more,but utilizes the word embedding based on context and word embedding based on characters.The former expresses the information about context of named entities,and the latter expresses the information about prefix,postfix and domain knowledge which make up the named entities.Simultaneously,it constrains the cost function of BLSTM by using the dependency between the labels in tagged sequence,and integrates the domain knowledge into the cost function,furtherly improving the recognition ability of the model.The experiments show that the recognition effect of the method in this paper is superior to traditional methods.
出处 《计算机科学》 CSCD 北大核心 2018年第2期261-268,共8页 Computer Science
基金 大连市科技计划项目海洋渔业大数据管理与集成关键技术研究(2015A11GX022)资助
关键词 BLSTM 命名实体 词向量 代价函数 BLSTM Named entity Word embedding Cost function
  • 相关文献

参考文献5

二级参考文献59

  • 1张晓艳,王挺,陈火旺.命名实体识别研究[J].计算机科学,2005,32(4):44-48. 被引量:69
  • 2俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:168
  • 3周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809. 被引量:115
  • 4李丽双,黄德根,陈春荣,杨元生.基于支持向量机的中文文本中地名识别[J].大连理工大学学报,2007,47(3):433-438. 被引量:16
  • 5ISOZAKI Hideki. Japanese named entity recognition based on a simple rule generator and deeision tree learning [C] // Proceedings of the 39th Annual Meeting Association for Computational Linguistics, San Francisco : Morgan Kaufmann, 2001 : 314-321.
  • 6ZHOU Guo-dong, SU Jian. Named entity recognition using an HMM-based Chunk Tagger [C] // Proceedings of the 40th Annual Meeting Association for Computational Linguistics. San Francisco : Morgan Kaufmann, 2002:473-480.
  • 7TAKEUCHI Koichi, COLLIER N. Use of support vector machines in extended named entity recognition [C] // Proceedings of the 6th Conference on Natural Language Learning. Morristown:Association for Computational Linguistics, 2002 : 167-170.
  • 8ZHANG Su-xiang, ZHANG Su-xian, Xiao-jie. Automatic recognition of WANG Chinese organization name based on conditional random fields [-C] // Natural Language Processing and Knowledge Engineering. Washington D C : IEEE Signal Processing Society, 2007:229-233.
  • 9YU Hong-kui, ZHANG Hua-ping, LIU Qun. Recognition of Chinese organization name based on role tagging [C] // 20th International Conferenee on Computer Processing of Oriental Languages. Beijing: Tsinghua University Press, 2003 : 79-87.
  • 10WU You-zheng, ZHAO Jun, XU Bo. Chinese named entity recognition combining statistical model with human knowledge [C] // Proceedings of the ACL Workshop on Multilingual and Mixed-language Named Entity Recognition. Morristown:Association for Computational Linguistics, 2003 : 65-72.

共引文献98

同被引文献379

引证文献53

二级引证文献626

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部