期刊文献+

话题句识别中候选话题句评估函数的优化 被引量:3

Optimization of Candidate Topic Clause Evaluation Function in Topic Clause Identification
在线阅读 下载PDF
导出
摘要 为了更好地解决汉语标点句句首话题缺失的问题,需要在话题句识别过程中优化用于评估候选话题句优劣的评估函数.为此,提出了话题句生成的上下文相似性特征、话题串与评述相邻接的局部相似性特征,并设计了相关的评估函数.实验结果表明:综合运用这2个评估函数,话题句识别的准确率提高了5.72个百分点. Topics were often omitted in the beginning of Chinese punctuation clause (abbreviated as PC). In order to better recover topics more accurately, an improved candidate topic clause (abbreviated as CTC) evaluation function was proposed in the topic clause (abbreviated as TC) identification task. Both the context similarity and the local similarity of CTC were taken into account in the evaluation function. Result shows that the performance of TC identification measured by accuracy is increased by 5.72 percent.
作者 蒋玉茹 宋柔
出处 《北京工业大学学报》 CAS CSCD 北大核心 2014年第1期43-48,共6页 Journal of Beijing University of Technology
基金 国家自然科学基金资助项目(61171129) 北京市属高等学校创新团队提升计划资助项目(IDHT20130519)
关键词 广义话题 话题句 相似度 上下文相似性 局部相似性 generalized topic topic clause similarity context similarity local similarity
  • 相关文献

参考文献1

二级参考文献2

共引文献13

同被引文献15

  • 1黄健传,宋柔.标点句标注研究[C]//第九届全国计算语言学学术会议论文集.北京:清华大学出版社,2007:350-355.
  • 2SONG R,JIANG Y,WANG J.On generalized-topic-based Chinese discourse structure[C]//S1GHAN 2010:Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing.Beijing:Tsinghua University Press,2010:23-33.
  • 3宋柔.汉语篇章广义话题结构研究[R].北京:北京语言大学,2012.
  • 4GILLELAND M.Levenshtein distance,in three flavors[EB/OL].[2013-02-04].http://people.cs.pitt.edu/-kirk/csl501/Pruhs/Spring2006/assiguments/editdistance/Levenshtein%20Distance.htm.
  • 5胡乔木.中国大百科全书:图文数据光盘[M/CD].北京:中国大百科全书出版社,1999.
  • 6KOHAVI R.A study of cross-validation and bootstrap for accuracy estimation and model selection[C]//IJCAI'95:Proceedings of the 14th International Joint Conference on Artificial Intelligence.San Francisco:Morgan Kaufmann,1995,2:1137-1143.
  • 7JIANG Y,SONG R.Topic structure identification of PClause sequence based on generalized topic theory[C]//Proceedings of the 2012 1st CCF Conference on Natural Language Processing and Chinese Computing.Berlin:Springer-Verlag,2012:85-96.
  • 8宋柔.现代汉语跨标点句句法关系的性质研究[J].世界汉语教学,2008,22(2):26-44. 被引量:27
  • 9蒋玉茹,宋柔.基于广义话题理论的话题句识别[J].中文信息学报,2012,26(5):114-119. 被引量:14
  • 10宋柔.汉语篇章广义话题结构的流水模型[J].中国语文,2013(6):483-494. 被引量:48

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部