期刊文献+

正则表达式在汉语语料检索中的应用 被引量:10

Using Regular Expressions in Chinese Corpus Search
在线阅读 下载PDF
导出
摘要 正则表达式能充分描述检索的条件,实现语言数据的精确提取。本文探讨正则表达式用于汉语语料检索的问题,分析了汉语元字符使用的特殊性,提出了反映汉语词汇和语法特点的表达式编写方法,并以实例展示了正则表达式在汉语语料检索时的使用效果。 Regular expression (regex for short) can specify retrieval conditions with precision for concise matching of strings in a given corpus. Much research has contributed to the use of regex to search for English corpus, however, little is done to use regex in Chinese data retrieval. This paper focuses on the use of metacharacters particular to Mandarin Chinese, and attempts to work out query patterns operable for the matching of strings in Chinese texts.
出处 《外国语文》 北大核心 2013年第6期74-79,共6页 Foreign Languages and Literature
基金 国家社科基金重大项目"大规模英汉平行语料库的建立与加工"(10&ZD127) 国家社科基金课题"历时语料类比中的翻译与现代汉语互动研究"(10BYY008)的阶段性基础研究成果
关键词 语料库 正则表达式 元字符 数据检索 corpus regular expression naetacharacter data retrieval
  • 相关文献

参考文献4

  • 1Dipper,Stefanie.Theory-driven and corpus-driven computational linguistics,and the use of corpora[C] // Anke Lüdeling & Merja Kytt(o).Corpus Linguistic:An International Handbook.Berlin:Walter de Gruyter GmbH & Co.,2007:68-96.
  • 2Eckle-Kohler,Judith.Linguistic Knowledge for Automatic Lexicon Acquisition from German Text Corpora[M].Berlin:Logos Verlag,2007.
  • 3Ushioda,A.,D.A.Evans,T.Gibson & A.Waibel.The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora[C] //Boguraev,B.& J.Pustejovsky.Proceedings of the Workshop on the Acquisition of Lexical Knowledge from Text.Columbus,OH,1993:95-106.
  • 4秦洪武.英译汉翻译语言的结构容量:基于多译本语料库的研究[J].外国语,2010,33(4):73-80. 被引量:52

二级参考文献13

  • 1Laviosa ,S.Corpus-based Translation Studies:Theory,Findings,Applications[M].Amsterdam:Rodopi,2002.
  • 2Olohan, M. Introducing Corpora in Translation Studies[ M ]. London/New York: Routledge, 2004.
  • 3Goldberg, A. E. Constructions: A Construction Grammar Approach to Argument Structure [ M ].Chicago: The University of Chicago Press, 1994.
  • 4Baker, M. In Other Words: A Coursebook on Translation[ M ]. London: Routledge, 1992.
  • 5Chen, H.-H. The Contextual Analysis of Chinese Sentences with Punctuation Marks[ J]. Literary and Linguistic Computing, 1994, (4):281 -289.
  • 6苏福忠.前言[A].人民文学出版社.名著名译英汉对照读本丛书[C].北京:人民文学出版社,2002.
  • 7Sinclair, J. Corpus Creation[ A]. G. Sampson & D. McCarthy. Corpus Linguistics: Readings in a Widening Discipline [ C ]. London: Continuum, 2004.
  • 8Baker, M. Towards a methodology for investigating the style of a literary translator [ J ]. Target, 2000 (12) :241 - 266.
  • 9Santos, D. On grammatical translationese [ A ]. Kimmo Koskenniemi. Short Papers Presented at the Tenth Scandinavian Conference on Computational Linguistics [ C ]. Helsinki, May 29 - 30, 1995.59 - 66.
  • 10高植.翻译在语文方面的任务[A].罗新璋.翻译论集[C].北京:商务印书馆,1984.532-535.

共引文献51

同被引文献60

引证文献10

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部