摘要
设计了一种中韩科技信息综合平台中的翻译辅助系统.首先,依据关键词确定的组词特证获取候选术语,并使用互信息评估候选术语以实现术语自动提取.其次,将已有术语、抽取到的新术语、术语译文和历史翻译记录等信息存储到系统数据库中建立术语库.最后,设计翻译工作者的用户接口,使其通过该接口获取已有术语的译文信息、新术语的相似译文信息和译文记忆库为基础的历史翻译数据.测试结果表明,本文设计的术语自动抽取功能和辅助译文生成功能达到了预定的设计目标,术语自动抽取算法召回率达到61.8%,结合优化方法进行优化后达到66.9%;辅助译文生成平均延时为0.031s,MRR为0.951,测试结果满足用户需求.
This paper describes the design method of Chinese and Korean science and technology information aided translation system.Firstly,extracting candidate terms based on word formation characteristics of keywords,and using mutual information to evaluate candidate terms for automatic term extraction.Secondly,existing terminology,extraction of the new terminology,terminology translation and history of translation records and so on are stored in the system database and the established terminology database.Finally,design the user interface for translators,so the translators can obtain the translation of exiting terminology,the similar translation of additional terms,and the history translation data based on translation memory through this interface.The results of the system test show that automatic extraction of term and the auxiliary translation function reach the desired goals.The recall rate or term automatic extraction algorithmis61.8%,and after optimization the rate is improved by optimizationmethodtoreach66.9%.Thegeneration of auxiliary translation averagely delays0.031 seconds,andtheMRRis0.951,so the test results fulfil the users' needs.
出处
《延边大学学报(自然科学版)》
CAS
2017年第3期259-263,共5页
Journal of Yanbian University(Natural Science Edition)
基金
吉林省自然科学基金资助项目(20140101186JC)
延边大学-延边州科技信息服务中心合作项目(延大科合字[2016]1号)
关键词
术语自动识别
术语抽取
翻译辅助系统
automatic term recognition
term extraction
computer aided translation