摘要
提出一种基于多重启发式规则的英文特征值提取算法。该算法从概率论和英语语义两个层面引入了词频启发式规则、集中度启发式规则、同义启发式规则、同源启发式规则等特征值提取手段,阐述各种启发规则下的数据处理方法以及特征值提取算法具体流程,并将所提出的基于多重启发式规则的英文特征值提取算法与基于词频的常规算法进行对比,取得了较为理想的特征值提取效果。
This paper proposes an English eigenvalue extraction algorithm based on multiple restart rules.The algorithm introduces eigenval-ue extraction methods such as word frequency heuristic rules,concentration heuristic rules,synonymous heuristic rules and ho-mologous heuristic rules from the two levels of probability theory and English semantics.The data processing methods under vari-ous heuristic rules and the specific flow of eigenvalue extraction algorithm are described in detail.The proposed English eigenval-ue extraction algorithm based on multi restart rules is compared with the conventional algorithm based on word frequency,and a more ideal eigenvalue extraction effect is achieved.
作者
郑海燕
ZHENG Hai-yan(Xianyang Vocational Technical College,Xianyang 712000 China)
出处
《自动化技术与应用》
2023年第11期95-97,共3页
Techniques of Automation and Applications
基金
2021年度陕西省教育科学“十四五”规划课题(SGH21Y0597)。
关键词
英文文本
特征值提取
概率启发
语义启发
English text
eigenvalue extraction
probabilistic heuristic
semantic inspiration