期刊文献+

基于大语言模型的审计领域命名实体识别算法研究 被引量:2

Study on Named Entity Recognition Algorithms in Audit Domain Based on Large Language Models
在线阅读 下载PDF
导出
摘要 随着ChatGPT的出现,从通用领域到专业领域,大语言模型开始在各行各业发挥着重要作用。审计领域与人工智能结合的方法不断涌现,但是传统人工智能方法的准确率远低于现有大语言模型,因此大语言模型在审计领域中的应用仍需进一步研究。在审计领域中,通过人工智能方法智能识别出文本中的有用实体可以极大提升工作效率,减少错误情况。传统的审计文本实体识别算法主要是基于机器学习结合特征工程,这种方法准确率普遍较低。鉴于此,研究几种常见的开源模型(如Llama等)和闭源模型(如ChatGPT等)在审计文本实体识别中的应用,同时结合上下文学习技术提升模型识别效果,有效提升了识别准确率。其中,上下文学习技术结合了基于相似度选择的样例组织方式,实体识别准确率最高提升至98.3%,取得了较好的效果。 With the emergence of ChatGPT,large language models have begun to play a significant role across various industries,from general fields to specialized domains.Although there have been methods combining artificial intelligence with auditing,the application of large language models in auditing still needs further research due to the fact that the accuracy of traditional artificial intelligence methods is much lower than that of existing large language models.The use of AI methods to intelligently identify useful entities within text in auditing can greatly enhance work efficiency and reduce errors.Conventional auditing text entity recognition algorithms primarily rely on machine learning combined with feature engineering,which generally results in lower accuracy.In light of this,this study investigates the applications of several common open-source models(such as Llama)and closed-source models(such as ChatGPT)in auditing text entity recognition,while integrating contextual learning techniques to improve model recognition performance.The results demonstrate that by employing a sample organization method based on similarity selection,the accuracy of entity recognition can be improved to 98.3%,achieving notable improvements.
作者 户才顺 HU Caishun(Naval University of Engineering,Wuhan 430000,China)
机构地区 海军工程大学
出处 《计算机科学》 北大核心 2025年第S1期60-63,共4页 Computer Science
关键词 审计 大语言模型 ChatGPT 命名实体识别 上下文学习 Audit Large language models ChatGPT Named entity recognition In-context learning
  • 相关文献

参考文献1

二级参考文献12

  • 1Volk Martin, Clematide Simon. Learn-filter-apply-forget mixed approaches to named entity recognition [C]. In: Proc of the 6th Int'l Workshop on Applications of Natural Language for Information Systems. Berlin: Springer, 2001. 153-163.
  • 2Y Z Wu, J Zhao, B Xu. Chinese named entity based on multiple features [C]. Human Language Technology Conference and Conf on Empirical Methods in Natural Language Processing (EMNLP-2005), Vancouver, Canada, 2005.
  • 3H P Zhang, Q Liu, H Zhang, et al. Automatic recognition of Chinese unknown words based on roles tagging [C]. SigHan2002 Workshop Attached with the 19th Int'l Conf on Computational Linguistics, Taipei, 2002.
  • 4O Bender, F J Och, H Ney. Maximum entropy models for named entity recognition [C]. The 7th Conf on Computational Natural Language Learning (CoNLL 2003), Edmonton, Canada, 2003.
  • 5H L Chieu, H T Ng. Named entity recognition with a maximum entropy approach [C]. The 7th Conf on Computational Natural Language Learning (CoNLL 2003), Edmonton, Canada, 2003.
  • 6A Berger, V J Della Pietra, S A Della Pietra. A maximum entropy approach to natural language processing [J]. Computational Linguistics, 1996, 22(1): 39-71.
  • 7Ramaparkhi Adwait. A simple introduction to maximum entropy models for natural language processing [R]. Institute for Research in Cognitive Science Report,.
  • 8J N Darroch, D Ratcliff. Generalized iterative scaling for loglinear models [J]. The Annals of Mathematical Statistics, 1972, 43(5): 1470-1480.
  • 9Y Z Wu, J Zhao, B Xu. Chinese named entity recognition combining a statistical model with human knowledge [C]. The 41st Annual Meeting of the Association for Computational Linguistics (ACL-2003), Sapporo, 2003.
  • 10T H Tsai, S H Wu, C W Lee, etal. Mencius: a Chinese named entity recognizer using maximum entropy-based hybrid model [J]. Computational Linguistics & Chinese Language Processing, 2004, 9(1): 65-82.

共引文献36

同被引文献10

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部