期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Evaluating Large Language Models’Potential in Field Epidemiology Investigation Based on Chinese Context—Zhejiang Province,China,2025
1
作者 Tao Zhang Qifeng Zhao +6 位作者 Yaxin Dai Mengna Wu Yujia Zhai Le Xu Xue Gu Junfen Lin Chen Wu 《China CDC weekly》 2025年第41期1296-1301,I0001,I0002,共8页
Introduction Large language models(LLMs)have demonstrated potential applications across diverse fields,yet their effectiveness in supporting field epidemiology investigations remains uncertain.Methods We assessed six ... Introduction Large language models(LLMs)have demonstrated potential applications across diverse fields,yet their effectiveness in supporting field epidemiology investigations remains uncertain.Methods We assessed six prominent LLMs(ChatGPT-o4-mini-high,ChatGPT-4o,DeepSeek-R1,DeepSeek-V3,Qwen3-235B-A22B,and Qwen2.5-max)using multiple-choice and case-based questions from the 2025 Zhejiang Field Epidemiology Training Program entrance examination.Model responses were evaluated against standard answers and benchmarked against performance scores from junior epidemiologists.Results For multiple-choice questions,only DeepSeek-V3(75%)exceeded the 75th percentile performance level of junior epidemiologists(67.5%).In case-based assessments,most LLMs achieved or surpassed the 75th percentile of junior epidemiologists,demonstrating particular strength in data analysis tasks.Conclusion Although LLMs demonstrate promise as supportive tools in field epidemiology investigations,they cannot yet replace human expertise.Significant challenges persist regarding the accuracy and timeliness of model outputs,alongside critical concerns about data security and privacy protection that must be addressed before widespread implementation. 展开更多
关键词 Zhejiang province standard answers large language models multiple choice questions large language models llms China field epidemiology evaluation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部