Objective To develop QingNangTCM,a specialized large language model(LLM)tailored for expert-level traditional Chinese medicine(TCM)question-answering and clinical reasoning,addressing the scarcity of domain-specific c...Objective To develop QingNangTCM,a specialized large language model(LLM)tailored for expert-level traditional Chinese medicine(TCM)question-answering and clinical reasoning,addressing the scarcity of domain-specific corpora and specialized alignment.Methods We constructed QnTCM_Dataset,a corpus of 100000 entries,by integrating data from ShenNong_TCM_Dataset and SymMap v2.0,and synthesizing additional samples via retrieval-augmented generation(RAG)and persona-driven generation.The dataset comprehensively covers diagnostic inquiries,prescriptions,and herbal knowledge.Utilizing P-Tuning v2,we fine-tuned the GLM-4-9B-Chat backbone to develop QingNangTCM.A multidimensional evaluation framework,assessing accuracy,coverage,consistency,safety,professionalism,and fluency,was established using metrics such as bilingual evaluation understudy(BLEU),recall-oriented understudy for gisting evaluation(ROUGE),metric for evaluation of translation with explicit ordering(METEOR),and LLM-as-a-Judge with expert review.Qualitative analysis was conducted across four simulated clinical scenarios:symptom analysis,disease treatment,herb inquiry,and failure cases.Baseline models included GLM-4-9BChat,DeepSeek-V2,HuatuoGPT-II(7B),and GLM-4-9B-Chat(freeze-tuning).Results QingNangTCM achieved the highest scores in BLEU-1/2/3/4(0.425/0.298/0.137/0.064),ROUGE-1/2(0.368/0.157),and METEOR(0.218),demonstrating a balanced and superior normalized performance profile of 0.900 across the dimensions of accuracy,coverage,and consistency.Although its ROUGE-L score(0.299)was lower than that of HuatuoGPT-II(7B)(0.351),it significantly outperformed domain-specific models in expert-validated win rates for professionalism(86%)and safety(73%).Qualitative analysis confirmed that the model strictly adheres to the“symptom-syndrome-pathogenesis-treatment”reasoning chain,though occasional misclassifications and hallucinations persisted when dealing with rare medicinal materials and uncommon syndromes.展开更多
This study explores the potential of Artificial Intelligence(AI)in early screening and prognosis of Dry Eye Disease(DED),aiming to enhance the accuracy of therapeutic approaches for eye-care practitioners.Despite the ...This study explores the potential of Artificial Intelligence(AI)in early screening and prognosis of Dry Eye Disease(DED),aiming to enhance the accuracy of therapeutic approaches for eye-care practitioners.Despite the promising opportunities,challenges such as diverse diagnostic evidence,complex etiology,and interdisciplinary knowledge integration impede the interpretability,reliability,and applicability of AI-based DED detection methods.The research conducts a comprehensive review of datasets,diagnostic evidence,and standards,as well as advanced algorithms in AI-based DED detection over the past five years.The DED diagnostic methods are categorized into three groups based on their relationship with AI techniques:(1)those with ground truth and/or comparable standards,(2)potential AI-based methods with significant advantages,and(3)supplementary methods for AI-based DED detection.The study proposes suggested DED detection standards,the combination of multiple diagnostic evidence,and future research directions to guide further investigations.Ultimately,the research contributes to the advancement of ophthalmic disease detection by providing insights into knowledge foundations,advanced methods,challenges,and potential future perspectives,emphasizing the significant role of AI in both academic and practical aspects of ophthalmology.展开更多
基金Hebei Province Higher Education Scientific Research Project(QN2025367)Zhangjiakou City 2022 Municipal Science and Technology Plan Self-raised Fund Project(221105D)Hebei Province Education Science“14th Five-Year Plan”Project(2404224).
文摘Objective To develop QingNangTCM,a specialized large language model(LLM)tailored for expert-level traditional Chinese medicine(TCM)question-answering and clinical reasoning,addressing the scarcity of domain-specific corpora and specialized alignment.Methods We constructed QnTCM_Dataset,a corpus of 100000 entries,by integrating data from ShenNong_TCM_Dataset and SymMap v2.0,and synthesizing additional samples via retrieval-augmented generation(RAG)and persona-driven generation.The dataset comprehensively covers diagnostic inquiries,prescriptions,and herbal knowledge.Utilizing P-Tuning v2,we fine-tuned the GLM-4-9B-Chat backbone to develop QingNangTCM.A multidimensional evaluation framework,assessing accuracy,coverage,consistency,safety,professionalism,and fluency,was established using metrics such as bilingual evaluation understudy(BLEU),recall-oriented understudy for gisting evaluation(ROUGE),metric for evaluation of translation with explicit ordering(METEOR),and LLM-as-a-Judge with expert review.Qualitative analysis was conducted across four simulated clinical scenarios:symptom analysis,disease treatment,herb inquiry,and failure cases.Baseline models included GLM-4-9BChat,DeepSeek-V2,HuatuoGPT-II(7B),and GLM-4-9B-Chat(freeze-tuning).Results QingNangTCM achieved the highest scores in BLEU-1/2/3/4(0.425/0.298/0.137/0.064),ROUGE-1/2(0.368/0.157),and METEOR(0.218),demonstrating a balanced and superior normalized performance profile of 0.900 across the dimensions of accuracy,coverage,and consistency.Although its ROUGE-L score(0.299)was lower than that of HuatuoGPT-II(7B)(0.351),it significantly outperformed domain-specific models in expert-validated win rates for professionalism(86%)and safety(73%).Qualitative analysis confirmed that the model strictly adheres to the“symptom-syndrome-pathogenesis-treatment”reasoning chain,though occasional misclassifications and hallucinations persisted when dealing with rare medicinal materials and uncommon syndromes.
基金funded by the National Natural Science Foundation of China Natural(Nos.U22A2041,82071915,and 62372047)the Shenzhen Key Laboratory of Intelligent Bioinformatics(No.ZDSYS20220422103800001)+5 种基金the Shenzhen Science and Technology Program(No.KQTD20200820113106007)the Guangdong Basic and Applied Basic Research Foundation(No.2022A1515220015)the Zhuhai Technology and Research Foundation(Nos.ZH22036201210034PWC,2220004000131,and 2220004002412)the Project of Humanities and Social Science of MOE(Ministry of Education in China)(No.22YJCZH213)the Science and Technology Research Program of Chongqing Municipal Education Commission(Nos.KJZD-K202203601,KJQN0202203605,and KJQN202203607)the Natural Science Foundation of Chongqing China(No.cstc2021jcyj-msxmX1108).
文摘This study explores the potential of Artificial Intelligence(AI)in early screening and prognosis of Dry Eye Disease(DED),aiming to enhance the accuracy of therapeutic approaches for eye-care practitioners.Despite the promising opportunities,challenges such as diverse diagnostic evidence,complex etiology,and interdisciplinary knowledge integration impede the interpretability,reliability,and applicability of AI-based DED detection methods.The research conducts a comprehensive review of datasets,diagnostic evidence,and standards,as well as advanced algorithms in AI-based DED detection over the past five years.The DED diagnostic methods are categorized into three groups based on their relationship with AI techniques:(1)those with ground truth and/or comparable standards,(2)potential AI-based methods with significant advantages,and(3)supplementary methods for AI-based DED detection.The study proposes suggested DED detection standards,the combination of multiple diagnostic evidence,and future research directions to guide further investigations.Ultimately,the research contributes to the advancement of ophthalmic disease detection by providing insights into knowledge foundations,advanced methods,challenges,and potential future perspectives,emphasizing the significant role of AI in both academic and practical aspects of ophthalmology.