Primary light chain amyloidosis is a rare hematologic disease with multi-organ involvement.Nearly one-third of patients with amyloidosis experience five or more consultations before diagnosis,which may lead to a poor ...Primary light chain amyloidosis is a rare hematologic disease with multi-organ involvement.Nearly one-third of patients with amyloidosis experience five or more consultations before diagnosis,which may lead to a poor prognosis due to delayed diagnosis.Early risk prediction based on artificial intelligence is valuable for clinical diagnosis and treatment of amyloidosis.For this disease,we propose an Evolutionary Neural Architecture Searching(ENAS)based risk prediction model,which achieves high-precision early risk prediction using physical examination data as a reference factor.To further enhance the value of clinic application,we designed a natural language-based interpretable system around the NAS-assisted risk prediction model for amyloidosis,which utilizes a large language model and Retrieval-Augmented Generation(RAG)to achieve further interpretation of the predicted conclusions.We also propose a document-based global semantic slicing approach in RAG to achievemore accurate slicing and improve the professionalism of the generated interpretations.Tests and implementation show that the proposed risk prediction model can be effectively used for early screening of amyloidosis and that the interpretation method based on the large language model and RAG can effectively provide professional interpretation of predicted results,which provides an effective method and means for the clinical applications of AI.展开更多
电子病历数据经常存在缺失,严重影响分析结果.基于MIMIC数据库中的重症监护单元(intensive care unit,ICU)患者数据研究缺失值插补,数据集由23组临床常用生理变量以及不存在缺失的5260例样本构成.提出了一种基于深度嵌入聚类的K近邻插...电子病历数据经常存在缺失,严重影响分析结果.基于MIMIC数据库中的重症监护单元(intensive care unit,ICU)患者数据研究缺失值插补,数据集由23组临床常用生理变量以及不存在缺失的5260例样本构成.提出了一种基于深度嵌入聚类的K近邻插值方法.该方法以深度嵌入聚类为核心,通过多次聚类构造样本邻近度矩阵,再选择缺失样本的K个近邻样本,以这些近邻样本的平均值填补缺失.与均值插补、中值插补、后验分布估算插补和条件均值插补相比,该方法插补后的结果与原数据相似度更高,且更好地保留了样本间的差异性.展开更多
基金supported by Liaoning Province Key R&D Program Project(Grant Nos.2019JH2/10100027)in part by Grants from Shenyang Science and Technology Plan Project(Grant No.RC210469).
文摘Primary light chain amyloidosis is a rare hematologic disease with multi-organ involvement.Nearly one-third of patients with amyloidosis experience five or more consultations before diagnosis,which may lead to a poor prognosis due to delayed diagnosis.Early risk prediction based on artificial intelligence is valuable for clinical diagnosis and treatment of amyloidosis.For this disease,we propose an Evolutionary Neural Architecture Searching(ENAS)based risk prediction model,which achieves high-precision early risk prediction using physical examination data as a reference factor.To further enhance the value of clinic application,we designed a natural language-based interpretable system around the NAS-assisted risk prediction model for amyloidosis,which utilizes a large language model and Retrieval-Augmented Generation(RAG)to achieve further interpretation of the predicted conclusions.We also propose a document-based global semantic slicing approach in RAG to achievemore accurate slicing and improve the professionalism of the generated interpretations.Tests and implementation show that the proposed risk prediction model can be effectively used for early screening of amyloidosis and that the interpretation method based on the large language model and RAG can effectively provide professional interpretation of predicted results,which provides an effective method and means for the clinical applications of AI.
文摘电子病历数据经常存在缺失,严重影响分析结果.基于MIMIC数据库中的重症监护单元(intensive care unit,ICU)患者数据研究缺失值插补,数据集由23组临床常用生理变量以及不存在缺失的5260例样本构成.提出了一种基于深度嵌入聚类的K近邻插值方法.该方法以深度嵌入聚类为核心,通过多次聚类构造样本邻近度矩阵,再选择缺失样本的K个近邻样本,以这些近邻样本的平均值填补缺失.与均值插补、中值插补、后验分布估算插补和条件均值插补相比,该方法插补后的结果与原数据相似度更高,且更好地保留了样本间的差异性.