营养大模型的技术架构、应用进展与未来挑战

Technical architecture,application progress,and future challenges of nutrition foundation models

导出

摘要营养信息学正由传统基于规则与常规机器学习范式,迈向以大语言模型(large language model,LLM)与多模态大模型(multimodal large language models,MLLM)为核心的新阶段。本文系统综述了2019–2025年间营养大模型领域的研究进展,归纳了视觉-语言对齐、领域知识注入、检索增强生成(retrieval-augmented generation,RAG)及可解释推理等关键架构与训练技术。在此基础上,本文详细梳理了模型在个性化膳食推荐、营养状态评估、疾病营养管理及膳食自动化记录等典型场景的应用现状。此外,本文总结了Nutrition5k、NutriBench等核心数据集与评测基准的演变历程。最后,针对模型可信度、数据隐私、跨文化泛化及临床循证支持等挑战,本文提出未来研究应深度融合临床证据,构建高质量多模态数据体系,并推进人机协同的精准营养服务落地,以提升临床转化价值。 Nutrition informatics has undergone a significant paradigm shift in recent years.Approaches historically grounded in rule-based decision support and classical task-specific machine learning pipelines are increasingly being superseded by an ecosystem centered on large language models(LLMs)and multimodal vision-language foundation models.This review synthesizes researches published between 2019 and 2025,with the objectives of clarifying architectural patterns that enable nutrition-oriented perception and reasoning,summarizing advances and identifying gaps across major application scenarios,and outlining strategic directions for reliable translation research in clinical and public health practice.Based on a systematic analysis of 92 representative studies,we organize the current landscape into three interrelated research trajectories:(1)Vision and multimodal modeling for dietary perception,focusing on food recognition,ingredient parsing,portion estimation,and nutrient prediction from meal images and videos.Recent methodologies increasingly adopt Transformer-based encoders and explicit vision-language alignment,leveraging depth cues and scale calibration to improve robustness under complex real-world conditions.(2)LLM-based nutrition agents for interactive guidance,supporting dietary counseling,meal planning,and health coaching.To mitigate challenges such as hallucinations and numerical inconsistency,current research emphasizes domain adaptation,tool-augmented computation,and retrieval-augmented generation(RAG)to ground model responses in reliable nutrition databases and clinical guidelines.(3)Personalization-oriented hybrid systems,which combine foundation models with structured components-such as knowledge graphs and causal inference frameworks-while integrating individual-level multi-omics signals,biomarkers,and lifestyle data.These systems aim to generate and optimize meal plans under strict constraints of safety,clinical feasibility,and patient adherence.Across these trajectories,interpretability has transitioned from an optional feature to a core system requirement,driven by the needs of clinical accountability and risk auditing.Concurrently,evaluation protocols are expanding from image-centric datasets(e.g.,Nutrition5k)to comprehensive benchmarking suites designed for multimodal reasoning.Despite rapid progress,limitations persist regarding model factuality,privacy preservation,and external validity across diverse cuisines and socioeconomic settings.We advocate for evidence-grounded pipelines,standardized multimodal datasets with clinical endpoints,and unified evaluation frameworks spanning accuracy,safety,and bias.Human-in-the-loop deployment remains essential to quantify benefit-risk profiles and facilitate the regulatory adoption of AI-driven nutrition services.

作者张成东孔浩楠杨元闫媛媛童天朗王慧 ZHANG Cheng-Dong;KONG Hao-Nan;YANG Yuan;YAN Yuan-Yuan;TONG Tian-Lang;WANG Hui(School of Public Health,Shanghai Jiao Tong University,Shanghai 200025,China;Institute of Digital and Intelligent Medicine,Hainan International Medical Center,Shanghai Jiao Tong University School of Medicine,Qionghai 571400,China)

机构地区上海交通大学公共卫生学院上海交通大学医学院

出处《生命科学》 2026年第1期1-17,共17页 Chinese Bulletin of Life Sciences

基金国家自然科学基金重点项目(82030099) 国家重点研发计划项目(2022YFD2101500)。

关键词营养大模型多模态学习大语言模型个性化营养检索增强生成评测基准 nutrition large language model multimodal learning large language model personalized nutrition retrieval-augmented generation evaluation benchmark

分类号 TP18 [自动化与计算机技术—控制理论与控制工程] R151 [医药卫生—营养与食品卫生学]

引文网络
相关文献

1鲍风,杨旭.人工智能在计算机网络安全中的应用困境与突破策略[J].移动信息,2026,48(1):128-130.
2丁浩晗,梁智然,宋晓东,崔晓晖,董冠军,乌日娜.人工智能驱动的食源性功能肽研究及其在个性化营养干预中的应用进展[J].食品科学,2026,47(3):345-355.
3赵尚卿,兰曼,柏晓鹏,徐默凡,任育培,周裕浩.负屃:古汉语文本理解与生成能力评测基准[J].中文信息学报,2026,40(1):85-99.
4邓钟燕,邱小芩,刘娜,陈思帆,谢镕蔓,代咏航,丘欣雨.基于移动医疗的慢性心力衰竭患者居家营养管理的范围综述[J].中华现代护理杂志,2026,32(6):834-840.
5原凤妍,常琳,李华,吴晔.“单模态”向“多模态”的范式跃迁:算法—情绪—意义的三维整合——2025年计算传播学研究综述[J].教育传媒研究,2026(2):36-45.
6苏婷凤,文思思,何媛,谭添,张成芳,王湛泽.恶性肿瘤住院患者的营养状况调查与分析[J].临床医学研究与实践,2026,11(6):35-38.
7唐媛,唐娟.动态化个体化营养干预应用于维持性血液透析患者钙磷代谢调节中的效果评价[J].中国科技期刊数据库医药,2026(1):009-012.
8李晓敏.妊娠期体质量管理对妊娠并发症及分娩结局的影响[J].护理研究杂志,2026,9(1):83-85.
9汲金刚,房玥晖,关方旭,陈同,王惠君,刘爱东,苏畅,何宇纳,贾小芳.膳食模式和肠道菌群与中老年男性认知功能关联分析[J].中国预防医学杂志,2026,27(1):38-48.
10邢思思,钱力.面向TRIZ的专利技术知识抽取研究[J].图书情报工作,2026,70(5):95-103.

生命科学

2026年第1期

浏览历史

内容加载中请稍等...

营养大模型的技术架构、应用进展与未来挑战

相关作者

相关机构

相关主题

浏览历史