期刊文献+

Agent-based multimodal information extraction for nanomaterials

原文传递
导出
摘要 Automating structured data extraction from scientific literature is a critical challenge with broad implications across domains.We introduce nanoMINER,a multi-agent system combining large language models and multimodal analysis to extract essential information from scientific research articles on nanomaterials.This system processes documents end-to-end,utilizing tools such as YOLO for visual data extraction and GPT-4o for linking textual and visual information.At its core,the ReAct agent orchestrates specialized agents to ensure comprehensive data extraction.We demonstrate the efficacy of the system by automating the assembly of nanomaterial and nanozyme datasets previously manually curated by domain experts.NanoMINER achieves high precision in extracting nanomaterial properties like chemical formulas,crystal systems,and surface characteristics.For nanozymes,we obtain near-perfect precision(0.98)for kinetic parameters and essential features such as Cmin and Cmax.To benchmark the systemperformance,we also compare nanoMINER to several baseline LLMs,including the most recent multimodal GPT-4.1,and show consistently higher extraction precision and recall.Our approach is extensible to other domains of materials science and fields like biomedicine,advancing data-driven research methodologies and automated knowledge extraction.
机构地区 AI Talent Hub
出处 《npj Computational Materials》 2025年第1期2067-2077,共11页 计算材料学(英文)
基金 supported by the Priority 2030 Federal Academic Leadership Program.Wewould like to thank S.Danilova,M.Reykina,and S.Shevtsova for their valuable contribution to the annotation of the dataset used in this study.
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部