Agent-based multimodal information extraction for nanomaterials

导出

摘要 Automating structured data extraction from scientific literature is a critical challenge with broad implications across domains.We introduce nanoMINER,a multi-agent system combining large language models and multimodal analysis to extract essential information from scientific research articles on nanomaterials.This system processes documents end-to-end,utilizing tools such as YOLO for visual data extraction and GPT-4o for linking textual and visual information.At its core,the ReAct agent orchestrates specialized agents to ensure comprehensive data extraction.We demonstrate the efficacy of the system by automating the assembly of nanomaterial and nanozyme datasets previously manually curated by domain experts.NanoMINER achieves high precision in extracting nanomaterial properties like chemical formulas,crystal systems,and surface characteristics.For nanozymes,we obtain near-perfect precision(0.98)for kinetic parameters and essential features such as Cmin and Cmax.To benchmark the systemperformance,we also compare nanoMINER to several baseline LLMs,including the most recent multimodal GPT-4.1,and show consistently higher extraction precision and recall.Our approach is extensible to other domains of materials science and fields like biomedicine,advancing data-driven research methodologies and automated knowledge extraction.

作者 R.Odobesku K.Romanova S.Mirzaeva O.Zagorulko R.Sim R.Khakimullin J.Razlivina A.Dmitrenko V.Vinogradov

机构地区 AI Talent Hub

出处《npj Computational Materials》 2025年第1期2067-2077,共11页 计算材料学(英文)

基金 supported by the Priority 2030 Federal Academic Leadership Program.Wewould like to thank S.Danilova,M.Reykina,and S.Shevtsova for their valuable contribution to the annotation of the dataset used in this study.

关键词 react agent orchestrates speciali scientific research multimodal information extraction NANOMATERIALS multimodal analysis large language models agent based scientific literature

分类号 TP391.1 [自动化与计算机技术—计算机应用技术] TP242 [自动化与计算机技术—检测技术与自动化装置] TB383 [一般工业技术—材料科学与工程]

引文网络
相关文献

1刘书凝,郭爱萍.A Corpus-based Study on Chinese and American Scholars' Use of Nominalizations in English Scientific Research Articles[J].海外英语,2018(12):210-211.
2《英汉科技语篇中语言评价系统对比研究(英文版)》[J].外国语,2022,45(4):88-88.
3Dennis Possart,Leonid Mill,Florian Vollnhals,Tor Hildebrand,Peter Suter,Mathis Hoffmann,Jonas Utz,Daniel Augsburger,Mareike Thies,Mingxuan Gu,Fabian Wagner,George Sarau,Silke Christiansen,Katharina Breininger.Addressing data scarcity in nanomaterial segmentation networks with differentiable rendering and generative modeling[J].npj Computational Materials,2025(1):2157-2167.
4Mohamed Mifdal,Marilyn Lewis.Revisiting the use of hedges and boosters in scientific research articles in Morocco:Caution that does not exclude conviction[J].Cultures of Science,2023(1):113-130.
5刘彩虹,郑维康.方面级情感分析视域下MOOC课程质量评估体系构建[J].沈阳大学学报(社会科学版),2026,28(1):58-69.
6Zhiyong Tang.CPL-enabled spatial displaying for immersive human-machine interaction[J].Science China Materials,2025,68(12):4586-4587.
7Tengfei Liu,Yanfeng Bai,Jianxia Chen,Jintao Zhai,Siqing Xiang,Xianwei Huang,Xiquan Fu.Image-free single-pixel semantic segmentation for complex scene based on multi-scale U-Net[J].Chinese Physics B,2026,35(1):440-447.
8Ying Ren,Ai‐Guo Wu,Yu Shi,Yi‐Fang Ping,Jing‐Hai Li,Xiu‐Wu Bian.Challenges in the Immune System:Mesoscale and Mesoregime Complexity[J].Cancer Innovation,2025,4(5):60-69.
9Cui Qijie,Huang Junjie,Liu Bo.Third-order consensus of discrete-time multi-agent systems[J].The Journal of China Universities of Posts and Telecommunications,2025,32(4):92-100.
10Zhuo Li,Weiran Wu,Yunlong Guo,Jian Sun,Qing-Long Han.Embodied Multi-Agent Systems:A Review[J].IEEE/CAA Journal of Automatica Sinica,2025,12(6):1095-1116. 被引量：1

npj Computational Materials

2025年第1期

浏览历史

内容加载中请稍等...

Agent-based multimodal information extraction for nanomaterials

相关作者

相关机构

相关主题

浏览历史