期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Theory of Mind Inspired Large Reasoning Language Model Improved Multi-agent Reinforcement Learning Algorithm for Robust and Adaptive Partner Modelling
1
作者 Xiyun Li Tielin Zhang +2 位作者 Chenghao Liu Shuang Xu Bo Xu 《Machine Intelligence Research》 2025年第6期1088-1101,共14页
The cooperative multi-agent reinforcement learning(MARL)field has experienced remarkable progress.However,these advanced methods still face substantial challenges in real-world applications.A significant direction for... The cooperative multi-agent reinforcement learning(MARL)field has experienced remarkable progress.However,these advanced methods still face substantial challenges in real-world applications.A significant direction for improving cooperative MARL techniques and addressing existing challenges is robust and adaptive partner modelling.Reasoning about the beliefs of partners,such as their intentions and behaviors,is crucial for partner modelling,which is known as the theory of mind(ToM)in cognitive science.In animals,biological ToM reasoning in the prefrontal cortex(PFC)plays an important role in complex environment survival before decision-making.However,the biological PFC is too complex to be directly incorporated into conventional artificial neural networks(ANNs)in either functional or structural manners.Large reasoning language models(LRMs)have recently demonstrated significant human-like reasoning abilities and impressive performance.Therefore,we propose an improved LRM framework to simulate the PFC for robust and adaptive partner modelling.Despite the excellent performance of LRMs in various fields,their ToM reasoning capabilities remain limited in complex MARL scenarios.Therefore,we further propose a ToM reasoner to enhance the ToM reasoning abilities of LRMs.Our framework exhibits robustness and adaptability across various LRM sizes,improving the ToM reasoning ability of agents and facilitating more effective partner modelling,thereby achieving higher performance scores in cooperative benchmarks. 展开更多
关键词 Theory of mind multi-agent reinforcement learning partner modelling large reasoning language model biological decision-making model
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部