期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Challenges and Optimization of Multimodal Large Language Models for Tree Falling Scenarios
1
作者 Lei Feng Yicheng Huang +5 位作者 Chunjie Sheng Yuxing Shi Jianhong Jin Yun Xu Yuzhou Du Sihao Miao 《国际计算机前沿大会会议论文集》 2025年第1期533-543,共11页
This study proposed an optimization method for multimodal large language models(MLLMs)reasoning based on structured chain of thought,aiming to enhance the visual decision-making capability in tree falling scenarios.Th... This study proposed an optimization method for multimodal large language models(MLLMs)reasoning based on structured chain of thought,aiming to enhance the visual decision-making capability in tree falling scenarios.The research first analyzed challenges faced by existing MLLMs when processing complicated visual scenes,including insufficient reasoning performance and low integration efficiency with other systems.To address these issues,an innovative structured chain of thought approach was introduced,which significantly improved the reasoning accuracy of the model in handling complex visual scenarios.To validate the proposed method,a specialized dataset focusing on tree falling scenarios in social governance was constructed,and a practical agent workflow was designed based on this dataset.Experimental results demonstrated that the proposed approach achieved better performance in real-world applications.The findings provide a reliable and efficient technical solution to visual decision-making in social governance. 展开更多
关键词 multimodal large language model MLLM social governance AI agent
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部