期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Bridging the Domain Gap in Grounded Situation Recognition via Unifying Event Extraction across Modalities 被引量:1
1
作者 Qingwen Liu Zejun Li +5 位作者 Zhihao Fan Cunxiang Yin Yancheng He Jing Cai Jinhua Fu Zhongyu Wei 《Data Intelligence》 2025年第1期143-162,共20页
Event extraction extracts event frames from text, while grounded situation recognition detects events in images. As real-world applications frequently encounter a multitude of unforeseen events, certain researchers ha... Event extraction extracts event frames from text, while grounded situation recognition detects events in images. As real-world applications frequently encounter a multitude of unforeseen events, certain researchers have introduced cross-domain and in-domain event extraction, while grounded situation recognition primarily explores in-domain scenarios. Therefore, in this paper, we propose cross-domain grounded situation recognition and establish a new benchmark SWiG-XD. In this more challenging setting, we deepen the connection between the two tasks based on their underlying unity in two different modalities and explore how to transfer the generalization ability from text to images. Firstly, we utilize ChatGPT to automatically generate textual data, which can be divided into two categories. One category is directly matched with images, establishing a direct connection with the images. The other category encompasses all event types and possesses greater generalization. Then we employ a unified model framework to establish the association between textual concepts and local image features and achieve cross-domain generalization transfer across modalities through modality-shared prompts and self-attention mechanism. Furthermore, we incorporate textual data with higher generalization to further assist in improving generalization on images. The experimental results on the newly constructed benchmark demonstrate the effectiveness of our method. 展开更多
关键词 Event argument extraction Cross-domain generalization Unified cross-modal framework modalityshared prompt Grounded situation recognition
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部