期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
TV-SAM:Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation
1
作者 Zekun Jiang Dongjie Cheng +10 位作者 Ziyuan Qin Jun Gao Qicheng Lao Abdullaev Bakhrom Ismoilovich Urazboev Gayrat Yuldashov Elyorbek Bekchanov Habibullo defu tang Linjing Wei Kang Li Le Zhang 《Big Data Mining and Analytics》 CSCD 2024年第4期1199-1211,共13页
This study presents a novel multimodal medical image zero-shot segmentation algorithm named the text-visual-prompt segment anything model(TV-SAM)without any manual annotations.The TV-SAM incorporates and integrates th... This study presents a novel multimodal medical image zero-shot segmentation algorithm named the text-visual-prompt segment anything model(TV-SAM)without any manual annotations.The TV-SAM incorporates and integrates the large language model GPT-4,the vision language model GLIP,and the SAM to autonomously generate descriptive text prompts and visual bounding box prompts from medical images,thereby enhancing the SAM’s capability for zero-shot segmentation.Comprehensive evaluations are implemented on seven public datasets encompassing eight imaging modalities to demonstrate that TV-SAM can effectively segment unseen targets across various modalities without additional training.TV-SAM significantly outperforms SAM AUTO(p<0.01)and GSAM(p<0.05),closely matching the performance of SAM BBOX with gold standard bounding box prompts(p=0.07),and surpasses the state-of-the-art methods on specific datasets such as ISIC(0.853 versus 0.802)and WBC(0.968 versus 0.883).The study indicates that TV-SAM serves as an effective multimodal medical image zero-shot segmentation algorithm,highlighting the significant contribution of GPT-4 to zero-shot segmentation.By integrating foundational models such as GPT-4,GLIP,and SAM,the ability to address complex problems in specialized domains can be enhanced. 展开更多
关键词 large language model vision language model segment anything model medical image segmentation zero-shot segmentation GPT-4
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部