Transformer-Based Fusion of Infrared and Visible Imagery for Smoke Recognition in Commercial Areas

下载PDF

导出

摘要 With rapid urbanization,fires pose significant challenges in urban governance.Traditional fire detection methods often struggle to detect smoke in complex urban scenes due to environmental interferences and variations in viewing angles.This study proposes a novel multimodal smoke detection method that fuses infrared and visible imagery using a transformer-based deep learning model.By capturing both thermal and visual cues,our approach significantly enhances the accuracy and robustness of smoke detection in business parks scenes.We first established a dual-view dataset comprising infrared and visible light videos,implemented an innovative image feature fusion strategy,and designed a deep learning model based on the transformer architecture and attention mechanism for smoke classification.Experimental results demonstrate that our method outperforms existing methods,under the condition of multi-view input,it achieves an accuracy rate of 90.88%,precision rate of 98.38%,recall rate of 92.41%and false positive and false negative rates both below 5%,underlining the effectiveness of the proposed multimodal and multi-view fusion approach.The attention mechanism plays a crucial role in improving detection performance,particularly in identifying subtle smoke features.

作者 Chongyang Wang Qiongyan Li Shu Liu Pengle Cheng Ying Huang

机构地区 School of Technology HES Technology Group Co. Department of Civil

出处《Computers, Materials & Continua》 2025年第9期5157-5176,共20页 计算机、材料和连续体(英文)

基金 supported by the National Natural Science Foundation of China(32171797) Chunhui Project Foundation of the Education Department of China(HZKY20220026).

关键词 Multimodal image processing smoke recognition urban safety environmental monitoring

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1The Great Urban Shift[J].Beijing Review,2025,68(30):2-2.
2Jinyuan Liu,Xingyuan Li,Zirui Wang,Zhiying Jiang,Wei Zhong,Wei Fan,Bin Xu.PromptFusion:Harmonized Semantic Prompt Learning for Infrared and Visible Image Fusion[J].IEEE/CAA Journal of Automatica Sinica,2025,12(3):502-515.
3高兆琨.论我国韧性城市建设的法制保障[J].现代管理,2025,15(4):257-264.
4Kai WANG,Wenhui KUANG,Weihua FANG,Yinyin DOU,Ming CHEN.Spatially explicit adaptation characteristics of urban development and construction across China over the past three decades[J].Science China Earth Sciences,2025,68(5):1624-1640.
5Wei Xiong,Kun Li,Long-Hai Liu,Si-Qi Wang,Zhong-Zheng Cai,Jian-Bo Zhu.Ring-opening Polymerization of Benzo-fused Thiolactones toward Chemically Recyclable Semi-aromatic Polythioesters[J].Chinese Journal of Polymer Science,2025,43(6):973-980.
6黄堃,段辞涵,任柳清,孟庆凯,李康宁,江浩田.基于GAN的电力巡检红外和可见光图像融合方法[J].中国新技术新产品,2025(15):14-17.
7马敏敏,魏宇腾,王田楠,朱子琳,李兴莉,郭庆华.考虑救援行为的地铁火灾行人疏散建模与模拟[J].安防技术,2024,12(4):23-33. 被引量：1
8HE Weidong,LAI Jialong,ZHONG Zhicheng,CUI Feifei,XU Yi,ZHANG Xiaoping.MSFNet:A Network for Lunar Impact Crater Detection Based on Enhanced Feature Fusion with Digital Elevation Model[J].深空探测学报(中英文),2025,12(2):190-204.
9Rodriguez Pillaga Renan Teodoro,Victor A.Banuls.Emergency scenario modeling for the analysis of dynamic risks in business parks[J].Journal of Safety Science and Resilience,2025,6(2):265-279.
10Juan Xiang,Yizhang Li,Xinyi Zhang,Yu He,Qiang Sun.Local large language model-assisted literature mining for on-surface reactions[J].Materials Genome Engineering Advances,2025,3(1):238-248.

Computers, Materials & Continua

2025年第9期

浏览历史

内容加载中请稍等...

Transformer-Based Fusion of Infrared and Visible Imagery for Smoke Recognition in Commercial Areas

相关作者

相关机构

相关主题

浏览历史