Super-resolution(SR)reconstruction addresses the challenge of enhancing image resolution,which is critical in domains such as medical imaging,remote sensing,and computational photography.High-quality image reconstruct...Super-resolution(SR)reconstruction addresses the challenge of enhancing image resolution,which is critical in domains such as medical imaging,remote sensing,and computational photography.High-quality image reconstruction is essential for enhancing visual details and improving the accuracy of subsequent tasks.Traditional methods,including interpolation techniques and basic CNNs,often fail to recover fine textures and detailed structures,particularly in complex or high-frequency regions.In this paper,we present Deep Supervised Swin Transformer U-Net(DSSTU-Net),a novel architecture designed to improve image SR by integrating Residual Swin Transformer Blocks(RSTB)and Deep Supervision(DS)mechanisms into the U-Net framework.DSSTU-Net leverages the Swin Transformer’s multi-scale attention capabilities for robust feature extraction,while DS at various stages of the network ensures better gradient propagation and refined feature learning.The ST block introduces a hierarchical self-attention mechanism,allowing the model to capture both local and global context,which is crucial for high-quality SR tasks.Moreover,DS applied at multiple layers in the decoder enables direct supervision on intermediate feature maps,accelerating convergence and improving performance.The DSSTU-Net architecture was rigorously evaluated on the DIV2K,LSDIR,SET5,and SET14 datasets,demonstrating its superior ability to generate high-quality images.Furthermore,the potential applications of this model extend beyond image enhancement,with promising use cases in medical imaging,satellite imagery,and industrial inspection,where high-quality image reconstruction plays a crucial role in accurate diagnostics and operational efficiency.This work provides a reference method for future research on advanced image restoration techniques.展开更多
基金supported in part by the National Natural Science Foundation of China(62263006)2021 Director’s Fund of the Guangxi Key Laboratory for Automatic Detection Technology and Instruments(YQ21107)+2 种基金Guilin University of Electronic Technology Scientific Research Fund Project(UF24014Y)Innovation Project of Guangxi Graduate Education(YCSW2024336)Middle-aged and Young Teachers’Basic Ability Promotion Project of Guangxi(2021KY0802).
文摘Super-resolution(SR)reconstruction addresses the challenge of enhancing image resolution,which is critical in domains such as medical imaging,remote sensing,and computational photography.High-quality image reconstruction is essential for enhancing visual details and improving the accuracy of subsequent tasks.Traditional methods,including interpolation techniques and basic CNNs,often fail to recover fine textures and detailed structures,particularly in complex or high-frequency regions.In this paper,we present Deep Supervised Swin Transformer U-Net(DSSTU-Net),a novel architecture designed to improve image SR by integrating Residual Swin Transformer Blocks(RSTB)and Deep Supervision(DS)mechanisms into the U-Net framework.DSSTU-Net leverages the Swin Transformer’s multi-scale attention capabilities for robust feature extraction,while DS at various stages of the network ensures better gradient propagation and refined feature learning.The ST block introduces a hierarchical self-attention mechanism,allowing the model to capture both local and global context,which is crucial for high-quality SR tasks.Moreover,DS applied at multiple layers in the decoder enables direct supervision on intermediate feature maps,accelerating convergence and improving performance.The DSSTU-Net architecture was rigorously evaluated on the DIV2K,LSDIR,SET5,and SET14 datasets,demonstrating its superior ability to generate high-quality images.Furthermore,the potential applications of this model extend beyond image enhancement,with promising use cases in medical imaging,satellite imagery,and industrial inspection,where high-quality image reconstruction plays a crucial role in accurate diagnostics and operational efficiency.This work provides a reference method for future research on advanced image restoration techniques.