期刊文献+
共找到352,787篇文章
< 1 2 250 >
每页显示 20 50 100
Robust key point descriptor for multi-spectral image matching 被引量:3
1
作者 Yueming Qin Zhiguo Cao +1 位作者 Wen Zhuo Zhenghong Yu 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2014年第4期681-687,共7页
Histogram of collinear gradient-enhanced coding (HCGEC), a robust key point descriptor for multi-spectral image matching, is proposed. The HCGEC mainly encodes rough structures within an image and suppresses detaile... Histogram of collinear gradient-enhanced coding (HCGEC), a robust key point descriptor for multi-spectral image matching, is proposed. The HCGEC mainly encodes rough structures within an image and suppresses detailed textural information, which is desirable in multi-spectral image matching. Experiments on two multi-spectral data sets demonstrate that the proposed descriptor can yield significantly better results than some state-of- the-art descriptors. 展开更多
关键词 collinear gradient-enhanced coding (CGEC) key pointdescriptor multi-spectral image matching.
在线阅读 下载PDF
Investigation of Image Fusion Between High-Resolution Image and Multi-spectral Image 被引量:1
2
作者 LI Pingxiang WANG ZhijunLI Pingxiang, professor, State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 129 Luoyu Road, Wuhan 430079, China. 《Geo-Spatial Information Science》 2003年第2期31-34,共4页
On the basis of a thorough understanding of the physical characteristics of remote sensing image, this paper employs the theories of wavelet transform and signal sampling to develop a new image fusion algorithm. The a... On the basis of a thorough understanding of the physical characteristics of remote sensing image, this paper employs the theories of wavelet transform and signal sampling to develop a new image fusion algorithm. The algorithm has been successfully applied to the image fusion of SPOT PAN and TM of Guangdong province, China. The experimental results show that a perfect image fusion can be built up by using the image analytical solution and re-construction in the image frequency domain based on the physical characteristics of the image formation. The method has demonstrated that the results of the image fusion do not change spectral characteristics of the original image. 展开更多
关键词 image fusion remote sensing wavelet transform signal sampling
在线阅读 下载PDF
Fusion of multi-spectral image and panchromatic image based on support vector regression
3
作者 胡根生 梁栋 《Journal of Beijing Institute of Technology》 EI CAS 2012年第2期269-277,共9页
In our study, support vector value contourlet transform is constructed by using support vector regression model and directional filter banks. The transform is then used to decompose source images at multi-scale, multi... In our study, support vector value contourlet transform is constructed by using support vector regression model and directional filter banks. The transform is then used to decompose source images at multi-scale, multi-direction and multi-resolution. After that, the super-resolved multi-spectral image is reconstructed by utilizing the strong learning ability of support vector regression and the correlation between multi-spectral image and panchromatic image. Finally, the super-resolved multi- spectral image and the panchromatic image are fused based on regions at different levels. Our experi- ments show that, the learning method based on support vector regression can improve the effect of super-resolution of multi-spectral image. The fused image preserves both high space resolution and spectrum information of multi-spectral image. 展开更多
关键词 image processing image fusion support vector regression SUPER-RESOLUTION
在线阅读 下载PDF
Multi-spectral image fusion method based on two channels non-separable wavelets 被引量:9
4
作者 LIU Bin PENG JiaXiong 《Science in China(Series F)》 2008年第12期2022-2032,共11页
A construction method of two channels non-separable wavelets filter bank which dilation matrix is[1,1;1,-1]and its application in the fusion of multi-spectral image are presented.Many 4×4 filter banks are designe... A construction method of two channels non-separable wavelets filter bank which dilation matrix is[1,1;1,-1]and its application in the fusion of multi-spectral image are presented.Many 4×4 filter banks are designed.The multi-spectral image fusion algorithm based on this kind of wavelet is proposed.Using this filter bank,multi-resolution wavelet decomposition of the intensity of multi-spectral image and panchromatic image is performed,and the two low-frequency components of the intensity and the panchromatic image are merged by using a tradeoff parameter.The experiment results show that this method is good in the preservation of spectral quality and high spatial resolution information.Its performance in preserving spectral quality and high spatial information is better than the fusion method based on DWFT and IHS.When the parameter t is closed to 1,the fused image can obtain rich spectral information from the original MS image.The amount of computation reduced to only half of the fusion method based on four channels wavelet transform. 展开更多
关键词 image fusion non-separable wavelets multi-spectral image panchromatic image
原文传递
基于人工智能Precise Image重建算法对头颅CT图像质量及辐射剂量的影响
5
作者 廖甜 刘晓静 +5 位作者 宁先英 桂绅 孔祥闯 雷子乔 余建明 吴红英 《放射学实践》 北大核心 2026年第1期66-71,共6页
目的:评估Precise Image人工智能重建算法对头颅CT图像质量及辐射剂量的影响。方法:回顾性搜集行头颅CT平扫的80例患者,A组(40例)采用120 kV、150 mAs采集图像,同时采用Precise Image(sharp/standard/smooth/smoother)算法、iDose 4等... 目的:评估Precise Image人工智能重建算法对头颅CT图像质量及辐射剂量的影响。方法:回顾性搜集行头颅CT平扫的80例患者,A组(40例)采用120 kV、150 mAs采集图像,同时采用Precise Image(sharp/standard/smooth/smoother)算法、iDose 4等级算法进行图像重建;B组(40例)采用传统轴扫方案采集图像(120 kV、250 mAs扫描条件),采用iDose 4等级算法进行图像重建。对比不同剂量、不同重建方式下头颅CT检查图像质量及辐射剂量。结果:A组较B组CTDIvol、DLP、SSDE分别降低约55.02%、42.68%、59.22%(P<0.05)。A组随着重建算法等级的升高(sharp、standard、smooth、smoother),小脑、背侧丘脑及灰白质噪声SD值下降,信号噪声比(SNR)、对比噪声比(CNR)升高,且均高于同扫描条件下iDose 4算法,除sharp算法外差异均有统计学意义(P<0.05)。A组standard、smooth算法主观评分为(4.63±0.49)分、(4.27±0.38)分,两组均满足诊断需求;B组主观评分为(4.52±0.41)分。结论:Precise Image人工智能重建算法在保证图像质量的前提下可大大降低头颅CT辐射剂量。 展开更多
关键词 体层摄影术 X线计算机 人工智能 Precise image 图像质量 辐射剂量
在线阅读 下载PDF
Acquiring multi-spectral images by digital still cameras based on XYZLMS interim connection space 被引量:1
6
作者 张显斗 王强 +1 位作者 杨根福 王萌萌 《Chinese Optics Letters》 SCIE EI CAS CSCD 2014年第11期129-132,共4页
A method based on the XYZLMS interim connection space is proposed to accurately acquire the multi-spectral images by digital still cameras. The XYZLMS values are firstly predicted from RGB values by polynomial model w... A method based on the XYZLMS interim connection space is proposed to accurately acquire the multi-spectral images by digital still cameras. The XYZLMS values are firstly predicted from RGB values by polynomial model with local training samples and then spectral reflectance is constructed from XYZLMS values by pseudo-inverse method. An experiment is implemented for multi-spectral image acquisition based on a commercial digital still camera. The results indicate that multi-spectral images can be accurately acquired except the very dark colors. 展开更多
关键词 RGB Acquiring multi-spectral images by digital still cameras based on XYZLMS interim connection space LUT
原文传递
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
7
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning image segmentation
暂未订购
FDEFusion:End-to-End Infrared and Visible Image Fusion Method Based on Frequency Decomposition and Enhancement
8
作者 Ming Chen Guoqiang Ma +3 位作者 Ping Qi Fucheng Wang Lin Shen Xiaoya Pi 《Computers, Materials & Continua》 2026年第4期817-839,共23页
In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,eff... In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,efficiently combining the advantages of both images while overcoming their shortcomings is necessary.To handle this challenge,we developed an end-to-end IRI andVI fusionmethod based on frequency decomposition and enhancement.By applying concepts from frequency domain analysis,we used the layering mechanism to better capture the salient thermal targets from the IRIs and the rich textural information from the VIs,respectively,significantly boosting the image fusion quality and effectiveness.In addition,the backbone network combined Restormer Blocks and Dense Blocks;Restormer blocks utilize global attention to extract shallow features.Meanwhile,Dense Blocks ensure the integration between shallow and deep features,thereby avoiding the loss of shallow attributes.Extensive experiments on TNO and MSRS datasets demonstrated that the suggested method achieved state-of-the-art(SOTA)performance in various metrics:Entropy(EN),Mutual Information(MI),Standard Deviation(SD),The Structural Similarity Index Measure(SSIM),Fusion quality(Qabf),MI of the pixel(FMI_(pixel)),and modified Visual Information Fidelity(VIF_(m)). 展开更多
关键词 Infrared images visible images frequency decomposition restormer blocks global attention
在线阅读 下载PDF
Advances in deep learning for bacterial image segmentation in optical microscopy
9
作者 Zhijun Tan Yang Ding +6 位作者 Huibin Ma Jintao Li Danrou Zheng Hua Bai Weini Xin Lin Li Bo Peng 《Journal of Innovative Optical Health Sciences》 2026年第1期30-44,共15页
Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bac... Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bacterial structures,facilitating precise measurement of morphological variations and population behaviors at single-cell resolution.This paper reviews advancements in bacterial image segmentation,emphasizing the shift from traditional thresholding and watershed methods to deep learning-driven approaches.Convolutional neural networks(CNNs),U-Net architectures,and three-dimensional(3D)frameworks excel at segmenting dense biofilms and resolving antibiotic-induced morphological changes.These methods combine automated feature extraction with physics-informed postprocessing.Despite progress,challenges persist in computational efficiency,cross-species generalizability,and integration with multimodal experimental workflows.Future progress will depend on improving model robustness across species and imaging modalities,integrating multimodal data for phenotype-function mapping,and developing standard pipelines that link computational tools with clinical diagnostics.These innovations will expand microbial phenotyping beyond structural analysis,enabling deeper insights into bacterial physiology and ecological interactions. 展开更多
关键词 Bacterial image deep learning optical microscopy image segmentation artificial intelligence
原文传递
Clinical information prompt-driven retinal fundus image for brain health evaluation
10
作者 Nuo Tong Ying Hui +10 位作者 Shui-Ping Gou Ling-Xi Chen Xiang-Hong Wang Shuo-Hua Chen Jing Li Xiao-Shuai Li Yun-Tao Wu Shou-Ling Wu Zhen-Chang Wang Jing Sun Han Lv 《Military Medical Research》 2026年第1期43-57,共15页
Background:Brain volume measurement serves as a critical approach for assessing brain health status.Considering the close biological connection between the eyes and brain,this study aims to investigate the feasibility... Background:Brain volume measurement serves as a critical approach for assessing brain health status.Considering the close biological connection between the eyes and brain,this study aims to investigate the feasibility of estimating brain volume through retinal fundus imaging integrated with clinical metadata,and to offer a cost-effective approach for assessing brain health.Methods:Based on clinical information,retinal fundus images,and neuroimaging data derived from a multicenter,population-based cohort study,the Kai Luan Study,we proposed a cross-modal correlation representation(CMCR)network to elucidate the intricate co-degenerative relationships between the eyes and brain for 755 subjects.Specifically,individual clinical information,which has been followed up for as long as 12 years,was encoded as a prompt to enhance the accuracy of brain volume estimation.Independent internal validation and external validation were performed to assess the robustness of the proposed model.Root mean square error(RMSE),peak signal-tonoise ratio(PSNR),and structural similarity index measure(SSIM)metrics were employed to quantitatively evaluate the quality of synthetic brain images derived from retinal imaging data.Results:The proposed framework yielded average RMSE,PSNR,and SSIM values of 98.23,35.78 d B,and 0.64,respectively,which significantly outperformed 5 other methods:multi-channel Variational Autoencoder(mcVAE),Pixelto-Pixel(Pixel2pixel),transformer-based U-Net(Trans UNet),multi-scale transformer network(MT-Net),and residual vision transformer(ResViT).The two-(2D)and three-dimensional(3D)visualization results showed that the shape and texture of the synthetic brain images generated by the proposed method most closely resembled those of actual brain images.Thus,the CMCR framework accurately captured the latent structural correlations between the fundus and the brain.The average difference between predicted and actual brain volumes was 61.36 cm~3,with a relative error of 4.54%.When all of the clinical information(including age and sex,daily habits,cardiovascular factors,metabolic factors,and inflammatory factors)was encoded,the difference was decreased to 53.89 cm~3,with a relative error of 3.98%.Based on the synthesized brain magnetic resonance images from retinal fundus images,the volumes of brain tissues could be estimated with high accuracy.Conclusion:This study provides an innovative,accurate,and cost-effective approach to characterize brain health status through readily accessible retinal fundus images. 展开更多
关键词 Retinal fundus image Brain volume Brain health Magnetic resonance imaging Deep learning Eye and brain connection
原文传递
RE-UKAN:A Medical Image Segmentation Network Based on Residual Network and Efficient Local Attention
11
作者 Bo Li Jie Jia +2 位作者 Peiwen Tan Xinyan Chen Dongjin Li 《Computers, Materials & Continua》 2026年第3期2184-2200,共17页
Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual infor... Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual information.Although the subsequent U-KAN model enhances nonlinear representation capabilities,it still faces challenges such as gradient vanishing during deep network training and spatial detail loss during feature downsampling,resulting in insufficient segmentation accuracy for edge structures and minute lesions.To address these challenges,this paper proposes the RE-UKAN model,which innovatively improves upon U-KAN.Firstly,a residual network is introduced into the encoder to effectively mitigate gradient vanishing through cross-layer identity mappings,thus enhancing modelling capabilities for complex pathological structures.Secondly,Efficient Local Attention(ELA)is integrated to suppress spatial detail loss during downsampling,thereby improving the perception of edge structures and minute lesions.Experimental results on four public datasets demonstrate that RE-UKAN outperforms existing medical image segmentation methods across multiple evaluation metrics,with particularly outstanding performance on the TN-SCUI 2020 dataset,achieving IoU of 88.18%and Dice of 93.57%.Compared to the baseline model,it achieves improvements of 3.05%and 1.72%,respectively.These results fully demonstrate RE-UKAN’s superior detail retention capability and boundary recognition accuracy in complex medical image segmentation tasks,providing a reliable solution for clinical precision segmentation. 展开更多
关键词 image segmentation U-KAN residual network ELA
在线阅读 下载PDF
The Construction of Ouyang Xiu's Posthumous Image in the Song Dynasty
12
作者 Yang Xiangkui Wang Minhan 《Contemporary Social Sciences》 2026年第1期18-34,共17页
The historical image of Ouyang Xiu constructed during the Song Dynasty evolved from a multifaceted portrayal that balanced his political and literary achievements into a singular cultural symbol.In the Northern Song D... The historical image of Ouyang Xiu constructed during the Song Dynasty evolved from a multifaceted portrayal that balanced his political and literary achievements into a singular cultural symbol.In the Northern Song Dynasty,writings by Ouyang Xiu's family and epitaphs by his colleagues crafted a balanced narrative emphasizing both his official duties and literary merits,thus constructing a dual image of him as a principled remonstrator and a literary master.In the Southern Song Dynasty,official historiography gradually eroded his complex persona as a political reformer by selectively trimming political disputes and emphasizing his literary lineage,ultimately establishing him as a cultural exemplar beyond factional strife.Throughout this evolution of historical writing,Ouyang Xiu's sharpness as a remonstrator was gradually obscured in historical texts,while his image as a literary master,revered by all,became firmly established.The reshaping of Ouyang Xiu's image in historical writings across the Northern and Southern Song dynasties not only reflects the logic of selecting scholar-official role models under the influence of official ideology but also reveals the inherent pattern whereby individual distinctiveness fades into symbolic construction in historical writing. 展开更多
关键词 Ouyang Xiu image construction biographical writing canonization
在线阅读 下载PDF
Multi-Feature Fragile Image Watermarking Algorithm for Tampering Blind-Detection and Content Self-Recovery
13
作者 Qiuling Wu Hao Li +1 位作者 Mingjian Li Ming Wang 《Computers, Materials & Continua》 2026年第1期759-778,共20页
Digital watermarking technology plays an important role in detecting malicious tampering and protecting image copyright.However,in practical applications,this technology faces various problems such as severe image dis... Digital watermarking technology plays an important role in detecting malicious tampering and protecting image copyright.However,in practical applications,this technology faces various problems such as severe image distortion,inaccurate localization of the tampered regions,and difficulty in recovering content.Given these shortcomings,a fragile image watermarking algorithm for tampering blind-detection and content self-recovery is proposed.The multi-feature watermarking authentication code(AC)is constructed using texture feature of local binary patterns(LBP),direct coefficient of discrete cosine transform(DCT)and contrast feature of gray level co-occurrence matrix(GLCM)for detecting the tampered region,and the recovery code(RC)is designed according to the average grayscale value of pixels in image blocks for recovering the tampered content.Optimal pixel adjustment process(OPAP)and least significant bit(LSB)algorithms are used to embed the recovery code and authentication code into the image in a staggered manner.When detecting the integrity of the image,the authentication code comparison method and threshold judgment method are used to perform two rounds of tampering detection on the image and blindly recover the tampered content.Experimental results show that this algorithm has good transparency,strong and blind detection,and self-recovery performance against four types of malicious attacks and some conventional signal processing operations.When resisting copy-paste,text addition,cropping and vector quantization under the tampering rate(TR)10%,the average tampering detection rate is up to 94.09%,and the peak signal-to-noise ratio(PSNR)of the watermarked image and the recovered image are both greater than 41.47 and 40.31 dB,which demonstrates its excellent advantages compared with other related algorithms in recent years. 展开更多
关键词 Fragile image watermark tampering blind-detection SELF-RECOVERY multi-feature
在线阅读 下载PDF
Design of a compact wide-field-of-view infrared imager based on wavefront coding
14
作者 Chonghui Zhu Jiaqian Yu Jingang Cui 《Chinese Physics B》 2026年第2期383-388,共6页
Compact size,high brightness,and wide field of view(FOV)are key requirements for long-wave infrared imagers used in military surveillance or night navigation.However,to meet the imaging requirements of high resolution... Compact size,high brightness,and wide field of view(FOV)are key requirements for long-wave infrared imagers used in military surveillance or night navigation.However,to meet the imaging requirements of high resolution and wide FOV,infrared optical systems often adopt complex optical lens groups,which will increase the size and weight of the optical system.In this paper,a strategy based on wavefront coding(WFC)is proposed to design a compact wide-FOV infrared imager.A cubic phase mask is inserted into the pupil plane of the infrared imager to correct the aberration.The simulated results show that,the WFC infrared imager has good imaging quality in a wide FOV of±16°.In addition,the WFC infrared imager achieves compactness with its 40 mm×40 mm×40 mm size.A fast focal ratio of 1 combined with an entrance pupil diameter of 25 mm ensures brightness.This work is of significance for designing a compact wide-FOV infrared imager. 展开更多
关键词 optical design infrared imager wavefront coding
原文传递
Image-free single-pixel semantic segmentation for complex scene based on multi-scale U-Net
15
作者 Tengfei Liu Yanfeng Bai +4 位作者 Jianxia Chen Jintao Zhai Siqing Xiang Xianwei Huang Xiquan Fu 《Chinese Physics B》 2026年第1期440-447,共8页
Single-pixel imaging(SPI)receives widespread attention due to its superior anti-interference capabilities,and image segmentation technology can effectively facilitate its recognition and information extraction.However... Single-pixel imaging(SPI)receives widespread attention due to its superior anti-interference capabilities,and image segmentation technology can effectively facilitate its recognition and information extraction.However,the complexity of the target scene and plenty of imaging time in SPI make it challenging to achieve high-quality and concise segmentation.In this paper,we investigate the image-free intricate scene semantic segmentation in SPI.Using“learned”illumination patterns allows for the full extraction of the object's spatial information,thereby enabling pixel-level segmentation results through the decoding of the received measurements.Simulation and experimentation show that,in the absence of image reconstruction,the mean intersection over union(MIoU)of segmented image can reach higher than 85%,and the Dice coefficient(DICE)close to 90%even at the sampling ratio of 5%.Our approach may be favorable to applications in medical image segmentation and autonomous driving field. 展开更多
关键词 single-pixel imaging image-free deep learning complex scene
原文传递
Tracing equatorward and poleward boundaries of the magnetospheric cusp from a simulated X-ray image
16
作者 Xue Wang TianRan Sun +4 位作者 C.Philippe Escoubet Andy Read YiHong Guo Steve Sembay Chi Wang 《Earth and Planetary Physics》 2026年第1期144-155,共12页
A large-scale view of the magnetospheric cusp is expected to be obtained by the Soft X-ray Imager(SXI)onboard the Solar wind Magnetosphere Ionosphere Link Explorer(SMILE).However,it is challenging to trace the three-d... A large-scale view of the magnetospheric cusp is expected to be obtained by the Soft X-ray Imager(SXI)onboard the Solar wind Magnetosphere Ionosphere Link Explorer(SMILE).However,it is challenging to trace the three-dimensional cusp boundary from a two-dimensional X-ray image because the detected X-ray signals will be integrated along the line of sight.In this work,a global magnetohydrodynamic code was used to simulate the X-ray images and photon count images,assuming an interplanetary magnetic field with a pure Bz component.The assumption of an elliptic cusp boundary at a given altitude was used to trace the equatorward and poleward boundaries of the cusp from a simulated X-ray image.The average discrepancy was less than 0.1 RE.To reduce the influence of instrument effects and cosmic X-ray backgrounds,image denoising was considered before applying the method above to SXI photon count images.The cusp boundaries were reasonably reconstructed from the noisy X-ray image. 展开更多
关键词 SMILE mission X-ray image cusp boundary
在线阅读 下载PDF
M2ATNet: Multi-Scale Multi-Attention Denoising and Feature Fusion Transformer for Low-Light Image Enhancement
17
作者 Zhongliang Wei Jianlong An Chang Su 《Computers, Materials & Continua》 2026年第1期1819-1838,共20页
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach... Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments. 展开更多
关键词 Low-light image enhancement multi-scale multi-attention TRANSFORMER
在线阅读 下载PDF
A Survey of Generative Adversarial Networks for Medical Images
18
作者 Sameera V.Mohd Sagheer U.Nimitha +3 位作者 P.M.Ameer Muneer Parayangat MohamedAbbas Krishna Prakash Arunachalam 《Computer Modeling in Engineering & Sciences》 2026年第2期130-185,共56页
Over the years,Generative Adversarial Networks(GANs)have revolutionized the medical imaging industry for applications such as image synthesis,denoising,super resolution,data augmentation,and cross-modality translation... Over the years,Generative Adversarial Networks(GANs)have revolutionized the medical imaging industry for applications such as image synthesis,denoising,super resolution,data augmentation,and cross-modality translation.The objective of this review is to evaluate the advances,relevances,and limitations of GANs in medical imaging.An organised literature review was conducted following the guidelines of PRISMA(Preferred Reporting Items for Systematic Reviews and Meta-Analyses).The literature considered included peer-reviewed papers published between 2020 and 2025 across databases including PubMed,IEEE Xplore,and Scopus.The studies related to applications of GAN architectures in medical imaging with reported experimental outcomes and published in English in reputable journals and conferences were considered for the review.Thesis,white papers,communication letters,and non-English articles were not included for the same.CLAIM based quality assessment criteria were applied to the included studies to assess the quality.The study classifies diverse GAN architectures,summarizing their clinical applications,technical performances,and their implementation hardships.Key findings reveal the increasing applications of GANs for enhancing diagnostic accuracy,reducing data scarcity through synthetic data generation,and supporting modality translation.However,concerns such as limited generalizability,lack of clinical validation,and regulatory constraints persist.This review provides a comprehensive study of the prevailing scenario of GANs in medical imaging and highlights crucial research gaps and future directions.Though GANs hold transformative capability for medical imaging,their integration into clinical use demands further validation,interpretability,and regulatory alignment. 展开更多
关键词 Generative adversarial networks medical images DENOISING SEGMENTATION TRANSLATION
在线阅读 下载PDF
Semi-Fragile Image Watermarking Using Quantization-Based DCT for Tamper Localization
19
作者 Agit Amrullah Ferda Ernawan 《Computers, Materials & Continua》 2026年第2期1967-1982,共16页
This paper proposes a tamper detection technique for semi-fragile watermarking using Quantizationbased Discrete Cosine Transform(DCT)for tamper localization.In this study,the proposed embedding strategy is investigate... This paper proposes a tamper detection technique for semi-fragile watermarking using Quantizationbased Discrete Cosine Transform(DCT)for tamper localization.In this study,the proposed embedding strategy is investigated by experimental tests over the diagonal order of the DCT coefficients.The cover image is divided into non-overlapping blocks of size 8×8 pixels.The DCT is applied to each block,and the coefficients are arranged using a zig-zag pattern within the block.In this study,the low-frequency coefficients are selected to examine the impact of the imperceptibility score and tamper detection accuracy.High accuracy of tamper detection can be achieved by checking the surrounding blocks to determine whether the corresponding block has been tampered with.The proposed tamper detection is tested under various malicious,incidental,and hybrid attacks(both incidental and malicious attacks).The experimental results demonstrate that the proposed technique achieves a Peak-Signal-to-Noise Ratio(PSNR)value of 41.2318 dB,an average Structural Similarity Index Measure(SSIM)value of 0.9768.The proposed scheme is also evaluated against malicious attacks such as copy-move,object deletion,object manipulation,and collage attacks.The proposed scheme can detect the malicious attack localization under various tampering rates.In addition,the proposed scheme can still detect tampered pixels under a hybrid attack,such as a combination ofmalicious and incidental attacks,with an average accuracy of 96.44%. 展开更多
关键词 image watermarking SEMI-FRAGILE DCT tamper localization hybrid attack
在线阅读 下载PDF
Importance-Aware Image Segmentation-Based Semantic Communication for Autonomous Driving
20
作者 Lyu Jie Tong Haonan +4 位作者 Pan Qiang Zhang Zhilong He Xinxin Luo Tao Yin Changchuan 《China Communications》 2026年第2期228-243,共16页
This article studies the problem of image segmentation-based semantic communication in autonomous driving.In real traffic scenes,the detecting of objects(e.g.,vehicles and pedestrians)is more important to guarantee dr... This article studies the problem of image segmentation-based semantic communication in autonomous driving.In real traffic scenes,the detecting of objects(e.g.,vehicles and pedestrians)is more important to guarantee driving safety,which is always ignored in existing works.Therefore,we propose a vehicular image segmentation-oriented semantic communication system,termed VIS-SemCom,focusing on transmitting and recovering image semantic features of high-important objects to reduce transmission redundancy.First,we develop a semantic codec based on Swin Transformer architecture,which expands the perceptual field thus improving the segmentation accuracy.To highlight the important objects'accuracy,we propose a multi-scale semantic extraction method by assigning the number of Swin Transformer blocks for diverse resolution semantic features.Also,an importance-aware loss incorporating important levels is devised,and an online hard example mining(OHEM)strategy is proposed to handle small sample issues in the dataset.Finally,experimental results demonstrate that the proposed VIS-SemCom can achieve a significant mean intersection over union(mIoU)performance in the SNR regions,a reduction of transmitted data volume by about 60%at 60%mIoU,and improve the segmentation accuracy of important objects,compared to baseline image communication. 展开更多
关键词 autonomous driving image segmentation semantic communication Swin Transformer
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部