期刊文献+
共找到18,759篇文章
< 1 2 250 >
每页显示 20 50 100
Multi-Scale Transformer for Image Restoration
1
作者 Wuzhen Shi Youwei Pan +4 位作者 Chun Zhao Yuqing Liu Shaobo Zhang Heng Zhang Yang Wen 《CAAI Transactions on Intelligence Technology》 2026年第1期41-54,共14页
Although Transformer-based image restoration methods have demonstrated impressive performance,existing Transformers still insufficiently exploit multiscale information.Previous non-Transformer-based studies have shown... Although Transformer-based image restoration methods have demonstrated impressive performance,existing Transformers still insufficiently exploit multiscale information.Previous non-Transformer-based studies have shown that incorporating multiscale features is crucial for improving restoration results.In this paper,we propose a multiscale Transformer(MST)that captures cross-scale attention among tokens,thereby effectively leveraging the multiscale patch recurrence prior of natural images.Furthermore,we introduce a channel-gate feed-forward network(CGFN)to enhance inter-channel information aggregation and reduce channel redundancy.To simultaneously utilise global,local and multiscale features,we design a multitype feature integration block(MFIB).Extensive experiments on both image super-resolution and HEVC compressed video artefact reduction demonstrate that the proposed MST achieves state-of-the-art performance.Ablation studies further verify the effectiveness of each proposed module. 展开更多
关键词 computer vision image enhancement image processing image reconstruction image resolution
在线阅读 下载PDF
Advances in deep learning for bacterial image segmentation in optical microscopy
2
作者 Zhijun Tan Yang Ding +6 位作者 Huibin Ma Jintao Li Danrou Zheng Hua Bai Weini Xin Lin Li Bo Peng 《Journal of Innovative Optical Health Sciences》 2026年第1期30-44,共15页
Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bac... Microscopy imaging is fundamental in analyzing bacterial morphology and dynamics,offering critical insights into bacterial physiology and pathogenicity.Image segmentation techniques enable quantitative analysis of bacterial structures,facilitating precise measurement of morphological variations and population behaviors at single-cell resolution.This paper reviews advancements in bacterial image segmentation,emphasizing the shift from traditional thresholding and watershed methods to deep learning-driven approaches.Convolutional neural networks(CNNs),U-Net architectures,and three-dimensional(3D)frameworks excel at segmenting dense biofilms and resolving antibiotic-induced morphological changes.These methods combine automated feature extraction with physics-informed postprocessing.Despite progress,challenges persist in computational efficiency,cross-species generalizability,and integration with multimodal experimental workflows.Future progress will depend on improving model robustness across species and imaging modalities,integrating multimodal data for phenotype-function mapping,and developing standard pipelines that link computational tools with clinical diagnostics.These innovations will expand microbial phenotyping beyond structural analysis,enabling deeper insights into bacterial physiology and ecological interactions. 展开更多
关键词 Bacterial image deep learning optical microscopy image segmentation artificial intelligence
原文传递
Intra-hour PV Power Forecasting Technique Based on Total-sky Images
3
作者 Songjie Zhang Zhekang Dong +5 位作者 Donglian Qi Minghao Wang Zhao Xu Yifeng Han Yunfeng Yan Zhenming Li 《CSEE Journal of Power and Energy Systems》 2026年第1期210-219,共10页
Clouds are one of the leading causes of sun shading,which reduces the direct horizontal irradiance and curtails the photovoltaic(PV)power.It is critical to estimate cloud cover to accurately predict PV generation with... Clouds are one of the leading causes of sun shading,which reduces the direct horizontal irradiance and curtails the photovoltaic(PV)power.It is critical to estimate cloud cover to accurately predict PV generation within a very short horizon(second/minute).To achieve the precise forecasting of cloud cover,an image preprocessing method based on total-sky images is proposed to remove the interference and address the image edge distortion issue.An optimal threshold estimation method is further designed to achieve higher cloud identification precision.Considering the cloud's meteorological properties,a random hypersurface model(RHM)based on the Gaussian mixture probability hypothesis density(GM-PHD)filter is applied to track the cloud.The GM-PHD can track the rotation and diffusion of clouds,which helps to estimate sun-cloud collision.Furthermore,a hybrid autoregressive integrated moving average(ARIMA)and backpropagation(BP)neural network-based model is applied for intra-hour PV power forecasting.The experiment results demonstrate that the proposed cloud-tracking-based PV power forecasting model can capture the ramp behavior of PV power,improving forecasting precision. 展开更多
关键词 Cloud tracking image processing intra-hour PV forecasting solar energy total-sky image
原文传递
FDEFusion:End-to-End Infrared and Visible Image Fusion Method Based on Frequency Decomposition and Enhancement
4
作者 Ming Chen Guoqiang Ma +3 位作者 Ping Qi Fucheng Wang Lin Shen Xiaoya Pi 《Computers, Materials & Continua》 2026年第4期817-839,共23页
In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,eff... In the image fusion field,fusing infrared images(IRIs)and visible images(VIs)excelled is a key area.The differences between IRIs and VIs make it challenging to fuse both types into a high-quality image.Accordingly,efficiently combining the advantages of both images while overcoming their shortcomings is necessary.To handle this challenge,we developed an end-to-end IRI andVI fusionmethod based on frequency decomposition and enhancement.By applying concepts from frequency domain analysis,we used the layering mechanism to better capture the salient thermal targets from the IRIs and the rich textural information from the VIs,respectively,significantly boosting the image fusion quality and effectiveness.In addition,the backbone network combined Restormer Blocks and Dense Blocks;Restormer blocks utilize global attention to extract shallow features.Meanwhile,Dense Blocks ensure the integration between shallow and deep features,thereby avoiding the loss of shallow attributes.Extensive experiments on TNO and MSRS datasets demonstrated that the suggested method achieved state-of-the-art(SOTA)performance in various metrics:Entropy(EN),Mutual Information(MI),Standard Deviation(SD),The Structural Similarity Index Measure(SSIM),Fusion quality(Qabf),MI of the pixel(FMI_(pixel)),and modified Visual Information Fidelity(VIF_(m)). 展开更多
关键词 Infrared images visible images frequency decomposition restormer blocks global attention
在线阅读 下载PDF
Enhanced sparse RCNN for transmission line bolt defect detection via text-to-image data augmentation and quality filtering
5
作者 Chen Zhenyu Yan Huaguang +2 位作者 Du Jianguang Xue Meng Zhao Shuai 《High Technology Letters》 2026年第1期11-20,共10页
To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detecti... To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detection framework integrating image quality evaluation and text-to-image data augmentation.First,a HyperNetwork-based image quality assessment module is introduced to filter low-quality inspection images in terms of clarity and structural integrity,resulting in a high-quality training dataset.Second,a text-to-image diffusion model is utilized for sample augmentation.By designing text prompts that describe various bolt defect types under diverse lighting and viewing conditions,the model automatically generates realistic synthetic samples.The generated images are further filtered using a combination of quality and perceptual similarity metrics to ensure consistency with the real data distribution.Building upon the sparse RCNN baseline,a dynamic label assignment mechanism and a random decision path detection head are incorporated to enhance bounding box matching and prediction accuracy.Experimental results demonstrate that the proposed method significantly improves detection accuracy(mAP@0.5) over the original sparse RCNN while maintaining low computational cost,enabling more efficient and intelligent inspection of transmission line components. 展开更多
关键词 sparse region-based convolutional neural network HyperNetwork image quality assessment text-to-image generation data augmentation bolt defect detection transmission line inspection
在线阅读 下载PDF
基于人工智能Precise Image重建算法对头颅CT图像质量及辐射剂量的影响
6
作者 廖甜 刘晓静 +5 位作者 宁先英 桂绅 孔祥闯 雷子乔 余建明 吴红英 《放射学实践》 北大核心 2026年第1期66-71,共6页
目的:评估Precise Image人工智能重建算法对头颅CT图像质量及辐射剂量的影响。方法:回顾性搜集行头颅CT平扫的80例患者,A组(40例)采用120 kV、150 mAs采集图像,同时采用Precise Image(sharp/standard/smooth/smoother)算法、iDose 4等... 目的:评估Precise Image人工智能重建算法对头颅CT图像质量及辐射剂量的影响。方法:回顾性搜集行头颅CT平扫的80例患者,A组(40例)采用120 kV、150 mAs采集图像,同时采用Precise Image(sharp/standard/smooth/smoother)算法、iDose 4等级算法进行图像重建;B组(40例)采用传统轴扫方案采集图像(120 kV、250 mAs扫描条件),采用iDose 4等级算法进行图像重建。对比不同剂量、不同重建方式下头颅CT检查图像质量及辐射剂量。结果:A组较B组CTDIvol、DLP、SSDE分别降低约55.02%、42.68%、59.22%(P<0.05)。A组随着重建算法等级的升高(sharp、standard、smooth、smoother),小脑、背侧丘脑及灰白质噪声SD值下降,信号噪声比(SNR)、对比噪声比(CNR)升高,且均高于同扫描条件下iDose 4算法,除sharp算法外差异均有统计学意义(P<0.05)。A组standard、smooth算法主观评分为(4.63±0.49)分、(4.27±0.38)分,两组均满足诊断需求;B组主观评分为(4.52±0.41)分。结论:Precise Image人工智能重建算法在保证图像质量的前提下可大大降低头颅CT辐射剂量。 展开更多
关键词 体层摄影术 X线计算机 人工智能 Precise image 图像质量 辐射剂量
在线阅读 下载PDF
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
7
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning image segmentation
暂未订购
Experiments on image data augmentation techniques for geological rock type classification with convolutional neural networks 被引量:2
8
作者 Afshin Tatar Manouchehr Haghighi Abbas Zeinijahromi 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第1期106-125,共20页
The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and hist... The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications. 展开更多
关键词 Deep learning(DL) image analysis image data augmentation Convolutional neural networks(CNNs) Geological image analysis Rock classification Rock thin section(RTS)images
在线阅读 下载PDF
A multi-attention mechanism U-Net neural network for image correction of PbS quantum dot focal plane detectors
9
作者 WANG Han-Ting DI Yun-Xiang +10 位作者 QI Xing-Yu SHA Ying-Zhe WANG Ya-Hui YE Ling-Feng TANG Wei-Yi BA Kun WANG Xu-Dong HUANG Zhang-Cheng CHU Jun-Hao SHEN Hong WANG Jian-Lu 《红外与毫米波学报》 北大核心 2026年第1期148-156,共9页
Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon... Near-infrared image sensors are widely used in fields such as material identification,machine vision,and autonomous driving.Lead sulfide colloidal quantum dot-based infrared photodiodes can be integrated with sil⁃icon-based readout circuits in a single step.Based on this,we propose a photodiode based on an n-i-p structure,which removes the buffer layer and further simplifies the manufacturing process of quantum dot image sensors,thus reducing manufacturing costs.Additionally,for the noise complexity in quantum dot image sensors when capturing images,traditional denoising and non-uniformity methods often do not achieve optimal denoising re⁃sults.For the noise and stripe-type non-uniformity commonly encountered in infrared quantum dot detector imag⁃es,a network architecture has been developed that incorporates multiple key modules.This network combines channel attention and spatial attention mechanisms,dynamically adjusting the importance of feature maps to en⁃hance the ability to distinguish between noise and details.Meanwhile,the residual dense feature fusion module further improves the network's ability to process complex image structures through hierarchical feature extraction and fusion.Furthermore,the pyramid pooling module effectively captures information at different scales,improv⁃ing the network's multi-scale feature representation ability.Through the collaborative effect of these modules,the network can better handle various mixed noise and image non-uniformity issues.Experimental results show that it outperforms the traditional U-Net network in denoising and image correction tasks. 展开更多
关键词 PbS quantum dot focal plane detector convolutional neural networks image denoising U-Net
在线阅读 下载PDF
Underwater Image Enhancement Based on Depthwise Separable Convolution-Based Generative Adversarial Network
10
作者 ZENG Jun-yang SI Zhan-jun 《印刷与数字媒体技术研究》 北大核心 2026年第1期60-66,共7页
The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adver... The existence of absorption and reflection of light underwater leads to problems such as color distortion and blue-green bias in underwater images.In this study,a depthwise separable convolution-based generative adversarial network(GAN)algorithm was proposed.Taking GAN as the basic framework,it combined a depthwise separable convolution module,attention mechanism,and reconstructed convolution module to realize the enhancement of underwater degraded images.Multi-scale features were captured by the depthwise separable convolution module,and the attention mechanism was utilized to enhance attention to important features.The reconstructed convolution module further extracts and fuses local and global features.Experimental results showed that the algorithm performs well in improving the color bias and blurring of underwater images,with PSNR reaching 27.835,SSIM reaching 0.883,UIQM reaching 3.205,and UCIQE reaching 0.713.The enhanced image outperforms the comparison algorithm in both subjective and objective metrics. 展开更多
关键词 Underwater image enhancement Generating adversarial network Depthwise separable convolution
在线阅读 下载PDF
Clinical information prompt-driven retinal fundus image for brain health evaluation
11
作者 Nuo Tong Ying Hui +10 位作者 Shui-Ping Gou Ling-Xi Chen Xiang-Hong Wang Shuo-Hua Chen Jing Li Xiao-Shuai Li Yun-Tao Wu Shou-Ling Wu Zhen-Chang Wang Jing Sun Han Lv 《Military Medical Research》 2026年第1期43-57,共15页
Background:Brain volume measurement serves as a critical approach for assessing brain health status.Considering the close biological connection between the eyes and brain,this study aims to investigate the feasibility... Background:Brain volume measurement serves as a critical approach for assessing brain health status.Considering the close biological connection between the eyes and brain,this study aims to investigate the feasibility of estimating brain volume through retinal fundus imaging integrated with clinical metadata,and to offer a cost-effective approach for assessing brain health.Methods:Based on clinical information,retinal fundus images,and neuroimaging data derived from a multicenter,population-based cohort study,the Kai Luan Study,we proposed a cross-modal correlation representation(CMCR)network to elucidate the intricate co-degenerative relationships between the eyes and brain for 755 subjects.Specifically,individual clinical information,which has been followed up for as long as 12 years,was encoded as a prompt to enhance the accuracy of brain volume estimation.Independent internal validation and external validation were performed to assess the robustness of the proposed model.Root mean square error(RMSE),peak signal-tonoise ratio(PSNR),and structural similarity index measure(SSIM)metrics were employed to quantitatively evaluate the quality of synthetic brain images derived from retinal imaging data.Results:The proposed framework yielded average RMSE,PSNR,and SSIM values of 98.23,35.78 d B,and 0.64,respectively,which significantly outperformed 5 other methods:multi-channel Variational Autoencoder(mcVAE),Pixelto-Pixel(Pixel2pixel),transformer-based U-Net(Trans UNet),multi-scale transformer network(MT-Net),and residual vision transformer(ResViT).The two-(2D)and three-dimensional(3D)visualization results showed that the shape and texture of the synthetic brain images generated by the proposed method most closely resembled those of actual brain images.Thus,the CMCR framework accurately captured the latent structural correlations between the fundus and the brain.The average difference between predicted and actual brain volumes was 61.36 cm~3,with a relative error of 4.54%.When all of the clinical information(including age and sex,daily habits,cardiovascular factors,metabolic factors,and inflammatory factors)was encoded,the difference was decreased to 53.89 cm~3,with a relative error of 3.98%.Based on the synthesized brain magnetic resonance images from retinal fundus images,the volumes of brain tissues could be estimated with high accuracy.Conclusion:This study provides an innovative,accurate,and cost-effective approach to characterize brain health status through readily accessible retinal fundus images. 展开更多
关键词 Retinal fundus image Brain volume Brain health Magnetic resonance imaging Deep learning Eye and brain connection
原文传递
Framework for Secure Substitution Box Construction and Its Application in Image Encryption
12
作者 Umar Hayat Ikram Ullah Muhammad Bilal 《Computers, Materials & Continua》 2026年第4期1428-1462,共35页
Elliptic curve(EC)based cryptosystems gained more attention due to enhanced security than the existing public key cryptosystems.A substitution box(S-box)plays a vital role in securing modern symmetric key cryptosystem... Elliptic curve(EC)based cryptosystems gained more attention due to enhanced security than the existing public key cryptosystems.A substitution box(S-box)plays a vital role in securing modern symmetric key cryptosystems.However,the recently developed EC based algorithms usually trade off between computational efficiency and security,necessitating the design of a new algorithm with the desired cryptographic strength.To address these shortcomings,this paper proposes a new scheme based onMordell elliptic curve(MEC)over the complex field for generating distinct,dynamic,and highly uncorrelated S-boxes.Furthermore,we count the exact number of the obtained S-boxes,and demonstrate that the permuted version of the presented S-box is statistically optimal.The nonsingularity of the presented algorithm and the injectivity of the resultant output are explored.Rigorous theoretical analysis and experimental results demonstrate that the proposedmethod is highly effective in generating a large number of dynamic S-boxes with adequate cryptographic properties,surpassing current state-of-the-art S-box generation algorithms in terms of security.Apart fromthis,the generated S-box is benchmarked using side-channel attacks,and its performance is compared with highly nonlinear S-boxes,demonstrating comparable results.In addition,we present an application of our proposed S-box generator by incorporating it into an image encryption technique.The encrypted and decrypted images are tested by employing extensive standard security metrics,including the Number of Pixel Change Rate,the Unified Average Changing Intensity,information entropy,correlation coefficient,and histogram analysis.Moreover,the analysis is extended beyond conventional metrics to validate the new method using advanced tests,such as the NIST statistical test suite,robustness analysis,and noise and cropping attacks.Experimental outcomes show that the presented algorithm strengthens the existing encryption scheme against various well-known cryptographic attacks. 展开更多
关键词 Substitution box Mordell elliptic curve Mobius transformation NONLINEARITY image encryption CRYPTANALYSIS data security
在线阅读 下载PDF
Semi-Fragile Image Watermarking Using Quantization-Based DCT for Tamper Localization
13
作者 Agit Amrullah Ferda Ernawan 《Computers, Materials & Continua》 2026年第2期1967-1982,共16页
This paper proposes a tamper detection technique for semi-fragile watermarking using Quantizationbased Discrete Cosine Transform(DCT)for tamper localization.In this study,the proposed embedding strategy is investigate... This paper proposes a tamper detection technique for semi-fragile watermarking using Quantizationbased Discrete Cosine Transform(DCT)for tamper localization.In this study,the proposed embedding strategy is investigated by experimental tests over the diagonal order of the DCT coefficients.The cover image is divided into non-overlapping blocks of size 8×8 pixels.The DCT is applied to each block,and the coefficients are arranged using a zig-zag pattern within the block.In this study,the low-frequency coefficients are selected to examine the impact of the imperceptibility score and tamper detection accuracy.High accuracy of tamper detection can be achieved by checking the surrounding blocks to determine whether the corresponding block has been tampered with.The proposed tamper detection is tested under various malicious,incidental,and hybrid attacks(both incidental and malicious attacks).The experimental results demonstrate that the proposed technique achieves a Peak-Signal-to-Noise Ratio(PSNR)value of 41.2318 dB,an average Structural Similarity Index Measure(SSIM)value of 0.9768.The proposed scheme is also evaluated against malicious attacks such as copy-move,object deletion,object manipulation,and collage attacks.The proposed scheme can detect the malicious attack localization under various tampering rates.In addition,the proposed scheme can still detect tampered pixels under a hybrid attack,such as a combination ofmalicious and incidental attacks,with an average accuracy of 96.44%. 展开更多
关键词 image watermarking SEMI-FRAGILE DCT tamper localization hybrid attack
在线阅读 下载PDF
Harnessing speckle images:efficient extraction of hidden information
14
作者 Weiru Fan Xiaobin Tang +5 位作者 Xingqi Xu Huizhu Hu Vladislav V.Yakovlev Shi-Yao Zhu Da-Wei Wang Delong Zhang 《Advanced Photonics Nexus》 2026年第1期211-223,共13页
Scattering obscures information carried by waves by producing speckle patterns,posing a fundamental challenge across diverse fields,from microscopy to astronomy.Although machine learning has recently shown promise in ... Scattering obscures information carried by waves by producing speckle patterns,posing a fundamental challenge across diverse fields,from microscopy to astronomy.Although machine learning has recently shown promise in speckle analysis,existing approaches are hindered by their dependence on large,labeled datasets—a significant bottleneck in many real-world applications.Here,we introduce speckle unsupervised recognition and evaluation(SURE),a groundbreaking unsupervised learning strategy for speckle recognition that eliminates the need for labeled training data.SURE's distinctive feature lies in its ability to extract invariant features through advanced clustering algorithms to enable direct classification of high-level information from speckle patterns without prior knowledge.We demonstrate the transformative potential of this approach in two key applications:(1)a noninvasive glucose monitoring system that accurately tracks glucose concentrations over time without extensive calibration and(2)a high-throughput communication system using multimode fibers,achieving improved performance in dynamic environments.In addition,we showcase SURE's unprecedented capability to classify objects hidden behind obstacles using scattered light,further broadening its scope.This versatile approach opens new frontiers in biomedical diagnostics,quantum network decoupling,and remote sensing,unlocking a transformative new paradigm for extracting information from seemingly random optical patterns. 展开更多
关键词 SCATTERING unsupervised learning speckle interpretation pattern recognition image sensing
在线阅读 下载PDF
Enhanced CT-CBCT image registration for orthopedic surgery:Integrating rigid-elastic motion models
15
作者 Zhiqi HUANG Deqiang XIAO +7 位作者 Hongxun LIU Long SHAO Danni AI Jingfan FAN Tianyu FU Yucong LIN Hong SONG Jian YANG 《虚拟现实与智能硬件(中英文)》 2026年第1期87-100,共14页
Background Computed tomography(CT) and cone-beam computed tomography(CBCT) image registration play pivotal roles in computer-assisted navigation for orthopedic surgery. Traditional methods often apply uniform deformat... Background Computed tomography(CT) and cone-beam computed tomography(CBCT) image registration play pivotal roles in computer-assisted navigation for orthopedic surgery. Traditional methods often apply uniform deformation models, neglecting the biomechanical differences between rigid structures and soft tissues, which compromises registration accuracy, especially during significant bone displacements. Method To address this issue, we introduce RE-Reg, a rigid-elastic CT-CBCT image registration framework that jointly learns rigid bone motion and soft tissue deformation. RE-Reg incorporates a rigid alignment(RA) module to estimate global bone motion and an elastic deformation(ED) module to model soft tissue deformation, preserving bony structures through bone shape preservation(BSP) loss. Result Our comprehensive evaluation on publicly available datasets demonstrates that RE-Reg significantly outperforms existing methods in terms of registration accuracy and rigid bone structure preservation, achieving a 1.3% improvement in Dice similarity coefficient(DSC) and a 23% reduction in rigid bone deformation(%Δvol) compared with the best baseline. Conclusion This framework not only enhances anatomical fidelity but also ensures biomechanical plausibility and provides a valuable tool for image-guided orthopedic surgery. This code is available athttps://github.com/Zq-Huang/RE-Reg. 展开更多
关键词 Orthopedic surgery image registration CT-CBCT Rigid motion Elastic deformation
在线阅读 下载PDF
RE-UKAN:A Medical Image Segmentation Network Based on Residual Network and Efficient Local Attention
16
作者 Bo Li Jie Jia +2 位作者 Peiwen Tan Xinyan Chen Dongjin Li 《Computers, Materials & Continua》 2026年第3期2184-2200,共17页
Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual infor... Medical image segmentation is of critical importance in the domain of contemporary medical imaging.However,U-Net and its variants exhibit limitations in capturing complex nonlinear patterns and global contextual information.Although the subsequent U-KAN model enhances nonlinear representation capabilities,it still faces challenges such as gradient vanishing during deep network training and spatial detail loss during feature downsampling,resulting in insufficient segmentation accuracy for edge structures and minute lesions.To address these challenges,this paper proposes the RE-UKAN model,which innovatively improves upon U-KAN.Firstly,a residual network is introduced into the encoder to effectively mitigate gradient vanishing through cross-layer identity mappings,thus enhancing modelling capabilities for complex pathological structures.Secondly,Efficient Local Attention(ELA)is integrated to suppress spatial detail loss during downsampling,thereby improving the perception of edge structures and minute lesions.Experimental results on four public datasets demonstrate that RE-UKAN outperforms existing medical image segmentation methods across multiple evaluation metrics,with particularly outstanding performance on the TN-SCUI 2020 dataset,achieving IoU of 88.18%and Dice of 93.57%.Compared to the baseline model,it achieves improvements of 3.05%and 1.72%,respectively.These results fully demonstrate RE-UKAN’s superior detail retention capability and boundary recognition accuracy in complex medical image segmentation tasks,providing a reliable solution for clinical precision segmentation. 展开更多
关键词 image segmentation U-KAN residual network ELA
在线阅读 下载PDF
Automatic Recognition Algorithm of Pavement Defects Based on S3M and SDI Modules Using UAV-Collected Road Images
17
作者 Hongcheng Zhao Tong Yang +1 位作者 Yihui Hu Fengxiang Guo 《Structural Durability & Health Monitoring》 2026年第1期121-137,共17页
With the rapid development of transportation infrastructure,ensuring road safety through timely and accurate highway inspection has become increasingly critical.Traditional manual inspection methods are not only time-... With the rapid development of transportation infrastructure,ensuring road safety through timely and accurate highway inspection has become increasingly critical.Traditional manual inspection methods are not only time-consuming and labor-intensive,but they also struggle to provide consistent,high-precision detection and realtime monitoring of pavement surface defects.To overcome these limitations,we propose an Automatic Recognition of PavementDefect(ARPD)algorithm,which leverages unmanned aerial vehicle(UAV)-based aerial imagery to automate the inspection process.The ARPD framework incorporates a backbone network based on the Selective State Space Model(S3M),which is designed to capture long-range temporal dependencies.This enables effective modeling of dynamic correlations among redundant and often repetitive structures commonly found in road imagery.Furthermore,a neck structure based on Semantics and Detail Infusion(SDI)is introduced to guide cross-scale feature fusion.The SDI module enhances the integration of low-level spatial details with high-level semantic cues,thereby improving feature expressiveness and defect localization accuracy.Experimental evaluations demonstrate that theARPDalgorithm achieves a mean average precision(mAP)of 86.1%on a custom-labeled pavement defect dataset,outperforming the state-of-the-art YOLOv11 segmentation model.The algorithm also maintains strong generalization ability on public datasets.These results confirm that ARPD is well-suited for diverse real-world applications in intelligent,large-scale highway defect monitoring and maintenance planning. 展开更多
关键词 Pavement defects state space model UAV detection algorithm image processing
在线阅读 下载PDF
The Construction of Ouyang Xiu's Posthumous Image in the Song Dynasty
18
作者 Yang Xiangkui Wang Minhan 《Contemporary Social Sciences》 2026年第1期18-34,共17页
The historical image of Ouyang Xiu constructed during the Song Dynasty evolved from a multifaceted portrayal that balanced his political and literary achievements into a singular cultural symbol.In the Northern Song D... The historical image of Ouyang Xiu constructed during the Song Dynasty evolved from a multifaceted portrayal that balanced his political and literary achievements into a singular cultural symbol.In the Northern Song Dynasty,writings by Ouyang Xiu's family and epitaphs by his colleagues crafted a balanced narrative emphasizing both his official duties and literary merits,thus constructing a dual image of him as a principled remonstrator and a literary master.In the Southern Song Dynasty,official historiography gradually eroded his complex persona as a political reformer by selectively trimming political disputes and emphasizing his literary lineage,ultimately establishing him as a cultural exemplar beyond factional strife.Throughout this evolution of historical writing,Ouyang Xiu's sharpness as a remonstrator was gradually obscured in historical texts,while his image as a literary master,revered by all,became firmly established.The reshaping of Ouyang Xiu's image in historical writings across the Northern and Southern Song dynasties not only reflects the logic of selecting scholar-official role models under the influence of official ideology but also reveals the inherent pattern whereby individual distinctiveness fades into symbolic construction in historical writing. 展开更多
关键词 Ouyang Xiu image construction biographical writing canonization
在线阅读 下载PDF
Multi-Feature Fragile Image Watermarking Algorithm for Tampering Blind-Detection and Content Self-Recovery
19
作者 Qiuling Wu Hao Li +1 位作者 Mingjian Li Ming Wang 《Computers, Materials & Continua》 2026年第1期759-778,共20页
Digital watermarking technology plays an important role in detecting malicious tampering and protecting image copyright.However,in practical applications,this technology faces various problems such as severe image dis... Digital watermarking technology plays an important role in detecting malicious tampering and protecting image copyright.However,in practical applications,this technology faces various problems such as severe image distortion,inaccurate localization of the tampered regions,and difficulty in recovering content.Given these shortcomings,a fragile image watermarking algorithm for tampering blind-detection and content self-recovery is proposed.The multi-feature watermarking authentication code(AC)is constructed using texture feature of local binary patterns(LBP),direct coefficient of discrete cosine transform(DCT)and contrast feature of gray level co-occurrence matrix(GLCM)for detecting the tampered region,and the recovery code(RC)is designed according to the average grayscale value of pixels in image blocks for recovering the tampered content.Optimal pixel adjustment process(OPAP)and least significant bit(LSB)algorithms are used to embed the recovery code and authentication code into the image in a staggered manner.When detecting the integrity of the image,the authentication code comparison method and threshold judgment method are used to perform two rounds of tampering detection on the image and blindly recover the tampered content.Experimental results show that this algorithm has good transparency,strong and blind detection,and self-recovery performance against four types of malicious attacks and some conventional signal processing operations.When resisting copy-paste,text addition,cropping and vector quantization under the tampering rate(TR)10%,the average tampering detection rate is up to 94.09%,and the peak signal-to-noise ratio(PSNR)of the watermarked image and the recovered image are both greater than 41.47 and 40.31 dB,which demonstrates its excellent advantages compared with other related algorithms in recent years. 展开更多
关键词 Fragile image watermark tampering blind-detection SELF-RECOVERY multi-feature
在线阅读 下载PDF
Pixel to Parcel:Transformative Applications of Image Segmentation in Geospatial and Crop Research
20
作者 Hui Zeng 《Journal of Environmental & Earth Sciences》 2026年第3期112-125,共14页
The rising need for precision farming and sustainable land management has catalyzed the requirement for sophisticated means of deriving practical data from remote sensing images.Image segmentation,or the process of di... The rising need for precision farming and sustainable land management has catalyzed the requirement for sophisticated means of deriving practical data from remote sensing images.Image segmentation,or the process of dividing the image into semantically relevant parts,has become a groundbreaking technology that allows resolving the problem of transitioning the pixel-level data to a parcel-level analysis.This review is a synthesis of the segmentation methods and their use in crop research and geospatial science.The architectures of pixel-based,object-based,and deep learning(convolutional neural networks,U-Net,Mask R-CNN,and Transformer models)are considered in terms of principles,capabilities,and limitations.Multi-spectral,hyperspectral,LiDAR,and SAR data are integrated to improve the efficiency of segmentation,allowing the possible delineation of fields,the classification of crops,health monitoring,monitoring of yields,and stress identification.In addition to agriculture,segmentation helps in land use and land cover mapping,identification of temporal change,monitoring of the environment,and is used in combination with GIS-based spatial modeling.Nevertheless,issues related to data heterogeneity,mixed pixels,computational requirements,and inadequate availability of labelled data still exist despite the major progress.The future directions involve multi-source data fusion,pixel-to-parcel pipeline automation,and predictive models based on AI,which are used to enhance its scalability,robustness,and the ability to monitor in real-time.This review makes it clear that the use of image segmentation as a tool in generating precision agriculture,sustainable land use,and informed geospatial. 展开更多
关键词 image Segmentation Precision Agriculture Geospatial Analysis Crop Monitoring Remote Sensing
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部