期刊文献+
共找到18,762篇文章
< 1 2 250 >
每页显示 20 50 100
A Survey of Generative Adversarial Networks for Medical Images
1
作者 Sameera V.Mohd Sagheer U.Nimitha +3 位作者 P.M.Ameer Muneer Parayangat MohamedAbbas Krishna Prakash Arunachalam 《Computer Modeling in Engineering & Sciences》 2026年第2期130-185,共56页
Over the years,Generative Adversarial Networks(GANs)have revolutionized the medical imaging industry for applications such as image synthesis,denoising,super resolution,data augmentation,and cross-modality translation... Over the years,Generative Adversarial Networks(GANs)have revolutionized the medical imaging industry for applications such as image synthesis,denoising,super resolution,data augmentation,and cross-modality translation.The objective of this review is to evaluate the advances,relevances,and limitations of GANs in medical imaging.An organised literature review was conducted following the guidelines of PRISMA(Preferred Reporting Items for Systematic Reviews and Meta-Analyses).The literature considered included peer-reviewed papers published between 2020 and 2025 across databases including PubMed,IEEE Xplore,and Scopus.The studies related to applications of GAN architectures in medical imaging with reported experimental outcomes and published in English in reputable journals and conferences were considered for the review.Thesis,white papers,communication letters,and non-English articles were not included for the same.CLAIM based quality assessment criteria were applied to the included studies to assess the quality.The study classifies diverse GAN architectures,summarizing their clinical applications,technical performances,and their implementation hardships.Key findings reveal the increasing applications of GANs for enhancing diagnostic accuracy,reducing data scarcity through synthetic data generation,and supporting modality translation.However,concerns such as limited generalizability,lack of clinical validation,and regulatory constraints persist.This review provides a comprehensive study of the prevailing scenario of GANs in medical imaging and highlights crucial research gaps and future directions.Though GANs hold transformative capability for medical imaging,their integration into clinical use demands further validation,interpretability,and regulatory alignment. 展开更多
关键词 Generative adversarial networks medical images DENOISING SEGMENTATION TRANSLATION
在线阅读 下载PDF
An Exploratory Study on Prognostic Prediction and Interpretability Analysis in Early-stage Triple-negative Breast Cancer Using Pathological Images
2
作者 Zixuan Yang Yaping Lyu +4 位作者 Liuliu Quan Shuyue Chen Yuying Sun Jie Ju Peng Yuan 《Biomedical and Environmental Sciences》 2026年第3期310-326,共17页
Objective To develop a prognostic prediction model for early-stage triple-negative breast cancer(TNBC)using H&E-stained pathological images and to investigate its underlying biological interpretability.Methods A d... Objective To develop a prognostic prediction model for early-stage triple-negative breast cancer(TNBC)using H&E-stained pathological images and to investigate its underlying biological interpretability.Methods A deep learning model was trained on 340 WSIs and externally validated using 81 TCGA cases.Image-derived features extracted through convolutional neural networks were integrated with clinicopathological variables.Model performance was assessed using ROC curve analysis,and interpretability was evaluated by correlating image features with mRNA-seq data and characteristics of the immune microenvironment.Results The model achieved AUCs of 0.86 and 0.75 in the training and validation cohorts,respectively.Analysis using HoVer-Net indicated that lymphocyte abundance was associated with recurrence risk.Texture-related features showed significant correlations with immune cell infiltration and prognostic gene expression profiles.Conclusion This study demonstrates that deep learning can enable accurate prognostic prediction in early-stage TNBC,with interpretable image features that reflect the tumor immune microenvironment and gene expression profiles. 展开更多
关键词 Triple-negative breast cancer Prognostic prediction model Deep learning H&E-stained pathological images Model interpretability
暂未订购
Enhanced Capacity Reversible Data Hiding Based on Pixel Value Ordering in Triple Stego Images
3
作者 Kim Sao Nguyen Ngoc Dung Bui 《Computers, Materials & Continua》 2026年第1期1571-1586,共16页
Reversible data hiding(RDH)enables secret data embedding while preserving complete cover image recovery,making it crucial for applications requiring image integrity.The pixel value ordering(PVO)technique used in multi... Reversible data hiding(RDH)enables secret data embedding while preserving complete cover image recovery,making it crucial for applications requiring image integrity.The pixel value ordering(PVO)technique used in multi-stego images provides good image quality but often results in low embedding capability.To address these challenges,this paper proposes a high-capacity RDH scheme based on PVO that generates three stego images from a single cover image.The cover image is partitioned into non-overlapping blocks with pixels sorted in ascending order.Four secret bits are embedded into each block’s maximum pixel value,while three additional bits are embedded into the second-largest value when the pixel difference exceeds a predefined threshold.A similar embedding strategy is also applied to the minimum side of the block,including the second-smallest pixel value.This design enables each block to embed up to 14 bits of secret data.Experimental results demonstrate that the proposed method achieves significantly higher embedding capacity and improved visual quality compared to existing triple-stego RDH approaches,advancing the field of reversible steganography. 展开更多
关键词 RDH reversible data hiding PVO RDH base three stego images
在线阅读 下载PDF
Lithology identification using borehole images by contrast-limited adaptive histogram equalization and machine learning models
4
作者 Enming Li Pablo Segarra +4 位作者 JoséA.Sanchidrián Zahir Ahmed Ignacio Catalán Alberto Fernández Santiago Gómez 《Journal of Rock Mechanics and Geotechnical Engineering》 2026年第3期1698-1718,共21页
Agile lithology identification can assist mining by providing important information in the exploration and production of mineral resources.This study proposes a new lithology recognition procedure using video-logging ... Agile lithology identification can assist mining by providing important information in the exploration and production of mineral resources.This study proposes a new lithology recognition procedure using video-logging of boreholes with an endoscope,applied to six production blocks in a limestone quarry.Images are automatically extracted from the videos and the lithology is classified into three classes based on clay content,i.e.massive limestone,brecciated limestone,and high amount of clay.The image quality is evaluated with a gray pixel intensity threshold and three no-reference image quality metrics,i.e.perception-based image quality evaluator,natural image quality evaluator,and blind/referenceless image spatial quality evaluator.After removing low-quality images,7583 images are retained and used for developing lithology classification models using six optimized classification techniques.The contrast-limited adaptive histogram equalization(CLAHE)technique is used to improve image quality.Ten color characteristics involving three percentiles of red,green and blue pixel intensities,together with color counting and five texture characteristics-correlation,entropy,homogeneity,contrast and energy-are used as inputs.Bayesian optimized light gradient boosting machine model performs best,with an overall accuracy of 88.04%,and a precision on the classes of massive limestone,brecciated limestone and high amount of clay of 90.72%,83.52%and 85.29%,respectively,for the testing set.The feature importance scores show that the color counting is the most significant parameter for the development of the classification model.Compared with previous image-based methodologies,this study provides a more flexible and cheaper procedure to identify lithology. 展开更多
关键词 Lithology identification Borehole images ENDOSCOPE Light gradient boosting machine Contrast-limited adaptive histogram equalization(CLAHE)
在线阅读 下载PDF
A Hybrid Deep Learning Multi-Class Classification Model for Alzheimer’s Disease Using Enhanced MRI Images
5
作者 Ghadah Naif Alwakid 《Computers, Materials & Continua》 2026年第1期797-821,共25页
Alzheimer’s Disease(AD)is a progressive neurodegenerative disorder that significantly affects cognitive function,making early and accurate diagnosis essential.Traditional Deep Learning(DL)-based approaches often stru... Alzheimer’s Disease(AD)is a progressive neurodegenerative disorder that significantly affects cognitive function,making early and accurate diagnosis essential.Traditional Deep Learning(DL)-based approaches often struggle with low-contrast MRI images,class imbalance,and suboptimal feature extraction.This paper develops a Hybrid DL system that unites MobileNetV2 with adaptive classification methods to boost Alzheimer’s diagnosis by processing MRI scans.Image enhancement is done using Contrast-Limited Adaptive Histogram Equalization(CLAHE)and Enhanced Super-Resolution Generative Adversarial Networks(ESRGAN).A classification robustness enhancement system integrates class weighting techniques and a Matthews Correlation Coefficient(MCC)-based evaluation method into the design.The trained and validated model gives a 98.88%accuracy rate and 0.9614 MCC score.We also performed a 10-fold cross-validation experiment with an average accuracy of 96.52%(±1.51),a loss of 0.1671,and an MCC score of 0.9429 across folds.The proposed framework outperforms the state-of-the-art models with a 98%weighted F1-score while decreasing misdiagnosis results for every AD stage.The model demonstrates apparent separation abilities between AD progression stages according to the results of the confusion matrix analysis.These results validate the effectiveness of hybrid DL models with adaptive preprocessing for early and reliable Alzheimer’s diagnosis,contributing to improved computer-aided diagnosis(CAD)systems in clinical practice. 展开更多
关键词 Alzheimer’s disease deep learning MRI images MobileNetV2 contrast-limited adaptive histogram equalization(CLAHE) enhanced super-resolution generative adversarial networks(ESRGAN) multi-class classification
在线阅读 下载PDF
Precision organoid segmentation technique(POST):accurate organoid segmentation in challenging bright-field images 被引量:1
6
作者 Xuan Du Yuchen Li +5 位作者 Jiaping Song Zilin Zhang Jing Zhang Yanhui Li Zaozao Chen Zhongze Gu 《Bio-Design and Manufacturing》 2026年第1期80-93,I0013-I0016,共18页
Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of... Organoids possess immense potential for unraveling the intricate functions of human tissues and facilitating preclinical disease treatment.Their applications span from high-throughput drug screening to the modeling of complex diseases,with some even achieving clinical translation.Changes in the overall size,shape,boundary,and other morphological features of organoids provide a noninvasive method for assessing organoid drug sensitivity.However,the precise segmentation of organoids in bright-field microscopy images is made difficult by the complexity of the organoid morphology and interference,including overlapping organoids,bubbles,dust particles,and cell fragments.This paper introduces the precision organoid segmentation technique(POST),which is a deep-learning algorithm for segmenting challenging organoids under simple bright-field imaging conditions.Unlike existing methods,POST accurately segments each organoid and eliminates various artifacts encountered during organoid culturing and imaging.Furthermore,it is sensitive to and aligns with measurements of organoid activity in drug sensitivity experiments.POST is expected to be a valuable tool for drug screening using organoids owing to its capability of automatically and rapidly eliminating interfering substances and thereby streamlining the organoid analysis and drug screening process. 展开更多
关键词 Organoid Drug screening Deep learning Image segmentation
暂未订购
Intra-hour PV Power Forecasting Technique Based on Total-sky Images
7
作者 Songjie Zhang Zhekang Dong +5 位作者 Donglian Qi Minghao Wang Zhao Xu Yifeng Han Yunfeng Yan Zhenming Li 《CSEE Journal of Power and Energy Systems》 2026年第1期210-219,共10页
Clouds are one of the leading causes of sun shading,which reduces the direct horizontal irradiance and curtails the photovoltaic(PV)power.It is critical to estimate cloud cover to accurately predict PV generation with... Clouds are one of the leading causes of sun shading,which reduces the direct horizontal irradiance and curtails the photovoltaic(PV)power.It is critical to estimate cloud cover to accurately predict PV generation within a very short horizon(second/minute).To achieve the precise forecasting of cloud cover,an image preprocessing method based on total-sky images is proposed to remove the interference and address the image edge distortion issue.An optimal threshold estimation method is further designed to achieve higher cloud identification precision.Considering the cloud's meteorological properties,a random hypersurface model(RHM)based on the Gaussian mixture probability hypothesis density(GM-PHD)filter is applied to track the cloud.The GM-PHD can track the rotation and diffusion of clouds,which helps to estimate sun-cloud collision.Furthermore,a hybrid autoregressive integrated moving average(ARIMA)and backpropagation(BP)neural network-based model is applied for intra-hour PV power forecasting.The experiment results demonstrate that the proposed cloud-tracking-based PV power forecasting model can capture the ramp behavior of PV power,improving forecasting precision. 展开更多
关键词 Cloud tracking image processing intra-hour PV forecasting solar energy total-sky image
原文传递
Harnessing speckle images:efficient extraction of hidden information
8
作者 Weiru Fan Xiaobin Tang +5 位作者 Xingqi Xu Huizhu Hu Vladislav V.Yakovlev Shi-Yao Zhu Da-Wei Wang Delong Zhang 《Advanced Photonics Nexus》 2026年第1期211-223,共13页
Scattering obscures information carried by waves by producing speckle patterns,posing a fundamental challenge across diverse fields,from microscopy to astronomy.Although machine learning has recently shown promise in ... Scattering obscures information carried by waves by producing speckle patterns,posing a fundamental challenge across diverse fields,from microscopy to astronomy.Although machine learning has recently shown promise in speckle analysis,existing approaches are hindered by their dependence on large,labeled datasets—a significant bottleneck in many real-world applications.Here,we introduce speckle unsupervised recognition and evaluation(SURE),a groundbreaking unsupervised learning strategy for speckle recognition that eliminates the need for labeled training data.SURE's distinctive feature lies in its ability to extract invariant features through advanced clustering algorithms to enable direct classification of high-level information from speckle patterns without prior knowledge.We demonstrate the transformative potential of this approach in two key applications:(1)a noninvasive glucose monitoring system that accurately tracks glucose concentrations over time without extensive calibration and(2)a high-throughput communication system using multimode fibers,achieving improved performance in dynamic environments.In addition,we showcase SURE's unprecedented capability to classify objects hidden behind obstacles using scattered light,further broadening its scope.This versatile approach opens new frontiers in biomedical diagnostics,quantum network decoupling,and remote sensing,unlocking a transformative new paradigm for extracting information from seemingly random optical patterns. 展开更多
关键词 SCATTERING unsupervised learning speckle interpretation pattern recognition image sensing
在线阅读 下载PDF
Automatic Recognition Algorithm of Pavement Defects Based on S3M and SDI Modules Using UAV-Collected Road Images
9
作者 Hongcheng Zhao Tong Yang +1 位作者 Yihui Hu Fengxiang Guo 《Structural Durability & Health Monitoring》 2026年第1期121-137,共17页
With the rapid development of transportation infrastructure,ensuring road safety through timely and accurate highway inspection has become increasingly critical.Traditional manual inspection methods are not only time-... With the rapid development of transportation infrastructure,ensuring road safety through timely and accurate highway inspection has become increasingly critical.Traditional manual inspection methods are not only time-consuming and labor-intensive,but they also struggle to provide consistent,high-precision detection and realtime monitoring of pavement surface defects.To overcome these limitations,we propose an Automatic Recognition of PavementDefect(ARPD)algorithm,which leverages unmanned aerial vehicle(UAV)-based aerial imagery to automate the inspection process.The ARPD framework incorporates a backbone network based on the Selective State Space Model(S3M),which is designed to capture long-range temporal dependencies.This enables effective modeling of dynamic correlations among redundant and often repetitive structures commonly found in road imagery.Furthermore,a neck structure based on Semantics and Detail Infusion(SDI)is introduced to guide cross-scale feature fusion.The SDI module enhances the integration of low-level spatial details with high-level semantic cues,thereby improving feature expressiveness and defect localization accuracy.Experimental evaluations demonstrate that theARPDalgorithm achieves a mean average precision(mAP)of 86.1%on a custom-labeled pavement defect dataset,outperforming the state-of-the-art YOLOv11 segmentation model.The algorithm also maintains strong generalization ability on public datasets.These results confirm that ARPD is well-suited for diverse real-world applications in intelligent,large-scale highway defect monitoring and maintenance planning. 展开更多
关键词 Pavement defects state space model UAV detection algorithm image processing
在线阅读 下载PDF
Super-resolution reconstruction of UAV-borne gamma-ray spectrum images based on Real-ESRGAN algorithm
10
作者 Xin Wang Yuan Yuan +4 位作者 Xuan Zhao Guang-Hao Luo Qi-Qiao Wei He-Xi Wu Chao Xiong 《Nuclear Science and Techniques》 2026年第2期42-54,共13页
Unmanned aerial vehicle(UAV)-borne gamma-ray spectrum survey plays a crucial role in geological mapping,radioactive mineral exploration,and environmental monitoring.However,raw data are often compromised by flight and... Unmanned aerial vehicle(UAV)-borne gamma-ray spectrum survey plays a crucial role in geological mapping,radioactive mineral exploration,and environmental monitoring.However,raw data are often compromised by flight and instrument background noise,as well as detector resolution limitations,which affect the accuracy of geological interpretations.This study aims to explore the application of the Real-ESRGAN algorithm in the super-resolution reconstruction of UAV-borne gamma-ray spectrum images to enhance spatial resolution and the quality of geological feature visualization.We conducted super-resolution reconstruction experiments with 2×,4×and 6×magnification using the Real-ESRGAN algorithm,comparing the results with three other mainstream algorithms(SRCNN,SRGAN,FSRCNN)to verify the superiority in image quality.The experimental results indicate that Real-ESRGAN achieved a structural similarity index(SSIM)value of 0.950 at 2×magnification,significantly higher than the other algorithms,demonstrating its advantage in detail preservation.Furthermore,Real-ESRGAN effectively reduced ringing and overshoot artifacts,enhancing the clarity of geological structures and mineral deposit sites,thus providing high-quality visual information for geological exploration. 展开更多
关键词 UAV-borne gamma-ray spectrum Super-resolution reconstruction Real-ESRGAN Image processing
在线阅读 下载PDF
A Deep Learning Approach for Three-Dimensional Thyroid Nodule Detection from Ultrasound Images
11
作者 Huda F.Al-Shahad Razali Yaakob +3 位作者 Nurfadhlina Mohd Sharef Hazlina Hamdan Hasyma Abu Hassan Xiaoyi Jiang 《Computer Modeling in Engineering & Sciences》 2026年第3期997-1016,共20页
Currently,thyroid diseases are prevalent worldwide;therefore,it is necessary to develop techniques that help doctors improve their diagnostic skills for such diseases.In previous studies,2-dimensional convolutional ne... Currently,thyroid diseases are prevalent worldwide;therefore,it is necessary to develop techniques that help doctors improve their diagnostic skills for such diseases.In previous studies,2-dimensional convolutional neural network(2D CNN)techniques were employed to classify thyroid nodules as benign and malignant without detecting the presence of thyroid nodules in the obtained ultrasound images.To address this issue,we propose a 3-dimensional convolutional neural network(3D CNN)for thyroid nodule detection.The proposed CNN exploits the 3D information and spatial features contained in ultrasound images and generates distinctive features during its training using multiple samples,even for small nodules.In contrast,a 2D CNN only depends on spatial features.In this study,we used two datasets of 2210 ultrasound images obtained from the Sultan Abdul Aziz Shah Hospital in Malaysia,and a public open dataset,Digital Database Thyroid Image(DDTI).We created folders containing three images each,processed the images and extracted volumetric features suitable for the 3-dimensional convolutional neural network(3D CNN).The proposed model was assessed using four metrics:accuracy,recall,precision and F1 score.The results showed that the accuracy of the model in predicting the presence of thyroid nodules in ultrasound images was 96%.In conclusion,this study could help radiologists in hospitals and medical centres in classifying ultrasound images and detecting thyroid nodules. 展开更多
关键词 Thyroid nodules ultrasound image 3D CNN feature extraction deep learning
在线阅读 下载PDF
A nonrigid registration deep-learning model for solar photosphere images using a hybrid cross-attention mechanism
12
作者 Mengwei Ban Rui Wang +2 位作者 Zhi Xu Zhongyan Liu Xudong Nan 《Astronomical Techniques and Instruments》 2026年第2期112-120,共9页
Image registration within a solar photosphere sequence is crucial for observational solar physics studies requiring high spatial and temporal resolutions.Previously,we identified residual large-scale nonrigid distorti... Image registration within a solar photosphere sequence is crucial for observational solar physics studies requiring high spatial and temporal resolutions.Previously,we identified residual large-scale nonrigid distortions in high-resolution solar photosphere images from ground-based telescopes after high-resolution reconstruction.Because these distortions are not eliminated by conventional sequence correlation alignment,they can affect the analysis of small-scale activity in the solar photosphere.Here,we implemented an image registration model using deep learning(HCAM-Net)to solve the problem.Within an encoder-decoder framework,we introduced a hybrid attention mechanism to improve context information capture and extract accurate deformation fields.Analyzing solar photosphere images acquired by the New Vacuum Solar Telescope,we demonstrated that the proposed model effectively achieved highly accurate nonrigid image registration.Evaluation metrics and visualization results indicated that our model outperformed current state-of-the-art models,such as VoxelMorph and TransMorph,for nonrigid registration of solar photosphere images,with a structural similarity index measure of 0.965 and a coefficient of determination of 0.976. 展开更多
关键词 Solar image registration Deep learning Unsupervised learning Hybrid attention mechanism
在线阅读 下载PDF
C-SegNet:a practical approach for automated diabetic macular edema segmentation in optical coherence tomography images
13
作者 Zhi-Yuan Guan Ge Deng +6 位作者 Shi-Long Shi Zhen Tang Xian-Kun Dong Qiu-Yi Li Shu-Jing Shen Yong-Ling He Xue-Jun Qiu 《Biomedical Engineering Communications》 2026年第2期15-22,共8页
Background:Diabetic macular edema is a prevalent retinal condition and a leading cause of visual impairment among diabetic patients’Early detection of affected areas is beneficial for effective diagnosis and treatmen... Background:Diabetic macular edema is a prevalent retinal condition and a leading cause of visual impairment among diabetic patients’Early detection of affected areas is beneficial for effective diagnosis and treatment.Traditionally,diagnosis relies on optical coherence tomography imaging technology interpreted by ophthalmologists.However,this manual image interpretation is often slow and subjective.Therefore,developing automated segmentation for macular edema images is essential to enhance to improve the diagnosis efficiency and accuracy.Methods:In order to improve clinical diagnostic efficiency and accuracy,we proposed a SegNet network structure integrated with a convolutional block attention module(CBAM).This network introduces a multi-scale input module,the CBAM attention mechanism,and jump connection.The multi-scale input module enhances the network’s perceptual capabilities,while the lightweight CBAM effectively fuses relevant features across channels and spatial dimensions,allowing for better learning of varying information levels.Results:Experimental results demonstrate that the proposed network achieves an IoU of 80.127%and an accuracy of 99.162%.Compared to the traditional segmentation network,this model has fewer parameters,faster training and testing speed,and superior performance on semantic segmentation tasks,indicating its highly practical applicability.Conclusion:The C-SegNet proposed in this study enables accurate segmentation of Diabetic macular edema lesion images,which facilitates quicker diagnosis for healthcare professionals. 展开更多
关键词 multi-scale input diabetic macular edema image segmentation optical coherence tomography
在线阅读 下载PDF
Multi-Constraint Generative Adversarial Network-Driven Optimization Method for Super-Resolution Reconstruction of Remote Sensing Images
14
作者 Binghong Zhang Jialing Zhou +3 位作者 Xinye Zhou Jia Zhao Jinchun Zhu Guangpeng Fan 《Computers, Materials & Continua》 2026年第1期779-796,共18页
Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods ex... Remote sensing image super-resolution technology is pivotal for enhancing image quality in critical applications including environmental monitoring,urban planning,and disaster assessment.However,traditional methods exhibit deficiencies in detail recovery and noise suppression,particularly when processing complex landscapes(e.g.,forests,farmlands),leading to artifacts and spectral distortions that limit practical utility.To address this,we propose an enhanced Super-Resolution Generative Adversarial Network(SRGAN)framework featuring three key innovations:(1)Replacement of L1/L2 loss with a robust Charbonnier loss to suppress noise while preserving edge details via adaptive gradient balancing;(2)A multi-loss joint optimization strategy dynamically weighting Charbonnier loss(β=0.5),Visual Geometry Group(VGG)perceptual loss(α=1),and adversarial loss(γ=0.1)to synergize pixel-level accuracy and perceptual quality;(3)A multi-scale residual network(MSRN)capturing cross-scale texture features(e.g.,forest canopies,mountain contours).Validated on Sentinel-2(10 m)and SPOT-6/7(2.5 m)datasets covering 904 km2 in Motuo County,Xizang,our method outperforms the SRGAN baseline(SR4RS)with Peak Signal-to-Noise Ratio(PSNR)gains of 0.29 dB and Structural Similarity Index(SSIM)improvements of 3.08%on forest imagery.Visual comparisons confirm enhanced texture continuity despite marginal Learned Perceptual Image Patch Similarity(LPIPS)increases.The method significantly improves noise robustness and edge retention in complex geomorphology,demonstrating 18%faster response in forest fire early warning and providing high-resolution support for agricultural/urban monitoring.Future work will integrate spectral constraints and lightweight architectures. 展开更多
关键词 Charbonnier loss function deep learning generative adversarial network perceptual loss remote sensing image super-resolution
在线阅读 下载PDF
Effective Deep Learning Models for the Semantic Segmentation of 3D Human MRI Kidney Images
15
作者 Roshni Khedgaonkar Pravinkumar Sonsare +5 位作者 Kavita Singh Ayman Altameem Hameed R.Farhan Salil Bharany Ateeq Ur Rehman Ahmad Almogren 《Computers, Materials & Continua》 2026年第4期667-684,共18页
Recent studies indicate that millions of individuals suffer from renal diseases,with renal carcinoma,a type of kidney cancer,emerging as both a chronic illness and a significant cause of mortality.Magnetic Resonance I... Recent studies indicate that millions of individuals suffer from renal diseases,with renal carcinoma,a type of kidney cancer,emerging as both a chronic illness and a significant cause of mortality.Magnetic Resonance Imaging(MRI)and Computed Tomography(CT)have become essential tools for diagnosing and assessing kidney disorders.However,accurate analysis of thesemedical images is critical for detecting and evaluating tumor severity.This study introduces an integrated hybrid framework that combines three complementary deep learning models for kidney tumor segmentation from MRI images.The proposed framework fuses a customized U-Net and Mask R-CNN using a weighted scheme to achieve semantic and instance-level segmentation.The fused outputs are further refined through edge detection using Stochastic FeatureMapping Neural Networks(SFMNN),while volumetric consistency is ensured through Improved Mini-Batch K-Means(IMBKM)clustering integrated with an Encoder-Decoder Convolutional Neural Network(EDCNN).The outputs of these three stages are combined through a weighted fusion mechanism,with optimal weights determined empirically.Experiments on MRI scans from the TCGA-KIRC dataset demonstrate that the proposed hybrid framework significantly outperforms standalone models,achieving a Dice Score of 92.5%,an IoU of 87.8%,a Precision of 93.1%,a Recall of 90.8%,and a Hausdorff Distance of 2.8 mm.These findings validate that the weighted integration of complementary architectures effectively overcomes key limitations in kidney tumor segmentation,leading to improved diagnostic accuracy and robustness in medical image analysis. 展开更多
关键词 Kidney tumor(Blob)segmentation customU-Net andmask R-CNN stochastic featuremapping neural networks medical image analysis deep learning
在线阅读 下载PDF
Symbolic Images of Colors, Animals and Machines in The Red Badge of Courage
16
作者 王晓俊 《英语广场(学术研究)》 2014年第1期23-27,30,共6页
Abstract:Stephen Crane was an outstanding American novelist,poet,and journalist.He achieved great success in his literary works during his brief career.Crane’s most well-known work,The Red Badge of Courage,is commonl... Abstract:Stephen Crane was an outstanding American novelist,poet,and journalist.He achieved great success in his literary works during his brief career.Crane’s most well-known work,The Red Badge of Courage,is commonly believed to be the first great novel of the American Civil War,largely because of its vivid and detailed description of the experience of warfare.This paper analyzes the images of color,animal and machine,which convey Crane’s thoughts of war:war is full of chaos,brutality,and confusion,without any romantic elements or heroism. 展开更多
关键词 The Red Badge of Courage symbolic images color images animal images machine images
在线阅读 下载PDF
YOLO-S3DT:A Small Target Detection Model for UAV Images Based on YOLOv8 被引量:2
17
作者 Pengcheng Gao Zhenjiang Li 《Computers, Materials & Continua》 2025年第3期4555-4572,共18页
The application of deep learning for target detection in aerial images captured by Unmanned Aerial Vehicles(UAV)has emerged as a prominent research focus.Due to the considerable distance between UAVs and the photograp... The application of deep learning for target detection in aerial images captured by Unmanned Aerial Vehicles(UAV)has emerged as a prominent research focus.Due to the considerable distance between UAVs and the photographed objects,coupled with complex shooting environments,existing models often struggle to achieve accurate real-time target detection.In this paper,a You Only Look Once v8(YOLOv8)model is modified from four aspects:the detection head,the up-sampling module,the feature extraction module,and the parameter optimization of positive sample screening,and the YOLO-S3DT model is proposed to improve the performance of the model for detecting small targets in aerial images.Experimental results show that all detection indexes of the proposed model are significantly improved without increasing the number of model parameters and with the limited growth of computation.Moreover,this model also has the best performance compared to other detecting models,demonstrating its advancement within this category of tasks. 展开更多
关键词 Target detection UAV images detection small target detection YOLO
在线阅读 下载PDF
An EfficientNet integrated ResNet deep network and explainable AI for breast lesion classification from ultrasound images 被引量:1
18
作者 Kiran Jabeen Muhammad Attique Khan +4 位作者 Ameer Hamza Hussain Mobarak Albarakati Shrooq Alsenan Usman Tariq Isaac Ofori 《CAAI Transactions on Intelligence Technology》 2025年第3期842-857,共16页
Breast cancer is one of the major causes of deaths in women.However,the early diagnosis is important for screening and control the mortality rate.Thus for the diagnosis of breast cancer at the early stage,a computer-a... Breast cancer is one of the major causes of deaths in women.However,the early diagnosis is important for screening and control the mortality rate.Thus for the diagnosis of breast cancer at the early stage,a computer-aided diagnosis system is highly required.Ultrasound is an important examination technique for breast cancer diagnosis due to its low cost.Recently,many learning-based techniques have been introduced to classify breast cancer using breast ultrasound imaging dataset(BUSI)datasets;however,the manual handling is not an easy process and time consuming.The authors propose an EfficientNet-integrated ResNet deep network and XAI-based framework for accurately classifying breast cancer(malignant and benign).In the initial step,data augmentation is performed to increase the number of training samples.For this purpose,three-pixel flip mathematical equations are introduced:horizontal,vertical,and 90°.Later,two pretrained deep learning models were employed,skipped some layers,and fine-tuned.Both fine-tuned models are later trained using a deep transfer learning process and extracted features from the deeper layer.Explainable artificial intelligence-based analysed the performance of trained models.After that,a new feature selection technique is proposed based on the cuckoo search algorithm called cuckoo search controlled standard error mean.This technique selects the best features and fuses using a new parallel zeropadding maximum correlated coefficient features.In the end,the selection algorithm is applied again to the fused feature vector and classified using machine learning algorithms.The experimental process of the proposed framework is conducted on a publicly available BUSI and obtained 98.4%and 98%accuracy in two different experiments.Comparing the proposed framework is also conducted with recent techniques and shows improved accuracy.In addition,the proposed framework was executed less than the original deep learning models. 展开更多
关键词 augmentation breast cancer CLASSIFICATION deep learning OPTIMIZATION ultrasound images
在线阅读 下载PDF
Enhanced Kinship Verification through Ear Images:A Comparative Study of CNNs,Attention Mechanisms,and MLP Mixer Models 被引量:1
19
作者 Thien-Tan Cao Huu-Thanh Duong +3 位作者 Viet-Tuan Le Hau Nguyen Trung Vinh Truong Hoang Kiet Tran-Trung 《Computers, Materials & Continua》 2025年第6期4373-4391,共19页
Kinship verification is a key biometric recognition task that determines biological relationships based on physical features.Traditional methods predominantly use facial recognition,leveraging established techniques a... Kinship verification is a key biometric recognition task that determines biological relationships based on physical features.Traditional methods predominantly use facial recognition,leveraging established techniques and extensive datasets.However,recent research has highlighted ear recognition as a promising alternative,offering advantages in robustness against variations in facial expressions,aging,and occlusions.Despite its potential,a significant challenge in ear-based kinship verification is the lack of large-scale datasets necessary for training deep learning models effectively.To address this challenge,we introduce the EarKinshipVN dataset,a novel and extensive collection of ear images designed specifically for kinship verification.This dataset consists of 4876 high-resolution color images from 157 multiracial families across different regions,forming 73,220 kinship pairs.EarKinshipVN,a diverse and large-scale dataset,advances kinship verification research using ear features.Furthermore,we propose the Mixer Attention Inception(MAI)model,an improved architecture that enhances feature extraction and classification accuracy.The MAI model fuses Inceptionv4 and MLP Mixer,integrating four attention mechanisms to enhance spatial and channel-wise feature representation.Experimental results demonstrate that MAI significantly outperforms traditional backbone architectures.It achieves an accuracy of 98.71%,surpassing Vision Transformer models while reducing computational complexity by up to 95%in parameter usage.These findings suggest that ear-based kinship verification,combined with an optimized deep learning model and a comprehensive dataset,holds significant promise for biometric applications. 展开更多
关键词 Biometric analytics ear kin Inceptionv4 kinship verification KIN ear images
在线阅读 下载PDF
Deep learning-based multi-task prediction of response to neoadjuvant chemotherapy using multiscale whole slide images in breast cancer:A multicenter study 被引量:1
20
作者 Qin Wang Feng Zhao +19 位作者 Haicheng Zhang Tongpeng Chu Qi Wang Xipeng Pan Yuqian Chen Heng Zhou Tiantian Zheng Ziyin Li Fan Lin Haizhu Xie Heng Ma Lan Liu Lina Zhang Qin Li Weiwei Wang Yi Dai Ruijun Tang Jigang Wang Ping Yang Ning Mao 《Chinese Journal of Cancer Research》 2025年第1期28-47,共20页
Objective:Early predicting response before neoadjuvant chemotherapy(NAC)is crucial for personalized treatment plans for locally advanced breast cancer patients.We aim to develop a multi-task model using multiscale who... Objective:Early predicting response before neoadjuvant chemotherapy(NAC)is crucial for personalized treatment plans for locally advanced breast cancer patients.We aim to develop a multi-task model using multiscale whole slide images(WSIs)features to predict the response to breast cancer NAC more finely.Methods:This work collected 1,670 whole slide images for training and validation sets,internal testing sets,external testing sets,and prospective testing sets of the weakly-supervised deep learning-based multi-task model(DLMM)in predicting treatment response and pCR to NAC.Our approach models two-by-two feature interactions across scales by employing concatenate fusion of single-scale feature representations,and controls the expressiveness of each representation via a gating-based attention mechanism.Results:In the retrospective analysis,DLMM exhibited excellent predictive performance for the prediction of treatment response,with area under the receiver operating characteristic curves(AUCs)of 0.869[95%confidence interval(95%CI):0.806−0.933]in the internal testing set and 0.841(95%CI:0.814−0.867)in the external testing sets.For the pCR prediction task,DLMM reached AUCs of 0.865(95%CI:0.763−0.964)in the internal testing and 0.821(95%CI:0.763−0.878)in the pooled external testing set.In the prospective testing study,DLMM also demonstrated favorable predictive performance,with AUCs of 0.829(95%CI:0.754−0.903)and 0.821(95%CI:0.692−0.949)in treatment response and pCR prediction,respectively.DLMM significantly outperformed the baseline models in all testing sets(P<0.05).Heatmaps were employed to interpret the decision-making basis of the model.Furthermore,it was discovered that high DLMM scores were associated with immune-related pathways and cells in the microenvironment during biological basis exploration.Conclusions:The DLMM represents a valuable tool that aids clinicians in selecting personalized treatment strategies for breast cancer patients. 展开更多
关键词 Artificial intelligence breast cancer digital pathology whole slide images
暂未订购
上一页 1 2 250 下一页 到第
使用帮助 返回顶部