Lung cancer(LC)is a major cancer which accounts for higher mortality rates worldwide.Doctors utilise many imaging modalities for identifying lung tumours and their severity in earlier stages.Nowadays,machine learning(...Lung cancer(LC)is a major cancer which accounts for higher mortality rates worldwide.Doctors utilise many imaging modalities for identifying lung tumours and their severity in earlier stages.Nowadays,machine learning(ML)and deep learning(DL)methodologies are utilised for the robust detection and prediction of lung tumours.Recently,multi modal imaging emerged as a robust technique for lung tumour detection by combining various imaging features.To cope with that,we propose a novel multi modal imaging technique named versatile scale malleable image integration and patch wise attention network(VSMI2−PANet)which adopts three imaging modalities named computed tomography(CT),magnetic resonance imaging(MRI)and single photon emission computed tomography(SPECT).The designed model accepts input from CT and MRI images and passes it to the VSMI2 module that is composed of three sub-modules named image cropping module,scale malleable convolution layer(SMCL)and PANet module.CT and MRI images are subjected to image cropping module in a parallel manner to crop the meaningful image patches and provide them to the SMCL module.The SMCL module is composed of adaptive convolutional layers that investigate those patches in a parallel manner by preserving the spatial information.The output from the SMCL is then fused and provided to the PANet module.The PANet module examines the fused patches by analysing its height,width and channels of the image patch.As a result,it provides an output as high-resolution spatial attention maps indicating the location of suspicious tumours.The high-resolution spatial attention maps are then provided as an input to the backbone module which uses light wave transformer(LWT)for segmenting the lung tumours into three classes,such as normal,benign and malignant.In addition,the LWT also accepts SPECT image as input for capturing the variations precisely to segment the lung tumours.The performance of the proposed model is validated using several performance metrics,such as accuracy,precision,recall,F1-score and AUC curve,and the results show that the proposed work outperforms the existing approaches.展开更多
Acute Bilirubin Encephalopathy(ABE)is a significant threat to neonates and it leads to disability and high mortality rates.Detecting and treating ABE promptly is important to prevent further complications and long-ter...Acute Bilirubin Encephalopathy(ABE)is a significant threat to neonates and it leads to disability and high mortality rates.Detecting and treating ABE promptly is important to prevent further complications and long-term issues.Recent studies have explored ABE diagnosis.However,they often face limitations in classification due to reliance on a single modality of Magnetic Resonance Imaging(MRI).To tackle this problem,the authors propose a Tri-M2MT model for precise ABE detection by using tri-modality MRI scans.The scans include T1-weighted imaging(T1WI),T2-weighted imaging(T2WI),and apparent diffusion coefficient maps to get indepth information.Initially,the tri-modality MRI scans are collected and preprocessesed by using an Advanced Gaussian Filter for noise reduction and Z-score normalisation for data standardisation.An Advanced Capsule Network was utilised to extract relevant features by using Snake Optimization Algorithm to select optimal features based on feature correlation with the aim of minimising complexity and enhancing detection accuracy.Furthermore,a multi-transformer approach was used for feature fusion and identify feature correlations effectively.Finally,accurate ABE diagnosis is achieved through the utilisation of a SoftMax layer.The performance of the proposed Tri-M2MT model is evaluated across various metrics,including accuracy,specificity,sensitivity,F1-score,and ROC curve analysis,and the proposed methodology provides better performance compared to existing methodologies.展开更多
The multi-modal characteristics of mineral particles play a pivotal role in enhancing the classification accuracy,which is critical for obtaining a profound understanding of the Earth's composition and ensuring ef...The multi-modal characteristics of mineral particles play a pivotal role in enhancing the classification accuracy,which is critical for obtaining a profound understanding of the Earth's composition and ensuring effective exploitation utilization of its resources.However,the existing methods for classifying mineral particles do not fully utilize these multi-modal features,thereby limiting the classification accuracy.Furthermore,when conventional multi-modal image classification methods are applied to planepolarized and cross-polarized sequence images of mineral particles,they encounter issues such as information loss,misaligned features,and challenges in spatiotemporal feature extraction.To address these challenges,we propose a multi-modal mineral particle polarization image classification network(MMGC-Net)for precise mineral particle classification.Initially,MMGC-Net employs a two-dimensional(2D)backbone network with shared parameters to extract features from two types of polarized images to ensure feature alignment.Subsequently,a cross-polarized intra-modal feature fusion module is designed to refine the spatiotemporal features from the extracted features of the cross-polarized sequence images.Ultimately,the inter-modal feature fusion module integrates the two types of modal features to enhance the classification precision.Quantitative and qualitative experimental results indicate that when compared with the current state-of-the-art multi-modal image classification methods,MMGC-Net demonstrates marked superiority in terms of mineral particle multi-modal feature learning and four classification evaluation metrics.It also demonstrates better stability than the existing models.展开更多
Raman imaging,as a molecular spectroscopy technique,has been widely studied and applied in research fields such as life sciences and food safety due to its excellent specificity and high resolution.However,its develop...Raman imaging,as a molecular spectroscopy technique,has been widely studied and applied in research fields such as life sciences and food safety due to its excellent specificity and high resolution.However,its development still faces challenges such as weak signals,slow acquisition speed,and insufficient penetration depth.In recent years,the rapid development of aggregate science has provided new insights for addressing these limitations.Aggregation-induced emission(AIE)materials exhibit enhanced signals in the aggregated state,which may compensate for the inherent weak Raman signals.This article reviews the cutting-edge progress of Raman imaging technology and its current status in cross-disciplinary research with aggregate science,emphasizing the strategy of constructing AIE-Raman dual-responsive probes through molecular engineering to achieve functional complementarity between fluorescence localization and Raman quantification,thereby significantly improving detection sensitivity and specificity.These probes have demonstrated single-cell resolution and high spatiotemporal accuracy in applications such as tumor surgical navigation,diagnosis and treatment of drug-resistant bacteria,and dynamic monitoring of organelles.We also analyze the bottlenecks in this field,such as biological safety and the complexity of molecular design,and outline the future development directions,including intelligent responsive probes,artificial intelligence-assisted analysis,and multimodal fusion platforms.The integration of Raman imaging and AIE sheds new light in the field of medical imaging.展开更多
Metasurfaces are artificial structures that can finely control the characteristics of electromagnetic waves at subwavelength scales,and they are widely used to manipulate the propagation,phase,amplitude,and polariza⁃t...Metasurfaces are artificial structures that can finely control the characteristics of electromagnetic waves at subwavelength scales,and they are widely used to manipulate the propagation,phase,amplitude,and polariza⁃tion of light.In this work,a bound state in the continuum(BIC)structure based on a metallic metasurface is pro⁃posed.By adjusting the metallic structure using CST and COMSOL software,a significant quasi-BIC peak can be achieved at a frequency of 0.8217 terahertz(THz).Through multi-level expansion analysis,it is found that the electric dipole(ED)is the main factor contributing to the resonant characteristics of the structure.By leveraging the characteristics of BIC,an imaging system was created and operated.According to the simulation results,the imaging system demonstrated excellent sensitivity and resolution,revealing the great potential of terahertz imag⁃ing.This research not only provides new ideas for the creation of BIC structures but also offers an effective refer⁃ence for the development of high-performance terahertz imaging technology.展开更多
Two novel aggregation-induced emission(AIE)-active probes,TPA-H and TPA-2 F,were designed and synthesized based on a triphenylamine(TPA)core.Systematic characterization demonstrated that both probes exhibit excellent ...Two novel aggregation-induced emission(AIE)-active probes,TPA-H and TPA-2 F,were designed and synthesized based on a triphenylamine(TPA)core.Systematic characterization demonstrated that both probes exhibit excellent biocompatibility(cell viability>90%at concentrations up to 50μmol/L)and outstanding LD-targeting speci⁃ficity with minimal colocalization with other organelles such as mitochondria and lysosomes.During early differentia⁃tion of 3 T 3-L 1 adipocytes,both TPA-2 F and TPA-H clearly visualized small and nascent LDs that were difficult to be detected with BODIPY,indicating superior imaging sensitivity compared to the existing fluorescent probes for LDs.Moreover,TPA-2 F demonstrated exceptional photostability,retaining over 90%of its initial fluorescence intensity after 100 continuous laser scanning cycles,significantly outperforming TPA-H.This work not only provides two high-performance LD imaging tools but also highlights the potential of AIE luminogens(AIEgens)in organelle-specific bioimaging,offering promising avenues for early diagnosis and mechanistic research of lipid-related metabolic diseases.展开更多
Neuromorphic cameras,or dynamic vision sensors,are bio-inspired event cameras that measure changes in the image brightness asynchronously and independently at the pixel level.Recently,they garnered increasing interest...Neuromorphic cameras,or dynamic vision sensors,are bio-inspired event cameras that measure changes in the image brightness asynchronously and independently at the pixel level.Recently,they garnered increasing interest due to their extremely high temporal resolution,wide dynamic range,low power consumption,and high pixel bandwidth.Despite their advantages,most existing three-dimensional (3D) event imaging solutions rely on multicamera configurations,which are costly,complex,and challenging to synchronize.In this study,we introduce a new framework for four-dimensional (4D) event imaging using a single static neuromorphic camera.We take advantage of the inherent sparsity of event data to combine optically encoded stereo channels into a single event camera.By utilizing optical channel multiplexing,we maintain sensor resolution while retaining the key advantages of event cameras.展开更多
Gait recognition is a key biometric for long-distance identification,yet its performance is severely degraded by real-world challenges such as varying clothing,carrying conditions,and changing viewpoints.While combini...Gait recognition is a key biometric for long-distance identification,yet its performance is severely degraded by real-world challenges such as varying clothing,carrying conditions,and changing viewpoints.While combining silhouette and skeleton data is a promising direction,effectively fusing these heterogeneous modalities and adaptively weighting their contributions in response to diverse conditions remains a central problem.This paper introduces GaitMAFF,a novelMulti-modal Adaptive Feature Fusion Network,to address this challenge.Our approach first transforms discrete skeleton joints into a dense SkeletonMap representation to align with silhouettes,then employs an attention-based module to dynamically learn the fusion weights between the two modalities.These fused features are processed by a powerful spatio-temporal backbone withWeighted Global-Local Feature FusionModules(WFFM)to learn a discriminative representation.Extensive experiments on the challenging CCPG and Gait3D datasets show that GaitMAFF achieves state-of-the-art performance,with an average Rank-1 accuracy of 84.6%on CCPG and 58.7%on Gait3D.These results demonstrate that our adaptive fusion strategy effectively integrates complementary multimodal information,significantly enhancing gait recognition robustness and accuracy in complex scenes and providing a practical solution for real-world applications.展开更多
Autism spectrum disorder(AsD)is a highly heterogeneous neurodevelopmental disorder.Early diagnosis and intervention are crucial for improving outcomes.Traditional single-modality diagnostic methods are subjective,limi...Autism spectrum disorder(AsD)is a highly heterogeneous neurodevelopmental disorder.Early diagnosis and intervention are crucial for improving outcomes.Traditional single-modality diagnostic methods are subjective,limited,and struggle to reveal the underlying pathological mechanisms.In contrast,multimodal data analysis integrates behavioral,physiological,and neuroimaging information with advanced machine-learning and deeplearning algorithms to overcome these limitations.In this review,we surveyed the recent pediatric AsD literature,highlighting artificial intelligence-driven diagnostic techniques,multimodal data fusion strategies,and emerging trends in ASD assessment.We surveyed studies that integrated two or more modalities and summarized the fusion levels,learning paradigms,tasks,datasets,and metrics.Multimodal approaches outperform singlemodality baselines in classification,severity estimation,and subtyping by leveraging complementary information and reducing modality-specific biases.Multimodal approaches significantly enhance diagnostic accuracy and comprehensiveness,enabling early screening of AsD,symptom subtyping,severity assessment,and personalized interventions.Advances in multimodal fusion techniques have promoted progress in precision medicine for the treatment of ASD.展开更多
Background:Sarcomatoid carcinoma of the ureter(SCU)is a highly aggressive and relatively uncommon malignant tumor of the urinary tract.Its frequency is quite low,and its prognosis is very bad when compared to other ca...Background:Sarcomatoid carcinoma of the ureter(SCU)is a highly aggressive and relatively uncommon malignant tumor of the urinary tract.Its frequency is quite low,and its prognosis is very bad when compared to other cancers of the urinary system.SCU clinical reports are still hard to come by.MRI and PEI/CT imaging of ureteral sarcomatoid cancer is presented in this case to promote diagnostic awareness and comprehension of the imaging characteristics of this uncommon illness.Method:The patient had ureteral sarcomatoid cancer,which was verified by pathological investigation after ureteroscopic biopsy.The patient’s clinical information,imaging results,surgical outcomes,and pathological findings were gathered.A retrospective study was carried out in combinationwith pertinent national and international literature.Results:An 84-year-old female patient was admitted for“left flank discomfort lasting over one month.”MRI revealed an irregular soft tissue mass in the middle-lower segment of the left ureter.T2-weighted imaging showed an unevenly slightly hyperintense signal.Diffusion-weighted imaging demonstrated restricted diffusion.Contrastenhanced imaging exhibited heterogeneous enhancement.PET/CT demonstrated significantly increased fluorodeoxyglucose metabolism in the mass with secondary left upper urinary tract obstruction.Concurrent findings included a solitary metastatic lesion in hepatic segment S6 and multiple lymph node metastases along the left common iliac and external iliac arteries.Preoperative diagnosis suggested a malignant tumor of the ureter.The patient underwent left nephroureteroscopy with biopsy,and the postoperative pathological diagnosis was ureteral sarcomatoid carcinoma.Conclusion:Ureteral sarcomatoid carcinoma is a rare,highly malignant,and aggressive tumor with nonspecific imaging features,typically presenting as an invasively growing mass.Diagnosis relies on postoperative pathology and immunohistochemical examination.MRI and PET/CT scans are valuable for preoperative localization and characterization,tumor staging,treatment planning,and postoperative follow-up.The prognosis is extremely negative.The main treatment option is radical surgery,although constant monitoring is necessary since early recurrence and metastases are frequent after surgery.展开更多
The level of glutathione(GSH)is significantly associated with numerous pathological processes,thus,real-time detection of the GSH level is of significance for early diagnosis of GSH-related diseases.Herein,we develope...The level of glutathione(GSH)is significantly associated with numerous pathological processes,thus,real-time detection of the GSH level is of significance for early diagnosis of GSH-related diseases.Herein,we developed in vivo second near-infrared(NIR-II)window fluorescence(FL)and ratiometric photoacoustic(RPA)dual-modality imaging of GSH using a GSH-activatable probe(LET-14).LET-14 was synthesized based on a rhodamine hybrid xanthene skeleton with a FL shielding 2,4-dinitrobenzene sulfonyl group that can be specifically cleaved by GSH,thus resulting in a markedly bathochromic-shift absorption,a 6.5-fold increase in NIR-II FL intensity(FL_(920))and a 13-fold increase in RPA signal(PA_(880)/PA_(705))in vitro.Intriguingly,LET-14 exhibits good selectivity and sensitivity for NIR-II FL and RPA dual-modality imaging of GSH in 4T1 tumor-bearing mouse model.Our findings develop an in vivo detection tool of GSH,which has great potential in the field of cancer diagnosis.展开更多
In multi-modal emotion recognition,excessive reliance on historical context often impedes the detection of emotional shifts,while modality heterogeneity and unimodal noise limit recognition performance.Existing method...In multi-modal emotion recognition,excessive reliance on historical context often impedes the detection of emotional shifts,while modality heterogeneity and unimodal noise limit recognition performance.Existing methods struggle to dynamically adjust cross-modal complementary strength to optimize fusion quality and lack effective mechanisms to model the dynamic evolution of emotions.To address these issues,we propose a multi-level dynamic gating and emotion transfer framework for multi-modal emotion recognition.A dynamic gating mechanism is applied across unimodal encoding,cross-modal alignment,and emotion transfer modeling,substantially improving noise robustness and feature alignment.First,we construct a unimodal encoder based on gated recurrent units and feature-selection gating to suppress intra-modal noise and enhance contextual representation.Second,we design a gated-attention crossmodal encoder that dynamically calibrates the complementary contributions of visual and audio modalities to the dominant textual features and eliminates redundant information.Finally,we introduce a gated enhanced emotion transfer module that explicitly models the temporal dependence of emotional evolution in dialogues via transfer gating and optimizes continuity modeling with a comparative learning loss.Experimental results demonstrate that the proposed method outperforms state-of-the-art models on the public MELD and IEMOCAP datasets.展开更多
The photoacoustic imaging of lipid is intrinsically constrained by the feeble nature of endogenous lipid signals,posing a persistent sensitivity challenge that demands innovative solutions.Although adopting high-effic...The photoacoustic imaging of lipid is intrinsically constrained by the feeble nature of endogenous lipid signals,posing a persistent sensitivity challenge that demands innovative solutions.Although adopting high-efficiency excitation and detection elements may improve the imaging sensitivity to a certain extent,the application of the elements is inevitably subject to various limitations in practical applications,particularly during in vivo imaging and endoscopic imaging.In this study,we propose a multi-combinatorial approach to enhance the sensitivity of lipid photoacoustic imaging.The approach involves wavelet transform processing of one-dimensional A-line signals,gradient-based denoising of two-dimensional B-scan images,and finally,threedimensional spatial weighted averaging of the data processed by the previous two steps.This method not only significantly improves the signal-to-noise ratio(SNR)in distinguished feature regions of the image by around 10 dB,but also efficiently extracts weak signals with no distinct features in the original image.After processing with this method,the images acquired under single scanning were compared with those obtained under multiple scanning.The results showed highly consistent image features,with the structural similarity index increasing from 0.2 to 0.8,confirming the accuracy and reliability of the multi-combinatorial approach.展开更多
Medical imaging is essential in modern health care,allowing accurate diagnosis and effective treatment planning.These images,however,often demonstrate low contrast,noise,and brightness distortion that reduce their dia...Medical imaging is essential in modern health care,allowing accurate diagnosis and effective treatment planning.These images,however,often demonstrate low contrast,noise,and brightness distortion that reduce their diagnostic reliability.This review presents a structured and comprehensive analysis of advanced histogram equalization(HE)-based techniques for medical image enhancement.Our review methodology encompasses:(1)classical HE approaches and related limitations in medical domains;(2)adaptive schemes like Adaptive Histogram Equalization(AHE)and Contrast Limited Adaptive Histogrma Equalization(CLAHE)and their advance variants;(3)brightnesspreserving schemes like BBHE and MMBEBHE and related algorithms;(4)dynamic and recursive histogram equalization methods incorporating DHE and RMSHE;(5)fuzzy logic-based enhancement methodologies addressing uncertainty and noise in medical images;and(6)hybrid optimization methodologies through the application of metaheuristic algorithms(World Cup Optimization,Particle Swarm Optimization,Genetic Algorithms,along with histogram-based methodologies.)There is also a comparative discussion given based on contrast improvement,image brightness preservation,noise management,and computational efficiency.Such advancements have better capabilities of improving image quality,which is more important for improved diagnosis and image analysis.展开更多
Electrons in the energy range of 10–100 keV are important energetic particle components in the magnetosphere,and they play a key role in many physical processes in the magnetosphere.However,many scientific questions ...Electrons in the energy range of 10–100 keV are important energetic particle components in the magnetosphere,and they play a key role in many physical processes in the magnetosphere.However,many scientific questions about these processes are still unanswered.High spatiotemporal and energy-resolution imaging detection of 10–100 keV electrons is of great significance for solving these scientific problems.The traditional space particle detection technology cannot effectively detect the medium-energy electrons in this energy range.In this project,we propose combining low-noise particle detection technology with pinhole imaging technology to achieve high-resolution imaging detection of 10–100 keV medium-energy electrons in the magnetosphere,and at the same time achieve miniaturization(≤3.2 kg and size of 150×150×170 mm)and low power consumption(≤5 W)of the instrument,which can be used for space physics research and space weather applications in the future.展开更多
Integrating multiple medical imaging techniques,including Magnetic Resonance Imaging(MRI),Computed Tomography,Positron Emission Tomography(PET),and ultrasound,provides a comprehensive view of the patient health status...Integrating multiple medical imaging techniques,including Magnetic Resonance Imaging(MRI),Computed Tomography,Positron Emission Tomography(PET),and ultrasound,provides a comprehensive view of the patient health status.Each of these methods contributes unique diagnostic insights,enhancing the overall assessment of patient condition.Nevertheless,the amalgamation of data from multiple modalities presents difficulties due to disparities in resolution,data collection methods,and noise levels.While traditional models like Convolutional Neural Networks(CNNs)excel in single-modality tasks,they struggle to handle multi-modal complexities,lacking the capacity to model global relationships.This research presents a novel approach for examining multi-modal medical imagery using a transformer-based system.The framework employs self-attention and cross-attention mechanisms to synchronize and integrate features across various modalities.Additionally,it shows resilience to variations in noise and image quality,making it adaptable for real-time clinical use.To address the computational hurdles linked to transformer models,particularly in real-time clinical applications in resource-constrained environments,several optimization techniques have been integrated to boost scalability and efficiency.Initially,a streamlined transformer architecture was adopted to minimize the computational load while maintaining model effectiveness.Methods such as model pruning,quantization,and knowledge distillation have been applied to reduce the parameter count and enhance the inference speed.Furthermore,efficient attention mechanisms such as linear or sparse attention were employed to alleviate the substantial memory and processing requirements of traditional self-attention operations.For further deployment optimization,researchers have implemented hardware-aware acceleration strategies,including the use of TensorRT and ONNX-based model compression,to ensure efficient execution on edge devices.These optimizations allow the approach to function effectively in real-time clinical settings,ensuring viability even in environments with limited resources.Future research directions include integrating non-imaging data to facilitate personalized treatment and enhancing computational efficiency for implementation in resource-limited environments.This study highlights the transformative potential of transformer models in multi-modal medical imaging,offering improvements in diagnostic accuracy and patient care outcomes.展开更多
The unique advantage of x-ray ghost imaging(XGI)is its potential in low dose radiology.One of the practical ways to reduce the radiation exposure is to reduce the measurements while remaining sufficient image quality....The unique advantage of x-ray ghost imaging(XGI)is its potential in low dose radiology.One of the practical ways to reduce the radiation exposure is to reduce the measurements while remaining sufficient image quality.Synthetic aperture x-ray ghost imaging(SAXGI)is invented to achieve megapixel XGI with limited measurements,which is expected to implement XGI simultaneously with large field of view and low radiation exposure.In this paper,we experimentally investigate the effect of measurements reduction on the spatial resolution and image quality of SAXGI with standard sample and biomedical specimen.The results with a resolution chart demonstrated that at 360 measurements,SAXGI successfully retrieved the sample image of 1960×1960 pixels with spatial resolution of 4μm.With measurement reduction,the spatial resolution deteriorates but the sparser structures are still discernable.Even with measurements reduced to 10,a spatial resolution of 10μm can still be achieved by SAXGI.A biomedical sample of a fish specimen is employed to evaluate the method and the fish image of 2000×1000 pixels with an SSIM of 0.962 is reconstructed by SAXGI with 770measurements,corresponding to an accumulative exposure reduction of more than 2 times.With the measurements reduced to 10 which corresponds to 1/160 of the accumulative radiation exposure for conventional radiology,bulky structure like the fish skeleton can still be definitely discerned and the SSIM for the reconstructed image still retained 0.9179.Results of this paper demonstrate that measurements reduction is practicable for the radiation exposure reduction of the sample,which implicates that SAXGI with limited measurements is an efficient solution for low dose radiology.展开更多
Generative Adversarial Networks(GANs)have become valuable tools in medical imaging,enabling realistic image synthesis for enhancement,augmentation,and restoration.However,their integration into clinical workflows rais...Generative Adversarial Networks(GANs)have become valuable tools in medical imaging,enabling realistic image synthesis for enhancement,augmentation,and restoration.However,their integration into clinical workflows raises concerns,particularly the risk of subtle distortions or hallucinations that may undermine diagnostic accuracy and weaken trust in AI-assisted decision-making.To address this challenge,we propose a hybrid deep learning framework designed to detect GAN-induced artifacts in medical images,thereby reinforcing the reliability of AI-driven diagnostics.The framework integrates low-level statistical descriptors,including high-frequency residuals and Gray-Level Co-occurrence Matrix(GLCM)texture features,with high-level semantic representations extracted from a pre-trained ResNet18.This dual-stream approach enables detection of both pixel-level anomalies and structural inconsistencies introduced by GAN-based manipulation.We validated the framework on a curated dataset of 10,000 medical images,evenly split between authentic and GAN-generated samples across four modalities:MRI,CT,X-ray,and fundus photography.To improve generalizability to real-world clinical settings,we incorporated domain adaptation strategies such as adversarial training and style transfer,reducing domain shift by 15%.Experimental results demonstrate robust performance,achieving 92.6%accuracy and an F1-score of 0.91 on synthetic test data,and maintaining strong performance on real-world GAN-modified images with 87.3%accuracy and an F1-score of 0.85.Additionally,the model attained an AUC of 0.96 and an average precision of 0.92,outperforming conventional GAN detection pipelines and baseline Convolutional Neural Network(CNN)architectures.These findings establish the proposed framework as an effective and reliable solution for detecting GAN-induced hallucinations in medical imaging,representing an important step toward building trustworthy and clinically deployable AI systems.展开更多
Prostate cancer(PCa)is characterized by high incidence and propensity for easy metastasis,presenting significant challenges in clinical diagnosis and treatment.Tumor microenvironment(TME)-responsive nanomaterials prov...Prostate cancer(PCa)is characterized by high incidence and propensity for easy metastasis,presenting significant challenges in clinical diagnosis and treatment.Tumor microenvironment(TME)-responsive nanomaterials provide a promising prospect for imaging-guided precision therapy.Considering that tumor-derived alkaline phosphatase(ALP)is over-expressed in metastatic PCa,it makes a great chance to develop a theranostics system with ALP responsive in the TME.Herein,an ALP-responsive aggregationinduced emission luminogens(AIEgens)nanoprobe AMNF self-assembly was designed for enhancing the diagnosis and treatment of metastatic PCa.The nanoprobe exhibited self-aggregation in the presence of ALP resulted in aggregation-induced fluorescence,and enhanced accumulation and prolonged retention period at the tumor site.In terms of detection,the fluorescence(FL)/computed tomography(CT)/magnetic resonance(MR)multi-mode imaging effect of nanoprobe was significantly improved post-aggregation,enabling precise diagnosis through the amalgamation of multiple imaging modes.Enhanced CT/MR imaging can achieve assist preoperative tumor diagnosis,and enhanced FL imaging technology can achieve“intraoperative visual navigation”,showing its potential application value in clinical tumor detection and surgical guidance.In terms of treatment,AMNF showed strong absorption in the near infrared region after aggregation,which improved the photothermal treatment effect.Overall,our work developed an effective aggregation-enhanced theranostic strategy for ALP-related cancers.展开更多
Laser wakefield accelerators(LWFAs)offer acceleration gradients up to 1000 times higher than those of conventional radio-frequency accelerators,offering a pathway to significantly more compact and cost-effective accel...Laser wakefield accelerators(LWFAs)offer acceleration gradients up to 1000 times higher than those of conventional radio-frequency accelerators,offering a pathway to significantly more compact and cost-effective accelerator systems.This breakthrough opens up new possibilities for laboratory-scale light sources.All-optical inverse Compton scattering(AOCS)sources driven by LWFAs produce high-brightness,quasimonochromatic X rays with micrometer-scale source sizes,delivering the spatial coherence and resolution required for X-ray phase-contrast imaging(XPCI).These features position AOCS X-ray sources as promising tools for applications in biology,medicine,physics,and materials science.However,previous AOCS-based imaging studies have primarily focused on X-ray absorption imaging.In this work,we report successful experimental demonstrations of edge-enhanced in-line XPCI using energy-tunable,quasi-monochromatic AOCS X rays.With a spatial resolution of~20μm,our results clearly show the potential of high-resolution,AOCS-based XPCI applications.展开更多
基金supported by the VTT Technical Research Centre of Finland and the work of Nayef Alqahtani is supported by the Deanship of Scientific Research,Vice Presidency for Graduate Studies and Scientific Research,King Faisal University,Saudi Arabia(Grant KFU251882).
文摘Lung cancer(LC)is a major cancer which accounts for higher mortality rates worldwide.Doctors utilise many imaging modalities for identifying lung tumours and their severity in earlier stages.Nowadays,machine learning(ML)and deep learning(DL)methodologies are utilised for the robust detection and prediction of lung tumours.Recently,multi modal imaging emerged as a robust technique for lung tumour detection by combining various imaging features.To cope with that,we propose a novel multi modal imaging technique named versatile scale malleable image integration and patch wise attention network(VSMI2−PANet)which adopts three imaging modalities named computed tomography(CT),magnetic resonance imaging(MRI)and single photon emission computed tomography(SPECT).The designed model accepts input from CT and MRI images and passes it to the VSMI2 module that is composed of three sub-modules named image cropping module,scale malleable convolution layer(SMCL)and PANet module.CT and MRI images are subjected to image cropping module in a parallel manner to crop the meaningful image patches and provide them to the SMCL module.The SMCL module is composed of adaptive convolutional layers that investigate those patches in a parallel manner by preserving the spatial information.The output from the SMCL is then fused and provided to the PANet module.The PANet module examines the fused patches by analysing its height,width and channels of the image patch.As a result,it provides an output as high-resolution spatial attention maps indicating the location of suspicious tumours.The high-resolution spatial attention maps are then provided as an input to the backbone module which uses light wave transformer(LWT)for segmenting the lung tumours into three classes,such as normal,benign and malignant.In addition,the LWT also accepts SPECT image as input for capturing the variations precisely to segment the lung tumours.The performance of the proposed model is validated using several performance metrics,such as accuracy,precision,recall,F1-score and AUC curve,and the results show that the proposed work outperforms the existing approaches.
文摘Acute Bilirubin Encephalopathy(ABE)is a significant threat to neonates and it leads to disability and high mortality rates.Detecting and treating ABE promptly is important to prevent further complications and long-term issues.Recent studies have explored ABE diagnosis.However,they often face limitations in classification due to reliance on a single modality of Magnetic Resonance Imaging(MRI).To tackle this problem,the authors propose a Tri-M2MT model for precise ABE detection by using tri-modality MRI scans.The scans include T1-weighted imaging(T1WI),T2-weighted imaging(T2WI),and apparent diffusion coefficient maps to get indepth information.Initially,the tri-modality MRI scans are collected and preprocessesed by using an Advanced Gaussian Filter for noise reduction and Z-score normalisation for data standardisation.An Advanced Capsule Network was utilised to extract relevant features by using Snake Optimization Algorithm to select optimal features based on feature correlation with the aim of minimising complexity and enhancing detection accuracy.Furthermore,a multi-transformer approach was used for feature fusion and identify feature correlations effectively.Finally,accurate ABE diagnosis is achieved through the utilisation of a SoftMax layer.The performance of the proposed Tri-M2MT model is evaluated across various metrics,including accuracy,specificity,sensitivity,F1-score,and ROC curve analysis,and the proposed methodology provides better performance compared to existing methodologies.
基金supported by the National Natural Science Foundation of China(Grant Nos.62071315 and 62271336).
文摘The multi-modal characteristics of mineral particles play a pivotal role in enhancing the classification accuracy,which is critical for obtaining a profound understanding of the Earth's composition and ensuring effective exploitation utilization of its resources.However,the existing methods for classifying mineral particles do not fully utilize these multi-modal features,thereby limiting the classification accuracy.Furthermore,when conventional multi-modal image classification methods are applied to planepolarized and cross-polarized sequence images of mineral particles,they encounter issues such as information loss,misaligned features,and challenges in spatiotemporal feature extraction.To address these challenges,we propose a multi-modal mineral particle polarization image classification network(MMGC-Net)for precise mineral particle classification.Initially,MMGC-Net employs a two-dimensional(2D)backbone network with shared parameters to extract features from two types of polarized images to ensure feature alignment.Subsequently,a cross-polarized intra-modal feature fusion module is designed to refine the spatiotemporal features from the extracted features of the cross-polarized sequence images.Ultimately,the inter-modal feature fusion module integrates the two types of modal features to enhance the classification precision.Quantitative and qualitative experimental results indicate that when compared with the current state-of-the-art multi-modal image classification methods,MMGC-Net demonstrates marked superiority in terms of mineral particle multi-modal feature learning and four classification evaluation metrics.It also demonstrates better stability than the existing models.
文摘Raman imaging,as a molecular spectroscopy technique,has been widely studied and applied in research fields such as life sciences and food safety due to its excellent specificity and high resolution.However,its development still faces challenges such as weak signals,slow acquisition speed,and insufficient penetration depth.In recent years,the rapid development of aggregate science has provided new insights for addressing these limitations.Aggregation-induced emission(AIE)materials exhibit enhanced signals in the aggregated state,which may compensate for the inherent weak Raman signals.This article reviews the cutting-edge progress of Raman imaging technology and its current status in cross-disciplinary research with aggregate science,emphasizing the strategy of constructing AIE-Raman dual-responsive probes through molecular engineering to achieve functional complementarity between fluorescence localization and Raman quantification,thereby significantly improving detection sensitivity and specificity.These probes have demonstrated single-cell resolution and high spatiotemporal accuracy in applications such as tumor surgical navigation,diagnosis and treatment of drug-resistant bacteria,and dynamic monitoring of organelles.We also analyze the bottlenecks in this field,such as biological safety and the complexity of molecular design,and outline the future development directions,including intelligent responsive probes,artificial intelligence-assisted analysis,and multimodal fusion platforms.The integration of Raman imaging and AIE sheds new light in the field of medical imaging.
基金supported by the National Natural Science Foundation of China(61927813,61865009,12104203)Jiangxi Provincial Natu-ral Science Foundation(20212ACB201007)。
文摘Metasurfaces are artificial structures that can finely control the characteristics of electromagnetic waves at subwavelength scales,and they are widely used to manipulate the propagation,phase,amplitude,and polariza⁃tion of light.In this work,a bound state in the continuum(BIC)structure based on a metallic metasurface is pro⁃posed.By adjusting the metallic structure using CST and COMSOL software,a significant quasi-BIC peak can be achieved at a frequency of 0.8217 terahertz(THz).Through multi-level expansion analysis,it is found that the electric dipole(ED)is the main factor contributing to the resonant characteristics of the structure.By leveraging the characteristics of BIC,an imaging system was created and operated.According to the simulation results,the imaging system demonstrated excellent sensitivity and resolution,revealing the great potential of terahertz imag⁃ing.This research not only provides new ideas for the creation of BIC structures but also offers an effective refer⁃ence for the development of high-performance terahertz imaging technology.
文摘Two novel aggregation-induced emission(AIE)-active probes,TPA-H and TPA-2 F,were designed and synthesized based on a triphenylamine(TPA)core.Systematic characterization demonstrated that both probes exhibit excellent biocompatibility(cell viability>90%at concentrations up to 50μmol/L)and outstanding LD-targeting speci⁃ficity with minimal colocalization with other organelles such as mitochondria and lysosomes.During early differentia⁃tion of 3 T 3-L 1 adipocytes,both TPA-2 F and TPA-H clearly visualized small and nascent LDs that were difficult to be detected with BODIPY,indicating superior imaging sensitivity compared to the existing fluorescent probes for LDs.Moreover,TPA-2 F demonstrated exceptional photostability,retaining over 90%of its initial fluorescence intensity after 100 continuous laser scanning cycles,significantly outperforming TPA-H.This work not only provides two high-performance LD imaging tools but also highlights the potential of AIE luminogens(AIEgens)in organelle-specific bioimaging,offering promising avenues for early diagnosis and mechanistic research of lipid-related metabolic diseases.
基金support from the Kreitman School of Advanced Graduate Studies, Ben-Gurion University of the Negev。
文摘Neuromorphic cameras,or dynamic vision sensors,are bio-inspired event cameras that measure changes in the image brightness asynchronously and independently at the pixel level.Recently,they garnered increasing interest due to their extremely high temporal resolution,wide dynamic range,low power consumption,and high pixel bandwidth.Despite their advantages,most existing three-dimensional (3D) event imaging solutions rely on multicamera configurations,which are costly,complex,and challenging to synchronize.In this study,we introduce a new framework for four-dimensional (4D) event imaging using a single static neuromorphic camera.We take advantage of the inherent sparsity of event data to combine optically encoded stereo channels into a single event camera.By utilizing optical channel multiplexing,we maintain sensor resolution while retaining the key advantages of event cameras.
基金funded by the Natural Science Foundation of Chongqing Municipality,grant number CSTB2022NSCQ-MSX0503.
文摘Gait recognition is a key biometric for long-distance identification,yet its performance is severely degraded by real-world challenges such as varying clothing,carrying conditions,and changing viewpoints.While combining silhouette and skeleton data is a promising direction,effectively fusing these heterogeneous modalities and adaptively weighting their contributions in response to diverse conditions remains a central problem.This paper introduces GaitMAFF,a novelMulti-modal Adaptive Feature Fusion Network,to address this challenge.Our approach first transforms discrete skeleton joints into a dense SkeletonMap representation to align with silhouettes,then employs an attention-based module to dynamically learn the fusion weights between the two modalities.These fused features are processed by a powerful spatio-temporal backbone withWeighted Global-Local Feature FusionModules(WFFM)to learn a discriminative representation.Extensive experiments on the challenging CCPG and Gait3D datasets show that GaitMAFF achieves state-of-the-art performance,with an average Rank-1 accuracy of 84.6%on CCPG and 58.7%on Gait3D.These results demonstrate that our adaptive fusion strategy effectively integrates complementary multimodal information,significantly enhancing gait recognition robustness and accuracy in complex scenes and providing a practical solution for real-world applications.
基金supported by the National Key Research and Development Program of China(Research Grant Number:2023YFC3603600).
文摘Autism spectrum disorder(AsD)is a highly heterogeneous neurodevelopmental disorder.Early diagnosis and intervention are crucial for improving outcomes.Traditional single-modality diagnostic methods are subjective,limited,and struggle to reveal the underlying pathological mechanisms.In contrast,multimodal data analysis integrates behavioral,physiological,and neuroimaging information with advanced machine-learning and deeplearning algorithms to overcome these limitations.In this review,we surveyed the recent pediatric AsD literature,highlighting artificial intelligence-driven diagnostic techniques,multimodal data fusion strategies,and emerging trends in ASD assessment.We surveyed studies that integrated two or more modalities and summarized the fusion levels,learning paradigms,tasks,datasets,and metrics.Multimodal approaches outperform singlemodality baselines in classification,severity estimation,and subtyping by leveraging complementary information and reducing modality-specific biases.Multimodal approaches significantly enhance diagnostic accuracy and comprehensiveness,enabling early screening of AsD,symptom subtyping,severity assessment,and personalized interventions.Advances in multimodal fusion techniques have promoted progress in precision medicine for the treatment of ASD.
文摘Background:Sarcomatoid carcinoma of the ureter(SCU)is a highly aggressive and relatively uncommon malignant tumor of the urinary tract.Its frequency is quite low,and its prognosis is very bad when compared to other cancers of the urinary system.SCU clinical reports are still hard to come by.MRI and PEI/CT imaging of ureteral sarcomatoid cancer is presented in this case to promote diagnostic awareness and comprehension of the imaging characteristics of this uncommon illness.Method:The patient had ureteral sarcomatoid cancer,which was verified by pathological investigation after ureteroscopic biopsy.The patient’s clinical information,imaging results,surgical outcomes,and pathological findings were gathered.A retrospective study was carried out in combinationwith pertinent national and international literature.Results:An 84-year-old female patient was admitted for“left flank discomfort lasting over one month.”MRI revealed an irregular soft tissue mass in the middle-lower segment of the left ureter.T2-weighted imaging showed an unevenly slightly hyperintense signal.Diffusion-weighted imaging demonstrated restricted diffusion.Contrastenhanced imaging exhibited heterogeneous enhancement.PET/CT demonstrated significantly increased fluorodeoxyglucose metabolism in the mass with secondary left upper urinary tract obstruction.Concurrent findings included a solitary metastatic lesion in hepatic segment S6 and multiple lymph node metastases along the left common iliac and external iliac arteries.Preoperative diagnosis suggested a malignant tumor of the ureter.The patient underwent left nephroureteroscopy with biopsy,and the postoperative pathological diagnosis was ureteral sarcomatoid carcinoma.Conclusion:Ureteral sarcomatoid carcinoma is a rare,highly malignant,and aggressive tumor with nonspecific imaging features,typically presenting as an invasively growing mass.Diagnosis relies on postoperative pathology and immunohistochemical examination.MRI and PET/CT scans are valuable for preoperative localization and characterization,tumor staging,treatment planning,and postoperative follow-up.The prognosis is extremely negative.The main treatment option is radical surgery,although constant monitoring is necessary since early recurrence and metastases are frequent after surgery.
基金supported by the National Natural Science Foundation of China(Nos.82372116,U23A2097)Guangdong Basic and Applied Basic Research Foundation(No.2022A1515010620)+2 种基金Shenzhen Medical Research Fund(Nos.B2302047,A2302047)Shenzhen Science and Technology Program(No.JCYJ20220818095806014)Research Team Cultivation Program of Shenzhen University(No.2023QNT019).
文摘The level of glutathione(GSH)is significantly associated with numerous pathological processes,thus,real-time detection of the GSH level is of significance for early diagnosis of GSH-related diseases.Herein,we developed in vivo second near-infrared(NIR-II)window fluorescence(FL)and ratiometric photoacoustic(RPA)dual-modality imaging of GSH using a GSH-activatable probe(LET-14).LET-14 was synthesized based on a rhodamine hybrid xanthene skeleton with a FL shielding 2,4-dinitrobenzene sulfonyl group that can be specifically cleaved by GSH,thus resulting in a markedly bathochromic-shift absorption,a 6.5-fold increase in NIR-II FL intensity(FL_(920))and a 13-fold increase in RPA signal(PA_(880)/PA_(705))in vitro.Intriguingly,LET-14 exhibits good selectivity and sensitivity for NIR-II FL and RPA dual-modality imaging of GSH in 4T1 tumor-bearing mouse model.Our findings develop an in vivo detection tool of GSH,which has great potential in the field of cancer diagnosis.
基金funded by“the Fanying Special Program of the National Natural Science Foundation of China,grant number 62341307”“the Scientific research project of Jiangxi Provincial Department of Education,grant number GJJ200839”“theDoctoral startup fund of JiangxiUniversity of Technology,grant number 205200100402”.
文摘In multi-modal emotion recognition,excessive reliance on historical context often impedes the detection of emotional shifts,while modality heterogeneity and unimodal noise limit recognition performance.Existing methods struggle to dynamically adjust cross-modal complementary strength to optimize fusion quality and lack effective mechanisms to model the dynamic evolution of emotions.To address these issues,we propose a multi-level dynamic gating and emotion transfer framework for multi-modal emotion recognition.A dynamic gating mechanism is applied across unimodal encoding,cross-modal alignment,and emotion transfer modeling,substantially improving noise robustness and feature alignment.First,we construct a unimodal encoder based on gated recurrent units and feature-selection gating to suppress intra-modal noise and enhance contextual representation.Second,we design a gated-attention crossmodal encoder that dynamically calibrates the complementary contributions of visual and audio modalities to the dominant textual features and eliminates redundant information.Finally,we introduce a gated enhanced emotion transfer module that explicitly models the temporal dependence of emotional evolution in dialogues via transfer gating and optimizes continuity modeling with a comparative learning loss.Experimental results demonstrate that the proposed method outperforms state-of-the-art models on the public MELD and IEMOCAP datasets.
基金supported by the National Key Research and Development Program of China(2022YFC2402400)the National Natural Science Foundation of China(82027803,62275062)+7 种基金the Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology(2020B121201010)the Shenzhen Science and Technology Innovation Committee under Grant(JCYJ20220818101417039)the Shenzhen Key Laboratory for Molecular lmaging(ZDSY20130401165820357)the Shenzhen Medical Research Fund(D2404002)the Project of Shandong Innovation and Startup Community of High-end Medical Apparatus and Instruments(2023-SGTTXM-002 and 2024-SGTTXM-005)the Shandong Province Technology Innovation Guidance Plan(Central Leading Local Science and Technology Development Fund)(YDZX2023115)the Taishan Scholar Special Funding Project of Shandong Provinceand the Shandong Laboratory of Advanced Biomaterials and Medical Devices in Weihai(ZL202402).
文摘The photoacoustic imaging of lipid is intrinsically constrained by the feeble nature of endogenous lipid signals,posing a persistent sensitivity challenge that demands innovative solutions.Although adopting high-efficiency excitation and detection elements may improve the imaging sensitivity to a certain extent,the application of the elements is inevitably subject to various limitations in practical applications,particularly during in vivo imaging and endoscopic imaging.In this study,we propose a multi-combinatorial approach to enhance the sensitivity of lipid photoacoustic imaging.The approach involves wavelet transform processing of one-dimensional A-line signals,gradient-based denoising of two-dimensional B-scan images,and finally,threedimensional spatial weighted averaging of the data processed by the previous two steps.This method not only significantly improves the signal-to-noise ratio(SNR)in distinguished feature regions of the image by around 10 dB,but also efficiently extracts weak signals with no distinct features in the original image.After processing with this method,the images acquired under single scanning were compared with those obtained under multiple scanning.The results showed highly consistent image features,with the structural similarity index increasing from 0.2 to 0.8,confirming the accuracy and reliability of the multi-combinatorial approach.
基金funded by the Deanship of Scientific Research(DSR)at King Abdulaziz University,Jeddah,under grant No.(IFPDP-261-22).
文摘Medical imaging is essential in modern health care,allowing accurate diagnosis and effective treatment planning.These images,however,often demonstrate low contrast,noise,and brightness distortion that reduce their diagnostic reliability.This review presents a structured and comprehensive analysis of advanced histogram equalization(HE)-based techniques for medical image enhancement.Our review methodology encompasses:(1)classical HE approaches and related limitations in medical domains;(2)adaptive schemes like Adaptive Histogram Equalization(AHE)and Contrast Limited Adaptive Histogrma Equalization(CLAHE)and their advance variants;(3)brightnesspreserving schemes like BBHE and MMBEBHE and related algorithms;(4)dynamic and recursive histogram equalization methods incorporating DHE and RMSHE;(5)fuzzy logic-based enhancement methodologies addressing uncertainty and noise in medical images;and(6)hybrid optimization methodologies through the application of metaheuristic algorithms(World Cup Optimization,Particle Swarm Optimization,Genetic Algorithms,along with histogram-based methodologies.)There is also a comparative discussion given based on contrast improvement,image brightness preservation,noise management,and computational efficiency.Such advancements have better capabilities of improving image quality,which is more important for improved diagnosis and image analysis.
基金supported by the National Natural Science Foundation of China(Grant No.42274225)the International Science and Technology Cooperation Project of the Ningbo Key R&D Program(Grant No.2024H031).
文摘Electrons in the energy range of 10–100 keV are important energetic particle components in the magnetosphere,and they play a key role in many physical processes in the magnetosphere.However,many scientific questions about these processes are still unanswered.High spatiotemporal and energy-resolution imaging detection of 10–100 keV electrons is of great significance for solving these scientific problems.The traditional space particle detection technology cannot effectively detect the medium-energy electrons in this energy range.In this project,we propose combining low-noise particle detection technology with pinhole imaging technology to achieve high-resolution imaging detection of 10–100 keV medium-energy electrons in the magnetosphere,and at the same time achieve miniaturization(≤3.2 kg and size of 150×150×170 mm)and low power consumption(≤5 W)of the instrument,which can be used for space physics research and space weather applications in the future.
基金supported by the Deanship of Research and Graduate Studies at King Khalid University under Small Research Project grant number RGP1/139/45.
文摘Integrating multiple medical imaging techniques,including Magnetic Resonance Imaging(MRI),Computed Tomography,Positron Emission Tomography(PET),and ultrasound,provides a comprehensive view of the patient health status.Each of these methods contributes unique diagnostic insights,enhancing the overall assessment of patient condition.Nevertheless,the amalgamation of data from multiple modalities presents difficulties due to disparities in resolution,data collection methods,and noise levels.While traditional models like Convolutional Neural Networks(CNNs)excel in single-modality tasks,they struggle to handle multi-modal complexities,lacking the capacity to model global relationships.This research presents a novel approach for examining multi-modal medical imagery using a transformer-based system.The framework employs self-attention and cross-attention mechanisms to synchronize and integrate features across various modalities.Additionally,it shows resilience to variations in noise and image quality,making it adaptable for real-time clinical use.To address the computational hurdles linked to transformer models,particularly in real-time clinical applications in resource-constrained environments,several optimization techniques have been integrated to boost scalability and efficiency.Initially,a streamlined transformer architecture was adopted to minimize the computational load while maintaining model effectiveness.Methods such as model pruning,quantization,and knowledge distillation have been applied to reduce the parameter count and enhance the inference speed.Furthermore,efficient attention mechanisms such as linear or sparse attention were employed to alleviate the substantial memory and processing requirements of traditional self-attention operations.For further deployment optimization,researchers have implemented hardware-aware acceleration strategies,including the use of TensorRT and ONNX-based model compression,to ensure efficient execution on edge devices.These optimizations allow the approach to function effectively in real-time clinical settings,ensuring viability even in environments with limited resources.Future research directions include integrating non-imaging data to facilitate personalized treatment and enhancing computational efficiency for implementation in resource-limited environments.This study highlights the transformative potential of transformer models in multi-modal medical imaging,offering improvements in diagnostic accuracy and patient care outcomes.
基金Project supported by the National Key Research and Development Program of China(Grant Nos.2022YFA1603601,2021YFF0601203,and 2021YFA1600703)。
文摘The unique advantage of x-ray ghost imaging(XGI)is its potential in low dose radiology.One of the practical ways to reduce the radiation exposure is to reduce the measurements while remaining sufficient image quality.Synthetic aperture x-ray ghost imaging(SAXGI)is invented to achieve megapixel XGI with limited measurements,which is expected to implement XGI simultaneously with large field of view and low radiation exposure.In this paper,we experimentally investigate the effect of measurements reduction on the spatial resolution and image quality of SAXGI with standard sample and biomedical specimen.The results with a resolution chart demonstrated that at 360 measurements,SAXGI successfully retrieved the sample image of 1960×1960 pixels with spatial resolution of 4μm.With measurement reduction,the spatial resolution deteriorates but the sparser structures are still discernable.Even with measurements reduced to 10,a spatial resolution of 10μm can still be achieved by SAXGI.A biomedical sample of a fish specimen is employed to evaluate the method and the fish image of 2000×1000 pixels with an SSIM of 0.962 is reconstructed by SAXGI with 770measurements,corresponding to an accumulative exposure reduction of more than 2 times.With the measurements reduced to 10 which corresponds to 1/160 of the accumulative radiation exposure for conventional radiology,bulky structure like the fish skeleton can still be definitely discerned and the SSIM for the reconstructed image still retained 0.9179.Results of this paper demonstrate that measurements reduction is practicable for the radiation exposure reduction of the sample,which implicates that SAXGI with limited measurements is an efficient solution for low dose radiology.
基金supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)(grant number IMSIU-DDRSP2601).
文摘Generative Adversarial Networks(GANs)have become valuable tools in medical imaging,enabling realistic image synthesis for enhancement,augmentation,and restoration.However,their integration into clinical workflows raises concerns,particularly the risk of subtle distortions or hallucinations that may undermine diagnostic accuracy and weaken trust in AI-assisted decision-making.To address this challenge,we propose a hybrid deep learning framework designed to detect GAN-induced artifacts in medical images,thereby reinforcing the reliability of AI-driven diagnostics.The framework integrates low-level statistical descriptors,including high-frequency residuals and Gray-Level Co-occurrence Matrix(GLCM)texture features,with high-level semantic representations extracted from a pre-trained ResNet18.This dual-stream approach enables detection of both pixel-level anomalies and structural inconsistencies introduced by GAN-based manipulation.We validated the framework on a curated dataset of 10,000 medical images,evenly split between authentic and GAN-generated samples across four modalities:MRI,CT,X-ray,and fundus photography.To improve generalizability to real-world clinical settings,we incorporated domain adaptation strategies such as adversarial training and style transfer,reducing domain shift by 15%.Experimental results demonstrate robust performance,achieving 92.6%accuracy and an F1-score of 0.91 on synthetic test data,and maintaining strong performance on real-world GAN-modified images with 87.3%accuracy and an F1-score of 0.85.Additionally,the model attained an AUC of 0.96 and an average precision of 0.92,outperforming conventional GAN detection pipelines and baseline Convolutional Neural Network(CNN)architectures.These findings establish the proposed framework as an effective and reliable solution for detecting GAN-induced hallucinations in medical imaging,representing an important step toward building trustworthy and clinically deployable AI systems.
基金supported by Natural Science Foundation of Jilin Province(No.SKL202302002)Key Research and Development project of Jilin Provincial Science and Technology Department(No.20210204142YY)+2 种基金The Science and Technology Development Program of Jilin Province(No.2020122256JC)Beijing Kechuang Medical Development Foundation Fund of China(No.KC2023-JX-0186BQ079)Talent Reserve Program(TRP),the First Hospital of Jilin University(No.JDYY-TRP-2024007)。
文摘Prostate cancer(PCa)is characterized by high incidence and propensity for easy metastasis,presenting significant challenges in clinical diagnosis and treatment.Tumor microenvironment(TME)-responsive nanomaterials provide a promising prospect for imaging-guided precision therapy.Considering that tumor-derived alkaline phosphatase(ALP)is over-expressed in metastatic PCa,it makes a great chance to develop a theranostics system with ALP responsive in the TME.Herein,an ALP-responsive aggregationinduced emission luminogens(AIEgens)nanoprobe AMNF self-assembly was designed for enhancing the diagnosis and treatment of metastatic PCa.The nanoprobe exhibited self-aggregation in the presence of ALP resulted in aggregation-induced fluorescence,and enhanced accumulation and prolonged retention period at the tumor site.In terms of detection,the fluorescence(FL)/computed tomography(CT)/magnetic resonance(MR)multi-mode imaging effect of nanoprobe was significantly improved post-aggregation,enabling precise diagnosis through the amalgamation of multiple imaging modes.Enhanced CT/MR imaging can achieve assist preoperative tumor diagnosis,and enhanced FL imaging technology can achieve“intraoperative visual navigation”,showing its potential application value in clinical tumor detection and surgical guidance.In terms of treatment,AMNF showed strong absorption in the near infrared region after aggregation,which improved the photothermal treatment effect.Overall,our work developed an effective aggregation-enhanced theranostic strategy for ALP-related cancers.
基金supported by the Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB0530000)the Discipline Construction Foundation of“Double World-class Project”.
文摘Laser wakefield accelerators(LWFAs)offer acceleration gradients up to 1000 times higher than those of conventional radio-frequency accelerators,offering a pathway to significantly more compact and cost-effective accelerator systems.This breakthrough opens up new possibilities for laboratory-scale light sources.All-optical inverse Compton scattering(AOCS)sources driven by LWFAs produce high-brightness,quasimonochromatic X rays with micrometer-scale source sizes,delivering the spatial coherence and resolution required for X-ray phase-contrast imaging(XPCI).These features position AOCS X-ray sources as promising tools for applications in biology,medicine,physics,and materials science.However,previous AOCS-based imaging studies have primarily focused on X-ray absorption imaging.In this work,we report successful experimental demonstrations of edge-enhanced in-line XPCI using energy-tunable,quasi-monochromatic AOCS X rays.With a spatial resolution of~20μm,our results clearly show the potential of high-resolution,AOCS-based XPCI applications.