期刊文献+
共找到987,119篇文章
< 1 2 250 >
每页显示 20 50 100
SDNet:A self-supervised bird recognition method based on large language models and diffusion models for improving long-term bird monitoring
1
作者 Zhongde Zhang Nan Su +3 位作者 Chenxun Deng Yandong Zhao Weiping Liu Qiaoling Han 《Avian Research》 2026年第1期200-215,共16页
The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-super... The collection and annotation of lar ge-scale bird datasets are resource-intensive and time-consuming processes that significantly limit the scalability and accuracy of biodiversity monitoring systems.While self-supervised learning(SSL)has emerged as a promising approach for leveraging unannotated data,current SSL methods face two critical challenges in bird species recognition:(1)long-tailed data distributions that result in poor performance on underrepresented species;and(2)domain shift issues caused by data augmentation strategies designed to mitigate class imbalance.Here we present SDNet,a novel SSL-based bird recognition framework that integrates diffusion models with large language models(LLMs)to overcome these limitations.SDNet employs LLMs to generate semantically rich textual descriptions for tail-class species by prompting the models with species taxonomy,morphological attributes,and habitat information,producing detailed natural language priors that capture fine-grained visual characteristics(e.g.,plumage patterns,body proportions,and distinctive markings).These textual descriptions are subsequently used by a conditional diffusion model to synthesize new bird image samples through cross-attention mechanisms that fuse textual embeddings with intermediate visual feature representations during the denoising process,ensuring generated images preserve species-specific morphological details while maintaining photorealistic quality.Additionally,we incorporate a Swin Transformer as the feature extraction backbone whose hierarchical window-based attention mechanism and shifted windowing scheme enable multi-scale local feature extraction that proves particularly effective at capturing finegrained discriminative patterns(such as beak shape and feather texture)while mitigating domain shift between synthetic and original images through consistent feature representations across both data sources.SDNet is validated on both a self-constructed dataset(Bird_BXS)an d a publicly available benchmark(Birds_25),demonstrating substantial improvements over conventional SSL approaches.Our results indicate that the synergistic integration of LLMs,diffusion models,and the Swin Transformer architecture contributes significantly to recognition accuracy,particularly for rare and morphologically similar species.These findings highlight the potential of SDNet for addressing fundamental limitations of existing SSL methods in avian recognition tasks and establishing a new paradigm for efficient self-supervised learning in large-scale ornithological vision applications. 展开更多
关键词 Biodiversity conservation Bird intelligent monitoring diffusion models Large-scale language models Long-tailed learning Self-supervised learning
在线阅读 下载PDF
Information Diffusion Models and Fuzzing Algorithms for a Privacy-Aware Data Transmission Scheduling in 6G Heterogeneous ad hoc Networks
2
作者 Borja Bordel Sánchez Ramón Alcarria Tomás Robles 《Computer Modeling in Engineering & Sciences》 2026年第2期1214-1234,共21页
In this paper,we propose a new privacy-aware transmission scheduling algorithm for 6G ad hoc networks.This system enables end nodes to select the optimum time and scheme to transmit private data safely.In 6G dynamic h... In this paper,we propose a new privacy-aware transmission scheduling algorithm for 6G ad hoc networks.This system enables end nodes to select the optimum time and scheme to transmit private data safely.In 6G dynamic heterogeneous infrastructures,unstable links and non-uniform hardware capabilities create critical issues regarding security and privacy.Traditional protocols are often too computationally heavy to allow 6G services to achieve their expected Quality-of-Service(QoS).As the transport network is built of ad hoc nodes,there is no guarantee about their trustworthiness or behavior,and transversal functionalities are delegated to the extreme nodes.However,while security can be guaranteed in extreme-to-extreme solutions,privacy cannot,as all intermediate nodes still have to handle the data packets they are transporting.Besides,traditional schemes for private anonymous ad hoc communications are vulnerable against modern intelligent attacks based on learning models.The proposed scheme fulfills this gap.Findings show the probability of a successful intelligent attack reduces by up to 65%compared to ad hoc networks with no privacy protection strategy when used the proposed technology.While congestion probability can remain below 0.001%,as required in 6G services. 展开更多
关键词 6G networks ad hoc networks PRIVACY scheduling algorithms diffusion models fuzzing algorithms
在线阅读 下载PDF
Detection of white matter microstructural changes in patients with systemic lupus erythematosus based on multiple diffusion models and related diffusion metrics
3
作者 Zhenxing Li Huanhuan Li +5 位作者 Bailing Tian Huiyang Liu Yueluan Jiang Pingting Yang Guoguang Fan Hu Liu 《Neural Regeneration Research》 2026年第6期2467-2474,共8页
Some patients with systemic lupus erythematosus experience neuropsychiatric symptoms.Although magnetic resonance imaging can detect abnormal signals in the white matter of the brain,conventional methods often struggle... Some patients with systemic lupus erythematosus experience neuropsychiatric symptoms.Although magnetic resonance imaging can detect abnormal signals in the white matter of the brain,conventional methods often struggle to accurately capture microstructural changes.Various diffusion models have been used to study white matter in systemic lupus erythematosus;however,comparative analyses of their sensitivity and specificity for detecting microstructural changes remain insufficient.To address this,our team designed a diagnostic trial that used multimodal diffusion imaging techniques to observe white matter microstructural changes in patients with systemic lupus erythematosus who had neuropsychiatric symptoms,with an aim to identify key diagnostic biomarkers for these patients.Patients with active lupus who received treatment at the Department of Rheumatology and Immunology,The First Affiliated Hospital of China Medical University,from September 2023 to March 2024 were recruited.According to the standards of the American College of Rheumatology,patients with systemic lupus erythematosus who had neuropsychiatric symptoms were assigned to the systemic lupus erythematosus group,whereas those without neuropsychiatric symptoms were assigned to the non-systemic lupus erythematosus group.Additionally,healthy volunteers matched by region,sex,and age were recruited as controls.All three groups underwent the same diffusion magnetic resonance imaging examination protocol to compare differences in diffusion parameters.Advanced diffusion imaging models were able to sensitively detect microstructural changes in the white matter fibers of patients with systemic lupus erythematosus who had neuropsychiatric symptoms,with specific diffusion parameters showing significant abnormalities in key brain regions.In the left superior longitudinal fasciculus subregion and the right thalamic radiations of patients with systemic lupus erythematosus who had neuropsychiatric symptoms,we also identified abnormal diffusion characteristics that were clearly correlated with disease activity,suggesting that microstructural changes in these areas may reflect the dynamic process of neuroinflammatory damage.The present study addresses critical challenges in the diagnosis of systemic lupus erythematosus by identifying specific white matter imaging biomarkers and elucidating the association between microstructural damage and clinical manifestations.The main contributions of our study include:1)establishing axial regression probability parameters from mean apparent propagator magnetic resonance imaging as sensitive biomarkers for systemic lupus erythematosus,particularly in the third subregion of the left superior longitudinal fasciculus;2)demonstrating that multimodal diffusion imaging may be superior to conventional diffusion tensor imaging for detecting white matter microstructural abnormalities in patients with systemic lupus erythematosus;and 3)integrating tract-based spatial statistics with clinically relevant analyses to link imaging findings to pathological mechanisms. 展开更多
关键词 diffusion kurtosis imaging diffusion tensor imaging mean apparent propagator neurite orientation dispersion and density imaging neuropsychiatric systemic lupus erythematosus return to axis probability return to origin probability superior longitudinal fasciculus-3 superior thalamic radiation tract-based spatial statistics white matter microstructure
暂未订购
BEDiff:denoising diffusion probabilistic models for building extraction 被引量:1
4
作者 LEI Yanjing WANG Yuan +3 位作者 CHAN Sixian HU Jie ZHOU Xiaolong ZHANG Hongkai 《Optoelectronics Letters》 2025年第5期298-305,共8页
Accurately identifying building distribution from remote sensing images with complex background information is challenging.The emergence of diffusion models has prompted the innovative idea of employing the reverse de... Accurately identifying building distribution from remote sensing images with complex background information is challenging.The emergence of diffusion models has prompted the innovative idea of employing the reverse denoising process to distill building distribution from these complex backgrounds.Building on this concept,we propose a novel framework,building extraction diffusion model(BEDiff),which meticulously refines the extraction of building footprints from remote sensing images in a stepwise fashion.Our approach begins with the design of booster guidance,a mechanism that extracts structural and semantic features from remote sensing images to serve as priors,thereby providing targeted guidance for the diffusion process.Additionally,we introduce a cross-feature fusion module(CFM)that bridges the semantic gap between different types of features,facilitating the integration of the attributes extracted by booster guidance into the diffusion process more effectively.Our proposed BEDiff marks the first application of diffusion models to the task of building extraction.Empirical evidence from extensive experiments on the Beijing building dataset demonstrates the superior performance of BEDiff,affirming its effectiveness and potential for enhancing the accuracy of building extraction in complex urban landscapes. 展开更多
关键词 booster guidance building extraction reverse denoising process diffusion model bediff which remote sensing images complex background diffusion models
原文传递
Air target intent recognition method combining graphing time series and diffusion models 被引量:1
5
作者 Chenghai LI Ke WANG +2 位作者 Yafei SONG Peng WANG Lemin LI 《Chinese Journal of Aeronautics》 2025年第1期507-519,共13页
Air target intent recognition holds significant importance in aiding commanders to assess battlefield situations and secure a competitive edge in decision-making.Progress in this domain has been hindered by challenges... Air target intent recognition holds significant importance in aiding commanders to assess battlefield situations and secure a competitive edge in decision-making.Progress in this domain has been hindered by challenges posed by imbalanced battlefield data and the limited robustness of traditional recognition models.Inspired by the success of diffusion models in addressing visual domain sample imbalances,this paper introduces a new approach that utilizes the Markov Transfer Field(MTF)method for time series data visualization.This visualization,when combined with the Denoising Diffusion Probabilistic Model(DDPM),effectively enhances sample data and mitigates noise within the original dataset.Additionally,a transformer-based model tailored for time series visualization and air target intent recognition is developed.Comprehensive experimental results,encompassing comparative,ablation,and denoising validations,reveal that the proposed method achieves a notable 98.86%accuracy in air target intent recognition while demonstrating exceptional robustness and generalization capabilities.This approach represents a promising avenue for advancing air target intent recognition. 展开更多
关键词 Intent Recognition Markov Transfer Field Denoising diffusion Probability Model Transformer Neural Network
原文传递
Anime Generation through Diffusion and Language Models:A Comprehensive Survey of Techniques and Trends
6
作者 Yujie Wu Xing Deng +4 位作者 Haijian Shao Ke Cheng Ming Zhang Yingtao Jiang Fei Wang 《Computer Modeling in Engineering & Sciences》 2025年第9期2709-2778,共70页
The application of generative artificial intelligence(AI)is bringing about notable changes in anime creation.This paper surveys recent advancements and applications of diffusion and language models in anime generation... The application of generative artificial intelligence(AI)is bringing about notable changes in anime creation.This paper surveys recent advancements and applications of diffusion and language models in anime generation,focusing on their demonstrated potential to enhance production efficiency through automation and personalization.Despite these benefits,it is crucial to acknowledge the substantial initial computational investments required for training and deploying these models.We conduct an in-depth survey of cutting-edge generative AI technologies,encompassing models such as Stable Diffusion and GPT,and appraise pivotal large-scale datasets alongside quantifiable evaluation metrics.Review of the surveyed literature indicates the achievement of considerable maturity in the capacity of AI models to synthesize high-quality,aesthetically compelling anime visual images from textual prompts,alongside discernible progress in the generation of coherent narratives.However,achieving perfect long-form consistency,mitigating artifacts like flickering in video sequences,and enabling fine-grained artistic control remain critical ongoing challenges.Building upon these advancements,research efforts have increasingly pivoted towards the synthesis of higher-dimensional content,such as video and three-dimensional assets,with recent studies demonstrating significant progress in this burgeoning field.Nevertheless,formidable challenges endure amidst these advancements.Foremost among these are the substantial computational exigencies requisite for training and deploying these sophisticated models,particularly pronounced in the realm of high-dimensional generation such as video synthesis.Additional persistent hurdles include maintaining spatial-temporal consistency across complex scenes and mitigating ethical considerations surrounding bias and the preservation of human creative autonomy.This research underscores the transformative potential and inherent complexities of AI-driven synergy within the creative industries.We posit that future research should be dedicated to the synergistic fusion of diffusion and autoregressive models,the integration of multimodal inputs,and the balanced consideration of ethical implications,particularly regarding bias and the preservation of human creative autonomy,thereby establishing a robust foundation for the advancement of anime creation and the broader landscape of AI-driven content generation. 展开更多
关键词 diffusion models language models anime generation image synthesis video generation stable diffusion AIGC
在线阅读 下载PDF
Predicting unsteady hydrodynamic performance of seaplanes based on diffusion models
7
作者 Xinlong YU Miao PENG +4 位作者 Mingzhen WANG Junlong ZHANG Jian YU Hongqiang LYU Xuejun LIU 《Chinese Journal of Aeronautics》 2025年第10期327-346,共20页
Obtaining unsteady hydrodynamic performance is of great significance for seaplane design.Common methods for obtaining unsteady hydrodynamic performance data include tank test and Computational Fluid Dynamics(CFD)numer... Obtaining unsteady hydrodynamic performance is of great significance for seaplane design.Common methods for obtaining unsteady hydrodynamic performance data include tank test and Computational Fluid Dynamics(CFD)numerical simulation,which are costly and time-consuming.Therefore,it is necessary to obtain unsteady hydrodynamic performance in a low-cost and high-precision manner.Due to the strong nonlinearity,complex data distribution,and temporal characteristics of unsteady hydrodynamic performance,the prediction of it is challenging.This paper proposes a Temporal Convolutional Diffusion Model(TCDM)for predicting the unsteady hydrodynamic performance of seaplanes given design parameters.Under the framework of a classifier-free guided diffusion model,TCDM learns the distribution patterns of unsteady hydrodynamic performance data with the designed denoising module based on temporal convolutional network and captures the temporal features of unsteady hydrodynamic performance data.Using CFD simulation data,the proposed method is compared with the alternative methods to demonstrate its accuracy and generalization.This paper provides a method that enables the rapid and accurate prediction of unsteady hydrodynamic performance data,expecting to shorten the design cycle of seaplanes. 展开更多
关键词 Seaplanes Unsteady hydrodynamic performance Classifier-free guided diffusion model Temporal convolutional network Temporal data
原文传递
Fixed Neural Network Image Steganography Based on Secure Diffusion Models
8
作者 Yixin Tang Minqing Zhang +2 位作者 Peizheng Lai Ya Yue Fuqiang Di 《Computers, Materials & Continua》 2025年第9期5733-5750,共18页
Traditional steganography conceals information by modifying cover data,but steganalysis tools easily detect such alterations.While deep learning-based steganography often involves high training costs and complex deplo... Traditional steganography conceals information by modifying cover data,but steganalysis tools easily detect such alterations.While deep learning-based steganography often involves high training costs and complex deployment.Diffusion model-based methods face security vulnerabilities,particularly due to potential information leakage during generation.We propose a fixed neural network image steganography framework based on secure diffu-sion models to address these challenges.Unlike conventional approaches,our method minimizes cover modifications through neural network optimization,achieving superior steganographic performance in human visual perception and computer vision analyses.The cover images are generated in an anime style using state-of-the-art diffusion models,ensuring the transmitted images appear more natural.This study introduces fixed neural network technology that allows senders to transmit only minimal critical information alongside stego-images.Recipients can accurately reconstruct secret images using this compact data,significantly reducing transmission overhead compared to conventional deep steganography.Furthermore,our framework innovatively integrates ElGamal,a cryptographic algorithm,to protect critical information during transmission,enhancing overall system security and ensuring end-to-end information protection.This dual optimization of payload reduction and cryptographic reinforcement establishes a new paradigm for secure and efficient image steganography. 展开更多
关键词 Image steganography fixed neural network secure diffusion models ELGAMAL
在线阅读 下载PDF
A general framework for airfoil flow field reconstruction based on transformer-guided diffusion models
9
作者 Jinhua LOU Rongqian CHEN +4 位作者 Zelun LIN Jiaqi LIU Yue BAO Hao WU Yancheng YOU 《Chinese Journal of Aeronautics》 2025年第12期214-244,共31页
High-Resolution(HR)data on flow fields are critical for accurately evaluating the aerodynamic performance of aircraft.However,acquiring such data through large-scale numerical simulations or wind tunnel experiments is... High-Resolution(HR)data on flow fields are critical for accurately evaluating the aerodynamic performance of aircraft.However,acquiring such data through large-scale numerical simulations or wind tunnel experiments is highly resource intensive.This paper proposes a FlowViT-Diff framework that integrates a Vision Transformer(ViT)with an enhanced denoising diffusion probabilistic model for the Super-Resolution(SR)reconstruction of HR flow fields based on low-resolution inputs.It provides a quick initial prediction of the HR flow field by optimizing the ViT architecture,and incorporates this preliminary output as guidance within an enhanced diffusion model.The latter captures the Gaussian noise distribution during forward diffusion and progressively removes it during backward diffusion to generate the flow field.Experiments on various supercritical airfoils under different flow conditions show that FlowViT-Diff can robustly reconstruct the flow field across multiple levels of downsampling.It obtains more consistent global and local features than traditional SR methods,and yields a 3.6-fold increase in its training speed via transfer learning.Its accuracy of reconstruction of the flow field is 99.7%under ultra-low downsampling.The results demonstrate that Flow Vi T-Diff not only exhibits effective flow field reconstruction capabilities,but also provides two reconstruction strategies,both of which show effective transferability. 展开更多
关键词 Flow fields Vision Transformer(ViT) Denoising diffusion probabilistic model Supercritical airfoil Transfer learning
原文传递
Combining transformer and 3DCNN models to achieve co-design of structures and sequences of antibodies in a diffusional manner
10
作者 Yue Hu Feng Tao +3 位作者 Jiajie Xu Wen-Jun Lan Jing Zhang Wei Lan 《Journal of Pharmaceutical Analysis》 2025年第6期1406-1408,共3页
AlphaPanda(AlphaFold2[1]inspired protein-specific antibody design in a diffusional manner)is an advanced algorithm for designing complementary determining regions(CDRs)of the antibody targeted the specific epitope,com... AlphaPanda(AlphaFold2[1]inspired protein-specific antibody design in a diffusional manner)is an advanced algorithm for designing complementary determining regions(CDRs)of the antibody targeted the specific epitope,combining transformer[2]models,3DCNN[3],and diffusion[4]generative models. 展开更多
关键词 advanced algorithm diffusion generative models dcnn epitope targeting antibody design complementary determining regions complementary determining regions cdrs transformer models
在线阅读 下载PDF
Efficient generation of digital rock CT images using LoRA-enhanced stable diffusion models 被引量:1
11
作者 Kunyao Li Haijiang Li +2 位作者 Ali Khudhair Jun Yan Bin Wang 《Intelligent Geoengineering》 2025年第2期96-108,共13页
Digital rock analysis(DRA)is fundamental for geo-energy research,enabling the characterisation of microstructures for applications like hydrocarbon recovery,carbon storage,and groundwater modelling.Although 2D CT imag... Digital rock analysis(DRA)is fundamental for geo-energy research,enabling the characterisation of microstructures for applications like hydrocarbon recovery,carbon storage,and groundwater modelling.Although 2D CT images provide valuable pore-scale data,the scarcity of real-world datasets limits the effectiveness of advanced analysis.Generative AI presents a promising approach for synthesizing high-quality rock images but faces key challenges,including high computational demands,insufficient evaluation metrics,and the trade-off between image fidelity and diversity.To address these limitations,this study proposes the use of Low-Rank Adaptation(LoRA)for fine-tuning stable diffusion models,significantly reducing computational requirements while maintaining image quality.A systematic investigation was conducted to evaluate the influence of LoRA training parameters,including rank and learning rate,on the quality of generated images.Image outputs were assessed using both standard generative metrics,such as Kernel Inception Distance(KID),and domain-specific metrics,including porosity,pore count,and pore area distributions.The optimised LoRA-enhanced diffusion model achieved a 92.6% reduction in KID relative to baseline models,while also improving inference speed.Building on these advancements,this study demonstrates that the LoRA-enhanced diffusion model significantly improves neural network extrapolation in incomplete data scenarios through statistically consistent synthetic generation.Despite control challenges,this approach reduces costs and enables diverse applications,bridging fundamental rock physics with practical energy research. 展开更多
关键词 Digital Rock Analysis Stable diffusion Low-Rank Adaptation(LoRA) Computed Tomography(CT) Generative AI Computational Geoscience
在线阅读 下载PDF
Motion In-Betweening via Frequency-Domain Diffusion Model
12
作者 Qiang Zhang Shuo Feng +2 位作者 Shanxiong Chen Teng Wan Ying Qi 《Computers, Materials & Continua》 2026年第1期275-296,共22页
Human motion modeling is a core technology in computer animation,game development,and humancomputer interaction.In particular,generating natural and coherent in-between motion using only the initial and terminal frame... Human motion modeling is a core technology in computer animation,game development,and humancomputer interaction.In particular,generating natural and coherent in-between motion using only the initial and terminal frames remains a fundamental yet unresolved challenge.Existing methods typically rely on dense keyframe inputs or complex prior structures,making it difficult to balance motion quality and plausibility under conditions such as sparse constraints,long-term dependencies,and diverse motion styles.To address this,we propose a motion generation framework based on a frequency-domain diffusion model,which aims to better model complex motion distributions and enhance generation stability under sparse conditions.Our method maps motion sequences to the frequency domain via the Discrete Cosine Transform(DCT),enabling more effective modeling of low-frequency motion structures while suppressing high-frequency noise.A denoising network based on self-attention is introduced to capture long-range temporal dependencies and improve global structural awareness.Additionally,a multi-objective loss function is employed to jointly optimize motion smoothness,pose diversity,and anatomical consistency,enhancing the realism and physical plausibility of the generated sequences.Comparative experiments on the Human3.6M and LaFAN1 datasets demonstrate that our method outperforms state-of-the-art approaches across multiple performance metrics,showing stronger capabilities in generating intermediate motion frames.This research offers a new perspective and methodology for human motion generation and holds promise for applications in character animation,game development,and virtual interaction. 展开更多
关键词 Motion generation diffusion model frequency domain human motion synthesis self-attention network 3D motion interpolation
在线阅读 下载PDF
A Trajectory-Guided Diffusion Model for Consistent and Realistic Video Synthesis in Autonomous Driving
13
作者 Beike Yu Dafang Wang 《Computer Modeling in Engineering & Sciences》 2026年第1期1075-1091,共17页
Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been i... Scalable simulation leveraging real-world data plays an essential role in advancing autonomous driving,owing to its efficiency and applicability in both training and evaluating algorithms.Consequently,there has been increasing attention on generating highly realistic and consistent driving videos,particularly those involving viewpoint changes guided by the control commands or trajectories of ego vehicles.However,current reconstruction approaches,such as Neural Radiance Fields and 3D Gaussian Splatting,frequently suffer from limited generalization and depend on substantial input data.Meanwhile,2D generative models,though capable of producing unknown scenes,still have room for improvement in terms of coherence and visual realism.To overcome these challenges,we introduce GenScene,a world model that synthesizes front-view driving videos conditioned on trajectories.A new temporal module is presented to improve video consistency by extracting the global context of each frame,calculating relationships of frames using these global representations,and fusing frame contexts accordingly.Moreover,we propose an innovative attention mechanism that computes relations of pixels within each frame and pixels in the corresponding window range of the initial frame.Extensive experiments show that our approach surpasses various state-of-the-art models in driving video generation,and the introduced modules contribute significantly to model performance.This work establishes a new paradigm for goal-oriented video synthesis in autonomous driving,which facilitates on-demand simulation to expedite algorithm development. 展开更多
关键词 Video generation autonomous vehicle diffusion model TRAJECTORY
在线阅读 下载PDF
Diffusion-Driven Generation of Synthetic Complex Concrete Crack Images for Segmentation Tasks
14
作者 Pengwei Guo Xiao Tan Yiming Liu 《Structural Durability & Health Monitoring》 2026年第1期47-69,共23页
Crack detection accuracy in computer vision is often constrained by limited annotated datasets.Although Generative Adversarial Networks(GANs)have been applied for data augmentation,they frequently introduce blurs and ... Crack detection accuracy in computer vision is often constrained by limited annotated datasets.Although Generative Adversarial Networks(GANs)have been applied for data augmentation,they frequently introduce blurs and artifacts.To address this challenge,this study leverages Denoising Diffusion Probabilistic Models(DDPMs)to generate high-quality synthetic crack images,enriching the training set with diverse and structurally consistent samples that enhance the crack segmentation.The proposed framework involves a two-stage pipeline:first,DDPMs are used to synthesize high-fidelity crack images that capture fine structural details.Second,these generated samples are combined with real data to train segmentation networks,thereby improving accuracy and robustness in crack detection.Compared with GAN-based approaches,DDPM achieved the best fidelity,with the highest Structural Similarity Index(SSIM)(0.302)and lowest Learned Perceptual Image Patch Similarity(LPIPS)(0.461),producing artifact-free images that preserve fine crack details.To validate its effectiveness,six segmentation models were tested,among which LinkNet consistently achieved the best performance,excelling in both region-level accuracy and structural continuity.Incorporating DDPM-augmented data further enhanced segmentation outcomes,increasing F1 scores by up to 1.1%and IoU by 1.7%,while also improving boundary alignment and skeleton continuity compared with models trained on real images alone.Experiments with varying augmentation ratios showed consistent improvements,with F1 rising from 0.946(no augmentation)to 0.957 and IoU from 0.897 to 0.913 at the highest ratio.These findings demonstrate the effectiveness of diffusion-based augmentation for complex crack detection in structural health monitoring. 展开更多
关键词 Crack monitoring complex cracks denoising diffusion models generative artificial intelligence synthetic data augmentation
在线阅读 下载PDF
Graph Guide Diffusion Solvers with Noises for Travelling Salesman Problem
15
作者 Yan Kong Xinpeng Guo Chih-Hsien Hsia 《Computers, Materials & Continua》 2026年第3期689-707,共19页
With the development of technology,diffusion model-based solvers have shown significant promise in solving Combinatorial Optimization(CO)problems,particularly in tackling Non-deterministic Polynomial-time hard(NP-hard... With the development of technology,diffusion model-based solvers have shown significant promise in solving Combinatorial Optimization(CO)problems,particularly in tackling Non-deterministic Polynomial-time hard(NP-hard)problems such as the Traveling Salesman Problem(TSP).However,existing diffusion model-based solvers typically employ a fixed,uniform noise schedule(e.g.,linear or cosine annealing)across all training instances,failing to fully account for the unique characteristics of each problem instance.To address this challenge,we present GraphGuided Diffusion Solvers(GGDS),an enhanced method for improving graph-based diffusion models.GGDS leverages Graph Neural Networks(GNNs)to capture graph structural information embedded in node coordinates and adjacency matrices,dynamically adjusting the noise levels in the diffusion model.This study investigates the TSP by examining two distinct time-step noise generation strategies:cosine annealing and a Neural Network(NN)-based approach.We evaluate their performance across different problem scales,particularly after integrating graph structural information.Experimental results indicate that GGDS outperforms previous methods with average performance improvements of 18.7%,6.3%,and 88.7%on TSP-500,TSP-100,and TSP-50,respectively.Specifically,GGDS demonstrates superior performance on TSP-500 and TSP-50,while its performance on TSP-100 is either comparable to or slightly better than that of previous methods,depending on the chosen noise schedule and decoding strategy. 展开更多
关键词 Combinatorial optimization problem diffusion model noise schedule traveling salesman problem
在线阅读 下载PDF
Global Stability of Traveling Wavefronts for a Belousov-Zhabotinsky Model with Mixed Nonlocal and Degenerate Diffusions
16
作者 Yuting YANG Guobao ZHANG 《Journal of Mathematical Research with Applications》 2026年第1期87-102,共16页
In this paper,we are concerned with the stability of traveling wavefronts of a Belousov-Zhabotinsky model with mixed nonlocal and degenerate diffusions.Such a system can be used to study the competition among nonlocal... In this paper,we are concerned with the stability of traveling wavefronts of a Belousov-Zhabotinsky model with mixed nonlocal and degenerate diffusions.Such a system can be used to study the competition among nonlocally diffusive species and degenerately diffusive species.We prove that the traveling wavefronts are exponentially stable,when the initial perturbation around the traveling waves decays exponentially as x→-∞,but in other locations,the initial data can be arbitrarily large.The adopted methods are the weighted energy with the comparison principle and squeezing technique. 展开更多
关键词 Belousov-Zhabotinsky model nonlocal diffusion stability comparison principle weighted energy
原文传递
A decision framework for rural domestic sewage treatment models and process:Evidence from Inner Mongolia Autonomous Region,China 被引量:1
17
作者 Ying Yan Pengyu Li +5 位作者 Zixuan Wang Yubo Tan Tianlong Zheng Jianguo Liu Xiaoxia Yang Junxin Liu 《Journal of Environmental Sciences》 2026年第1期302-311,共10页
Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making sys... Rural domestic sewage treatment is critical for environmental protection.This study defines the spatial pattern of villages from the perspective of rural sewage treatment and develops an integrated decision-making system to propose a sewage treatment mode and scheme suitable for local conditions.By considering the village spatial layout and terrain factors,a decision tree model of residential density and terrain type was constructed with accuracies of 76.47%and 96.00%,respectively.Combined with binary classification probability unit regression,an appropriate sewage treatment mode for the village was determined with 87.00%accuracy.The Analytic Hierarchy Process(AHP),combined with the Technique for Order Preference(TOPSIS)by Similarity to an Ideal Solution model,formed the basis for optimal treatment process selection under different emission standards.Verification was conducted in 542 villages across three counties of the Inner Mongolia Autonomous Region,focusing on the standard effluent effect(0.3773),low investment cost(0.3196),and high standard effluent effect(0.5115)to determine the best treatment process for the same emission standard under different needs.The annual environmental and carbon emission benefits of sewage treatment in these villages were estimated.This model matches village density,geographic feature,and social development level,and provides scientific support and a theoretical basis for rural sewage treatment decision-making. 展开更多
关键词 Rural domestic sewage Sewage treatment model DECISION-MAKING Environmental-economic benefits Inner Mongolia
原文传递
基于Stable Diffusion的乡村农房造型设计方法与应用研究——以绍兴市笕桥村为例
18
作者 金雷雷 楼瑛浩 刘子琛 《建筑与文化》 2026年第3期269-272,共4页
针对当下乡村农房设计中普遍存在的样板化倾向与个性化、地域化需求之间的矛盾,文章系统探讨了以Stable Diffusion为代表的生成式人工智能工具在乡村农房造型设计中的应用。研究建立了一套相对完整的技术路线,涵盖农房数据收集与处理、... 针对当下乡村农房设计中普遍存在的样板化倾向与个性化、地域化需求之间的矛盾,文章系统探讨了以Stable Diffusion为代表的生成式人工智能工具在乡村农房造型设计中的应用。研究建立了一套相对完整的技术路线,涵盖农房数据收集与处理、专项模型训练及测试、方案生成与迭代、深化设计、落地实施与意见反馈。以浙江省绍兴市笕桥村实践项目为例,验证了该设计方法能够高效生成兼具地方民居风貌与业主个性需求的农房方案,并显著提升了设计效率及风貌契合性。 展开更多
关键词 人工智能生成内容 乡村农房 造型设计 Stable diffusion
在线阅读 下载PDF
Stable Diffusion AI辅助下基于边缘增强的数字图像优化处理
19
作者 臧德龙 汤陈燕 《兰州文理学院学报(自然科学版)》 2026年第2期45-50,65,共7页
针对传统数字图像优化方法存在的特征表征能力不足等问题,提出一种深度框架下Stable Diffusion AI辅助的数字图像优化处理方法.首先,通过图像归一化、畸变校正两个步骤,完成数字图像的预处理;其次,在深度学习框架下,利用卷积神经网络算... 针对传统数字图像优化方法存在的特征表征能力不足等问题,提出一种深度框架下Stable Diffusion AI辅助的数字图像优化处理方法.首先,通过图像归一化、畸变校正两个步骤,完成数字图像的预处理;其次,在深度学习框架下,利用卷积神经网络算法和注意力机制,提取数字图像特征;然后,利用Stable Diffusion模型消除数字图像噪声,根据提取图像纹理特征,补偿数字图像缺失区域;最后,以提取的数字图像边缘特征为处理对象,针对低频轮廓与高频细节两个部分,实现数字图像边缘特征增强.通过亮度和对比度增强,实现数字图像优化处理.实验结果表明,与传统方法相比,本文方法得到数字图像处理结果的峰值信噪比提高2.87,直方图空白区间比减小0.11,且结构相似性达到96%以上,证明优化设计方法处理性能更优. 展开更多
关键词 深度框架 Stable diffusion软件 数字图像 图像处理 图像优化
在线阅读 下载PDF
Stable Diffusion扩散机制下白马曹盖面具的风格化迁移研究
20
作者 朱光良 张鑫 《染整技术》 2026年第3期39-42,52,共5页
生成式人工智能的发展以及AI图像生成技术的加速迭代,促进了多模态内容生成模型在参数规模与语义表征能力上的双重突破。其中以Stable Diffusion为代表的扩散模型通过潜空间降噪采样机制与跨模态语义对齐技术,在“文本-图像”跨域生成... 生成式人工智能的发展以及AI图像生成技术的加速迭代,促进了多模态内容生成模型在参数规模与语义表征能力上的双重突破。其中以Stable Diffusion为代表的扩散模型通过潜空间降噪采样机制与跨模态语义对齐技术,在“文本-图像”跨域生成任务中表现出了强大优势。基于此,以Stable Diffusion设计平台为媒,本研究通过风格图片与文本指令的双重引导,对白马曹盖面具进行风格化迁移尝试和图式言语再造,推动曹盖面具视觉形象的多元化展演;通过与文创产业的关联,本研究为曹盖面具视觉符号语义的数字孪生和数字文化资产的创新应用提供协同场景与关联链路。 展开更多
关键词 AI图像生成技术 Stable diffusion 风格迁移 曹盖面具 服饰设计
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部