Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectra...Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.展开更多
Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approach...Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.展开更多
Vehicle-induced response separation is a crucial issue in structural health monitoring(SHM).This paper proposes a block-wise sliding recursive wavelet transform algorithm to meet the real-time processing requirements ...Vehicle-induced response separation is a crucial issue in structural health monitoring(SHM).This paper proposes a block-wise sliding recursive wavelet transform algorithm to meet the real-time processing requirements of monitoring data.To extend the separation target from a fixed dataset to a continuously updating data stream,a block-wise sliding framework is first developed.This framework is further optimized considering the characteristics of real-time data streams,and its advantage in computational efficiency is theoretically demonstrated.During the decomposition and reconstruction processes,information from neighboring data blocks is fully utilized to reduce algorithmic complexity.In addition,a delay-setting strategy is introduced for each processing window to mitigate boundary effects,thereby balancing accuracy and efficiency.Simulated signal experiments are conducted to determine the optimal delay configuration and to verify the algorithm’s superior performance,achieving a lower Root Mean Square Error(RMSE)and only 0.0249 times the average computational time compared with the original algorithm.Furthermore,strain signals from the Lieshi River Bridge are employed to validate the method.The proposed algorithm successfully separates the static trend from vehicle-induced responses in real time across different sampling frequencies,demonstrating its effectiveness and applicability in real-time bridge monitoring.展开更多
This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi...This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.展开更多
The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault ...The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault modes.However,existing methods often suffer from insufficient frequency-domain representation in practical applications,which greatly affects diagnostic performance.Therefore,this paper proposes a rolling bearing fault diagnosismethod based on aMulti-Scale FusionNetwork(MSFN)using the Time-Division Fourier Transform(TDFT).The method constructs multi-scale channels to extract time-domain and frequency-domain features of the signal in parallel.A multi-level,multi-scale filter-based approach is designed to extract frequency-domain features in a segmented manner.A cross-attention mechanism is introduced to facilitate the fusion of the extracted time-frequency domain features.The performance of the proposed method is validated using the CWRU and Ottawa datasets.The results show that the average accuracy of MSFN under complex noisy signals is 97.75%and 94.41%.The average accuracy under variable load conditions is 98.68%.This demonstrates its significant application potential compared to existing methods.展开更多
Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete v...Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net.展开更多
BACKGROUND: Recent studies have focused on various methods of wavelet transformation for electroencephalogram (EEG) signals. However, there are very few studies reporting characteristics of multi-scale phase waves ...BACKGROUND: Recent studies have focused on various methods of wavelet transformation for electroencephalogram (EEG) signals. However, there are very few studies reporting characteristics of multi-scale phase waves during epileptic discharge.OBJECTIVE: To extract multi-scale phase average waveforms from childhood absence epilepsy EEG signals between time and frequency domains using wavelet transformation, and to compare EEG signals of absence seizure with pre-epileptic seizure and normal children, and to quantify multi-scale phase average waveforms from childhood absence epilepsy EEG signals. DESIGN, TIME AND SETTING: The case-comparative experiment was performed at the Department of Neuroelectrophysiology, Tianjin Medical University from August 2002 to May 2005. PARTICIPANTS: A total of 15 patients with childhood absence epilepsy from the General Hospital of Tianjin Medical University were enrolled in the study. The patients were not administered anti-epileptic drugs or sedatives prior to EEG testing. In addition, 12 healthy, age- and gender-matched children were also enrolled.METHODS: EEG signals were tested on 15 patients with childhood absence epilepsy and 12 normal children. Epileptic discharge signals during clinical and subclinical seizures were collected 10 and 20 times, respectively. The collected EEG signals were treated with wavelet transformation to extract multi-scale characteristics during absence epilepsy seizure using a conditional sampling method. Multi-scale phase average waveforms were collected using a conditional phase averaging technique. Amplitude of phase average waveform from EEG signals of epilepsy seizure, subclinical epileptic discharge, and EEG signals of normal children were compared and statistically analyzed in the first half-cycle.MAIN OUTCOME MEASURES: Multi-scale wavelet coefficient and the evolution of EEG signals were observed during childhood absence epilepsy seizures using wavelet transformation. Multi-scale phase average waveforms from EEG signals were observed using a conditional sampling method and phase averaging technique.RESULTS: Multi-scale characteristics of EEG signals demonstrated that 12-scale (3 Hz) rhythmical activity was significantly enhanced during childhood absence epilepsy seizure and co-existed with background structure (〈1 Hz, low frequency discharge). The phase average wave exhibited opposed phase abnormal rhythm at 3 Hz. Prior to childhood absence epilepsy seizure, EEG detected opposed abnormal a rhythm and 3 Hz composition, which were not detected with traditional EEG. Compared to EEG signals from normal children, epileptic discharges from clinical and subclinical childhood absence epilepsy seizures were positive and amplitude was significantly greater (P〈0.05).CONCLUSION: Wavelet transformation was used to analyze EEG signals from childhood absence epilepsy to obtain multi-scale quantitative characteristics and phase average waveforms. Multi-scale wavelet coefficients of EEG signals correlated with childhood absence epilepsy seizure, and multi-scale waveforms prior to epilepsy seizure were similar to characteristics during the onset period. Compared to normal children, EEG signals during epilepsy seizure exhibited an opposed phase model.展开更多
Noise has traditionally been suppressed or eliminated in seismic data sets by the use of Fourier filters and, to a lesser degree, nonlinear statistical filters. Although these methods are quite useful under specific c...Noise has traditionally been suppressed or eliminated in seismic data sets by the use of Fourier filters and, to a lesser degree, nonlinear statistical filters. Although these methods are quite useful under specific conditions, they may produce undesirable effects for the low signal to noise ratio data. In this paper, a new method, multi-scale ridgelet transform, is used in the light of the theory of ridgelet transform. We employ wavelet transform to do sub-band decomposition for the signals and then use non-linear thresholding in ridgelet domain for every block. In other words, it is based on the idea of partition, at sufficiently fine scale, a curving singularity looks straight, and so ridgelet transform can work well in such cases. Applications on both synthetic data and actual seismic data from Sichuan basin, South China, show that the new method eliminates the noise portion of the signal more efficiently and retains a greater amount of geologic data than other methods, the quality and consecutiveness of seismic event are improved obviously as well as the quality of section is improved.展开更多
Urban faults in Shenzhen are potential threats to city security and sustainable development. In consideration of the importance of the Shenzhen fault zone, the author provide a detailed interpretation on gravity data ...Urban faults in Shenzhen are potential threats to city security and sustainable development. In consideration of the importance of the Shenzhen fault zone, the author provide a detailed interpretation on gravity data model. Bouguer gravity covering the whole Shenzhen City was calculated with a 1-km resolution. Wavelet multi-scale analysis(MSA) was applied to the Bouguer gravity data to obtain the multilayer residual anomalies corresponding to different depths. In addition, 2D gravity models were constructed along three profiles. The Bouguer gravity anomaly shows an NE-striking high-low-high pattern from northwest to southeast, strongly related to the main faults. According to the results of MSA, the correlation between gravity anomaly and faults is particularly significant from 4 to 12 km depth. The residual gravity with small amplitude in each layer indicates weak tectonic activity in the crust. In the upper layers, positive anomalies along most of faults reveal the upwelling of high-density materials during the past tectonic movements. The multilayer residual anomalies also yield important information about the faults, such as the vertical extension and the dip direction. The maximum depth of the faults is about 20 km. In general, NE-striking faults extend deeper than NW-striking faults and have a larger dip angle.展开更多
The Ying-Qiong Basin is located on the northwestern margin of the South China Sea and at the junction of the South China Block and the Indochina Block.It is characterized by complex geological structures.The existing ...The Ying-Qiong Basin is located on the northwestern margin of the South China Sea and at the junction of the South China Block and the Indochina Block.It is characterized by complex geological structures.The existing seismic data in the study area is sparse due to the lack of earthquake activities.Because of the limited source energy and poor coverage of seismic data,the knowledge of deep structures in the area,including the spatial distribution of deep faults,is incomplete.Contrarily,satellite gravity data cover the entire study area and can reveal the spatial distribution of faults.Based on the wavelet multi-scale decomposition method,the Bouguer gravity field in the Ying-Qiong Basin was decomposed and reconstructed to obtain the detailed images of the first-to sixth-order gravitational fields.By incorporating the known geological features,the gravitational field responses of the main faults in the Ying-Qiong Basin were identified in the detailed fields,and the power spectrum analysis yielded the depths of 1.4,8,15,26.5,and 39 km for the average burial depths of the bottom surfaces from the first-to fifth-order detailed fields,respectively.The four main faults in the Yinggehai Basin all have a large active depth range:fault A(No.1)is between 5 and 39 km,fault B is between 26.5 and 39 km,and faults C and D are between 15 and 39 km.However,the depth of active faults in the Qiongdongnan Basin is relatively shallow,mainly between 8 and 26.5 km.展开更多
In modern electromagnetic environment, radar emitter signal recognition is an important research topic. On the basis of multi-resolution wavelet analysis, an adaptive radar emitter signal recognition method based on m...In modern electromagnetic environment, radar emitter signal recognition is an important research topic. On the basis of multi-resolution wavelet analysis, an adaptive radar emitter signal recognition method based on multi-scale wavelet entropy feature extraction and feature weighting was proposed. With the only priori knowledge of signal to noise ratio(SNR), the method of extracting multi-scale wavelet entropy features of wavelet coefficients from different received signals were combined with calculating uneven weight factor and stability weight factor of the extracted multi-dimensional characteristics. Radar emitter signals of different modulation types and different parameters modulated were recognized through feature weighting and feature fusion. Theoretical analysis and simulation results show that the presented algorithm has a high recognition rate. Additionally, when the SNR is greater than-4 d B, the correct recognition rate is higher than 93%. Hence, the proposed algorithm has great application value.展开更多
Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhanc...Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.展开更多
The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients ar...The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients around the zero value are very few, so we cannot sparsely represent low-frequency image information. The low-frequency component contains the main energy of the image and depicts the profile of the image. Direct fusion of the low-frequency component will not be conducive to obtain highly accurate fusion result. Therefore, this paper presents an infrared and visible image fusion method combining the multi-scale and top-hat transforms. On one hand, the new top-hat-transform can effectively extract the salient features of the low-frequency component. On the other hand, the multi-scale transform can extract highfrequency detailed information in multiple scales and from diverse directions. The combination of the two methods is conducive to the acquisition of more characteristics and more accurate fusion results. Among them, for the low-frequency component, a new type of top-hat transform is used to extract low-frequency features, and then different fusion rules are applied to fuse the low-frequency features and low-frequency background; for high-frequency components, the product of characteristics method is used to integrate the detailed information in high-frequency. Experimental results show that the proposed algorithm can obtain more detailed information and clearer infrared target fusion results than the traditional multiscale transform methods. Compared with the state-of-the-art fusion methods based on sparse representation, the proposed algorithm is simple and efficacious, and the time consumption is significantly reduced.展开更多
This paper proposes an image segmentation method based on the combination of the wavelet multi-scale edge detection and the entropy iterative threshold selection.Image for segmentation is divided into two parts by hig...This paper proposes an image segmentation method based on the combination of the wavelet multi-scale edge detection and the entropy iterative threshold selection.Image for segmentation is divided into two parts by high- and low-frequency.In the high-frequency part the wavelet multiscale was used for the edge detection,and the low-frequency part conducted on segmentation using the entropy iterative threshold selection method.Through the consideration of the image edge and region,a CT image of the thorax was chosen to test the proposed method for the segmentation of the lungs.Experimental results show that the method is efficient to segment the interesting region of an image compared with conventional methods.展开更多
In the vision transformer(ViT)architecture,image data are transformed into sequential data for processing,which may result in the loss of spatial positional information.While the self-attention mechanism enhances the ...In the vision transformer(ViT)architecture,image data are transformed into sequential data for processing,which may result in the loss of spatial positional information.While the self-attention mechanism enhances the capacity of ViT to capture global features,it compromises the preservation of fine-grained local feature information.To address these challenges,we propose a spatial positional enhancement module and a wavelet transform enhancement module tailored for ViT models.These modules aim to reduce spatial positional information loss during the patch embedding process and enhance the model’s feature extraction capabilities.The spatial positional enhancement module reinforces spatial information in sequential data through convolutional operations and multi-scale feature extraction.Meanwhile,the wavelet transform enhancement module utilizes the multi-scale analysis and frequency decomposition to improve the ViT’s understanding of global and local image structures.This enhancement also improves the ViT’s ability to process complex structures and intricate image details.Experiments on CIFAR-10,CIFAR-100 and ImageNet-1k datasets are done to compare the proposed method with advanced classification methods.The results show that the proposed model achieves a higher classification accuracy,confirming its effectiveness and competitive advantage.展开更多
Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion siz...Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet.展开更多
Fluorescence microscopy is indispensable in life science research,yet denoising remains challenging due to varied biological samples and imaging conditions.We introduce a wavelet-enhanced transformer based on DnCNN th...Fluorescence microscopy is indispensable in life science research,yet denoising remains challenging due to varied biological samples and imaging conditions.We introduce a wavelet-enhanced transformer based on DnCNN that fuses wavelet preprocessing with a dual-branch transformer-convolutional neural network(CNN)architecture.Wavelet decomposition separates highand low-frequency components for targeted noise reduction;the CNN branch restores local details,whereas the transformer branch captures global context;and an adaptive loss balances quantitative fidelity with perceptual quality.On the fluorescence microscopy denoising benchmark,our method surpasses leading CNNand transformer-based approaches,improving peak signal-to-noise ratio by 2.34%and 0.88%and structural similarity index measure by 0.53%and 1.07%,respectively.This framework offers enhanced generalization and practical gains for fluorescence image denoising.展开更多
基金supported by the Henan Province Key R&D Project under Grant 241111210400the Henan Provincial Science and Technology Research Project under Grants 252102211047,252102211062,252102211055 and 232102210069+2 种基金the Jiangsu Provincial Scheme Double Initiative Plan JSS-CBS20230474,the XJTLU RDF-21-02-008the Science and Technology Innovation Project of Zhengzhou University of Light Industry under Grant 23XNKJTD0205the Higher Education Teaching Reform Research and Practice Project of Henan Province under Grant 2024SJGLX0126。
文摘Accurate and efficient detection of building changes in remote sensing imagery is crucial for urban planning,disaster emergency response,and resource management.However,existing methods face challenges such as spectral similarity between buildings and backgrounds,sensor variations,and insufficient computational efficiency.To address these challenges,this paper proposes a novel Multi-scale Efficient Wavelet-based Change Detection Network(MewCDNet),which integrates the advantages of Convolutional Neural Networks and Transformers,balances computational costs,and achieves high-performance building change detection.The network employs EfficientNet-B4 as the backbone for hierarchical feature extraction,integrates multi-level feature maps through a multi-scale fusion strategy,and incorporates two key modules:Cross-temporal Difference Detection(CTDD)and Cross-scale Wavelet Refinement(CSWR).CTDD adopts a dual-branch architecture that combines pixel-wise differencing with semanticaware Euclidean distance weighting to enhance the distinction between true changes and background noise.CSWR integrates Haar-based Discrete Wavelet Transform with multi-head cross-attention mechanisms,enabling cross-scale feature fusion while significantly improving edge localization and suppressing spurious changes.Extensive experiments on four benchmark datasets demonstrate MewCDNet’s superiority over comparison methods:achieving F1 scores of 91.54%on LEVIR,93.70%on WHUCD,and 64.96%on S2Looking for building change detection.Furthermore,MewCDNet exhibits optimal performance on the multi-class⋅SYSU dataset(F1:82.71%),highlighting its exceptional generalization capability.
基金funded by the National Natural Science Foundation of China,grant numbers 52374156 and 62476005。
文摘Images taken in dim environments frequently exhibit issues like insufficient brightness,noise,color shifts,and loss of detail.These problems pose significant challenges to dark image enhancement tasks.Current approaches,while effective in global illumination modeling,often struggle to simultaneously suppress noise and preserve structural details,especially under heterogeneous lighting.Furthermore,misalignment between luminance and color channels introduces additional challenges to accurate enhancement.In response to the aforementioned difficulties,we introduce a single-stage framework,M2ATNet,using the multi-scale multi-attention and Transformer architecture.First,to address the problems of texture blurring and residual noise,we design a multi-scale multi-attention denoising module(MMAD),which is applied separately to the luminance and color channels to enhance the structural and texture modeling capabilities.Secondly,to solve the non-alignment problem of the luminance and color channels,we introduce the multi-channel feature fusion Transformer(CFFT)module,which effectively recovers the dark details and corrects the color shifts through cross-channel alignment and deep feature interaction.To guide the model to learn more stably and efficiently,we also fuse multiple types of loss functions to form a hybrid loss term.We extensively evaluate the proposed method on various standard datasets,including LOL-v1,LOL-v2,DICM,LIME,and NPE.Evaluation in terms of numerical metrics and visual quality demonstrate that M2ATNet consistently outperforms existing advanced approaches.Ablation studies further confirm the critical roles played by the MMAD and CFFT modules to detail preservation and visual fidelity under challenging illumination-deficient environments.
基金the support of the Major Science and Technology Project of Yunnan Province,China(Grant No.202502AD080007)the National Natural Science Foundation of China(Grant No.52378288)。
文摘Vehicle-induced response separation is a crucial issue in structural health monitoring(SHM).This paper proposes a block-wise sliding recursive wavelet transform algorithm to meet the real-time processing requirements of monitoring data.To extend the separation target from a fixed dataset to a continuously updating data stream,a block-wise sliding framework is first developed.This framework is further optimized considering the characteristics of real-time data streams,and its advantage in computational efficiency is theoretically demonstrated.During the decomposition and reconstruction processes,information from neighboring data blocks is fully utilized to reduce algorithmic complexity.In addition,a delay-setting strategy is introduced for each processing window to mitigate boundary effects,thereby balancing accuracy and efficiency.Simulated signal experiments are conducted to determine the optimal delay configuration and to verify the algorithm’s superior performance,achieving a lower Root Mean Square Error(RMSE)and only 0.0249 times the average computational time compared with the original algorithm.Furthermore,strain signals from the Lieshi River Bridge are employed to validate the method.The proposed algorithm successfully separates the static trend from vehicle-induced responses in real time across different sampling frequencies,demonstrating its effectiveness and applicability in real-time bridge monitoring.
基金funded by the Deanship of Research and Graduate Studies at King Khalid University through small group research under grant number RGP1/278/45.
文摘This paper introduces a novel method for medical image retrieval and classification by integrating a multi-scale encoding mechanism with Vision Transformer(ViT)architectures and a dynamic multi-loss function.The multi-scale encoding significantly enhances the model’s ability to capture both fine-grained and global features,while the dynamic loss function adapts during training to optimize classification accuracy and retrieval performance.Our approach was evaluated on the ISIC-2018 and ChestX-ray14 datasets,yielding notable improvements.Specifically,on the ISIC-2018 dataset,our method achieves an F1-Score improvement of+4.84% compared to the standard ViT,with a precision increase of+5.46% for melanoma(MEL).On the ChestX-ray14 dataset,the method delivers an F1-Score improvement of 5.3%over the conventional ViT,with precision gains of+5.0% for pneumonia(PNEU)and+5.4%for fibrosis(FIB).Experimental results demonstrate that our approach outperforms traditional CNN-based models and existing ViT variants,particularly in retrieving relevant medical cases and enhancing diagnostic accuracy.These findings highlight the potential of the proposedmethod for large-scalemedical image analysis,offering improved tools for clinical decision-making through superior classification and case comparison.
基金fully supported by the Frontier Exploration Projects of Longmen Laboratory(No.LMQYTSKT034)Key Research and Development and Promotion of Special(Science and Technology)Project of Henan Province,China(No.252102210158)。
文摘The capacity to diagnose faults in rolling bearings is of significant practical importance to ensure the normal operation of the equipment.Frequency-domain features can effectively enhance the identification of fault modes.However,existing methods often suffer from insufficient frequency-domain representation in practical applications,which greatly affects diagnostic performance.Therefore,this paper proposes a rolling bearing fault diagnosismethod based on aMulti-Scale FusionNetwork(MSFN)using the Time-Division Fourier Transform(TDFT).The method constructs multi-scale channels to extract time-domain and frequency-domain features of the signal in parallel.A multi-level,multi-scale filter-based approach is designed to extract frequency-domain features in a segmented manner.A cross-attention mechanism is introduced to facilitate the fusion of the extracted time-frequency domain features.The performance of the proposed method is validated using the CWRU and Ottawa datasets.The results show that the average accuracy of MSFN under complex noisy signals is 97.75%and 94.41%.The average accuracy under variable load conditions is 98.68%.This demonstrates its significant application potential compared to existing methods.
基金the National Natural Science Foundation of China(No.62266025)。
文摘Segmentation of the retinal vessels in the fundus is crucial for diagnosing ocular diseases.Retinal vessel images often suffer from category imbalance and large scale variations.This ultimately results in incomplete vessel segmentation and poor continuity.In this study,we propose CT-MFENet to address the aforementioned issues.First,the use of context transformer(CT)allows for the integration of contextual feature information,which helps establish the connection between pixels and solve the problem of incomplete vessel continuity.Second,multi-scale dense residual networks are used instead of traditional CNN to address the issue of inadequate local feature extraction when the model encounters vessels at multiple scales.In the decoding stage,we introduce a local-global fusion module.It enhances the localization of vascular information and reduces the semantic gap between high-and low-level features.To address the class imbalance in retinal images,we propose a hybrid loss function that enhances the segmentation ability of the model for topological structures.We conducted experiments on the publicly available DRIVE,CHASEDB1,STARE,and IOSTAR datasets.The experimental results show that our CT-MFENet performs better than most existing methods,including the baseline U-Net.
基金the National Natural Science Foundation of China,No. 60703045
文摘BACKGROUND: Recent studies have focused on various methods of wavelet transformation for electroencephalogram (EEG) signals. However, there are very few studies reporting characteristics of multi-scale phase waves during epileptic discharge.OBJECTIVE: To extract multi-scale phase average waveforms from childhood absence epilepsy EEG signals between time and frequency domains using wavelet transformation, and to compare EEG signals of absence seizure with pre-epileptic seizure and normal children, and to quantify multi-scale phase average waveforms from childhood absence epilepsy EEG signals. DESIGN, TIME AND SETTING: The case-comparative experiment was performed at the Department of Neuroelectrophysiology, Tianjin Medical University from August 2002 to May 2005. PARTICIPANTS: A total of 15 patients with childhood absence epilepsy from the General Hospital of Tianjin Medical University were enrolled in the study. The patients were not administered anti-epileptic drugs or sedatives prior to EEG testing. In addition, 12 healthy, age- and gender-matched children were also enrolled.METHODS: EEG signals were tested on 15 patients with childhood absence epilepsy and 12 normal children. Epileptic discharge signals during clinical and subclinical seizures were collected 10 and 20 times, respectively. The collected EEG signals were treated with wavelet transformation to extract multi-scale characteristics during absence epilepsy seizure using a conditional sampling method. Multi-scale phase average waveforms were collected using a conditional phase averaging technique. Amplitude of phase average waveform from EEG signals of epilepsy seizure, subclinical epileptic discharge, and EEG signals of normal children were compared and statistically analyzed in the first half-cycle.MAIN OUTCOME MEASURES: Multi-scale wavelet coefficient and the evolution of EEG signals were observed during childhood absence epilepsy seizures using wavelet transformation. Multi-scale phase average waveforms from EEG signals were observed using a conditional sampling method and phase averaging technique.RESULTS: Multi-scale characteristics of EEG signals demonstrated that 12-scale (3 Hz) rhythmical activity was significantly enhanced during childhood absence epilepsy seizure and co-existed with background structure (〈1 Hz, low frequency discharge). The phase average wave exhibited opposed phase abnormal rhythm at 3 Hz. Prior to childhood absence epilepsy seizure, EEG detected opposed abnormal a rhythm and 3 Hz composition, which were not detected with traditional EEG. Compared to EEG signals from normal children, epileptic discharges from clinical and subclinical childhood absence epilepsy seizures were positive and amplitude was significantly greater (P〈0.05).CONCLUSION: Wavelet transformation was used to analyze EEG signals from childhood absence epilepsy to obtain multi-scale quantitative characteristics and phase average waveforms. Multi-scale wavelet coefficients of EEG signals correlated with childhood absence epilepsy seizure, and multi-scale waveforms prior to epilepsy seizure were similar to characteristics during the onset period. Compared to normal children, EEG signals during epilepsy seizure exhibited an opposed phase model.
基金supported by China Petrochemical key project during the 11th Five-year Plan as well as the Doctorate Fund of Ministry of Education of China (No.20050491504)
文摘Noise has traditionally been suppressed or eliminated in seismic data sets by the use of Fourier filters and, to a lesser degree, nonlinear statistical filters. Although these methods are quite useful under specific conditions, they may produce undesirable effects for the low signal to noise ratio data. In this paper, a new method, multi-scale ridgelet transform, is used in the light of the theory of ridgelet transform. We employ wavelet transform to do sub-band decomposition for the signals and then use non-linear thresholding in ridgelet domain for every block. In other words, it is based on the idea of partition, at sufficiently fine scale, a curving singularity looks straight, and so ridgelet transform can work well in such cases. Applications on both synthetic data and actual seismic data from Sichuan basin, South China, show that the new method eliminates the noise portion of the signal more efficiently and retains a greater amount of geologic data than other methods, the quality and consecutiveness of seismic event are improved obviously as well as the quality of section is improved.
基金supported by the National Natural Science Foundation of China (Nos.41504015,41429401)the National 973 Project of China (No.2013CB733302)+2 种基金 China Postdoctoral Science Foundation (No.2015M572146)the National High Technology Research and Development Program of China (No.2011AA060503)the Surveying and Mapping Basic Research Program of National Administration of Surveying,Mapping and Geoinformation (No.15-01-08)
文摘Urban faults in Shenzhen are potential threats to city security and sustainable development. In consideration of the importance of the Shenzhen fault zone, the author provide a detailed interpretation on gravity data model. Bouguer gravity covering the whole Shenzhen City was calculated with a 1-km resolution. Wavelet multi-scale analysis(MSA) was applied to the Bouguer gravity data to obtain the multilayer residual anomalies corresponding to different depths. In addition, 2D gravity models were constructed along three profiles. The Bouguer gravity anomaly shows an NE-striking high-low-high pattern from northwest to southeast, strongly related to the main faults. According to the results of MSA, the correlation between gravity anomaly and faults is particularly significant from 4 to 12 km depth. The residual gravity with small amplitude in each layer indicates weak tectonic activity in the crust. In the upper layers, positive anomalies along most of faults reveal the upwelling of high-density materials during the past tectonic movements. The multilayer residual anomalies also yield important information about the faults, such as the vertical extension and the dip direction. The maximum depth of the faults is about 20 km. In general, NE-striking faults extend deeper than NW-striking faults and have a larger dip angle.
基金sup-ported by the National Natural Science Foundation of China(Nos.41530963,91858215 and 41906048)the Fundamental Research Funds for the Central Universities(No.201964015)the Laboratory for Marine Mineral Resources,Qingdao National Laboratory for Marine Science and Technology(No.MMRZZ201801).
文摘The Ying-Qiong Basin is located on the northwestern margin of the South China Sea and at the junction of the South China Block and the Indochina Block.It is characterized by complex geological structures.The existing seismic data in the study area is sparse due to the lack of earthquake activities.Because of the limited source energy and poor coverage of seismic data,the knowledge of deep structures in the area,including the spatial distribution of deep faults,is incomplete.Contrarily,satellite gravity data cover the entire study area and can reveal the spatial distribution of faults.Based on the wavelet multi-scale decomposition method,the Bouguer gravity field in the Ying-Qiong Basin was decomposed and reconstructed to obtain the detailed images of the first-to sixth-order gravitational fields.By incorporating the known geological features,the gravitational field responses of the main faults in the Ying-Qiong Basin were identified in the detailed fields,and the power spectrum analysis yielded the depths of 1.4,8,15,26.5,and 39 km for the average burial depths of the bottom surfaces from the first-to fifth-order detailed fields,respectively.The four main faults in the Yinggehai Basin all have a large active depth range:fault A(No.1)is between 5 and 39 km,fault B is between 26.5 and 39 km,and faults C and D are between 15 and 39 km.However,the depth of active faults in the Qiongdongnan Basin is relatively shallow,mainly between 8 and 26.5 km.
基金Project(61301095)supported by the National Natural Science Foundation of ChinaProject(QC2012C070)supported by Heilongjiang Provincial Natural Science Foundation for the Youth,ChinaProjects(HEUCF130807,HEUCFZ1129)supported by the Fundamental Research Funds for the Central Universities of China
文摘In modern electromagnetic environment, radar emitter signal recognition is an important research topic. On the basis of multi-resolution wavelet analysis, an adaptive radar emitter signal recognition method based on multi-scale wavelet entropy feature extraction and feature weighting was proposed. With the only priori knowledge of signal to noise ratio(SNR), the method of extracting multi-scale wavelet entropy features of wavelet coefficients from different received signals were combined with calculating uneven weight factor and stability weight factor of the extracted multi-dimensional characteristics. Radar emitter signals of different modulation types and different parameters modulated were recognized through feature weighting and feature fusion. Theoretical analysis and simulation results show that the presented algorithm has a high recognition rate. Additionally, when the SNR is greater than-4 d B, the correct recognition rate is higher than 93%. Hence, the proposed algorithm has great application value.
基金supported by the China Postdoctoral Science Foundation Funded Project(No.2021M690385)the National Natural Science Foundation of China(No.62101045).
文摘Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.
基金Project supported by the National Natural Science Foundation of China(Grant No.61402368)Aerospace Support Fund,China(Grant No.2017-HT-XGD)Aerospace Science and Technology Innovation Foundation,China(Grant No.2017 ZD 53047)
文摘The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients around the zero value are very few, so we cannot sparsely represent low-frequency image information. The low-frequency component contains the main energy of the image and depicts the profile of the image. Direct fusion of the low-frequency component will not be conducive to obtain highly accurate fusion result. Therefore, this paper presents an infrared and visible image fusion method combining the multi-scale and top-hat transforms. On one hand, the new top-hat-transform can effectively extract the salient features of the low-frequency component. On the other hand, the multi-scale transform can extract highfrequency detailed information in multiple scales and from diverse directions. The combination of the two methods is conducive to the acquisition of more characteristics and more accurate fusion results. Among them, for the low-frequency component, a new type of top-hat transform is used to extract low-frequency features, and then different fusion rules are applied to fuse the low-frequency features and low-frequency background; for high-frequency components, the product of characteristics method is used to integrate the detailed information in high-frequency. Experimental results show that the proposed algorithm can obtain more detailed information and clearer infrared target fusion results than the traditional multiscale transform methods. Compared with the state-of-the-art fusion methods based on sparse representation, the proposed algorithm is simple and efficacious, and the time consumption is significantly reduced.
基金Science Research Foundation of Yunnan Fundamental Research Foundation of Applicationgrant number:2009ZC049M+1 种基金Science Research Foundation for the Overseas Chinese Scholars,State Education Ministrygrant number:2010-1561
文摘This paper proposes an image segmentation method based on the combination of the wavelet multi-scale edge detection and the entropy iterative threshold selection.Image for segmentation is divided into two parts by high- and low-frequency.In the high-frequency part the wavelet multiscale was used for the edge detection,and the low-frequency part conducted on segmentation using the entropy iterative threshold selection method.Through the consideration of the image edge and region,a CT image of the thorax was chosen to test the proposed method for the segmentation of the lungs.Experimental results show that the method is efficient to segment the interesting region of an image compared with conventional methods.
基金National Natural Science Foundation of China(No.62176052)。
文摘In the vision transformer(ViT)architecture,image data are transformed into sequential data for processing,which may result in the loss of spatial positional information.While the self-attention mechanism enhances the capacity of ViT to capture global features,it compromises the preservation of fine-grained local feature information.To address these challenges,we propose a spatial positional enhancement module and a wavelet transform enhancement module tailored for ViT models.These modules aim to reduce spatial positional information loss during the patch embedding process and enhance the model’s feature extraction capabilities.The spatial positional enhancement module reinforces spatial information in sequential data through convolutional operations and multi-scale feature extraction.Meanwhile,the wavelet transform enhancement module utilizes the multi-scale analysis and frequency decomposition to improve the ViT’s understanding of global and local image structures.This enhancement also improves the ViT’s ability to process complex structures and intricate image details.Experiments on CIFAR-10,CIFAR-100 and ImageNet-1k datasets are done to compare the proposed method with advanced classification methods.The results show that the proposed model achieves a higher classification accuracy,confirming its effectiveness and competitive advantage.
基金supported in part by the National Natural Science Foundation of China under Grant 62201201the Foundation of Henan Educational Committee under Grant 242102211042.
文摘Segmenting skin lesions is critical for early skin cancer detection.Existing CNN and Transformer-based methods face challenges such as high computational complexity and limited adaptability to variations in lesion sizes.To overcome these limitations,we introduce MSAMamba-UNet,a lightweight model that integrates two novel architectures:Multi-Scale Mamba(MSMamba)and Adaptive Dynamic Gating Block(ADGB).MSMamba utilizes multi-scale decomposition and a parallel hierarchical structure to enhance the delineation of irregular lesion boundaries and sensitivity to small targets.ADGB dynamically selects convolutional kernels with varying receptive fields based on input features,improving the model’s capacity to accommodate diverse lesion textures and scales.Additionally,we introduce a Mix Attention Fusion Block(MAF)to enhance shallow feature representation by integrating parallel channel and pixel attention mechanisms.Extensive evaluation of MSAMamba-UNet on the ISIC 2016,ISIC 2017,and ISIC 2018 datasets demonstrates competitive segmentation accuracy with only 0.056 M parameters and 0.069 GFLOPs.Our experiments revealed that MSAMamba-UNet achieved IoU scores of 85.53%,85.47%,and 82.22%,as well as DSC scores of 92.20%,92.17%,and 90.24%,respectively.These results underscore the lightweight design and effectiveness of MSAMamba-UNet.
基金supported by the National Natural Science Foundation of China(Grant No.62275210)the National Leading Talent Program,the National Young Talent Program,the Key Research and Development Program of Shaanxi(Grant No.2024SF2-GJHX-25)+5 种基金the Scientific Research Program Funded by the Education Department of Shaanxi Provincial Government(Grant No.24JS016)the Xidian University Specially Funded Project for Interdisciplinary Exploration(Grant No.TZJHF202523)the Fundamental Research Funds for Central Universities(Grant No.YJSJ25014)the Guangdong Provincial General Colleges and Universities Young Innovative Talents Research Project(Grant No.2024KQNCX172)the Shenzhen Science and Technology Program(Grant No.GJHZ20210705141805015)the Key Research Areas Support Science and Technology Project of Shenzhen Institute of Information Technology(Grant No.SZIIT2024KJ056).
文摘Fluorescence microscopy is indispensable in life science research,yet denoising remains challenging due to varied biological samples and imaging conditions.We introduce a wavelet-enhanced transformer based on DnCNN that fuses wavelet preprocessing with a dual-branch transformer-convolutional neural network(CNN)architecture.Wavelet decomposition separates highand low-frequency components for targeted noise reduction;the CNN branch restores local details,whereas the transformer branch captures global context;and an adaptive loss balances quantitative fidelity with perceptual quality.On the fluorescence microscopy denoising benchmark,our method surpasses leading CNNand transformer-based approaches,improving peak signal-to-noise ratio by 2.34%and 0.88%and structural similarity index measure by 0.53%and 1.07%,respectively.This framework offers enhanced generalization and practical gains for fluorescence image denoising.