A heart attack disrupts the normal flow of blood to the heart muscle,potentially causing severe damage or death if not treated promptly.It can lead to long-term health complications,reduce quality of life,and signific...A heart attack disrupts the normal flow of blood to the heart muscle,potentially causing severe damage or death if not treated promptly.It can lead to long-term health complications,reduce quality of life,and significantly impact daily activities and overall well-being.Despite the growing popularity of deep learning,several drawbacks persist,such as complexity and the limitation of single-model learning.In this paper,we introduce a residual learning-based feature fusion technique to achieve high accuracy in differentiating abnormal cardiac rhythms heart sound.Combining MobileNet with DenseNet201 for feature fusion leverages MobileNet lightweight,efficient architecture with DenseNet201,dense connections,resulting in enhanced feature extraction and improved model performance with reduced computational cost.To further enhance the fusion,we employed residual learning to optimize the hierarchical features of heart abnormal sounds during training.The experimental results demonstrate that the proposed fusion method achieved an accuracy of 95.67%on the benchmark PhysioNet-2016 Spectrogram dataset.To further validate the performance,we applied it to the BreakHis dataset with a magnification level of 100X.The results indicate that the model maintains robust performance on the second dataset,achieving an accuracy of 96.55%.it highlights its consistent performance,making it a suitable for various applications.展开更多
A new lighting and enlargement on phase spectrogram (PS) and frequency spectrogram (FS) is presented in this paper. These representations result from the coupling of power spectrogram and short time Fourier transf...A new lighting and enlargement on phase spectrogram (PS) and frequency spectrogram (FS) is presented in this paper. These representations result from the coupling of power spectrogram and short time Fourier transform (STFT). The main contribution is the construction of the 3D phase spectrogram (3DPS) and the 3D frequency spectrogram (3DFS). These new tools allow such specific test signals as small slope linear chirp, phase jump case of musical signal analysis is reported. The main objective is to and small frequency jump to be analyzed. An application detect small frequency and phase variations in order to characterize each type of sound attack without losing the amplitude information given by power spectrogram展开更多
Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods su...Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods such as ultrasonic method and pulse current method.However,due to the sensitivity of the acoustic array sensor and the influence of the equipment operation site interference,the acoustic array sensor device for partial discharge type diagnosis by phase resolved partial discharge(PRPD)map might occasionally presents incorrect results,thus affecting the power equipment operation and maintenance strategy.The acoustic array sensor detection device for power equipment developed in this paper applies the array design model of equal-area multi-arm spiral with machine learning fast fourier transform clean(FFT-CLEAN)sound source localization identification algorithm to avoid the interference factors in the noise acquisition system using a single microphone and conventional beam forming algorithm,improves the spatial resolution of the acoustic array sensor device,and proposes an acoustic array sensor device based on the acoustic spectrogram.The analysis and diagnosis method of discharge type of acoustic array sensor device can effectively reduce the system misjudgment caused by factors such as the resolution of the acoustic imaging device and the time domain pulse of the digital signal,and reduce the false alarm rate of the acoustic array sensor device.The proposed method is tested by selecting power cables as the object,and its effectiveness is proved by laboratory verification and field verification.展开更多
Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technol...Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technology(5G)based mobile devices.The EMG signals generated inside the body with unique individual characteristics are being studied as a part of nextgeneration user recognition methods.However,there is a limitation when applying EMG signals to user recognition systems as the same operation needs to be repeated while maintaining a constant strength of muscle over time.Hence,it is necessary to conduct research on multidimensional feature transformation that includes changes in frequency features over time.In this paper,we propose a user recognition system that applies EMG signals to the short-time fourier transform(STFT),and converts the signals into EMG spectrogram images while adjusting the time-frequency resolution to extract multidimensional features.The proposed system is composed of a data pre-processing and normalization process,spectrogram image conversion process,and final classification process.The experimental results revealed that the proposed EMG spectrogram image-based user recognition system has a 95.4%accuracy performance,which is 13%higher than the EMGsignal-based system.Such a user recognition accuracy improvement was achieved by using multidimensional features,in the time-frequency domain.展开更多
In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)t...In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)to assist detection of failure before it extends to the worse phase.Machine Learning(ML)based TCM has been extensively explored in the last decade.However,most of the research is now directed toward Deep Learning(DL).The“Deep”formulation,hierarchical compositionality,distributed representation and end-to-end learning of Neural Nets need to be explored to create a generalized TCM framework to perform eciently in a high-noise environment of cross-domain machining.With this motivation,the design of dierent CNN(Convolutional Neural Network)architectures such as AlexNet,ResNet-50,LeNet-5,and VGG-16 is presented in this paper.Real-time spindle vibrations corresponding to healthy and various faulty congurations of milling cutter were acquired.This data was transformed into the time-frequency domain and further processed by proposed architectures in graphical form,i.e.,spectrogram.The model is trained,tested,and validated considering dierent datasets and showcased promising results.展开更多
Cardiovascular diseases(CVDs)remain one of the foremost causes of death globally;hence,the need for several must-have,advanced automated diagnostic solutions towards early detection and intervention.Traditional auscul...Cardiovascular diseases(CVDs)remain one of the foremost causes of death globally;hence,the need for several must-have,advanced automated diagnostic solutions towards early detection and intervention.Traditional auscultation of cardiovascular sounds is heavily reliant on clinical expertise and subject to high variability.To counter this limitation,this study proposes an AI-driven classification system for cardiovascular sounds whereby deep learning techniques are engaged to automate the detection of an abnormal heartbeat.We employ FastAI vision-learner-based convolutional neural networks(CNNs)that include ResNet,DenseNet,VGG,ConvNeXt,SqueezeNet,and AlexNet to classify heart sound recordings.Instead of raw waveform analysis,the proposed approach transforms preprocessed cardiovascular audio signals into spectrograms,which are suited for capturing temporal and frequency-wise patterns.The models are trained on the PASCAL Cardiovascular Challenge dataset while taking into consideration the recording variations,noise levels,and acoustic distortions.To demonstrate generalization,external validation using Google’s Audio set Heartbeat Sound dataset was performed using a dataset rich in cardiovascular sounds.Comparative analysis revealed that DenseNet-201,ConvNext Large,and ResNet-152 could deliver superior performance to the other architectures,achieving an accuracy of 81.50%,a precision of 85.50%,and an F1-score of 84.50%.In the process,we performed statistical significance testing,such as the Wilcoxon signed-rank test,to validate performance improvements over traditional classification methods.Beyond the technical contributions,the research underscores clinical integration,outlining a pathway in which the proposed system can augment conventional electronic stethoscopes and telemedicine platforms in the AI-assisted diagnostic workflows.We also discuss in detail issues of computational efficiency,model interpretability,and ethical considerations,particularly concerning algorithmic bias stemming from imbalanced datasets and the need for real-time processing in clinical settings.The study describes a scalable,automated system combining deep learning,feature extraction using spectrograms,and external validation that can assist healthcare providers in the early and accurate detection of cardiovascular disease.AI-driven solutions can be viable in improving access,reducing delays in diagnosis,and ultimately even the continued global burden of heart disease.展开更多
This study examines the variations in noise levels across various subway lines in Singapore and three other cities,and provides a detailed overview of the trends and factors influencing subway noise.Most of the equiva...This study examines the variations in noise levels across various subway lines in Singapore and three other cities,and provides a detailed overview of the trends and factors influencing subway noise.Most of the equivalent sound pressure level(Leq)in typical subway cabins across the Singapore subway lines are below 85 dBA,with some notable exceptions.These variations in noise levels are influenced by several factors,including rolling stock structure,track conditions and environmental and aerodynamic factors.The spectrogram analysis indicates that the cabin noise is mostly concentrated below the frequency of 1,000 Hz.This study also analyzes cabin noise in subway systems in Suzhou,Seoul,and Tokyo to allow for broader comparisons.It studies the impact of factors such as stock materials,track conditions including the quality of the rails,the presence of curves or irregularities,and maintenance frequency on cabin noise.展开更多
Frequency-modulated continuous-wave radar enables the non-contact and privacy-preserving recognition of human behavior.However,the accuracy of behavior recognition is directly influenced by the spatial relationship be...Frequency-modulated continuous-wave radar enables the non-contact and privacy-preserving recognition of human behavior.However,the accuracy of behavior recognition is directly influenced by the spatial relationship between human posture and the radar.To address the issue of low accuracy in behavior recognition when the human body is not directly facing the radar,a method combining local outlier factor with Doppler information is proposed for the correction of multi-classifier recognition results.Initially,the information such as distance,velocity,and micro-Doppler spectrogram of the target is obtained using the fast Fourier transform and histogram of oriented gradients-support vector machine methods,followed by preliminary recognition.Subsequently,Platt scaling is employed to transform recognition results into confidence scores,and finally,the Doppler-local outlier factor method is utilized to calibrate the confidence scores,with the highest confidence classifier result considered as the recognition outcome.Experimental results demonstrate that this approach achieves an average recognition accuracy of 96.23%for comprehensive human behavior recognition in various orientations.展开更多
文摘A heart attack disrupts the normal flow of blood to the heart muscle,potentially causing severe damage or death if not treated promptly.It can lead to long-term health complications,reduce quality of life,and significantly impact daily activities and overall well-being.Despite the growing popularity of deep learning,several drawbacks persist,such as complexity and the limitation of single-model learning.In this paper,we introduce a residual learning-based feature fusion technique to achieve high accuracy in differentiating abnormal cardiac rhythms heart sound.Combining MobileNet with DenseNet201 for feature fusion leverages MobileNet lightweight,efficient architecture with DenseNet201,dense connections,resulting in enhanced feature extraction and improved model performance with reduced computational cost.To further enhance the fusion,we employed residual learning to optimize the hierarchical features of heart abnormal sounds during training.The experimental results demonstrate that the proposed fusion method achieved an accuracy of 95.67%on the benchmark PhysioNet-2016 Spectrogram dataset.To further validate the performance,we applied it to the BreakHis dataset with a magnification level of 100X.The results indicate that the model maintains robust performance on the second dataset,achieving an accuracy of 96.55%.it highlights its consistent performance,making it a suitable for various applications.
文摘A new lighting and enlargement on phase spectrogram (PS) and frequency spectrogram (FS) is presented in this paper. These representations result from the coupling of power spectrogram and short time Fourier transform (STFT). The main contribution is the construction of the 3D phase spectrogram (3DPS) and the 3D frequency spectrogram (3DFS). These new tools allow such specific test signals as small slope linear chirp, phase jump case of musical signal analysis is reported. The main objective is to and small frequency jump to be analyzed. An application detect small frequency and phase variations in order to characterize each type of sound attack without losing the amplitude information given by power spectrogram
基金This work was supported by the science and technology project of State Grid Shanghai Municipal Electric Power Company(No.52090020007F)National Key R&D Program of China(2017YFB0902800).
文摘Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods such as ultrasonic method and pulse current method.However,due to the sensitivity of the acoustic array sensor and the influence of the equipment operation site interference,the acoustic array sensor device for partial discharge type diagnosis by phase resolved partial discharge(PRPD)map might occasionally presents incorrect results,thus affecting the power equipment operation and maintenance strategy.The acoustic array sensor detection device for power equipment developed in this paper applies the array design model of equal-area multi-arm spiral with machine learning fast fourier transform clean(FFT-CLEAN)sound source localization identification algorithm to avoid the interference factors in the noise acquisition system using a single microphone and conventional beam forming algorithm,improves the spatial resolution of the acoustic array sensor device,and proposes an acoustic array sensor device based on the acoustic spectrogram.The analysis and diagnosis method of discharge type of acoustic array sensor device can effectively reduce the system misjudgment caused by factors such as the resolution of the acoustic imaging device and the time domain pulse of the digital signal,and reduce the false alarm rate of the acoustic array sensor device.The proposed method is tested by selecting power cables as the object,and its effectiveness is proved by laboratory verification and field verification.
基金supported by Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(No.2017R1A6A1A03015496)the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.NRF-2021R1A2C1014033).
文摘Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technology(5G)based mobile devices.The EMG signals generated inside the body with unique individual characteristics are being studied as a part of nextgeneration user recognition methods.However,there is a limitation when applying EMG signals to user recognition systems as the same operation needs to be repeated while maintaining a constant strength of muscle over time.Hence,it is necessary to conduct research on multidimensional feature transformation that includes changes in frequency features over time.In this paper,we propose a user recognition system that applies EMG signals to the short-time fourier transform(STFT),and converts the signals into EMG spectrogram images while adjusting the time-frequency resolution to extract multidimensional features.The proposed system is composed of a data pre-processing and normalization process,spectrogram image conversion process,and final classification process.The experimental results revealed that the proposed EMG spectrogram image-based user recognition system has a 95.4%accuracy performance,which is 13%higher than the EMGsignal-based system.Such a user recognition accuracy improvement was achieved by using multidimensional features,in the time-frequency domain.
文摘In-process damage to a cutting tool degrades the surfacenish of the job shaped by machining and causes a signicantnancial loss.This stimulates the need for Tool Condition Monitoring(TCM)to assist detection of failure before it extends to the worse phase.Machine Learning(ML)based TCM has been extensively explored in the last decade.However,most of the research is now directed toward Deep Learning(DL).The“Deep”formulation,hierarchical compositionality,distributed representation and end-to-end learning of Neural Nets need to be explored to create a generalized TCM framework to perform eciently in a high-noise environment of cross-domain machining.With this motivation,the design of dierent CNN(Convolutional Neural Network)architectures such as AlexNet,ResNet-50,LeNet-5,and VGG-16 is presented in this paper.Real-time spindle vibrations corresponding to healthy and various faulty congurations of milling cutter were acquired.This data was transformed into the time-frequency domain and further processed by proposed architectures in graphical form,i.e.,spectrogram.The model is trained,tested,and validated considering dierent datasets and showcased promising results.
基金funded by the deanship of scientific research(DSR),King Abdulaziz University,Jeddah,under grant No.(G-1436-611-309).
文摘Cardiovascular diseases(CVDs)remain one of the foremost causes of death globally;hence,the need for several must-have,advanced automated diagnostic solutions towards early detection and intervention.Traditional auscultation of cardiovascular sounds is heavily reliant on clinical expertise and subject to high variability.To counter this limitation,this study proposes an AI-driven classification system for cardiovascular sounds whereby deep learning techniques are engaged to automate the detection of an abnormal heartbeat.We employ FastAI vision-learner-based convolutional neural networks(CNNs)that include ResNet,DenseNet,VGG,ConvNeXt,SqueezeNet,and AlexNet to classify heart sound recordings.Instead of raw waveform analysis,the proposed approach transforms preprocessed cardiovascular audio signals into spectrograms,which are suited for capturing temporal and frequency-wise patterns.The models are trained on the PASCAL Cardiovascular Challenge dataset while taking into consideration the recording variations,noise levels,and acoustic distortions.To demonstrate generalization,external validation using Google’s Audio set Heartbeat Sound dataset was performed using a dataset rich in cardiovascular sounds.Comparative analysis revealed that DenseNet-201,ConvNext Large,and ResNet-152 could deliver superior performance to the other architectures,achieving an accuracy of 81.50%,a precision of 85.50%,and an F1-score of 84.50%.In the process,we performed statistical significance testing,such as the Wilcoxon signed-rank test,to validate performance improvements over traditional classification methods.Beyond the technical contributions,the research underscores clinical integration,outlining a pathway in which the proposed system can augment conventional electronic stethoscopes and telemedicine platforms in the AI-assisted diagnostic workflows.We also discuss in detail issues of computational efficiency,model interpretability,and ethical considerations,particularly concerning algorithmic bias stemming from imbalanced datasets and the need for real-time processing in clinical settings.The study describes a scalable,automated system combining deep learning,feature extraction using spectrograms,and external validation that can assist healthcare providers in the early and accurate detection of cardiovascular disease.AI-driven solutions can be viable in improving access,reducing delays in diagnosis,and ultimately even the continued global burden of heart disease.
文摘This study examines the variations in noise levels across various subway lines in Singapore and three other cities,and provides a detailed overview of the trends and factors influencing subway noise.Most of the equivalent sound pressure level(Leq)in typical subway cabins across the Singapore subway lines are below 85 dBA,with some notable exceptions.These variations in noise levels are influenced by several factors,including rolling stock structure,track conditions and environmental and aerodynamic factors.The spectrogram analysis indicates that the cabin noise is mostly concentrated below the frequency of 1,000 Hz.This study also analyzes cabin noise in subway systems in Suzhou,Seoul,and Tokyo to allow for broader comparisons.It studies the impact of factors such as stock materials,track conditions including the quality of the rails,the presence of curves or irregularities,and maintenance frequency on cabin noise.
基金the National Key Research and Development Program of China(No.2022YFC3601400)。
文摘Frequency-modulated continuous-wave radar enables the non-contact and privacy-preserving recognition of human behavior.However,the accuracy of behavior recognition is directly influenced by the spatial relationship between human posture and the radar.To address the issue of low accuracy in behavior recognition when the human body is not directly facing the radar,a method combining local outlier factor with Doppler information is proposed for the correction of multi-classifier recognition results.Initially,the information such as distance,velocity,and micro-Doppler spectrogram of the target is obtained using the fast Fourier transform and histogram of oriented gradients-support vector machine methods,followed by preliminary recognition.Subsequently,Platt scaling is employed to transform recognition results into confidence scores,and finally,the Doppler-local outlier factor method is utilized to calibrate the confidence scores,with the highest confidence classifier result considered as the recognition outcome.Experimental results demonstrate that this approach achieves an average recognition accuracy of 96.23%for comprehensive human behavior recognition in various orientations.