期刊文献+
共找到748篇文章
< 1 2 38 >
每页显示 20 50 100
Detection of Abnormal Cardiac Rhythms Using Feature Fusion Technique with Heart Sound Spectrograms
1
作者 Saif Ur Rehman Khan Zia Khan 《Journal of Bionic Engineering》 2025年第4期2030-2049,共20页
A heart attack disrupts the normal flow of blood to the heart muscle,potentially causing severe damage or death if not treated promptly.It can lead to long-term health complications,reduce quality of life,and signific... A heart attack disrupts the normal flow of blood to the heart muscle,potentially causing severe damage or death if not treated promptly.It can lead to long-term health complications,reduce quality of life,and significantly impact daily activities and overall well-being.Despite the growing popularity of deep learning,several drawbacks persist,such as complexity and the limitation of single-model learning.In this paper,we introduce a residual learning-based feature fusion technique to achieve high accuracy in differentiating abnormal cardiac rhythms heart sound.Combining MobileNet with DenseNet201 for feature fusion leverages MobileNet lightweight,efficient architecture with DenseNet201,dense connections,resulting in enhanced feature extraction and improved model performance with reduced computational cost.To further enhance the fusion,we employed residual learning to optimize the hierarchical features of heart abnormal sounds during training.The experimental results demonstrate that the proposed fusion method achieved an accuracy of 95.67%on the benchmark PhysioNet-2016 Spectrogram dataset.To further validate the performance,we applied it to the BreakHis dataset with a magnification level of 100X.The results indicate that the model maintains robust performance on the second dataset,achieving an accuracy of 96.55%.it highlights its consistent performance,making it a suitable for various applications. 展开更多
关键词 Cardiac rhythms Feature fusion Residual learning BreakHis spectrogram sound
在线阅读 下载PDF
Continuous frequency and phase spectrograms: a study of their 2D and 3D capabilities and application to musical signal analysis 被引量:1
2
作者 Laurent NAVARRO Guy COURBEBAISSE Jean-Charles PINOLI 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第2期199-206,共8页
A new lighting and enlargement on phase spectrogram (PS) and frequency spectrogram (FS) is presented in this paper. These representations result from the coupling of power spectrogram and short time Fourier transf... A new lighting and enlargement on phase spectrogram (PS) and frequency spectrogram (FS) is presented in this paper. These representations result from the coupling of power spectrogram and short time Fourier transform (STFT). The main contribution is the construction of the 3D phase spectrogram (3DPS) and the 3D frequency spectrogram (3DFS). These new tools allow such specific test signals as small slope linear chirp, phase jump case of musical signal analysis is reported. The main objective is to and small frequency jump to be analyzed. An application detect small frequency and phase variations in order to characterize each type of sound attack without losing the amplitude information given by power spectrogram 展开更多
关键词 Frequency spectrogram (FS) Phase spectrogram (PS) Time-frequency representations Musical signals
在线阅读 下载PDF
Research on data diagnosis method of acoustic array sensor device based on spectrogram 被引量:4
3
作者 Xing Lei Hang Ji +3 位作者 Qiang Xu Ting Ye Shengfu Zhang Chengjun Huang 《Global Energy Interconnection》 EI CAS CSCD 2022年第4期418-433,共16页
Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods su... Acoustic array sensor device for partial discharge detection is widely used in power equipment inspection with the advantages of non-contact and precise positioning compared with partial discharge detection methods such as ultrasonic method and pulse current method.However,due to the sensitivity of the acoustic array sensor and the influence of the equipment operation site interference,the acoustic array sensor device for partial discharge type diagnosis by phase resolved partial discharge(PRPD)map might occasionally presents incorrect results,thus affecting the power equipment operation and maintenance strategy.The acoustic array sensor detection device for power equipment developed in this paper applies the array design model of equal-area multi-arm spiral with machine learning fast fourier transform clean(FFT-CLEAN)sound source localization identification algorithm to avoid the interference factors in the noise acquisition system using a single microphone and conventional beam forming algorithm,improves the spatial resolution of the acoustic array sensor device,and proposes an acoustic array sensor device based on the acoustic spectrogram.The analysis and diagnosis method of discharge type of acoustic array sensor device can effectively reduce the system misjudgment caused by factors such as the resolution of the acoustic imaging device and the time domain pulse of the digital signal,and reduce the false alarm rate of the acoustic array sensor device.The proposed method is tested by selecting power cables as the object,and its effectiveness is proved by laboratory verification and field verification. 展开更多
关键词 Acoustic array sensor device Acoustic spectrogram Partial discharge Power equipment False alarm rate
在线阅读 下载PDF
Health Monitoring of Milling Tool Inserts Using CNN Architectures Trained by Vibration Spectrograms 被引量:2
4
作者 Sonali S.Patil Sujit S.Pardeshi Abhishek D.Patange 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第7期177-199,共23页
In-process damage to a cutting tool degrades the surface􀀀nish of the job shaped by machining and causes a signi􀀀cant􀀀nancial loss.This stimulates the need for Tool Condition Monitoring(TCM)t... In-process damage to a cutting tool degrades the surface􀀀nish of the job shaped by machining and causes a signi􀀀cant􀀀nancial loss.This stimulates the need for Tool Condition Monitoring(TCM)to assist detection of failure before it extends to the worse phase.Machine Learning(ML)based TCM has been extensively explored in the last decade.However,most of the research is now directed toward Deep Learning(DL).The“Deep”formulation,hierarchical compositionality,distributed representation and end-to-end learning of Neural Nets need to be explored to create a generalized TCM framework to perform eciently in a high-noise environment of cross-domain machining.With this motivation,the design of dierent CNN(Convolutional Neural Network)architectures such as AlexNet,ResNet-50,LeNet-5,and VGG-16 is presented in this paper.Real-time spindle vibrations corresponding to healthy and various faulty con􀀀gurations of milling cutter were acquired.This data was transformed into the time-frequency domain and further processed by proposed architectures in graphical form,i.e.,spectrogram.The model is trained,tested,and validated considering dierent datasets and showcased promising results. 展开更多
关键词 Milling tool inserts health monitoring vibration spectrograms deep learning convolutional neural network
在线阅读 下载PDF
User Recognition System Based on Spectrogram Image Conversion Using EMG Signals 被引量:2
5
作者 Jae Myung Kim Gyu Ho Choi +1 位作者 Min-Gu Kim Sung Bum Pan 《Computers, Materials & Continua》 SCIE EI 2022年第7期1213-1227,共15页
Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technol... Recently,user recognitionmethods to authenticate personal identity has attracted significant attention especially with increased availability of various internet of things(IoT)services through fifth-generation technology(5G)based mobile devices.The EMG signals generated inside the body with unique individual characteristics are being studied as a part of nextgeneration user recognition methods.However,there is a limitation when applying EMG signals to user recognition systems as the same operation needs to be repeated while maintaining a constant strength of muscle over time.Hence,it is necessary to conduct research on multidimensional feature transformation that includes changes in frequency features over time.In this paper,we propose a user recognition system that applies EMG signals to the short-time fourier transform(STFT),and converts the signals into EMG spectrogram images while adjusting the time-frequency resolution to extract multidimensional features.The proposed system is composed of a data pre-processing and normalization process,spectrogram image conversion process,and final classification process.The experimental results revealed that the proposed EMG spectrogram image-based user recognition system has a 95.4%accuracy performance,which is 13%higher than the EMGsignal-based system.Such a user recognition accuracy improvement was achieved by using multidimensional features,in the time-frequency domain. 展开更多
关键词 EMG user recognition spectrogram CNN
在线阅读 下载PDF
基于改进EfficientNetV2的铝液泄漏声音识别与预警机制
6
作者 梁艳辉 温承杰 +2 位作者 闫军威 周璇 张洪涛 《华南理工大学学报(自然科学版)》 北大核心 2026年第2期38-51,共14页
铝液泄漏是导致铝加工深井铸造爆炸事故的直接原因。为解决实际工程中铝液泄漏判断方法滞后性强、准确率低和监测范围受限等问题,该文提出了基于改进EfficientNetV2的铝液泄漏声音识别方法。该方法通过声音特征判断铝液泄漏,以扩大监测... 铝液泄漏是导致铝加工深井铸造爆炸事故的直接原因。为解决实际工程中铝液泄漏判断方法滞后性强、准确率低和监测范围受限等问题,该文提出了基于改进EfficientNetV2的铝液泄漏声音识别方法。该方法通过声音特征判断铝液泄漏,以扩大监测范围;同时通过优化堆叠因子、引入高效通道注意力机制改进EfficientNetV2结构,以进一步提升识别速率与准确率。首先,利用拾音器采集不同场景下的声音数据,构建包含7类声音场景的声音数据库;然后,从声音信号中提取对数梅尔语谱图作为特征集,输入到改进的EfficientNetV2模型进行训练与验证,最终得到铝液泄漏声音识别模型。实验结果表明:改进的EfficientNetV2识别准确率达95.48%;与原始EfficientNetV2、ResNet、 RegNet及DenseNet相比,改进模型的浮点运算次数分别为上述模型的12.34%、8.64%、11.14%和10.80%,参数量分别为上述模型的11.37%、9.55%、15.95%和17.24%,CPU环境下每秒处理图像帧数分别为上述模型的6.53倍、6.14倍、4.41倍和8.00倍,说明改进的EfficientNetV2具有快速准确的识别性能。此外,基于该文提出的铝液泄漏声音识别方法,构建了铝液泄漏风险预警机制,并将该机制应用于铸造单元的实时风险监测。实践结果验证了所提识别方法与预警机制的有效性,可为铝加工深井铸造爆炸事故的预防提供技术参考。 展开更多
关键词 铝加工深井铸造 铝液泄漏 声音识别 风险预警 改进的EfficientNetV2 对数梅尔语谱图
在线阅读 下载PDF
An Improved Forest Fire Detection Model Using Audio Classification and Machine Learning
7
作者 Kemahyanto Exaudi Deris Stiawan +4 位作者 Bhakti Yudho Suprapto Hanif Fakhrurroja MohdYazid Idris Tami AAlghamdi Rahmat Budiarto 《Computers, Materials & Continua》 2026年第1期2062-2085,共24页
Sudden wildfires cause significant global ecological damage.While satellite imagery has advanced early fire detection and mitigation,image-based systems face limitations including high false alarm rates,visual obstruc... Sudden wildfires cause significant global ecological damage.While satellite imagery has advanced early fire detection and mitigation,image-based systems face limitations including high false alarm rates,visual obstructions,and substantial computational demands,especially in complex forest terrains.To address these challenges,this study proposes a novel forest fire detection model utilizing audio classification and machine learning.We developed an audio-based pipeline using real-world environmental sound recordings.Sounds were converted into Mel-spectrograms and classified via a Convolutional Neural Network(CNN),enabling the capture of distinctive fire acoustic signatures(e.g.,crackling,roaring)that are minimally impacted by visual or weather conditions.Internet of Things(IoT)sound sensors were crucial for generating complex environmental parameters to optimize feature extraction.The CNN model achieved high performance in stratified 5-fold cross-validation(92.4%±1.6 accuracy,91.2%±1.8 F1-score)and on test data(94.93%accuracy,93.04%F1-score),with 98.44%precision and 88.32%recall,demonstrating reliability across environmental conditions.These results indicate that the audio-based approach not only improves detection reliability but also markedly reduces computational overhead compared to traditional image-based methods.The findings suggest that acoustic sensing integrated with machine learning offers a powerful,low-cost,and efficient solution for real-time forest fire monitoring in complex,dynamic environments. 展开更多
关键词 Audio classification convolutional neural network(CNN) environmental science forest fire detection machine learning spectrogram analysis IOT
在线阅读 下载PDF
基于双分支残差网络的病理语音识别
8
作者 程愉凯 段淑斐 +3 位作者 贾海蓉 李付江 LIANG Huizhi 张卫 《科学技术与工程》 北大核心 2026年第2期663-672,共10页
针对现有研究对病理语音特征提取不充分,导致病理语音识别率低的问题,提出了一种基于双分支残差网络的病理语音识别算法。根据构音障碍患者复杂多样的语音症状,采用宽带和窄带频谱图作为网络输入;提出了自适应特征提取残差块,通过全维... 针对现有研究对病理语音特征提取不充分,导致病理语音识别率低的问题,提出了一种基于双分支残差网络的病理语音识别算法。根据构音障碍患者复杂多样的语音症状,采用宽带和窄带频谱图作为网络输入;提出了自适应特征提取残差块,通过全维动态像素注意力卷积从位置、通道、滤波和像素多个维度全面捕捉病理特征;提出了双流互补融合模块,通过加权融合后的特征不仅保留了各分支的关键信息,还通过跨维度交互实现了更优的特征表达,提升了病理语音识别的准确率。在中文病理语音数据集THE-POSSD和西方公开病理语音数据集UA-Speech上进行实验,其结果验证了所提算法的有效性和泛化能力。 展开更多
关键词 病理语音识别 构音障碍 残差网络 动态卷积 加权融合 频谱图
在线阅读 下载PDF
Joint spectrogram segmentation and ridge-extraction method for separating multimodal guided waves in long bones 被引量:10
9
作者 ZHANG ZhengGang XU KaiLiang +1 位作者 TA DeAn WANG WeiQi 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2013年第7期1317-1323,共7页
Ultrasonic guided waves (GWs) can be used to evaluate long bones effectively because of the ability to provide the information of the whole bone. In this study, a joint spectrogram segmentation and ridge-extraction (J... Ultrasonic guided waves (GWs) can be used to evaluate long bones effectively because of the ability to provide the information of the whole bone. In this study, a joint spectrogram segmentation and ridge-extraction (JSSRE) method was proposed to separate multiple modes in long bones. First, the Gabor time-frequency transform was applied to obtain the spectrogram of multimodal signals. Then, a multi-class image segmentation algorithm was used to find the corresponding region of each mode in the spectrogram, including an improved watershed transform and a region growing procedure. Finally, the ridges were extracted and the time domain signals representing individual modes were reconstructed from these ridges in each region. The validations of this method were discussed by simulated multimodal signals with different signal-to-noise ratios (SNR). The correlation coefficients between the original signals without noise and the reconstructed signals were calculated to analyze the results quantitatively. The results showed that the extracted ridges were in good agreement with generated theoretical dispersion curves, and the reconstructed signals were highly related to the original signals, even under the SNR=3 dB situation. 展开更多
关键词 multimodal guided waves long bone spectrogram SEGMENTATION
原文传递
基于多通道声发射信号融合的水电机组空化故障诊断
10
作者 肖龙 肖湘曲 +3 位作者 何志宏 师博威 徐恺 李超顺 《水利学报》 北大核心 2026年第2期293-305,共13页
针对水电机组空化故障因信号单一及噪声干扰而难以识别的问题,本文提出一种基于多通道声发射信号融合的水电机组空化故障诊断方法。首先,在水电机组空化模拟试验台采集空化试验的多通道声发射信号,将多通道声发射信号经数据压缩处理形... 针对水电机组空化故障因信号单一及噪声干扰而难以识别的问题,本文提出一种基于多通道声发射信号融合的水电机组空化故障诊断方法。首先,在水电机组空化模拟试验台采集空化试验的多通道声发射信号,将多通道声发射信号经数据压缩处理形成水电机组空化故障数据集;再将声发射信号变换成梅尔时频图,对频率进行加权处理,以去除高频信号中的噪声和突出低频信号中的特征;最后,结合卷积块注意力模块(CBAM)和D-S证据理论构建出基于决策级融合的多通道深度卷积神经网络模型,进行水电机组空化故障样本的训练和测试,得到故障诊断结果。结果表明,该方法能有效区分不同工况下的空化故障,与其他模型方法对比,具有较高的诊断精度和良好的抗噪能力,对实际中的水电机组空化故障诊断应用有较大参考作用。 展开更多
关键词 多通道信号融合 声发射信号 水电机组空化故障诊断 梅尔时频图 深度卷积神经网络
在线阅读 下载PDF
Wheeze detecting method based on spectrogram entropy analysis 被引量:5
11
作者 LI Jiarui HONG Ying 《Chinese Journal of Acoustics》 CSCD 2016年第4期508-515,共8页
In order to eliminate the subjectivity of wheeze diagnosis and improve the accuracy of objective detecting methods,this paper introduces a wheeze detecting method based on spectrogram entropy analysis.This algorithm m... In order to eliminate the subjectivity of wheeze diagnosis and improve the accuracy of objective detecting methods,this paper introduces a wheeze detecting method based on spectrogram entropy analysis.This algorithm mainly comprises three steps which are preprocessing,features extracting and wheeze detecting based on support vector machine(SVM).Herein,the preprocessing consists of the short-time Fourier transform(STFT) decomposition and detrending.The features are extracted from the entropy of spectrograms.The step of detrending makes the difference of the features between wheeze and normal lung sounds more obvious.Moreover,compared with the method whose decision is based on the empirical threshold,there is no uncertain detecting result any more.Results of two testing experiments show that the detecting accuracy(AC) are 97.1%and 95.7%,respectively,which proves that the proposed method could be an efficient way to detect wheeze. 展开更多
关键词 NLS Wheeze detecting method based on spectrogram entropy analysis STFT SVM
原文传递
基于双低秩调整训练的船舶辐射噪声识别
12
作者 马治勋 汤宁 +1 位作者 李璇 郝程鹏 《水下无人系统学报》 2026年第1期47-56,共10页
针对深度学习模型在船舶辐射噪声识别中由数据短缺、域偏移导致的泛化能力受限问题,文中提出了一种权重-特征双低秩自适应迁移学习框架。该框架从模型权重和特征表达2个维度协同开展低秩优化:在权重空间,冻结预训练权重,通过轻量化低秩... 针对深度学习模型在船舶辐射噪声识别中由数据短缺、域偏移导致的泛化能力受限问题,文中提出了一种权重-特征双低秩自适应迁移学习框架。该框架从模型权重和特征表达2个维度协同开展低秩优化:在权重空间,冻结预训练权重,通过轻量化低秩权重调整(WLoRA)模块构建可学习低秩权重参数,以较少参数量完成权重微调,从而降低过拟合风险;在特征空间,基于船舶辐射噪声Mel时频谱的内在低秩结构,通过低秩特征调整(FLoRA)模块对特征进行压缩和重构,从而显式约束模型学习低秩特征。该框架充分考虑了Mel时频谱的固有低秩结构,深入挖掘预训练模型潜力,有效提升了迁移学习性能。通过在ShipsEar和Deepship公开数据集上的实验表明,相对于直接微调预训练模型,所提方法能够有效提升迁移学习在船舶辐射嗓声分类模型中的性能。进一步的消融实验验证了2个低秩模块的有效性。 展开更多
关键词 船舶辐射噪声 双低秩 迁移学习 Mel时频谱
在线阅读 下载PDF
Speech endpoint detection in low-SNRs environment based on perception spectrogram structure boundary parameter 被引量:9
13
作者 WU Di ZHAO Heming +4 位作者 HUANG Chengwei XIAO Zhongzhe ZHANG Xiaojun XU Yishen TAO Zhi 《Chinese Journal of Acoustics》 2014年第4期428-440,共13页
The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out... The Perception Spectrogram Structure Boundary(PSSB)parameter is proposed for speech endpoint detection as a preprocess of speech or speaker recognition.At first a hearing perception speech enhancement is carried out.Then the two-dimensional enhancement is performed upon the sound spectrogram according to the difference between the determinacy distribution characteristic of speech and the random distribution characteristic of noise.Finally a decision for endpoint was made by the PSSB parameter.Experimental results show that,in a low SNR environment from-10 dB to 10 dB,the algorithm proposed in this paper may achieve higher accuracy than the extant endpoint detection algorithms.The detection accuracy of 75.2%can be reached even in the extremely low SNR at-10 dB.Therefore it is suitable for speech endpoint detection in low-SNRs environment. 展开更多
关键词 Speech endpoint detection in low-SNRs environment based on perception spectrogram structure boundary parameter
原文传递
抽水蓄能电动机励磁绕组匝间短路的环流特性分析
14
作者 李泽同 李永刚 +1 位作者 马明晗 齐鹏 《内蒙古大学学报(自然科学版)》 2026年第1期23-33,共11页
围绕抽水蓄能电动机励磁绕组早期匝间短路难以识别的难题,提出一种以定子并联支路环流特性为基础的方法。首先,从电磁场理论出发,在电动机运行条件下,建立并推导出励磁绕组匝间短路与定子同相支路环流谐波之间的定量关系式。然后,利用... 围绕抽水蓄能电动机励磁绕组早期匝间短路难以识别的难题,提出一种以定子并联支路环流特性为基础的方法。首先,从电磁场理论出发,在电动机运行条件下,建立并推导出励磁绕组匝间短路与定子同相支路环流谐波之间的定量关系式。然后,利用有限元软件建立抽水蓄能电动机的二维仿真模型,模拟正常、轻微及严重短路3种工况,并对气隙磁密和支路环流进行频谱分析。研究发现,匝间短路故障会在定子支路环流中激发出特定的分数次谐波,且这些特征谐波的幅值与故障严重程度呈显著正相关,同时故障磁极处的气隙磁密会相应减小。该方法通过监测环流中的特征谐波,可实现对电动机励磁绕组早期匝间短路的灵敏度、无扰性进行在线检测,为保障机组安全稳定运行提供了有效的技术手段。 展开更多
关键词 抽水蓄能电动机 励磁绕组 匝间短路 环流时频谱图
原文传递
基于DenseNet和迁移学习的声纹识别方法
15
作者 陈润强 王卫辰 +1 位作者 徐亚博 李烈 《现代电子技术》 北大核心 2026年第2期171-177,共7页
传统的声纹识别方法受环境噪声和个体变化等因素的影响,准确率难以进一步提升。为此,提出一种基于DenseNet和迁移学习的语谱图声纹识别方法,以进一步提高声纹识别系统的性能。使用DenseNet的声纹识别模型对源域语音进行训练;采用迁移学... 传统的声纹识别方法受环境噪声和个体变化等因素的影响,准确率难以进一步提升。为此,提出一种基于DenseNet和迁移学习的语谱图声纹识别方法,以进一步提高声纹识别系统的性能。使用DenseNet的声纹识别模型对源域语音进行训练;采用迁移学习将源域训练的DenseNet模型迁移到目标域训练数据;在目标域测试数据上验证迁移后模型的性能,并对比分析迁移前后DenseNet模型和ResNet模型的声纹识别性能。实验结果表明,与原始ResNet模型、DenseNet模型和经迁移学习的ResNet模型相比,经迁移学习的DenseNet模型的识别准确率分别提高了3.89%、6.67%和3.34%,且具有较快的收敛速度。 展开更多
关键词 声纹识别 DenseNet 迁移学习 语谱图 ResNet 语音信号处理
在线阅读 下载PDF
Manifestation of attosecond XUV fields temporal structures in attosecond streaking spectrogram
16
作者 陈光龙 曹云玖 Dong Eon Kim 《Chinese Optics Letters》 SCIE EI CAS CSCD 2011年第6期100-103,共4页
The features of an attosecond extreme ultraviolet (XUV) field are encoded in the attosecond XUV spectrogram. We investigate the effect of the temporal structures of attosecond XUV fields on the attosecond streaking ... The features of an attosecond extreme ultraviolet (XUV) field are encoded in the attosecond XUV spectrogram. We investigate the effect of the temporal structures of attosecond XUV fields on the attosecond streaking spectrogram. Factors such as the number of attosecond XUV pulses and the temporal chirp of attosecond XUV pulses are considered. Results indicate that unlike the attosecond streaking spectrogram for an attosecond XUV field with two pulses of a half-cycle separation of streaking field, the spectrogram for the attosecond XUV field with three pulses demonstrates fine spectral fringes in separated traces. 展开更多
关键词 Manifestation of attosecond XUV fields temporal structures in attosecond streaking spectrogram NIR
原文传递
基于LTE多普勒谱图的手势识别方法
17
作者 乔媛 苗苗 +2 位作者 贺伟杰 李金保 邬晶淼 《内蒙古大学学报(自然科学版)》 2026年第1期34-47,共14页
针对LTE信号在手势识别中因随机相位偏移导致手势特征提取困难的问题,提出一种基于多普勒谱图的手势识别方法。首先,计算不同天线间信道频率响应(CFR)的商,用来消除因载波频率偏移(CFO)和采样频率偏移(SFO)引起的随机相位偏移,并滤除高... 针对LTE信号在手势识别中因随机相位偏移导致手势特征提取困难的问题,提出一种基于多普勒谱图的手势识别方法。首先,计算不同天线间信道频率响应(CFR)的商,用来消除因载波频率偏移(CFO)和采样频率偏移(SFO)引起的随机相位偏移,并滤除高频噪声;提取信号的切线相位变化,计算由手势运动引起的信号传播路径变化。然后,采用连续小波变换(CWT)计算多普勒谱图,并通过一阶时间微分消除静态干扰。最后,利用卷积神经网络对不同手势的多普勒谱图进行分类,从而实现手势识别。实验结果表明,该方法能够有效抑制CFO和SFO引起的随机相位偏移,精准提取多普勒特征。在1.5 m距离的径向方向下,4个目标在两个场景下的平均识别准确率达到94%,展现出卓越的手势识别能力。 展开更多
关键词 LTE信号 多普勒谱图 卷积神经网络 手势识别
原文传递
A METHOD OF DISPLAYING COLOR SPECTROGRAM OF SPEECH BY USE OF MICROCOMPUTER
18
作者 SUN Jincheng and LU Shinan( Institute of Aconsties , Academia Sinica ) 《Chinese Journal of Acoustics》 1989年第4期355-358,共4页
A method of drawing color spectrogram of speech by using microcomputer is described in this paper , and referred to the metod of drawing spectrogram by computer . With the software and no addition any other aqripment.... A method of drawing color spectrogram of speech by using microcomputer is described in this paper , and referred to the metod of drawing spectrogram by computer . With the software and no addition any other aqripment., we can draw color three - dimension spectrogram ( or black -white spectrogram without color monitor ), and it is similar to spectrogram of sonagrapher . 展开更多
关键词 A METHOD OF DISPLAYING COLOR spectrogram OF SPEECH BY USE OF MICROCOMPUTER
原文传递
基于改进CBAM注意力机制的MobileNetV3风扇异常状况识别研究 被引量:2
19
作者 刘明 王荣燕 +3 位作者 王汝旭 武高旭 张佳宁 梁俊祥 《工业控制计算机》 2025年第3期90-92,共3页
工业风扇在生产设施中起着至关重要的作用,关键风扇的突然停机对安全生产影响巨大。通过分析在-6 dB噪声环境中的故障风扇发出的声音,提取声音样本的语谱图,采用MobileNetV3模型,针对该模型注意力模块SE(Squeeze-and-Excitation)存在的... 工业风扇在生产设施中起着至关重要的作用,关键风扇的突然停机对安全生产影响巨大。通过分析在-6 dB噪声环境中的故障风扇发出的声音,提取声音样本的语谱图,采用MobileNetV3模型,针对该模型注意力模块SE(Squeeze-and-Excitation)存在的参数化程度较低问题,采用空洞卷积(Dilated Convolution)优化的卷积块注意力模块CBAM(Convolutional Block Attention Module)予以替代,提出了改进后的MobileNetV3模型。实验结果显示,该模型的分类准确率达到了98%,相较于原MobileNetV3模型,准确率提升了2.07个百分点。 展开更多
关键词 空洞卷积 CBAM MobileNetV3 迁移学习 spectrogram
在线阅读 下载PDF
基于改进EfficientNet的煤矸音频分类方法 被引量:1
20
作者 宋庆军 焦守悦 +2 位作者 姜海燕 宋庆辉 郝文超 《工矿自动化》 北大核心 2025年第1期138-144,共7页
针对煤矸音频特征提取过程中设备运行噪声干扰严重及单一提取方法易导致信息丢失的问题,提出了一种基于改进EfficientNet的煤矸音频分类方法。采用基于Mel频谱和Gammatone倒谱系数的特征提取方法,有效捕捉矸石声音中的低频信息和细节特... 针对煤矸音频特征提取过程中设备运行噪声干扰严重及单一提取方法易导致信息丢失的问题,提出了一种基于改进EfficientNet的煤矸音频分类方法。采用基于Mel频谱和Gammatone倒谱系数的特征提取方法,有效捕捉矸石声音中的低频信息和细节特征。选择EfficientNet-B0作为骨干网络,并对其进行以下改进:将原有的多尺度通道注意力模块换成卷积块注意力模块,得到卷积注意力特征融合(CAFF)模块,通过网络自学习为不同空间位置的特征分配不同的权重信息,生成新的有效特征;在原有的MBConv模块中并行嵌入频域通道注意力(FCA)模块,加强特征图的表达能力,从而提高整个网络的性能。实验结果表明:引入CAFF模块后,模型准确率提升了0.61%,F1得分提升了0.52%,且模型收敛更快,说明CAFF模块有效提升了模型对频谱特征的捕捉能力;引入FCA模块后,准确率提升了0.45%,F1得分提升了0.62%,说明模块的叠加可以进一步提高模型的泛化能力和处理复杂特征的能力;改进EfficientNe模型的准确率为91.90%,标准差为0.108,显著优于同类对比音频分类模型。 展开更多
关键词 综放开采 煤矸识别 音频特征提取 EfficientNet Mel频谱特征 Gammatone倒谱系数 注意力机制
在线阅读 下载PDF
上一页 1 2 38 下一页 到第
使用帮助 返回顶部