Accurate diagnosis of apple leaf diseases is crucial for improving the quality of apple production and promoting the development of the apple industry. However, apple leaf diseases do not differ significantly from ima...Accurate diagnosis of apple leaf diseases is crucial for improving the quality of apple production and promoting the development of the apple industry. However, apple leaf diseases do not differ significantly from image texture and structural information. The difficulties in disease feature extraction in complex backgrounds slow the related research progress. To address the problems, this paper proposes an improved multi-scale inverse bottleneck residual network model based on a triplet parallel attention mechanism, which is built upon ResNet-50, while improving and combining the inception module and ResNext inverse bottleneck blocks, to recognize seven types of apple leaf(including six diseases of alternaria leaf spot, brown spot, grey spot, mosaic, rust, scab, and one healthy). First, the 3×3 convolutions in some of the residual modules are replaced by multi-scale residual convolutions, the convolution kernels of different sizes contained in each branch of the multi-scale convolution are applied to extract feature maps of different sizes, and the outputs of these branches are multi-scale fused by summing to enrich the output features of the images. Second, the global layer-wise dynamic coordinated inverse bottleneck structure is used to reduce the network feature loss. The inverse bottleneck structure makes the image information less lossy when transforming from different dimensional feature spaces. The fusion of multi-scale and layer-wise dynamic coordinated inverse bottlenecks makes the model effectively balances computational efficiency and feature representation capability, and more robust with a combination of horizontal and vertical features in the fine identification of apple leaf diseases. Finally, after each improved module, a triplet parallel attention module is integrated with cross-dimensional interactions among channels through rotations and residual transformations, which improves the parallel search efficiency of important features and the recognition rate of the network with relatively small computational costs while the dimensional dependencies are improved. To verify the validity of the model in this paper, we uniformly enhance apple leaf disease images screened from the public data sets of Plant Village, Baidu Flying Paddle, and the Internet. The final processed image count is 14,000. The ablation study, pre-processing comparison, and method comparison are conducted on the processed datasets. The experimental results demonstrate that the proposed method reaches 98.73% accuracy on the adopted datasets, which is 1.82% higher than the classical ResNet-50 model, and 0.29% better than the apple leaf disease datasets before preprocessing. It also achieves competitive results in apple leaf disease identification compared to some state-ofthe-art methods.展开更多
Automatic modulation classification(AMC) technology is one of the cutting-edge technologies in cognitive radio communications. AMC based on deep learning has recently attracted much attention due to its superior perfo...Automatic modulation classification(AMC) technology is one of the cutting-edge technologies in cognitive radio communications. AMC based on deep learning has recently attracted much attention due to its superior performances in classification accuracy and robustness. In this paper, we propose a novel, high resolution and multi-scale feature fusion convolutional neural network model with a squeeze-excitation block, referred to as HRSENet,to classify different kinds of modulation signals.The proposed model establishes a parallel computing mechanism of multi-resolution feature maps through the multi-layer convolution operation, which effectively reduces the information loss caused by downsampling convolution. Moreover, through dense skipconnecting at the same resolution and up-sampling or down-sampling connection at different resolutions, the low resolution representation of the deep feature maps and the high resolution representation of the shallow feature maps are simultaneously extracted and fully integrated, which is benificial to mine signal multilevel features. Finally, the feature squeeze and excitation module embedded in the decoder is used to adjust the response weights between channels, further improving classification accuracy of proposed model.The proposed HRSENet significantly outperforms existing methods in terms of classification accuracy on the public dataset “Over the Air” in signal-to-noise(SNR) ranging from-2dB to 20dB. The classification accuracy in the proposed model achieves 85.36% and97.30% at 4dB and 10dB, respectively, with the improvement by 9.71% and 5.82% compared to LWNet.Furthermore, the model also has a moderate computation complexity compared with several state-of-the-art methods.展开更多
Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency d...Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency domain. Theoretical analysis and simulation show the relation between the measurement matrix resolution and compressive sensing(CS)imaging quality. The matrix design is improved to provide multi-scale modulations, followed by individual reconstruction of images of different spatial frequencies. Compared with traditional single-scale CS imaging, the multi-scale method provides high quality imaging in both high and low frequencies, and effectively decreases the overall reconstruction error.Experimental results confirm the feasibility of this technique, especially at low sampling rate. The method may thus be helpful in promoting the implementation of compressive imaging in real applications.展开更多
To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease rec...To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices.展开更多
A unique nest-type catalyst has been designed with a nest of oxygen capture surrounding catalytic Pt centers, which shows much promoted performance, on the base of Pt/C catalyst, for oxygen reduction reaction(ORR). Th...A unique nest-type catalyst has been designed with a nest of oxygen capture surrounding catalytic Pt centers, which shows much promoted performance, on the base of Pt/C catalyst, for oxygen reduction reaction(ORR). The nest is constructed with nitrogen-doped carbon matrix(NCM), derived from the controlled carbonization of PANI precursor, to cover Pt/C catalyst. The unique structure of the catalyst(denoted as NCM■ Pt/C) has many merits. Firstly, it can capture oxygen both in air and in acidic electrolyte. Compared with naked Pt/C, it is found that, in air, the oxygen concentration within the porous nest of NCM surrounding Pt/C particles is ~13 times higher than atmospheric oxygen concentration and, in acidic electrolyte, the concentration of activated oxygen over the catalyst NCM■ Pt/C rise to~1.9 times. Secondly, the NCM nest offers a special electronic modulation on Pt centers toward modified ORR kinetics and then catalytic performances. With these merits, compared with Pt/C, the NCM■ Pt/C catalyst shows 3.2 times higher turnover frequency value and 2.9 times enhanced specific activity for ORR with half-wave potential at 0.894 V. After 50,000 sweeping cycles, the NCM■ Pt/C catalyst retains~66% mass activity and still has advantages over the fresh Pt/C catalyst. We envision that the nest-type catalyst provides a new idea for progress of practical Pt/C ORR catalyst.展开更多
Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD)...Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD).Recently,an increasing number of studies have focused on employing deep learning techniques to analyze FC patterns for brain disease classification.However,the high dimensionality of the FC features and the interpretation of deep learning results are issues that need to be addressed in the FC-based brain disease classification.In this paper,we proposed a multi-scale attention-based deep neural network(MSA-DNN)model to classify FC patterns for the ASD diagnosis.The model was implemented by adding a flexible multi-scale attention(MSA)module to the auto-encoder based backbone DNN,which can extract multi-scale features of the FC patterns and change the level of attention for different FCs by continuous learning.Our model will reinforce the weights of important FC features while suppress the unimportant FCs to ensure the sparsity of the model weights and enhance the model interpretability.We performed systematic experiments on the large multi-sites ASD dataset with both ten-fold and leaveone-site-out cross-validations.Results showed that our model outperformed classical methods in brain disease classification and revealed robust intersite prediction performance.We also localized important FC features and brain regions associated with ASD classification.Overall,our study further promotes the biomarker detection and computer-aided classification for ASD diagnosis,and the proposed MSA module is flexible and easy to implement in other classification networks.展开更多
“双碳”目标背景下,为实现综合能源系统(integrated energy systems,IES)多能耦合利用和低碳化,文中提出含光热模块的先进绝热压缩空气(advanced adiabatic compressed air energy storage,AA-CAES)储能电站和电转气(power to gas,P2G...“双碳”目标背景下,为实现综合能源系统(integrated energy systems,IES)多能耦合利用和低碳化,文中提出含光热模块的先进绝热压缩空气(advanced adiabatic compressed air energy storage,AA-CAES)储能电站和电转气(power to gas,P2G)与储液式碳捕集(carbon capture system,CCS)协同运行的IES低碳优化调度模型。论文建立光热模块与AA-CAES电站耦合模型并将其引入至含P2G-CCS的IES中;提出风-光-碳捕集电厂联合供能碳捕集设备运行策略及碳交易模型,以净碳排放量、综合成本最小化为目标函数构建IES低碳优化调度模型。通过算例对比,验证了含光热模块AA-CAES储能电站与P2G-CCS协同运行能够进一步降低总成本,减少碳排放。展开更多
在“双碳”战略背景下,碳捕集、利用与封存(Carbon Capture,Utilization and Storage,CCUS)技术的重要性日益凸显。作为CCUS技术中的关键环节,CO_(2)分离已成为当前的研究热点。在众多分离法中,膜分离技术凭借低能耗、可连续操作的优势...在“双碳”战略背景下,碳捕集、利用与封存(Carbon Capture,Utilization and Storage,CCUS)技术的重要性日益凸显。作为CCUS技术中的关键环节,CO_(2)分离已成为当前的研究热点。在众多分离法中,膜分离技术凭借低能耗、可连续操作的优势脱颖而出。本工作全面综述了国内外各类CO_(2)分离膜在膜法碳捕集领域的研究现状,重点介绍了混合基质膜的CO_(2)气体渗透分离性能及制备方法的优化。目前,绝大多数混合基质膜的CO_(2)渗透速率介于0~1000 Barrer之间,CO_(2)/N_(2)的分离系数在20~120范围内。研究表明,通过使用修饰改性的晶态多孔填料或直接使用非晶态多孔填料,可有效提升聚合物基底与多孔填料的兼容性,是优化混合基质膜制备方法的可行途径。从后续工业应用的实际需求出发,本工作剖析了不同CO_(2)分离膜组件的优势与不足,明确螺旋卷式是当前最适用于膜法碳捕集领域的组件形式。混合基质膜具备优异的机械与热稳定性、杰出的抗塑化能力及卓越的CO_(2)渗透分离性能,因此在利用螺旋卷式膜组件实现膜法碳捕集应用中展现最大潜力。此外,在添加吹扫气等合适的捕集工艺条件下,基于当前混合基质膜的性能,CO_(2)捕集成本可控制在23 USD/t CO_(2)以内。充分证明了混合基质膜在膜法碳捕集技术中应用的可行性,本工作旨在为CO_(2)膜分离在CCUS技术中的广泛应用提供有力指导。展开更多
Neural networks excel at capturing local spatial patterns through convolutional modules,but they may struggle to identify and effectively utilize the morphological and amplitude periodic nature of physiological signal...Neural networks excel at capturing local spatial patterns through convolutional modules,but they may struggle to identify and effectively utilize the morphological and amplitude periodic nature of physiological signals.In this work,we propose a novel network named filtering module fully convolutional network(FM-FCN),which fuses traditional filtering techniques with neural networks to amplify physiological signals and suppress noise.First,instead of using a fully connected layer,we use an FCN to preserve the time-dimensional correlation information of physiological signals,enabling multiple cycles of signals in the network and providing a basis for signal processing.Second,we introduce the FM as a network module that adapts to eliminate unwanted interference,leveraging the structure of the filter.This approach builds a bridge between deep learning and signal processing methodologies.Finally,we evaluate the performance of FM-FCN using remote photoplethysmography.Experimental results demonstrate that FM-FCN outperforms the second-ranked method in terms of both blood volume pulse(BVP)signal and heart rate(HR)accuracy.It substantially improves the quality of BVP waveform reconstruction,with a decrease of 20.23%in mean absolute error(MAE)and an increase of 79.95%in signal-to-noise ratio(SNR).Regarding HR estimation accuracy,FM-FCN achieves a decrease of 35.85%in MAE,29.65%in error standard deviation,and 32.88%decrease in 95%limits of agreement width,meeting clinical standards for HR accuracy requirements.The results highlight its potential in improving the accuracy and reliability of vital sign measurement through high-quality BVP signal extraction.The codes and datasets are available online at https://github.com/zhaoqi106/FM-FCN.展开更多
As one of the key technologies of intelligent vehicles, traffic sign detection is still a challenging task because of the tiny size of its target object. To address the challenge, we present a novel detection network ...As one of the key technologies of intelligent vehicles, traffic sign detection is still a challenging task because of the tiny size of its target object. To address the challenge, we present a novel detection network improved from yolo-v3 for the tiny traffic sign with high precision in real-time. First, a visual multi-scale attention module(MSAM), a light-weight yet effective module, is devised to fuse the multi-scale feature maps with channel weights and spatial masks. It increases the representation power of the network by emphasizing useful features and suppressing unnecessary ones. Second, we exploit effectively fine-grained features about tiny objects from the shallower layers through modifying backbone Darknet-53 and adding one prediction head to yolo-v3. Finally, a receptive field block is added into the neck of the network to broaden the receptive field. Experiments prove the effectiveness of our network in both quantitative and qualitative aspects. The m AP@0.5 of our network reaches 0.965 and its detection speed is55.56 FPS for 512 × 512 images on the challenging Tsinghua-Tencent 100 k(TT100 k) dataset.展开更多
To improve the accuracy of modulated signal recognition in variable environments and reduce the impact of factors such as lack of prior knowledge on recognition results,researchers have gradually adopted deep learning...To improve the accuracy of modulated signal recognition in variable environments and reduce the impact of factors such as lack of prior knowledge on recognition results,researchers have gradually adopted deep learning techniques to replace traditional modulated signal processing techniques.To address the problem of low recognition accuracy of the modulated signal at low signal-to-noise ratios,we have designed a novel modulation recognition network of multi-scale analysis with deep threshold noise elimination to recognize the actually collected modulated signals under a symmetric cross-entropy function of label smoothing.The network consists of a denoising encoder with deep adaptive threshold learning and a decoder with multi-scale feature fusion.The two modules are skip-connected to work together to improve the robustness of the overall network.Experimental results show that this method has better recognition accuracy at low signal-to-noise ratios than previous methods.The network demonstrates a flexible self-learning capability for different noise thresholds and the effectiveness of the designed feature fusion module in multi-scale feature acquisition for various modulation types.展开更多
基金supported in part by the General Program Hunan Provincial Natural Science Foundation of 2022,China(2022JJ31022)the Undergraduate Education Reform Project of Hunan Province,China(HNJG-20210532)the National Natural Science Foundation of China(62276276)。
文摘Accurate diagnosis of apple leaf diseases is crucial for improving the quality of apple production and promoting the development of the apple industry. However, apple leaf diseases do not differ significantly from image texture and structural information. The difficulties in disease feature extraction in complex backgrounds slow the related research progress. To address the problems, this paper proposes an improved multi-scale inverse bottleneck residual network model based on a triplet parallel attention mechanism, which is built upon ResNet-50, while improving and combining the inception module and ResNext inverse bottleneck blocks, to recognize seven types of apple leaf(including six diseases of alternaria leaf spot, brown spot, grey spot, mosaic, rust, scab, and one healthy). First, the 3×3 convolutions in some of the residual modules are replaced by multi-scale residual convolutions, the convolution kernels of different sizes contained in each branch of the multi-scale convolution are applied to extract feature maps of different sizes, and the outputs of these branches are multi-scale fused by summing to enrich the output features of the images. Second, the global layer-wise dynamic coordinated inverse bottleneck structure is used to reduce the network feature loss. The inverse bottleneck structure makes the image information less lossy when transforming from different dimensional feature spaces. The fusion of multi-scale and layer-wise dynamic coordinated inverse bottlenecks makes the model effectively balances computational efficiency and feature representation capability, and more robust with a combination of horizontal and vertical features in the fine identification of apple leaf diseases. Finally, after each improved module, a triplet parallel attention module is integrated with cross-dimensional interactions among channels through rotations and residual transformations, which improves the parallel search efficiency of important features and the recognition rate of the network with relatively small computational costs while the dimensional dependencies are improved. To verify the validity of the model in this paper, we uniformly enhance apple leaf disease images screened from the public data sets of Plant Village, Baidu Flying Paddle, and the Internet. The final processed image count is 14,000. The ablation study, pre-processing comparison, and method comparison are conducted on the processed datasets. The experimental results demonstrate that the proposed method reaches 98.73% accuracy on the adopted datasets, which is 1.82% higher than the classical ResNet-50 model, and 0.29% better than the apple leaf disease datasets before preprocessing. It also achieves competitive results in apple leaf disease identification compared to some state-ofthe-art methods.
基金supported by the Beijing Natural Science Foundation (L202003)National Natural Science Foundation of China (No. 31700479)。
文摘Automatic modulation classification(AMC) technology is one of the cutting-edge technologies in cognitive radio communications. AMC based on deep learning has recently attracted much attention due to its superior performances in classification accuracy and robustness. In this paper, we propose a novel, high resolution and multi-scale feature fusion convolutional neural network model with a squeeze-excitation block, referred to as HRSENet,to classify different kinds of modulation signals.The proposed model establishes a parallel computing mechanism of multi-resolution feature maps through the multi-layer convolution operation, which effectively reduces the information loss caused by downsampling convolution. Moreover, through dense skipconnecting at the same resolution and up-sampling or down-sampling connection at different resolutions, the low resolution representation of the deep feature maps and the high resolution representation of the shallow feature maps are simultaneously extracted and fully integrated, which is benificial to mine signal multilevel features. Finally, the feature squeeze and excitation module embedded in the decoder is used to adjust the response weights between channels, further improving classification accuracy of proposed model.The proposed HRSENet significantly outperforms existing methods in terms of classification accuracy on the public dataset “Over the Air” in signal-to-noise(SNR) ranging from-2dB to 20dB. The classification accuracy in the proposed model achieves 85.36% and97.30% at 4dB and 10dB, respectively, with the improvement by 9.71% and 5.82% compared to LWNet.Furthermore, the model also has a moderate computation complexity compared with several state-of-the-art methods.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61601442,61605218,and 61575207)the National Key Research and Development Program of China(Grant No.2018YFB0504302)the Youth Innovation Promotion Association of the Chinese Academy of Sciences(Grant Nos.2015124 and 2019154)。
文摘Imaging quality is a critical component of compressive imaging in real applications. In this study, we propose a compressive imaging method based on multi-scale modulation and reconstruction in the spatial frequency domain. Theoretical analysis and simulation show the relation between the measurement matrix resolution and compressive sensing(CS)imaging quality. The matrix design is improved to provide multi-scale modulations, followed by individual reconstruction of images of different spatial frequencies. Compared with traditional single-scale CS imaging, the multi-scale method provides high quality imaging in both high and low frequencies, and effectively decreases the overall reconstruction error.Experimental results confirm the feasibility of this technique, especially at low sampling rate. The method may thus be helpful in promoting the implementation of compressive imaging in real applications.
基金funded by the Science and Technology Development Program of Jilin Province(20190301024NY)the Precision Agriculture and Big Data Engineering Research Center of Jilin Province(2020C005).
文摘To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks,a lightweight ResNet(LW-ResNet)model for apple disease recognition is proposed.Based on the deep residual network(ResNet18),the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features.By improving the identity mapping structure to reduce information loss.By introducing the efficient channel attention module(ECANet)to suppress noise from a complex background.The experimental results show that the average precision,recall and F1-score of the LW-ResNet on the test set are 97.80%,97.92%and 97.85%,respectively.The parameter memory is 2.32 MB,which is 94%less than that of ResNet18.Compared with the classic lightweight networks SqueezeNet and MobileNetV2,LW-ResNet has obvious advantages in recognition performance,speed,parameter memory requirement and time complexity.The proposed model has the advantages of low computational cost,low storage cost,strong real-time performance,high identification accuracy,and strong practicability,which can meet the needs of real-time identification task of apple leaf disease on resource-constrained devices.
基金supported by the National Natural Science Foundation of China(91963206,21932004)the Ministry of Science and Technology of China(2017YFB0702800)the China Postdoctoral Science Foundation(2021M691512)。
文摘A unique nest-type catalyst has been designed with a nest of oxygen capture surrounding catalytic Pt centers, which shows much promoted performance, on the base of Pt/C catalyst, for oxygen reduction reaction(ORR). The nest is constructed with nitrogen-doped carbon matrix(NCM), derived from the controlled carbonization of PANI precursor, to cover Pt/C catalyst. The unique structure of the catalyst(denoted as NCM■ Pt/C) has many merits. Firstly, it can capture oxygen both in air and in acidic electrolyte. Compared with naked Pt/C, it is found that, in air, the oxygen concentration within the porous nest of NCM surrounding Pt/C particles is ~13 times higher than atmospheric oxygen concentration and, in acidic electrolyte, the concentration of activated oxygen over the catalyst NCM■ Pt/C rise to~1.9 times. Secondly, the NCM nest offers a special electronic modulation on Pt centers toward modified ORR kinetics and then catalytic performances. With these merits, compared with Pt/C, the NCM■ Pt/C catalyst shows 3.2 times higher turnover frequency value and 2.9 times enhanced specific activity for ORR with half-wave potential at 0.894 V. After 50,000 sweeping cycles, the NCM■ Pt/C catalyst retains~66% mass activity and still has advantages over the fresh Pt/C catalyst. We envision that the nest-type catalyst provides a new idea for progress of practical Pt/C ORR catalyst.
基金This work was supported by the National Natural Science Foundation of China(No.61906006).
文摘Whole brain functional connectivity(FC)patterns obtained from resting-state functional magnetic resonance imaging(rs-fMRI)have been widely used in the diagnosis of brain disorders such as autism spectrum disorder(ASD).Recently,an increasing number of studies have focused on employing deep learning techniques to analyze FC patterns for brain disease classification.However,the high dimensionality of the FC features and the interpretation of deep learning results are issues that need to be addressed in the FC-based brain disease classification.In this paper,we proposed a multi-scale attention-based deep neural network(MSA-DNN)model to classify FC patterns for the ASD diagnosis.The model was implemented by adding a flexible multi-scale attention(MSA)module to the auto-encoder based backbone DNN,which can extract multi-scale features of the FC patterns and change the level of attention for different FCs by continuous learning.Our model will reinforce the weights of important FC features while suppress the unimportant FCs to ensure the sparsity of the model weights and enhance the model interpretability.We performed systematic experiments on the large multi-sites ASD dataset with both ten-fold and leaveone-site-out cross-validations.Results showed that our model outperformed classical methods in brain disease classification and revealed robust intersite prediction performance.We also localized important FC features and brain regions associated with ASD classification.Overall,our study further promotes the biomarker detection and computer-aided classification for ASD diagnosis,and the proposed MSA module is flexible and easy to implement in other classification networks.
文摘“双碳”目标背景下,为实现综合能源系统(integrated energy systems,IES)多能耦合利用和低碳化,文中提出含光热模块的先进绝热压缩空气(advanced adiabatic compressed air energy storage,AA-CAES)储能电站和电转气(power to gas,P2G)与储液式碳捕集(carbon capture system,CCS)协同运行的IES低碳优化调度模型。论文建立光热模块与AA-CAES电站耦合模型并将其引入至含P2G-CCS的IES中;提出风-光-碳捕集电厂联合供能碳捕集设备运行策略及碳交易模型,以净碳排放量、综合成本最小化为目标函数构建IES低碳优化调度模型。通过算例对比,验证了含光热模块AA-CAES储能电站与P2G-CCS协同运行能够进一步降低总成本,减少碳排放。
基金supported by Ministry of Science and Technology of the People’s Republic of China(STI2030-Major Projects 2021ZD0201900)National Natural Science Foundation of China(grant mo.12090052)+2 种基金Natural Science Foundation of Liaoning Province(grant no.2023-MS-288)Fundamental Research Funds for the Central Universities(grant no.20720230017)Basic Public Welfare Research Program of Zhejiang Province(grant no.LGF20F030005).
文摘Neural networks excel at capturing local spatial patterns through convolutional modules,but they may struggle to identify and effectively utilize the morphological and amplitude periodic nature of physiological signals.In this work,we propose a novel network named filtering module fully convolutional network(FM-FCN),which fuses traditional filtering techniques with neural networks to amplify physiological signals and suppress noise.First,instead of using a fully connected layer,we use an FCN to preserve the time-dimensional correlation information of physiological signals,enabling multiple cycles of signals in the network and providing a basis for signal processing.Second,we introduce the FM as a network module that adapts to eliminate unwanted interference,leveraging the structure of the filter.This approach builds a bridge between deep learning and signal processing methodologies.Finally,we evaluate the performance of FM-FCN using remote photoplethysmography.Experimental results demonstrate that FM-FCN outperforms the second-ranked method in terms of both blood volume pulse(BVP)signal and heart rate(HR)accuracy.It substantially improves the quality of BVP waveform reconstruction,with a decrease of 20.23%in mean absolute error(MAE)and an increase of 79.95%in signal-to-noise ratio(SNR).Regarding HR estimation accuracy,FM-FCN achieves a decrease of 35.85%in MAE,29.65%in error standard deviation,and 32.88%decrease in 95%limits of agreement width,meeting clinical standards for HR accuracy requirements.The results highlight its potential in improving the accuracy and reliability of vital sign measurement through high-quality BVP signal extraction.The codes and datasets are available online at https://github.com/zhaoqi106/FM-FCN.
基金supported by the National Key R&D Program of China(Grant Nos.2018YFB2101100 and 2019YFB2101600)the National Natural Science Foundation of China(Grant No.62176016)+2 种基金the Guizhou Province Science and Technology Project:Research and Demonstration of Science and Technology Big Data Mining Technology Based on Knowledge Graph(Qiankehe[2021]General 382)the Training Program of the Major Research Plan of the National Natural Science Foundation of China(Grant No.92046015)the Beijing Natural Science Foundation Program and Scientific Research Key Program of Beijing Municipal Commission of Education(Grant No.KZ202010025047)。
文摘As one of the key technologies of intelligent vehicles, traffic sign detection is still a challenging task because of the tiny size of its target object. To address the challenge, we present a novel detection network improved from yolo-v3 for the tiny traffic sign with high precision in real-time. First, a visual multi-scale attention module(MSAM), a light-weight yet effective module, is devised to fuse the multi-scale feature maps with channel weights and spatial masks. It increases the representation power of the network by emphasizing useful features and suppressing unnecessary ones. Second, we exploit effectively fine-grained features about tiny objects from the shallower layers through modifying backbone Darknet-53 and adding one prediction head to yolo-v3. Finally, a receptive field block is added into the neck of the network to broaden the receptive field. Experiments prove the effectiveness of our network in both quantitative and qualitative aspects. The m AP@0.5 of our network reaches 0.965 and its detection speed is55.56 FPS for 512 × 512 images on the challenging Tsinghua-Tencent 100 k(TT100 k) dataset.
基金Project supported by the National Key R&D Program of China(No.2020YFF01015000ZL)the Fundamental Research Funds for the Central Universities,China(No.3072022CF0806)。
文摘To improve the accuracy of modulated signal recognition in variable environments and reduce the impact of factors such as lack of prior knowledge on recognition results,researchers have gradually adopted deep learning techniques to replace traditional modulated signal processing techniques.To address the problem of low recognition accuracy of the modulated signal at low signal-to-noise ratios,we have designed a novel modulation recognition network of multi-scale analysis with deep threshold noise elimination to recognize the actually collected modulated signals under a symmetric cross-entropy function of label smoothing.The network consists of a denoising encoder with deep adaptive threshold learning and a decoder with multi-scale feature fusion.The two modules are skip-connected to work together to improve the robustness of the overall network.Experimental results show that this method has better recognition accuracy at low signal-to-noise ratios than previous methods.The network demonstrates a flexible self-learning capability for different noise thresholds and the effectiveness of the designed feature fusion module in multi-scale feature acquisition for various modulation types.