期刊文献+
共找到1,322篇文章
< 1 2 67 >
每页显示 20 50 100
Tensor Low-Rank Orthogonal Compression for Convolutional Neural Networks
1
作者 Yaping He Xin Luo 《IEEE/CAA Journal of Automatica Sinica》 2026年第1期227-229,共3页
Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression... Dear Editor,The letter proposes a tensor low-rank orthogonal compression(TLOC)model for a convolutional neural network(CNN),which facilitates its efficient and highly-accurate low-rank representation.Model compression is crucial for deploying deep neural network(DNN)models on resource-constrained embedded devices. 展开更多
关键词 model compression convolutional neural network cnn which tensor low rank orthogonal compression deep neural network dnn models embedded devices convolutional neural networks
在线阅读 下载PDF
Physics-informed Neural Network-based Prediction of Multi-factor Coupled Thermal-oxidative Aging Behavior in Polyamide66-Glass Fiber Composites
2
作者 Hui Zhan Jie Liu +2 位作者 Sen-Hua Zhan Bo Wu Tong-Fei Shi 《Chinese Journal of Polymer Science》 2026年第2期437-449,I0013,共14页
Accelerated aging tests are widely used to rapidly evaluate the durability of materials,of which thermal-oxidative aging is the most common approach.To quantitatively predict the effects of multiple coupled factors,th... Accelerated aging tests are widely used to rapidly evaluate the durability of materials,of which thermal-oxidative aging is the most common approach.To quantitatively predict the effects of multiple coupled factors,this study takes polyamide66 reinforced with glass fiber(PA66-GF)as a model system and proposed a high-precision paradigm for coupled thermal-oxidative aging.By integrating Arrhenius-type reaction kinetics with oxygen diffusion,a predictive formula that holistically captures the nonlinear synergistic effects of multiple factors was developed,thereby overcoming the limitations of traditional single-variable models.A systematic evaluation of the stepwise improved formulas through nonlinear fitting showed that the coefficient of determination(R^(2))increased from 0.223 to 0.803,elucidating the fundamental reason why conventional approaches fail in quantitative prediction.These formulae were further embedded as physical constraints into a physics-informed neural network(PINN),which further enhanced the predictive performance,with the proposed formula achieving a peak R^(2)of 0.946.The results highlight that robust data fitting alone is insufficient;the decisive factor for the success of PINN lies in whether the embedded formula faithfully reflects the underlying physical mechanisms.When applied to polyamide 6 reinforced with glass fiber(PA6-GF),the Formula-constrained PINN maintained a high level of accuracy(R^(2)=0.916),demonstrating its strong cross-system generalizability.In summary,this work establishes a robust hybrid physics-machine learning framework that combines high accuracy with transferability for predicting the thermal-oxidative aging behavior of composite material systems. 展开更多
关键词 PA66-GF composites Accelerated aging Modified Arrhenius model DIMENSIONLESS Physics-informed neural network(PINN)
原文传递
探索非零位置约束:算法-硬件协同设计的DNN稀疏训练方法
3
作者 王淼 张盛兵 张萌 《西北工业大学学报》 北大核心 2025年第1期119-127,共9页
设备上的学习使得边缘设备能连续适应人工智能应用的新数据。利用稀疏性消除训练过程中的冗余计算和存储占用是提高边缘深度神经网络(deep neural network,DNN)学习效率的关键途径。然而由于缺乏对非零位置的假设,往往需要昂贵的代价用... 设备上的学习使得边缘设备能连续适应人工智能应用的新数据。利用稀疏性消除训练过程中的冗余计算和存储占用是提高边缘深度神经网络(deep neural network,DNN)学习效率的关键途径。然而由于缺乏对非零位置的假设,往往需要昂贵的代价用于实时地识别和分配零的位置以及对不规则计算的负载均衡,这使得现有稀疏训练工作难以接近理想加速比。如果能提前预知训练过程中操作数的非零位置约束规则,就可以跳过这些处理开销,从而提升稀疏训练性能和能效比。针对稀疏训练过程,面向边缘场景中典型的3类激活函数探索操作数之间的位置约束规则,提出:①一个硬件友好的稀疏训练算法以减少3个阶段的计算量和存储压力;②一个高能效的稀疏训练加速器,能预估非零位置使得实时处理代价被并行执行掩盖。实验表明所提出的方法比密集加速器和2个其他稀疏训练工作的能效比分别提升了2.2倍,1.38倍和1.46倍。 展开更多
关键词 稀疏训练 非零位置约束 dnn 稀疏加速器
在线阅读 下载PDF
A Survey of Accelerator Architectures for Deep Neural Networks 被引量:10
4
作者 Yiran Chen Yuan Xie +2 位作者 Linghao Song Fan Chen Tianqi Tang 《Engineering》 SCIE EI 2020年第3期264-274,共11页
Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully ap... Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully applied to solve many problems in academia and in industry.Although the explosion of big data applications is driving the development of ML,it also imposes severe challenges of data processing speed and scalability on conventional computer systems.Computing platforms that are dedicatedly designed for AI applications have been considered,ranging from a complement to von Neumann platforms to a“must-have”and stand-alone technical solution.These platforms,which belong to a larger category named“domain-specific computing,”focus on specific customization for AI.In this article,we focus on summarizing the recent advances in accelerator designs for deep neural networks(DNNs)-that is,DNN accelerators.We discuss various architectures that support DNN executions in terms of computing units,dataflow optimization,targeted network topologies,architectures on emerging technologies,and accelerators for emerging applications.We also provide our visions on the future trend of AI chip designs. 展开更多
关键词 Deep neural network Domain-specific architecture accelerator
在线阅读 下载PDF
FPGA implementation of neural network accelerator for pulse information extraction in high energy physics 被引量:2
5
作者 Jun-Ling Chen Peng-Cheng Ai +5 位作者 Dong Wang Hui Wang Ni Fang De-Li Xu Qi Gong Yuan-Kang Yang 《Nuclear Science and Techniques》 SCIE CAS CSCD 2020年第5期27-35,共9页
Extracting the amplitude and time information from the shaped pulse is an important step in nuclear physics experiments.For this purpose,a neural network can be an alternative in off-line data processing.For processin... Extracting the amplitude and time information from the shaped pulse is an important step in nuclear physics experiments.For this purpose,a neural network can be an alternative in off-line data processing.For processing the data in real time and reducing the off-line data storage required in a trigger event,we designed a customized neural network accelerator on a field programmable gate array platform to implement specific layers in a convolutional neural network.The latter is then used in the front-end electronics of the detector.With fully reconfigurable hardware,a tested neural network structure was used for accurate timing of shaped pulses common in front-end electronics.This design can handle up to four channels of pulse signals at once.The peak performance of each channel is 1.665 Giga operations per second at a working frequency of 25 MHz. 展开更多
关键词 Convolutional neural networks PULSE SHAPING ACCELERATION FRONT-END ELECTRONICS
在线阅读 下载PDF
基于POD-DNN降阶模型的油浸式变压器绕组稳态温升快速计算方法 被引量:2
6
作者 赵庆贤 刘云鹏 +3 位作者 刘刚 傅榕韵 邹莹 武卫革 《中国电机工程学报》 北大核心 2025年第6期2423-2436,I0033,共15页
为解决油浸式变压器绕组稳态温升计算耗时久的问题,该文提出一种基于POD-DNN降阶模型的快速计算方法。首先,通过绕组稳态温升全阶模型构建快照矩阵,并基于本征正交分解(proper orthogonal decomposition,POD)获得物理系统的模态及模态... 为解决油浸式变压器绕组稳态温升计算耗时久的问题,该文提出一种基于POD-DNN降阶模型的快速计算方法。首先,通过绕组稳态温升全阶模型构建快照矩阵,并基于本征正交分解(proper orthogonal decomposition,POD)获得物理系统的模态及模态系数。然后,建立工况参数与模态系数间的深度神经网络(deep neural networks,DNN)代理模型,解决POD方法中非线性项求解效率低和控制方程依赖强的局限,同时设计网络正则化策略,避免小样本下模型过拟合。最后,将DNN代理模型预测的模态系数与对应的POD模态线性加权,重构绕组温度场。经验证,POD-DNN求解的绕组温升结果与Fluent仿真和试验测量高度一致,计算效率相较于全阶模型和Fluent仿真分别提升了247478倍和23056倍,该算法能够为变压器的在线监测、运行维护和绝缘设计提供技术支撑。 展开更多
关键词 本征正交分解 深度神经网络 绕组稳态温升 快速计算 降阶模型
原文传递
A survey of neural network accelerator with software development environments
7
作者 Jin Song Xuemeng Wang +2 位作者 Zhipeng Zhao Wei Li Tian Zhi 《Journal of Semiconductors》 EI CAS CSCD 2020年第2期20-28,共9页
Recent years,the deep learning algorithm has been widely deployed from cloud servers to terminal units.And researchers proposed various neural network accelerators and software development environments.In this article... Recent years,the deep learning algorithm has been widely deployed from cloud servers to terminal units.And researchers proposed various neural network accelerators and software development environments.In this article,we have reviewed the representative neural network accelerators.As an entirety,the corresponding software stack must consider the hardware architecture of the specific accelerator to enhance the end-to-end performance.And we summarize the programming environments of neural network accelerators and optimizations in software stack.Finally,we comment the future trend of neural network accelerator and programming environments. 展开更多
关键词 neural network accelerator compiling optimization programming environments
在线阅读 下载PDF
LACC:a hardware and software co-design accelerator for deep neural networks
8
作者 Yu Yong Zhi Tian Zhou Shengyuan 《High Technology Letters》 EI CAS 2021年第1期62-67,共6页
With the increasing of data size and model size,deep neural networks(DNNs)show outstanding performance in many artificial intelligence(AI)applications.But the big model size makes it a challenge for high-performance a... With the increasing of data size and model size,deep neural networks(DNNs)show outstanding performance in many artificial intelligence(AI)applications.But the big model size makes it a challenge for high-performance and low-power running DNN on processors,such as central processing unit(CPU),graphics processing unit(GPU),and tensor processing unit(TPU).This paper proposes a LOGNN data representation of 8 bits and a hardware and software co-design deep neural network accelerator LACC to meet the challenge.LOGNN data representation replaces multiply operations to add and shift operations in running DNN.LACC accelerator achieves higher efficiency than the state-of-the-art DNN accelerators by domain specific arithmetic computing units.Finally,LACC speeds up the performance per watt by 1.5 times,compared to the state-of-the-art DNN accelerators on average. 展开更多
关键词 deep neural network(dnn) domain specific accelerator domain specific data type
在线阅读 下载PDF
Energy-optimal DNN model placement in UAV-enabled edge computing networks
9
作者 Jianhang Tang Guoquan Wu +3 位作者 Mohammad Mussadiq Jalalzai Lin Wang Bing Zhang Yi Zhou 《Digital Communications and Networks》 SCIE CSCD 2024年第4期827-836,共10页
Unmanned aerial vehicle(UAV)-enabled edge computing is emerging as a potential enabler for Artificial Intelligence of Things(AIoT)in the forthcoming sixth-generation(6G)communication networks.With the use of flexible ... Unmanned aerial vehicle(UAV)-enabled edge computing is emerging as a potential enabler for Artificial Intelligence of Things(AIoT)in the forthcoming sixth-generation(6G)communication networks.With the use of flexible UAVs,massive sensing data is gathered and processed promptly without considering geographical locations.Deep neural networks(DNNs)are becoming a driving force to extract valuable information from sensing data.However,the lightweight servers installed on UAVs are not able to meet the extremely high requirements of inference tasks due to the limited battery capacities of UAVs.In this work,we investigate a DNN model placement problem for AIoT applications,where the trained DNN models are selected and placed on UAVs to execute inference tasks locally.It is impractical to obtain future DNN model request profiles and system operation states in UAV-enabled edge computing.The Lyapunov optimization technique is leveraged for the proposed DNN model placement problem.Based on the observed system overview,an advanced online placement(AOP)algorithm is developed to solve the transformed problem in each time slot,which can reduce DNN model transmission delay and disk I/O energy cost simultaneously while keeping the input data queues stable.Finally,extensive simulations are provided to depict the effectiveness of the AOP algorithm.The numerical results demonstrate that the AOP algorithm can reduce 18.14%of the model placement cost and 29.89%of the input data queue backlog on average by comparing it with benchmark algorithms. 展开更多
关键词 UAV-Enabled edge computing dnn model Placement 6G networks Inference tasks
在线阅读 下载PDF
An FPGA-Based Resource-Saving Hardware Accelerator for Deep Neural Network
10
作者 Han Jia Xuecheng Zou 《International Journal of Intelligence Science》 2021年第2期57-69,共13页
With the development of computer vision researches, due to the state-of-the-art performance on image and video processing tasks, deep neural network (DNN) has been widely applied in various applications (autonomous ve... With the development of computer vision researches, due to the state-of-the-art performance on image and video processing tasks, deep neural network (DNN) has been widely applied in various applications (autonomous vehicles, weather forecasting, counter-terrorism, surveillance, traffic management, etc.). However, to achieve such performance, DNN models have become increasingly complicated and deeper, and result in heavy computational stress. Thus, it is not sufficient for the general central processing unit (CPU) processors to meet the real-time application requirements. To deal with this bottleneck, research based on hardware acceleration solution for DNN attracts great attention. Specifically, to meet various real-life applications, DNN acceleration solutions mainly focus on issue of hardware acceleration with intense memory and calculation resource. In this paper, a novel resource-saving architecture based on Field Programmable Gate Array (FPGA) is proposed. Due to the novel designed processing element (PE), the proposed architecture </span><span style="font-family:Verdana;">achieves good performance with the extremely limited calculating resource. The on-chip buffer allocation helps enhance resource-saving performance on memory. Moreover, the accelerator improves its performance by exploiting</span> <span style="font-family:Verdana;">the sparsity property of the input feature map. Compared to other state-of-the-art</span><span style="font-family:Verdana;"> solutions based on FPGA, our architecture achieves good performance, with quite limited resource consumption, thus fully meet the requirement of real-time applications. 展开更多
关键词 Deep Neural network RESOURCE-SAVING Hardware accelerator Data Flow
在线阅读 下载PDF
Design space exploration of neural network accelerator based on transfer learning
11
作者 吴豫章 ZHI Tian +1 位作者 SONG Xinkai LI Xi 《High Technology Letters》 EI CAS 2023年第4期416-426,共11页
With the increasing demand of computational power in artificial intelligence(AI)algorithms,dedicated accelerators have become a necessity.However,the complexity of hardware architectures,vast design search space,and c... With the increasing demand of computational power in artificial intelligence(AI)algorithms,dedicated accelerators have become a necessity.However,the complexity of hardware architectures,vast design search space,and complex tasks of accelerators have posed significant challenges.Tra-ditional search methods can become prohibitively slow if the search space continues to be expanded.A design space exploration(DSE)method is proposed based on transfer learning,which reduces the time for repeated training and uses multi-task models for different tasks on the same processor.The proposed method accurately predicts the latency and energy consumption associated with neural net-work accelerator design parameters,enabling faster identification of optimal outcomes compared with traditional methods.And compared with other DSE methods by using multilayer perceptron(MLP),the required training time is shorter.Comparative experiments with other methods demonstrate that the proposed method improves the efficiency of DSE without compromising the accuracy of the re-sults. 展开更多
关键词 design space exploration(DSE) transfer learning neural network accelerator multi-task learning
在线阅读 下载PDF
Optimizing deep learning inference on mobile devices with neural network accelerators
12
作者 Zeng Xi Xu Yunlong Zhi Tian 《High Technology Letters》 EI CAS 2019年第4期417-425,共9页
Deep learning has now been widely used in intelligent apps of mobile devices.In pursuit of ultra-low power and latency,integrating neural network accelerators(NNA)to mobile phones has become a trend.However,convention... Deep learning has now been widely used in intelligent apps of mobile devices.In pursuit of ultra-low power and latency,integrating neural network accelerators(NNA)to mobile phones has become a trend.However,conventional deep learning programming frameworks are not well-developed to support such devices,leading to low computing efficiency and high memory-occupation.To address this problem,a 2-stage pipeline is proposed for optimizing deep learning model inference on mobile devices with NNAs in terms of both speed and memory-footprint.The 1 st stage reduces computation workload via graph optimization,including splitting and merging nodes.The 2 nd stage goes further by optimizing at compilation level,including kernel fusion and in-advance compilation.The proposed optimizations on a commercial mobile phone with an NNA is evaluated.The experimental results show that the proposed approaches achieve 2.8×to 26×speed up,and reduce the memory-footprint by up to 75%. 展开更多
关键词 machine learning inference neural network accelerator(NNA) low latency kernel fusion in-advance compilation
在线阅读 下载PDF
NoC加速器中的高效DNN动态切片与智能映射算法
13
作者 齐芸 欧阳一鸣 《电信科学》 北大核心 2025年第10期151-160,共10页
针对深度神经网络(deep neural network,DNN)模型在传统切片与映射方法中存在的资源调度和数据传输瓶颈问题,提出了一种基于片上网络(network on chip,NoC)加速器的高效DNN动态切片与智能映射优化算法。该算法通过动态切片技术灵活划分... 针对深度神经网络(deep neural network,DNN)模型在传统切片与映射方法中存在的资源调度和数据传输瓶颈问题,提出了一种基于片上网络(network on chip,NoC)加速器的高效DNN动态切片与智能映射优化算法。该算法通过动态切片技术灵活划分DNN模型的计算任务,并结合智能映射策略优化NoC架构中的任务分配与数据流管理。实验结果表明,与传统方法相比,该算法在计算吞吐量、NoC传输时延、外部内存访问次数和计算能效等方面均显著提升,尤其在复杂模型上表现突出。 展开更多
关键词 NoC加速器 dnn切片 智能映射
在线阅读 下载PDF
Research on High-Precision Stochastic Computing VLSI Structures for Deep Neural Network Accelerators
14
作者 WU Jingguo ZHU Jingwei +3 位作者 XIONG Xiankui YAO Haidong WANG Chengchen CHEN Yun 《ZTE Communications》 2024年第4期9-17,共9页
Deep neural networks(DNN)are widely used in image recognition,image classification,and other fields.However,as the model size increases,the DNN hardware accelerators face the challenge of higher area overhead and ener... Deep neural networks(DNN)are widely used in image recognition,image classification,and other fields.However,as the model size increases,the DNN hardware accelerators face the challenge of higher area overhead and energy consumption.In recent years,stochastic computing(SC)has been considered a way to realize deep neural networks and reduce hardware consumption.A probabilistic compensation algorithm is proposed to solve the accuracy problem of stochastic calculation,and a fully parallel neural network accelerator based on a deterministic method is designed.The software simulation results show that the accuracy of the probability compensation algorithm on the CIFAR-10 data set is 95.32%,which is 14.98%higher than that of the traditional SC algorithm.The accuracy of the deterministic algorithm on the CIFAR-10 dataset is 95.06%,which is 14.72%higher than that of the traditional SC algorithm.The results of Very Large Scale Integration Circuit(VLSI)hardware tests show that the normalized energy efficiency of the fully parallel neural network accelerator based on the deterministic method is improved by 31%compared with the circuit based on binary computing. 展开更多
关键词 stochastic computing hardware accelerator deep neural network
在线阅读 下载PDF
基于DNN-GRU-SVM的深度学习组合模型的网络入侵检测方法
15
作者 刘虎鹏 颜辉 +5 位作者 于萍 许晓晴 龙蕴鑫 耿晓中 龙多 赵禺 《电脑与信息技术》 2025年第4期64-70,共7页
针对现代大数据环境中网络入侵检测系统(network intrusion detection system,NIDS)难以应对复杂网络攻击的问题,提出了一种基于深度神经网络(Deep Neural Network,DNN)-门控循环单元(Gated Recurrent Unit,GRU)-支持向量机(Support Vec... 针对现代大数据环境中网络入侵检测系统(network intrusion detection system,NIDS)难以应对复杂网络攻击的问题,提出了一种基于深度神经网络(Deep Neural Network,DNN)-门控循环单元(Gated Recurrent Unit,GRU)-支持向量机(Support Vector Machine,SVM)的组合模型DNN-GRU-SVM。该模型结合了DNN、GRU与SVM的优势,首先利用DNN提取网络数据特征,通过调整学习率与批量归一化来加速训练并减少过拟合;采用GRU捕捉序列数据中的时间依赖性;通过SVM实现精确分类。在KDD Cup'99数据集上的实验表明,DNNGRU-SVM组合模型取得了显著的性能提升,其检测准确率达94.53%,精确度为99.8%,召回率为92.8%,F1分数为96.2%,显著优于传统机器学习算法及单一的深度神经网络。实验结果表明,该模型能够有效提高网络入侵检测的准确率和适应性,为复杂网络环境下的入侵检测提供了可靠的解决方案。 展开更多
关键词 网络入侵检测 机器学习 深度学习 dnn-GRU-SVM
在线阅读 下载PDF
基于MAHAKIL与AM-DNN的煤层识别方法
16
作者 马晓易 段中钰 《北京信息科技大学学报(自然科学版)》 2025年第2期93-98,共6页
针对煤层识别中数据不平衡导致的精度下降问题,提出一种基于过采样算法MAHAKIL的融合注意力机制(attention mechanism,AM)的深度神经网络(deep neural network,DNN)模型MAHAKIL-AM-DNN。首先,使用改进的MAHAKIL算法生成具有多样性的合... 针对煤层识别中数据不平衡导致的精度下降问题,提出一种基于过采样算法MAHAKIL的融合注意力机制(attention mechanism,AM)的深度神经网络(deep neural network,DNN)模型MAHAKIL-AM-DNN。首先,使用改进的MAHAKIL算法生成具有多样性的合成样本;然后,使用注意力机制强化关键特征权重,优化深度神经网络的识别能力。实验结果表明,相较于不使用过采样技术的DNN方法以及使用合成少数类过采样技术(synthetic minority over-sampling technique,SMOTE)的SMOTE-DNN方法,该方法性能更优,F1值分别提高了58.5和4.8百分点,提升了煤层识别精度。 展开更多
关键词 煤层识别 遗传算法 过采样 注意力机制 深度神经网络
在线阅读 下载PDF
An Adaptive Features Fusion Convolutional Neural Network for Multi-Class Agriculture Pest Detection
17
作者 Muhammad Qasim Syed MAdnan Shah +4 位作者 Qamas Gul Khan Safi Danish Mahmood Adeel Iqbal Ali Nauman Sung Won Kim 《Computers, Materials & Continua》 2025年第6期4429-4445,共17页
Grains are the most important food consumed globally,yet their yield can be severely impacted by pest infestations.Addressing this issue,scientists and researchers strive to enhance the yield-to-seed ratio through eff... Grains are the most important food consumed globally,yet their yield can be severely impacted by pest infestations.Addressing this issue,scientists and researchers strive to enhance the yield-to-seed ratio through effective pest detection methods.Traditional approaches often rely on preprocessed datasets,but there is a growing need for solutions that utilize real-time images of pests in their natural habitat.Our study introduces a novel twostep approach to tackle this challenge.Initially,raw images with complex backgrounds are captured.In the subsequent step,feature extraction is performed using both hand-crafted algorithms(Haralick,LBP,and Color Histogram)and modified deep-learning architectures.We propose two models for this purpose:PestNet-EF and PestNet-LF.PestNet-EF uses an early fusion technique to integrate handcrafted and deep learning features,followed by adaptive feature selection methods such as CFS and Recursive Feature Elimination(RFE).PestNet-LF utilizes a late fusion technique,incorporating three additional layers(fully connected,softmax,and classification)to enhance performance.These models were evaluated across 15 classes of pests,including five classes each for rice,corn,and wheat.The performance of our suggested algorithms was tested against the IP102 dataset.Simulation demonstrates that the Pestnet-EF model achieved an accuracy of 96%,and the PestNet-LF model with majority voting achieved the highest accuracy of 94%,while PestNet-LF with the average model attained an accuracy of 92%.Also,the proposed approach was compared with existing methods that rely on hand-crafted and transfer learning techniques,showcasing the effectiveness of our approach in real-time pest detection for improved agricultural yield. 展开更多
关键词 Artificial neural network(ANN) support vector machine(SVM) deep neural network(dnn) transfer learning(TL)
在线阅读 下载PDF
改进DDPG的端边DNN协同推理策略
18
作者 和涛 栗娟 《计算机工程与应用》 北大核心 2025年第2期304-315,共12页
当前基于端边的深度神经网络(deep neural network,DNN)协同推理策略仅关注于优化时延敏感型任务的推理时延,而未考虑能耗敏感型任务的推理能耗成本,以及DNN划分后在异构边缘服务器之间的高效卸载问题。基于此,提出一种改进深度确定性... 当前基于端边的深度神经网络(deep neural network,DNN)协同推理策略仅关注于优化时延敏感型任务的推理时延,而未考虑能耗敏感型任务的推理能耗成本,以及DNN划分后在异构边缘服务器之间的高效卸载问题。基于此,提出一种改进深度确定性策略梯度(deep deterministic policy gradients,DDPG)的端边DNN协同推理策略,综合考虑任务对时延与能耗的敏感度,进而对推理成本进行综合优化。该策略将DNN划分与计算卸载问题分离,对不同协同设备建立预测模型,去预测出协同推理DNN的最优划分点与推理综合成本;根据预测的推理综合成本建立奖励函数,使用DDPG算法制定每个DNN推理任务的卸载策略,进而进行协同推理。实验结果证明,相比其他DNN协同推理策略,该策略在复杂的DNN协同推理环境下决策更高效,推理时延平均减少了46%,推理能耗平均减少了44%,推理综合成本平均降低了46%。 展开更多
关键词 边缘智能 深度神经网络(dnn) 协同推理 深度确定性策略梯度 任务卸载 能耗优化
在线阅读 下载PDF
Acceleration Response Reconstruction for Structural Health Monitoring Based on Fully Convolutional Networks
19
作者 Wenda Ma Qizhi Tang +2 位作者 Huang Lei Longfei Chang Chen Wang 《Structural Durability & Health Monitoring》 2025年第5期1265-1286,共22页
Lost acceleration response reconstruction is crucial for assessing structural conditions in structural health monitoring(SHM).However,traditional methods struggle to address the reconstruction of acceleration response... Lost acceleration response reconstruction is crucial for assessing structural conditions in structural health monitoring(SHM).However,traditional methods struggle to address the reconstruction of acceleration responses with complex features,resulting in a lower reconstruction accuracy.This paper addresses this challenge by leveraging the advanced feature extraction and learning capabilities of fully convolutional networks(FCN)to achieve precise reconstruction of acceleration responses.In the designed network architecture,the incorporation of skip connections preserves low-level details of the network,greatly facilitating the flow of information and improving training efficiency and accuracy.Dropout techniques are employed to reduce computational load and enhance feature extraction.The proposed FCN model automatically extracts high-level features from the input data and establishes a nonlinearmapping relationship between the input and output responses.Finally,the accuracy of the FCN for structural response reconstructionwas evaluated using acceleration data from an experimental arch rib and comparedwith several traditional methods.Additionally,this approach was applied to reconstruct actual acceleration responses measured by an SHM system on a long-span bridge.Through parameter analysis,the feasibility and accuracy of aspects such as available response positions,the number of available channels,and multi-channel response reconstruction were explored.The results indicate that this method exhibits high-precision response reconstruction capability in both time and frequency domains.,with performance surpassing that of other networks,confirming its effectiveness in reconstructing responses under various sensor data loss scenarios. 展开更多
关键词 Structural health monitoring acceleration response reconstruction fully convolutional network experimental validation large-scale structural application
在线阅读 下载PDF
基于DNN的声学模型自适应实验 被引量:5
20
作者 张宇 计哲 +3 位作者 万辛 张震 葛凤培 颜永红 《天津大学学报(自然科学与工程技术版)》 EI CAS CSCD 北大核心 2015年第9期765-770,共6页
声学模型自适应算法研究目的是缓解由测试数据和训练数据不匹配而引起的识别性能下降问题.基于深度神经网络(DNN)模型框架的自适应技术中,重训练是最直接的方法,但极容易出现过拟合现象,尤其是自适应数据稀疏的情况下.文章针对领域相关... 声学模型自适应算法研究目的是缓解由测试数据和训练数据不匹配而引起的识别性能下降问题.基于深度神经网络(DNN)模型框架的自适应技术中,重训练是最直接的方法,但极容易出现过拟合现象,尤其是自适应数据稀疏的情况下.文章针对领域相关的自动语音识别任务,对典型的两种声学模型自适应算法进行了尝试,实验了基于线性变换网络的自适应方法和基于相对熵正则化准则的自适应方法,并对两种算法进行了详尽的系统性能比较.结果表明,在不同的自适应数据量下,相对熵正则化自适应方法均能表现出较好的性能. 展开更多
关键词 声学模型自适应 语音识别 深度神经网络
在线阅读 下载PDF
上一页 1 2 67 下一页 到第
使用帮助 返回顶部