期刊文献+
共找到208篇文章
< 1 2 11 >
每页显示 20 50 100
A leap forward in compute-in-memory system for neural network inference
1
作者 Liang Chu Wenjun Li 《Journal of Semiconductors》 2025年第4期5-7,共3页
Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall&quo... Developing efficient neural network(NN)computing systems is crucial in the era of artificial intelligence(AI).Traditional von Neumann architectures have both the issues of"memory wall"and"power wall",limiting the data transfer between memory and processing units[1,2].Compute-in-memory(CIM)technologies,particularly analogue CIM with memristor crossbars,are promising because of their high energy efficiency,computational parallelism,and integration density for NN computations[3].In practical applications,analogue CIM excels in tasks like speech recognition and image classification,revealing its unique advantages.For instance,it efficiently processes vast amounts of audio data in speech recognition,achieving high accuracy with minimal power consumption.In image classification,the high parallelism of analogue CIM significantly speeds up feature extraction and reduces processing time.With the boosting development of AI applications,the demands for computational accuracy and task complexity are rising continually.However,analogue CIM systems are limited in handling complex regression tasks with needs of precise floating-point(FP)calculations.They are primarily suited for the classification tasks with low data precision and a limited dynamic range[4]. 展开更多
关键词 neural network von neumann architectures compute memory inference MEMRISTOR artificial intelligence ai traditional memristor crossbarsare analogue cim
在线阅读 下载PDF
Adaptive model switching of collaborative inference for multi-CNN streams in UAV swarm
2
作者 Yu LI Yuben QU +3 位作者 Chao DONG Zhen QIN Lei ZHANG Qihui WU 《Chinese Journal of Aeronautics》 2025年第8期485-497,共13页
Unmanned Aerial Vehicles(UAVs)coupled with deep learning such as Convolutional Neural Networks(CNNs)have been widely applied across numerous domains,including agriculture,smart city monitoring,and fire rescue operatio... Unmanned Aerial Vehicles(UAVs)coupled with deep learning such as Convolutional Neural Networks(CNNs)have been widely applied across numerous domains,including agriculture,smart city monitoring,and fire rescue operations,owing to their malleability and versatility.However,the computation-intensive and latency-sensitive natures of CNNs present a formidable obstacle to their deployment on resource-constrained UAVs.Some early studies have explored a hybrid approach that dynamically switches between lightweight and complex models to balance accuracy and latency.However,they often overlook scenarios involving multiple concurrent CNN streams,where competition for resources between streams can substantially impact latency and overall system performance.In this paper,we first investigate the deployment of both lightweight and complex models for multiple CNN streams in UAV swarm.Specifically,we formulate an optimization problem to minimize the total latency across multiple CNN streams,under the constraints on UAV memory and the accuracy requirement of each stream.To address this problem,we propose an algorithm called Adaptive Model Switching of collaborative inference for MultiCNN streams(AMSM)to identify the inference strategy with a low latency.Simulation results demonstrate that the proposed AMSM algorithm consistently achieves the lowest latency while meeting the accuracy requirements compared to benchmark algorithms. 展开更多
关键词 UAV swarmEdge computing Collaborative inference Model switching Multi-CNN streams
原文传递
Federated Experiments:Generative Causal Inference Powered by LLM-based Agents Simulation and RAG-based Domain Docking
3
作者 By De-Yu Zhou Xiao Xue +5 位作者 Qun Ma Chao Guo Li-Zhen Cui Yong-Lin Tian Jing Yang Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 2025年第7期1301-1304,共4页
COMPUTATIONAL experiments method is an essential tool for analyzing,designing,managing,and integrating complex systems.However,a significant challenge arises in constructing agents with human-like characteristics to f... COMPUTATIONAL experiments method is an essential tool for analyzing,designing,managing,and integrating complex systems.However,a significant challenge arises in constructing agents with human-like characteristics to form an AI society.Agent modeling typically encompasses four levels:1)The autonomy features of agents,e.g.,perception,behavior,and decision-making;2)The evolutionary features of agents,e.g.,bounded rationality,heterogeneity,and learning evolution;3)The social features of agents,e.g.,interaction,cooperation,and competition;4)The emergent features of agents,e.g.,gaming with environments or regulatory strategies.Traditional modeling techniques primarily derive from ABMs(Agent-based Models)and incorporate various emerging technologies(e.g.,machine learning,big data,and social networks),which can enhance modeling capabilities,while amplifying the complexity[1]. 展开更多
关键词 autonomy features generative causal inference complex systems llm based agents simulation federated experiments rag based domain docking computational experiments method agent modeling
在线阅读 下载PDF
Fuzzy inference systems with no any rule base and linearly parameter growth 被引量:2
4
作者 ShitongWANC KorrisF.L.CHUNG +2 位作者 JiepingLU BinHAN DewenHU 《控制理论与应用(英文版)》 EI 2004年第2期185-192,共8页
A class of new fuzzy inference systems New-FISs is presented.Compared with the standard fuzzy system, New-FIS is still a universal approximator and has no fuzzy rule base and linearly parameter growth. Thus, it effect... A class of new fuzzy inference systems New-FISs is presented.Compared with the standard fuzzy system, New-FIS is still a universal approximator and has no fuzzy rule base and linearly parameter growth. Thus, it effectively overcomes the second "curse of dimensionality":there is an exponential growth in the number of parameters of a fuzzy system as the number of input variables,resulting in surprisingly reduced computational complexity and being especially suitable for applications,where the complexity is of the first importance with respect to the approximation accuracy. 展开更多
关键词 Fuzzy inference Fuzzy systems Universal approximation computational complexity Linearly parameter growth
在线阅读 下载PDF
Communication-Efficient Edge AI Inference over Wireless Networks 被引量:2
5
作者 YANG Kai ZHOU Yong +1 位作者 YANG Zhanpeng SHI Yuanming 《ZTE Communications》 2020年第2期31-39,共9页
Given the fast growth of intelligent devices, it is expected that a large number of high-stakes artificial intelligence (AI) applications, e. g., drones, autonomous cars, and tac?tile robots, will be deployed at the e... Given the fast growth of intelligent devices, it is expected that a large number of high-stakes artificial intelligence (AI) applications, e. g., drones, autonomous cars, and tac?tile robots, will be deployed at the edge of wireless networks in the near future. Therefore, the intelligent communication networks will be designed to leverage advanced wireless tech?niques and edge computing technologies to support AI-enabled applications at various end devices with limited communication, computation, hardware and energy resources. In this article, we present the principles of efficient deployment of model inference at network edge to provide low-latency and energy-efficient AI services. This includes the wireless distribut?ed computing framework for low-latency device distributed model inference as well as the wireless cooperative transmission strategy for energy-efficient edge cooperative model infer?ence. The communication efficiency of edge inference systems is further improved by build?ing up a smart radio propagation environment via intelligent reflecting surface. 展开更多
关键词 communication efficiency cooperative transmission distributed computing edge AI edge inference
在线阅读 下载PDF
A New Way to Implement Quantum Computation
6
作者 Gennaro Auletta 《Journal of Quantum Information Science》 2013年第4期127-137,共11页
In this paper, I shall sketch a new way to consider a Lindenbaum-Tarski algebra as a 3D logical space in which any one (of the 256 statements) occupies a well-defined position and it is identified by a numerical ID. T... In this paper, I shall sketch a new way to consider a Lindenbaum-Tarski algebra as a 3D logical space in which any one (of the 256 statements) occupies a well-defined position and it is identified by a numerical ID. This allows pure mechanical computation both for generating rules and inferences. It is shown that this abstract formalism can be geometrically represented with logical spaces and subspaces allowing a vectorial representation. Finally, it shows the application to quantum computing through the example of three coupled harmonic oscillators. 展开更多
关键词 Lindenbaum-Tarski ALGEBRA 3D Logical Space Mechanical computation inference Quantum Com-puting RAISING OPERATORS Lowering OPERATORS
在线阅读 下载PDF
Parallel Inference for Real-Time Machine Learning Applications
7
作者 Sultan Al Bayyat Ammar Alomran +3 位作者 Mohsen Alshatti Ahmed Almousa Rayyan Almousa Yasir Alguwaifli 《Journal of Computer and Communications》 2024年第1期139-146,共8页
Hyperparameter tuning is a key step in developing high-performing machine learning models, but searching large hyperparameter spaces requires extensive computation using standard sequential methods. This work analyzes... Hyperparameter tuning is a key step in developing high-performing machine learning models, but searching large hyperparameter spaces requires extensive computation using standard sequential methods. This work analyzes the performance gains from parallel versus sequential hyperparameter optimization. Using scikit-learn’s Randomized SearchCV, this project tuned a Random Forest classifier for fake news detection via randomized grid search. Setting n_jobs to -1 enabled full parallelization across CPU cores. Results show the parallel implementation achieved over 5× faster CPU times and 3× faster total run times compared to sequential tuning. However, test accuracy slightly dropped from 99.26% sequentially to 99.15% with parallelism, indicating a trade-off between evaluation efficiency and model performance. Still, the significant computational gains allow more extensive hyperparameter exploration within reasonable timeframes, outweighing the small accuracy decrease. Further analysis could better quantify this trade-off across different models, tuning techniques, tasks, and hardware. 展开更多
关键词 Machine Learning Models computational Efficiency Parallel computing Systems Random Forest inference Hyperparameter Tuning Python Frameworks (TensorFlow PyTorch Scikit-Learn) High-Performance computing
在线阅读 下载PDF
Adaptive layer splitting forwireless large language model inference in edge computing:amodel-based reinforcement learning approach
8
作者 Yuxuan CHEN Rongpeng LI +2 位作者 Xiaoxue YU Zhifeng ZHAO Honggang ZHANG 《Frontiers of Information Technology & Electronic Engineering》 2025年第2期278-292,共15页
Optimizing the deployment of large language models(LLMs)in edge computing environments is critical for enhancing privacy and computational efficiency.In the path toward efficient wireless LLM inference in edge computi... Optimizing the deployment of large language models(LLMs)in edge computing environments is critical for enhancing privacy and computational efficiency.In the path toward efficient wireless LLM inference in edge computing,this study comprehensively analyzes the impact of different splitting points in mainstream open-source LLMs.Accordingly,this study introduces a framework taking inspiration from model-based reinforcement learning to determine the optimal splitting point across the edge and user equipment.By incorporating a reward surrogate model,our approach significantly reduces the computational cost of frequent performance evaluations.Extensive simulations demonstrate that this method effectively balances inference performance and computational load under varying network conditions,providing a robust solution for LLM deployment in decentralized settings. 展开更多
关键词 Large language models(LLMs) Edge computing Model-based reinforcement learning(MBRL) Split inference Transformer
原文传递
A Novel Forensic Computing Model 被引量:1
9
作者 XU Yunfeng LU Yansheng 《Wuhan University Journal of Natural Sciences》 CAS 2006年第6期1865-1868,共4页
According to the requirement of computer forensic and network forensic, a novel forensic computing model is presented, which exploits XML/OEM/RM data model, Data fusion technology, forensic knowledgebase, inference me... According to the requirement of computer forensic and network forensic, a novel forensic computing model is presented, which exploits XML/OEM/RM data model, Data fusion technology, forensic knowledgebase, inference mechanism of expert system and evidence mining engine. This model takes advantage of flexility and openness, so it can be widely used in mining evidence. 展开更多
关键词 forensic computing data fusion inference mechanism hidden Markov model petri network
在线阅读 下载PDF
Energy-optimal DNN model placement in UAV-enabled edge computing networks
10
作者 Jianhang Tang Guoquan Wu +3 位作者 Mohammad Mussadiq Jalalzai Lin Wang Bing Zhang Yi Zhou 《Digital Communications and Networks》 SCIE CSCD 2024年第4期827-836,共10页
Unmanned aerial vehicle(UAV)-enabled edge computing is emerging as a potential enabler for Artificial Intelligence of Things(AIoT)in the forthcoming sixth-generation(6G)communication networks.With the use of flexible ... Unmanned aerial vehicle(UAV)-enabled edge computing is emerging as a potential enabler for Artificial Intelligence of Things(AIoT)in the forthcoming sixth-generation(6G)communication networks.With the use of flexible UAVs,massive sensing data is gathered and processed promptly without considering geographical locations.Deep neural networks(DNNs)are becoming a driving force to extract valuable information from sensing data.However,the lightweight servers installed on UAVs are not able to meet the extremely high requirements of inference tasks due to the limited battery capacities of UAVs.In this work,we investigate a DNN model placement problem for AIoT applications,where the trained DNN models are selected and placed on UAVs to execute inference tasks locally.It is impractical to obtain future DNN model request profiles and system operation states in UAV-enabled edge computing.The Lyapunov optimization technique is leveraged for the proposed DNN model placement problem.Based on the observed system overview,an advanced online placement(AOP)algorithm is developed to solve the transformed problem in each time slot,which can reduce DNN model transmission delay and disk I/O energy cost simultaneously while keeping the input data queues stable.Finally,extensive simulations are provided to depict the effectiveness of the AOP algorithm.The numerical results demonstrate that the AOP algorithm can reduce 18.14%of the model placement cost and 29.89%of the input data queue backlog on average by comparing it with benchmark algorithms. 展开更多
关键词 UAV-Enabled edge computing DNN model Placement 6G networks inference tasks
在线阅读 下载PDF
Intelligent Robust Control of Redundant Smart Robotic Arm Pt II: Quantum Computing KB Optimizer Supremacy
11
作者 Alena V.Nikolaeva Sergey V.Ulyanov 《Artificial Intelligence Advances》 2020年第2期32-67,共36页
This article is a continuation of the work“Intelligent robust control of redundant smart robotic arm Pt I:Soft computing KB optimizer-deep machine learning IT”.In the first part of the paper,we examined control syst... This article is a continuation of the work“Intelligent robust control of redundant smart robotic arm Pt I:Soft computing KB optimizer-deep machine learning IT”.In the first part of the paper,we examined control systems with constant coefficients of the conventional PID controller(based on genetic algorithm)and intelligent control systems based on soft computing technologies.For demonstration,MatLab/Simulink models and a test benchmark of the robot manipulator demonstrated.Advantages and limitations of intelligent control systems based on soft computing technology discussed.Intelligent main element of the control system based on soft computing is a fuzzy controller with a knowledge base in it.In the first part of the article,two ways to implement fuzzy controllers showed.First way applyied one controller for all links of the manipulator and showed the best performance.However,such an implementation is not possible in complex control objects,such as a manipulator with seven degrees of freedom(7DOF).The second way use of separated control when an independent fuzzy controller controls each link.The control decomposition due to a slight decrease in the quality of management has greatly simplified the processes of creating and placing knowledge bases.In this Pt II,to eliminate the mismatch of the work of separate independent fuzzy controllers,methods for organizing coordination control based on quantum computing technologies to create robust intelligent control systems for robotic manipulators with 3DOF and 7DOF described.Quantum supremacy of developed end-to-end IT design of robust intelligent control systems demonstrated. 展开更多
关键词 Quantum computing supremacy Quantum-classical correlation Knowledge base Fuzzy controller Quantum fuzzy inference
在线阅读 下载PDF
Intelligent Control of Mobile Robot with Redundant Manipulator & Stereovision: Quantum / Soft Computing Toolkit
12
作者 Kirill V.Koshelev Alena V.Nikolaeva +1 位作者 Andrey G.Reshetnikov Sergey V.Ulyanov 《Artificial Intelligence Advances》 2020年第2期1-31,共31页
The task of an intelligent control system design applying soft and quantum computational intelligence technologies discussed.An example of a control object as a mobile robot with redundant robotic manipulator and ster... The task of an intelligent control system design applying soft and quantum computational intelligence technologies discussed.An example of a control object as a mobile robot with redundant robotic manipulator and stereovision introduced.Design of robust knowledge bases is performed using a developed computational intelligence-quantum/soft computing toolkit(QC/SCOptKBTM).The knowledge base self-organization process of fuzzy homogeneous regulators through the application of end-to-end IT of quantum computing described.The coordination control between the mobile robot and redundant manipulator with stereovision based on soft computing described.The general design methodology of a generalizing control unit based on the physical laws of quantum computing(quantum information-thermodynamic trade-off of control quality distribution and knowledge base self-organization goal)is considered.The modernization of the pattern recognition system based on stereo vision technology presented.The effectiveness of the proposed methodology is demonstrated in comparison with the structures of control systems based on soft computing for unforeseen control situations with sensor system.The main objective of this article is to demonstrate the advantages of the approach based on quantum/soft computing. 展开更多
关键词 Quantum/Soft computing optimizer Knowledge base Fuzzy controller Quantum fuzzy inference Multi-agent systems Mobile robot stereo vision
在线阅读 下载PDF
面向边缘智能的协同推理方法研究综述 被引量:7
13
作者 赵婵婵 吕飞 +3 位作者 石宝 尉晓敏 杨星辰 岳效灿 《计算机工程与应用》 北大核心 2025年第3期1-20,共20页
随着边缘智能的兴起,协同推理技术通过云、边缘和终端设备之间的协作在提升智能应用的效率和性能方面取得了明显的进展。阐述了边缘智能的性能指标和应用场景及挑战,并以边缘智能的评级架构引出协同推理技术下的四种推理范式:端端协同... 随着边缘智能的兴起,协同推理技术通过云、边缘和终端设备之间的协作在提升智能应用的效率和性能方面取得了明显的进展。阐述了边缘智能的性能指标和应用场景及挑战,并以边缘智能的评级架构引出协同推理技术下的四种推理范式:端端协同、边端协同、边边协同和云边端协同推理。根据协同推理技术应用场景的局限性和差异性,对不同推理范式中协同推理技术的优势、局限性、原理及优化目标进行了全面分析对比。详细探讨了协同推理技术在不同应用场景下所解决的计算资源分配、推理时延优化和吞吐量优化等问题,指出了边缘智能中协同推理技术在隐私安全、通信服务资源管理、协同训练方面的挑战,并对其未来的发展趋势和研究方向进行了讨论,为该领域的研究提供参考和借鉴。 展开更多
关键词 边缘计算 边缘智能 协同推理 机器学习
在线阅读 下载PDF
计算思维与批判性思维毕业要求观测点的课程支撑设计
14
作者 段斌 周静 +1 位作者 石金晶 旷怡 《电气技术》 2025年第11期27-33,共7页
为适应信息化、智能化时代的需求,《工程教育认证标准》新增“计算思维”和“批判性思维”观测点,旨在培养学生在解决复杂工程问题中运用计算思维的能力,并通过批判性思维实现创新和持续学习。然而,课程设计面临混杂因子干扰、静态评价... 为适应信息化、智能化时代的需求,《工程教育认证标准》新增“计算思维”和“批判性思维”观测点,旨在培养学生在解决复杂工程问题中运用计算思维的能力,并通过批判性思维实现创新和持续学习。然而,课程设计面临混杂因子干扰、静态评价体系、缺乏动态优化机制等挑战,难以有效支撑这两个观测点的达成。为此,本文基于因果科学理论,提出“引入中介消除混杂”方法,精准量化教学干预的因果效应,并以能源信息物理系统课程为例,具体展示如何引入中介变量消除混杂因子的干扰,量化教学干预效果并实现课程的持续改进。 展开更多
关键词 计算思维 批判性思维 因果推断 消除混杂 课程设计
在线阅读 下载PDF
异构算力人工智能推理基础设施的机遇与挑战
15
作者 崔慧敏 李广力 +3 位作者 杜臻 赵家程 刘颖 冯晓兵 《计算》 2025年第4期18-24,共7页
随着人工智能(artificial intelligence,AI)模型规模持续扩大与芯片架构日益异构化,AI推理基础设施面临跨平台兼容性差、算力利用率低与运行时行为高度动态等挑战。系统分析其优化路径,涵盖统一抽象、多层融合、自适应机制与服务场景分... 随着人工智能(artificial intelligence,AI)模型规模持续扩大与芯片架构日益异构化,AI推理基础设施面临跨平台兼容性差、算力利用率低与运行时行为高度动态等挑战。系统分析其优化路径,涵盖统一抽象、多层融合、自适应机制与服务场景分化,并探讨关键技术方向与未来机遇. 展开更多
关键词 人工智能推理基础设施 异构算力 编译优化 统一抽象 多层融合 运行时自适应
在线阅读 下载PDF
一种基于频域内推理计算的长短期记忆神经网络硬件加速器设计
16
作者 靳松 陈诗琪 《计算机学报》 北大核心 2025年第8期1781-1794,共14页
长短期记忆神经网络(Long Short-Term Memory,LSTM)可以捕捉到序列数据间长距离的依赖关系,因此在时间序列预测、自然语言分析和语音识别等领域得到广泛应用。然而,LSTM网络独特的门控机制和状态更新过程导致其推理计算的复杂度较高,参... 长短期记忆神经网络(Long Short-Term Memory,LSTM)可以捕捉到序列数据间长距离的依赖关系,因此在时间序列预测、自然语言分析和语音识别等领域得到广泛应用。然而,LSTM网络独特的门控机制和状态更新过程导致其推理计算的复杂度较高,参数量较大,对其在资源受限的边缘设备上的部署形成挑战。本文提出一种基于频域内推理计算的长短期记忆神经网络硬件加速器设计。采用循环分块矩阵对网络的权重参数进行压缩存储,结合快速傅里叶变换(Fast Fourier Transform,FFT)和频域激活函数实现频域内网络推理计算,避免在处理不同时间样本时频繁的时域-频域切换开销。采用坐标旋转数字计算机算法(Coordinate Rotation Digital Computer,CORDIC)替换频域内的乘法运算和超函数计算,实现LSTM的低功耗硬件部署。提出的硬件加速器在PYNQ-Z2开发板上进行了原型实现。面向开源时间序列数据集的实验结果表明,加速器实现了63.6μs的网络平均推理延迟,功耗1.743 W,相比时域LSTM推理计算延迟降低了44.2%,功耗降低6.4%。同时,BRAM和FIFO的资源占用率仅为5%和2%,相比时域LSTM推理计算分别降低了83%和91.2%。 展开更多
关键词 长短期记忆神经网络 分块循环矩阵 坐标旋转数字计算机 频域推理计算 快速傅里叶变换
在线阅读 下载PDF
面向AIoT的协同智能综述 被引量:2
17
作者 罗宇哲 李玲 +5 位作者 侯朋朋 于佳耕 程丽敏 张常有 武延军 赵琛 《计算机研究与发展》 北大核心 2025年第1期179-206,共28页
深度学习和物联网的融合发展有力地促进了AIoT生态的繁荣.一方面AIoT设备为深度学习提供了海量数据资源,另一方面深度学习使得AIoT设备更加智能化.为保护用户数据隐私和克服单个AIoT设备的资源瓶颈,联邦学习和协同推理成为了深度学习在A... 深度学习和物联网的融合发展有力地促进了AIoT生态的繁荣.一方面AIoT设备为深度学习提供了海量数据资源,另一方面深度学习使得AIoT设备更加智能化.为保护用户数据隐私和克服单个AIoT设备的资源瓶颈,联邦学习和协同推理成为了深度学习在AIoT应用场景中广泛应用的重要支撑.联邦学习能在保护隐私的前提下有效利用用户的数据资源来训练深度学习模型,协同推理能借助多个设备的计算资源来提升推理的性能.引入了面向AIoT的协同智能的基本概念,围绕实现高效、安全的知识传递与算力供给,总结了近十年来联邦学习和协同推理算法以及架构和隐私安全3个方面的相关技术进展,介绍了联邦学习和协同推理在AIoT应用场景中的内在联系.从设备共用、模型共用、隐私安全机制协同和激励机制协同等方面展望了面向AIoT的协同智能的未来发展. 展开更多
关键词 协同智能 联邦学习 协同推理 智能物联网 智能计算系统
在线阅读 下载PDF
基于多方计算的安全拜占庭弹性联邦学习 被引量:3
18
作者 高鸿峰 黄浩 田有亮 《通信学报》 北大核心 2025年第2期108-122,共15页
为了解决联邦学习中梯度隐私保护、服务器推理攻击和客户端数据投毒导致的低准确率等问题,针对服务器-客户端的两层架构,提出了一种基于多方计算的安全拜占庭弹性联邦学习方案。首先,提出了一种基于加法秘密共享的两方密文计算方法,对... 为了解决联邦学习中梯度隐私保护、服务器推理攻击和客户端数据投毒导致的低准确率等问题,针对服务器-客户端的两层架构,提出了一种基于多方计算的安全拜占庭弹性联邦学习方案。首先,提出了一种基于加法秘密共享的两方密文计算方法,对本地模型梯度进行拆分,来抵抗服务器的推理攻击。其次,设计了一种密态数据下的投毒检测算法和客户端筛选机制来抵御投毒攻击。最后,在MNIST数据集和CIFAR-10数据集上进行实验来验证方案的可行性。与传统的Trim-mean和Median方法相比,当拜占庭参与者比例达到40%时,模型的准确率提升了3%~6%。综上所述,所提方案既能抵御推理攻击和投毒攻击,又能提高全局模型的准确率,足以证明方案的有效性。 展开更多
关键词 联邦学习 隐私保护 多方计算 推理攻击 投毒攻击
在线阅读 下载PDF
面向深度学习的高效安全推理研究综述 被引量:1
19
作者 胡鹏 孙磊 +4 位作者 胡翠云 郭松 王晶雯 王志鸿 姚敬怡 《通信学报》 北大核心 2025年第5期238-257,共20页
基于同态加密、安全多方计算等隐私保护密码技术实现深度学习模型安全推理的同时,也引入了巨大的计算和通信开销。针对如何加速安全推理,对现有研究成果进行了总结与梳理。首先,对实现安全推理的2个关键环节——安全协议和推理模型的技... 基于同态加密、安全多方计算等隐私保护密码技术实现深度学习模型安全推理的同时,也引入了巨大的计算和通信开销。针对如何加速安全推理,对现有研究成果进行了总结与梳理。首先,对实现安全推理的2个关键环节——安全协议和推理模型的技术路线和优化方法进行了系统性对比分析。针对安全协议,区分底层的线性和非线性运算,对不同密码原语方案的性能和效率进行对比分析;针对推理模型,就如何平衡安全推理的性能和效率,讨论分析了现有的主要优化方法。其次,增加了对安全推理方案构建成本和主流隐私保护框架的讨论分析,从实际应用角度出发进一步充实了高效安全推理研究的关注范畴。最后,通过分析安全推理面临的问题挑战,面向实际应用需求提出未来深度学习安全推理的探索方向。 展开更多
关键词 深度学习 隐私保护 安全推理 同态加密 安全多方计算
在线阅读 下载PDF
面向边缘智能的交错式算子划分协同推理加速策略
20
作者 刘志邦 吴凡 +2 位作者 徐朝农 张自晓 马丹 《模式识别与人工智能》 北大核心 2025年第9期851-860,共10页
协同推理是在资源受限的边缘设备上实现模型部署并加速推理的有效方案,但目前的算子划分策略在设备间通信开销仍然较高.因此,文中提出面向边缘智能的交错式算子划分(Interleaved Operator Partitioning,IOP)协同推理加速策略,将相邻算... 协同推理是在资源受限的边缘设备上实现模型部署并加速推理的有效方案,但目前的算子划分策略在设备间通信开销仍然较高.因此,文中提出面向边缘智能的交错式算子划分(Interleaved Operator Partitioning,IOP)协同推理加速策略,将相邻算子分别沿输入通道维度和输出通道维度进行划分,匹配前后级算子的通道数量,减少算子输出激活的拼接过程,降低协同推理的时间开销.首先,通过模型中的算子信息建模设备的计算开销和通信开销,建立最小化协同推理时间的整数规划模型.然后,设计启发式的算子配对算法,自前向后枚举相邻算子,对比IOP策略与OCP(Output Channel Partitioning)策略的推理时间开销,选择收益最高的算子进行配对.最后,对配对后的算子实施交错式划分和分散部署.实验表明,IOP策略在推理延迟时间、内存占用及能耗上均较优,同时在面对突发链路波动时仍具有良好的鲁棒性. 展开更多
关键词 深度学习 边缘智能 协同推理 并行计算
在线阅读 下载PDF
上一页 1 2 11 下一页 到第
使用帮助 返回顶部