期刊文献+
共找到1,086篇文章
< 1 2 55 >
每页显示 20 50 100
Resource Allocation in V2X Networks:A Double Deep Q-Network Approach with Graph Neural Networks
1
作者 Zhengda Huan Jian Sun +3 位作者 Zeyu Chen Ziyi Zhang Xiao Sun Zenghui Xiao 《Computers, Materials & Continua》 2025年第9期5427-5443,共17页
With the advancement of Vehicle-to-Everything(V2X)technology,efficient resource allocation in dynamic vehicular networks has become a critical challenge for achieving optimal performance.Existing methods suffer from h... With the advancement of Vehicle-to-Everything(V2X)technology,efficient resource allocation in dynamic vehicular networks has become a critical challenge for achieving optimal performance.Existing methods suffer from high computational complexity and decision latency under high-density traffic and heterogeneous network conditions.To address these challenges,this study presents an innovative framework that combines Graph Neural Networks(GNNs)with a Double Deep Q-Network(DDQN),utilizing dynamic graph structures and reinforcement learning.An adaptive neighbor sampling mechanism is introduced to dynamically select the most relevant neighbors based on interference levels and network topology,thereby improving decision accuracy and efficiency.Meanwhile,the framework models communication links as nodes and interference relationships as edges,effectively capturing the direct impact of interference on resource allocation while reducing computational complexity and preserving critical interaction information.Employing an aggregation mechanism based on the Graph Attention Network(GAT),it dynamically adjusts the neighbor sampling scope and performs attention-weighted aggregation based on node importance,ensuring more efficient and adaptive resource management.This design ensures reliable Vehicle-to-Vehicle(V2V)communication while maintaining high Vehicle-to-Infrastructure(V2I)throughput.The framework retains the global feature learning capabilities of GNNs and supports distributed network deployment,allowing vehicles to extract low-dimensional graph embeddings from local observations for real-time resource decisions.Experimental results demonstrate that the proposed method significantly reduces computational overhead,mitigates latency,and improves resource utilization efficiency in vehicular networks under complex traffic scenarios.This research not only provides a novel solution to resource allocation challenges in V2X networks but also advances the application of DDQN in intelligent transportation systems,offering substantial theoretical significance and practical value. 展开更多
关键词 Resource allocation V2X double deep q-network graph neural network
在线阅读 下载PDF
Dimensionality-Decomposition Based Deep Learning Approach for Non-Equilibrium Electric Double Layer Modeling
2
作者 Weijie Li Yajie Li +1 位作者 Maxim Avdeev Siqi Shi 《Chinese Physics Letters》 2025年第12期381-401,共21页
The electric double layer(EDL),formed by charge adsorption at the electrolyte–electrode interface,constitutes the microenvironment governing electrochemical reactions.However,due to scale mismatch between the EDL thi... The electric double layer(EDL),formed by charge adsorption at the electrolyte–electrode interface,constitutes the microenvironment governing electrochemical reactions.However,due to scale mismatch between the EDL thickness and electrode topography,solving the two-dimensional(2D)nonhomogeneous Poisson–Nernst–Planck(N-PNP)equations remains computationally intractable.This limitation hinders understanding of fundamental phenomena such as curvature-driven instabilities in 2D EDL.Here,we propose a dimensionality-decomposition strategy embedding a fully connected neural network(FCNN)to solve 2D N-PNP equations,in which the FCNN is trained on key electrochemical parameters by reducing the electrostatic boundary into multiple equivalent 1D representations.Through a representative case of LiPF6 reduction on lithium metal half-cell,nucleus size is unexpectedly found to have an important influence on dendrite morphology and tip kinetics.This work paves the way for bridging nanoscale and macroscale simulations with expandability to 2D situations of other 1D EDL models. 展开更多
关键词 non equilibrium dimensionality decomposition Poisson Nernst Planck equations electric double layer edl formed electrolyte electrode interfaceconstitutes charge adsorption electrochemical reactionshoweverdue deep learning
原文传递
Transformer-Aided Deep Double Dueling Spatial-Temporal Q-Network for Spatial Crowdsourcing Analysis
3
作者 Yu Li Mingxiao Li +2 位作者 Dongyang Ou Junjie Guo Fangyuan Pan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期893-909,共17页
With the rapid development ofmobile Internet,spatial crowdsourcing has becomemore andmore popular.Spatial crowdsourcing consists of many different types of applications,such as spatial crowd-sensing services.In terms ... With the rapid development ofmobile Internet,spatial crowdsourcing has becomemore andmore popular.Spatial crowdsourcing consists of many different types of applications,such as spatial crowd-sensing services.In terms of spatial crowd-sensing,it collects and analyzes traffic sensing data from clients like vehicles and traffic lights to construct intelligent traffic prediction models.Besides collecting sensing data,spatial crowdsourcing also includes spatial delivery services like DiDi and Uber.Appropriate task assignment and worker selection dominate the service quality for spatial crowdsourcing applications.Previous research conducted task assignments via traditional matching approaches or using simple network models.However,advanced mining methods are lacking to explore the relationship between workers,task publishers,and the spatio-temporal attributes in tasks.Therefore,in this paper,we propose a Deep Double Dueling Spatial-temporal Q Network(D3SQN)to adaptively learn the spatialtemporal relationship between task,task publishers,and workers in a dynamic environment to achieve optimal allocation.Specifically,D3SQNis revised through reinforcement learning by adding a spatial-temporal transformer that can estimate the expected state values and action advantages so as to improve the accuracy of task assignments.Extensive experiments are conducted over real data collected fromDiDi and ELM,and the simulation results verify the effectiveness of our proposed models. 展开更多
关键词 Historical behavior analysis spatial crowdsourcing deep double dueling q-networks
在线阅读 下载PDF
Double Sarsa and Double Expected Sarsa with Shallow and Deep Learning 被引量:10
4
作者 Michael Ganger Ethan Duryea Wei Hu 《Journal of Data Analysis and Information Processing》 2016年第4期159-176,共18页
Double Q-learning has been shown to be effective in reinforcement learning scenarios when the reward system is stochastic. We apply the idea of double learning that this algorithm uses to Sarsa and Expected Sarsa, pro... Double Q-learning has been shown to be effective in reinforcement learning scenarios when the reward system is stochastic. We apply the idea of double learning that this algorithm uses to Sarsa and Expected Sarsa, producing two new algorithms called Double Sarsa and Double Expected Sarsa that are shown to be more robust than their single counterparts when rewards are stochastic. We find that these algorithms add a significant amount of stability in the learning process at only a minor computational cost, which leads to higher returns when using an on-policy algorithm. We then use shallow and deep neural networks to approximate the actionvalue, and show that Double Sarsa and Double Expected Sarsa are much more stable after convergence and can collect larger rewards than the single versions. 展开更多
关键词 double Sarsa double Expected Sarsa Reinforcement Learning deep Learning
在线阅读 下载PDF
Hydrodynamic deep drawing of double layered conical cups 被引量:4
5
作者 Alireza JALIL Mohammad HOSEINPOUR GOLLO +1 位作者 M.Morad SHEIKHI S.M.Hossein SEYEDKASHI 《Transactions of Nonferrous Metals Society of China》 SCIE EI CAS CSCD 2016年第1期237-247,共11页
Hydrodynamic deep drawing assisted by radial pressure is an advanced sheet forming technology with great advantages such as higher drawing ratio, good surface quality and higher dimensional accuracy. In this process, ... Hydrodynamic deep drawing assisted by radial pressure is an advanced sheet forming technology with great advantages such as higher drawing ratio, good surface quality and higher dimensional accuracy. In this process, both the bottom surface and the peripheral edge of sheets are under hydrodynamic pressure, so that the forming procedure is more uniform with low failure probability. Multi-layered sheets with complex geometries could be formed more easily with this technique compared with other traditional methods. Rupture is the main irrecoverable failure form in sheet forming processes. Prediction of rupture occurrence is of great importance for determining and optimizing the proper process parameters. In this research, a theoretical model was proposed to calculate the critical rupture pressure in production of double layered conical parts with hydrodynamic deep drawing process assisted by radial pressure. The effects of other process parameters on critical rupture pressure, such as punch tip radius, drawing ratio, coefficient of friction, sheet thickness and material properties were also discussed. The proposed model was compared with finite element simulation and validated by experiments on Al1050/St13 double layered sheets, where a good agreement was found with analytical results. 展开更多
关键词 hydrodynamic deep drawing radial pressure double layered sheet conical cup critical pressure
在线阅读 下载PDF
Magnetic resonance imaging-based deep learning model to predict multiple firings in double-stapled colorectal anastomosis 被引量:8
6
作者 Zheng-Hao Cai Qun Zhang +7 位作者 Zhan-Wei Fu Abraham Fingerhut Jing-Wen Tan Lu Zang Feng Dong Shu-Chun Li Shi-Lin Wang Jun-Jun Ma 《World Journal of Gastroenterology》 SCIE CAS 2023年第3期536-548,共13页
BACKGROUND Multiple linear stapler firings during double stapling technique(DST)after laparoscopic low anterior resection(LAR)are associated with an increased risk of anastomotic leakage(AL).However,it is difficult to... BACKGROUND Multiple linear stapler firings during double stapling technique(DST)after laparoscopic low anterior resection(LAR)are associated with an increased risk of anastomotic leakage(AL).However,it is difficult to predict preoperatively the need for multiple linear stapler cartridges during DST anastomosis.AIM To develop a deep learning model to predict multiple firings during DST anastomosis based on pelvic magnetic resonance imaging(MRI).METHODS We collected 9476 MR images from 328 mid-low rectal cancer patients undergoing LAR with DST anastomosis,which were randomly divided into a training set(n=260)and testing set(n=68).Binary logistic regression was adopted to create a clinical model using six factors.The sequence of fast spin-echo T2-weighted MRI of the entire pelvis was segmented and analyzed.Pure-image and clinical-image integrated deep learning models were constructed using the mask region-based convolutional neural network segmentation tool and three-dimensional convolutional networks.Sensitivity,specificity,accuracy,positive predictive value(PPV),and area under the receiver operating characteristic curve(AUC)was calculated for each model.RESULTS The prevalence of≥3 linear stapler cartridges was 17.7%(58/328).The prevalence of AL was statistically significantly higher in patients with≥3 cartridges compared to those with≤2 cartridges(25.0%vs 11.8%,P=0.018).Preoperative carcinoembryonic antigen level>5 ng/mL(OR=2.11,95%CI 1.08-4.12,P=0.028)and tumor size≥5 cm(OR=3.57,95%CI 1.61-7.89,P=0.002)were recognized as independent risk factors for use of≥3 linear stapler cartridges.Diagnostic performance was better with the integrated model(accuracy=94.1%,PPV=87.5%,and AUC=0.88)compared with the clinical model(accuracy=86.7%,PPV=38.9%,and AUC=0.72)and the image model(accuracy=91.2%,PPV=83.3%,and AUC=0.81).CONCLUSION MRI-based deep learning model can predict the use of≥3 linear stapler cartridges during DST anastomosis in laparoscopic LAR surgery.This model might help determine the best anastomosis strategy by avoiding DST when there is a high probability of the need for≥3 linear stapler cartridges. 展开更多
关键词 deep learning Image-reading artificial intelligence Magnetic resonance imaging Predictive model double stapling technique Linear stapler Rectal cancer Laparoscopic surgery Low anterior resection Anastomotic leakage
暂未订购
Displacement of surrounding rock in a deep circular hole considering double moduli and strength-stiffness degradation 被引量:2
7
作者 Zenghui ZHAO WeiSUN +2 位作者 Shaojie CHEN Yuanhui FENG Weiming WANG 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI CSCD 2020年第12期1847-1860,共14页
The problem of cavity stability widely exists in deep underground engineering and energy exploitation.First,the stress field of the surrounding rock under the uniform stress field is deduced based on a post-peak stren... The problem of cavity stability widely exists in deep underground engineering and energy exploitation.First,the stress field of the surrounding rock under the uniform stress field is deduced based on a post-peak strength drop model considering the rock’s characteristics of constant modulus and double moduli.Then,the orthogonal non-associative flow rule is used to establish the displacement of the surrounding rock under constant modulus and double moduli,respectively,considering the stiffness degradation and dilatancy effects in the plastic region and assuming that the elastic strain in the plastic region satisfies the elastic constitutive relationship.Finally,the evolution of the displacement in the surrounding rock is analyzed under the effects of the double modulus characteristics,the strength drop,the stiffness degradation,and the dilatancy.The results show that the displacement solutions of the surrounding rock under constant modulus and double moduli have a unified expression.The coefficients of the expression are related to the stress field of the original rock,the elastic constant of the surrounding rock,the strength parameters,and the dilatancy angle.The strength drop,the stiffness degradation,and the dilatancy effects all have effects on the displacement.The effects can be characterized by quantitative relationships. 展开更多
关键词 deep rock double moduli strength-stiffness degradation circular hole displacement solution
在线阅读 下载PDF
A deep trench super-junction LDMOS with double charge compensation layer 被引量:2
8
作者 Lijuan Wu Shaolian Su +2 位作者 Xing Chen Jinsheng Zeng Haifeng Wu 《Journal of Semiconductors》 EI CAS CSCD 2022年第10期103-108,共6页
A deep trench super-junction LDMOS with double charge compensation layer(DC DT SJ LDMOS)is proposed in this paper.Due to the capacitance effect of the deep trench which is known as silicon-insulator-silicon(SIS)capaci... A deep trench super-junction LDMOS with double charge compensation layer(DC DT SJ LDMOS)is proposed in this paper.Due to the capacitance effect of the deep trench which is known as silicon-insulator-silicon(SIS)capacitance,the charge balance in the super-junction region of the conventional deep trench SJ LDMOS(Con.DT SJ LDMOS)device will be broken,resulting in breakdown voltage(BV)of the device drops.DC DT SJ LDMOS solves the SIS capacitance effect by adding a vertical variable doped charge compensation layer and a triangular charge compensation layer inside the Con.DT SJ LDMOS device.Therefore,the drift region reaches an ideal charge balance state again.The electric field is optimized by double charge compensation and gate field plate so that the breakdown voltage of the proposed device is improved sharply,meanwhile the enlarged on-current region reduces its specific on-resistance.The simulation results show that compared with the Con.DT SJ LD-MOS,the BV of the DC DT SJ LDMOS has been increased from 549.5 to 705.5 V,and the R_(on,sp) decreased to 23.7 mΩ·cm^(2). 展开更多
关键词 double charge compensation layer super-junction deep trench SIS capacitance
在线阅读 下载PDF
Double-directional control bolt support technology and engineering application at large span Y-type intersections in deep coal mines 被引量:13
9
作者 GUO, Zhibiao SHI, Jianjun +2 位作者 WANG, Jiong CAI, Feng WANG, Fuqiang 《Mining Science and Technology》 EI CAS 2010年第2期254-259,共6页
Under deep and complex geological conditions,severe deformation occurs at intersection points of Y-type roadways with large cross sections during engineering projects in coal mines,especially at junction arches.Based ... Under deep and complex geological conditions,severe deformation occurs at intersection points of Y-type roadways with large cross sections during engineering projects in coal mines,especially at junction arches.Based on in-situ investigations and theoretical studies,we have summarized typical forms of destruction and identified high stress and unrestricted support at both sides of junction arch as its main causes.In this study,we also presented double-directional control bolt support technology for a large Y-type span intersection,applied to deep intersection engineering in the Jiahe Coal Mine,which has proved effective. 展开更多
关键词 Y-type intersection double-directional control bolt support deep coal mines
在线阅读 下载PDF
Study on Sediment Removal Method of Reservoir Based on Double Branch Convolution
10
作者 Hailong Wang Junchao Shi Xinjie Li 《Computers, Materials & Continua》 2025年第2期2951-2967,共17页
In response to the limitations and low computational efficiency of one-dimensional water and sediment models in representing complex hydrological conditions, this paper proposes a dual branch convolution method based ... In response to the limitations and low computational efficiency of one-dimensional water and sediment models in representing complex hydrological conditions, this paper proposes a dual branch convolution method based on deep learning. This method utilizes the ability of deep learning to extract data features and introduces a dual branch convolutional network to handle the non-stationary and nonlinear characteristics of noise and reservoir sediment transport data. This method combines permutation variant structure to preserve the original time series information, constructs a corresponding time series model, models and analyzes the changes in the outbound water and sediment sequence, and can more accurately predict the future trend of outbound sediment changes based on the current sequence changes. The experimental results show that the DCON model established in this paper has good predictive performance in monthly, bimonthly, seasonal, and semi-annual predictions, with determination coefficients of 0.891, 0.898, 0.921, and 0.931, respectively. The results can provide more reference schemes for personnel formulating reservoir scheduling plans. Although this study has shown good applicability in predicting sediment discharge, it has not been able to make timely predictions for some non-periodic events in reservoirs. Therefore, future research will gradually incorporate monitoring devices to obtain more comprehensive data, in order to further validate and expand the conclusions of this study. 展开更多
关键词 Prediction of reservoir sediment discharge double-branch convolution double prediction head deep learning
在线阅读 下载PDF
基于Double Deep Q-learning的无线通信节点覆盖优化 被引量:1
11
作者 李忠涛 《电子技术与软件工程》 2021年第14期1-3,共3页
本文拟采用Double Deep Q-learning模型进行算法设计,该算法是强化学习中的一种values-based算法,实现一种神经网络模型来代替表格Q-Table,解决了系统状态过多导致的Q-Table过大问题。
关键词 无线通信节点 最优路径 double deep Q-learning
在线阅读 下载PDF
Improved Double Deep Q Network Algorithm Based on Average Q-Value Estimation and Reward Redistribution for Robot Path Planning
12
作者 Yameng Yin Lieping Zhang +3 位作者 Xiaoxu Shi Yilin Wang Jiansheng Peng Jianchu Zou 《Computers, Materials & Continua》 SCIE EI 2024年第11期2769-2790,共22页
By integrating deep neural networks with reinforcement learning,the Double Deep Q Network(DDQN)algorithm overcomes the limitations of Q-learning in handling continuous spaces and is widely applied in the path planning... By integrating deep neural networks with reinforcement learning,the Double Deep Q Network(DDQN)algorithm overcomes the limitations of Q-learning in handling continuous spaces and is widely applied in the path planning of mobile robots.However,the traditional DDQN algorithm suffers from sparse rewards and inefficient utilization of high-quality data.Targeting those problems,an improved DDQN algorithm based on average Q-value estimation and reward redistribution was proposed.First,to enhance the precision of the target Q-value,the average of multiple previously learned Q-values from the target Q network is used to replace the single Q-value from the current target Q network.Next,a reward redistribution mechanism is designed to overcome the sparse reward problem by adjusting the final reward of each action using the round reward from trajectory information.Additionally,a reward-prioritized experience selection method is introduced,which ranks experience samples according to reward values to ensure frequent utilization of high-quality data.Finally,simulation experiments are conducted to verify the effectiveness of the proposed algorithm in fixed-position scenario and random environments.The experimental results show that compared to the traditional DDQN algorithm,the proposed algorithm achieves shorter average running time,higher average return and fewer average steps.The performance of the proposed algorithm is improved by 11.43%in the fixed scenario and 8.33%in random environments.It not only plans economic and safe paths but also significantly improves efficiency and generalization in path planning,making it suitable for widespread application in autonomous navigation and industrial automation. 展开更多
关键词 double deep Q Network path planning average Q-value estimation reward redistribution mechanism reward-prioritized experience selection method
在线阅读 下载PDF
Energy Optimization for Autonomous Mobile Robot Path Planning Based on Deep Reinforcement Learning
13
作者 Longfei Gao Weidong Wang Dieyun Ke 《Computers, Materials & Continua》 2026年第1期984-998,共15页
At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown ... At present,energy consumption is one of the main bottlenecks in autonomous mobile robot development.To address the challenge of high energy consumption in path planning for autonomous mobile robots navigating unknown and complex environments,this paper proposes an Attention-Enhanced Dueling Deep Q-Network(ADDueling DQN),which integrates a multi-head attention mechanism and a prioritized experience replay strategy into a Dueling-DQN reinforcement learning framework.A multi-objective reward function,centered on energy efficiency,is designed to comprehensively consider path length,terrain slope,motion smoothness,and obstacle avoidance,enabling optimal low-energy trajectory generation in 3D space from the source.The incorporation of a multihead attention mechanism allows the model to dynamically focus on energy-critical state features—such as slope gradients and obstacle density—thereby significantly improving its ability to recognize and avoid energy-intensive paths.Additionally,the prioritized experience replay mechanism accelerates learning from key decision-making experiences,suppressing inefficient exploration and guiding the policy toward low-energy solutions more rapidly.The effectiveness of the proposed path planning algorithm is validated through simulation experiments conducted in multiple off-road scenarios.Results demonstrate that AD-Dueling DQN consistently achieves the lowest average energy consumption across all tested environments.Moreover,the proposed method exhibits faster convergence and greater training stability compared to baseline algorithms,highlighting its global optimization capability under energy-aware objectives in complex terrains.This study offers an efficient and scalable intelligent control strategy for the development of energy-conscious autonomous navigation systems. 展开更多
关键词 Autonomous mobile robot deep reinforcement learning energy optimization multi-attention mechanism prioritized experience replay dueling deep q-network
在线阅读 下载PDF
Simulation and Optimization of Double Row Piles Supporting Structure for Deep Foundation Pit
14
作者 SHEN Xiaoyan 《外文科技期刊数据库(文摘版)工程技术》 2021年第7期108-110,共5页
In Chinese modernization and social development level enhances unceasingly, and under the background of deepening the process of urbanization, urban development level in China has been an unprecedented increase, espec... In Chinese modernization and social development level enhances unceasingly, and under the background of deepening the process of urbanization, urban development level in China has been an unprecedented increase, especially with the constant development of information technology, make our country construction technology has been constantly strengthened, all kinds of tunnel construction, underground construction, high-rise buildings appear constantly, higher and more strict requirements are put forward for deep foundation pit engineering in terms of quantity and construction quality. In this paper, a detailed analysis is carried out on the simulation and optimization of the double-row pile supporting structure of deep foundation pit, which lays a solid foundation for the further improvement of the modern construction technology level in China. 展开更多
关键词 deep foundation pit double row pile support structure SIMULATION optimization approach
原文传递
Double Pruning Structure Design for Deep Stochastic Configuration Networks Based on Mutual Information and Relevance
15
作者 YAN Aijun LI Jiale TANG Jian 《Instrumentation》 2022年第4期26-39,共14页
Deep stochastic configuration networks(DSCNs)produce redundant hidden nodes and connections during training,which complicates their model structures.Aiming at the above problems,this paper proposes a double pruning st... Deep stochastic configuration networks(DSCNs)produce redundant hidden nodes and connections during training,which complicates their model structures.Aiming at the above problems,this paper proposes a double pruning structure design algorithm for DSCNs based on mutual information and relevance.During the training process,the mutual information algorithm is used to calculate and sort the importance scores of the nodes in each hidden layer in a layer-by-layer manner,the node pruning rate of each layer is set according to the depth of the DSCN at the current time,the nodes that contribute little to the model are deleted,and the network-related parameters are updated.When the model completes the configuration procedure,the correlation evaluation strategy is used to sort the global connection weights and delete insignificance connections;then,the network parameters are updated after pruning is completed.The experimental results show that the proposed structure design method can effectively compress the scale of a DSCN model and improve its modeling speed;the model accuracy loss is small,and fine-tuning for accuracy restoration is not needed.The obtained DSCN model has certain application value in the field of regression analysis. 展开更多
关键词 deep Stochastic Configuration Networks Mutual Information RELEVANCE Hidden Node double Pruning
原文传递
基于Dueling Double DQN的交通信号控制方法 被引量:4
16
作者 叶宝林 陈栋 +2 位作者 刘春元 陈滨 吴维敏 《计算机测量与控制》 2024年第7期154-161,共8页
为了提高交叉口通行效率缓解交通拥堵,深入挖掘交通状态信息中所包含的深层次隐含特征信息,提出了一种基于Dueling Double DQN(D3QN)的单交叉口交通信号控制方法;构建了一个基于深度强化学习Double DQN(DDQN)的交通信号控制模型,对动作... 为了提高交叉口通行效率缓解交通拥堵,深入挖掘交通状态信息中所包含的深层次隐含特征信息,提出了一种基于Dueling Double DQN(D3QN)的单交叉口交通信号控制方法;构建了一个基于深度强化学习Double DQN(DDQN)的交通信号控制模型,对动作-价值函数的估计值和目标值迭代运算过程进行了优化,克服基于深度强化学习DQN的交通信号控制模型存在收敛速度慢的问题;设计了一个新的Dueling Network解耦交通状态和相位动作的价值,增强Double DQN(DDQN)提取深层次特征信息的能力;基于微观仿真平台SUMO搭建了一个单交叉口模拟仿真框架和环境,开展仿真测试;仿真测试结果表明,与传统交通信号控制方法和基于深度强化学习DQN的交通信号控制方法相比,所提方法能够有效减少车辆平均等待时间、车辆平均排队长度和车辆平均停车次数,明显提升交叉口通行效率。 展开更多
关键词 交通信号控制 深度强化学习 Dueling double DQN Dueling Network
在线阅读 下载PDF
Multi-Agent Path Planning Method Based on Improved Deep Q-Network in Dynamic Environments 被引量:2
17
作者 LI Shuyi LI Minzhe JING Zhongliang 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第4期601-612,共12页
The multi-agent path planning problem presents significant challenges in dynamic environments,primarily due to the ever-changing positions of obstacles and the complex interactions between agents’actions.These factor... The multi-agent path planning problem presents significant challenges in dynamic environments,primarily due to the ever-changing positions of obstacles and the complex interactions between agents’actions.These factors contribute to a tendency for the solution to converge slowly,and in some cases,diverge altogether.In addressing this issue,this paper introduces a novel approach utilizing a double dueling deep Q-network(D3QN),tailored for dynamic multi-agent environments.A novel reward function based on multi-agent positional constraints is designed,and a training strategy based on incremental learning is performed to achieve collaborative path planning of multiple agents.Moreover,the greedy and Boltzmann probability selection policy is introduced for action selection and avoiding convergence to local extremum.To match radar and image sensors,a convolutional neural network-long short-term memory(CNN-LSTM)architecture is constructed to extract the feature of multi-source measurement as the input of the D3QN.The algorithm’s efficacy and reliability are validated in a simulated environment,utilizing robot operating system and Gazebo.The simulation results show that the proposed algorithm provides a real-time solution for path planning tasks in dynamic scenarios.In terms of the average success rate and accuracy,the proposed method is superior to other deep learning algorithms,and the convergence speed is also improved. 展开更多
关键词 MULTI-AGENT path planning deep reinforcement learning deep q-network
原文传递
Evaluation of “top-down” treatment of early Crohn's disease by double balloon enteroscopy 被引量:3
18
作者 Rong Fan Jie Zhong +3 位作者 Zheng-Ting Wang Shu-Yi Li Jie Zhou Yong-Hua Tang 《World Journal of Gastroenterology》 SCIE CAS 2014年第39期14479-14487,共9页
AIM: To assess &#x0201c;top-down&#x0201d; treatment for deep remission of early moderate to severe Crohn&#x02019;s disease (CD) by double balloon enteroscopy.
关键词 Crohn’ s disease Top-down treatment deep remission double balloon enteroscopy Mucosal healing
暂未订购
Convolutional Neural Network-Based Deep Q-Network (CNN-DQN) Resource Management in Cloud Radio Access Network 被引量:3
19
作者 Amjad Iqbal Mau-Luen Tham Yoong Choon Chang 《China Communications》 SCIE CSCD 2022年第10期129-142,共14页
The recent surge of mobile subscribers and user data traffic has accelerated the telecommunication sector towards the adoption of the fifth-generation (5G) mobile networks. Cloud radio access network (CRAN) is a promi... The recent surge of mobile subscribers and user data traffic has accelerated the telecommunication sector towards the adoption of the fifth-generation (5G) mobile networks. Cloud radio access network (CRAN) is a prominent framework in the 5G mobile network to meet the above requirements by deploying low-cost and intelligent multiple distributed antennas known as remote radio heads (RRHs). However, achieving the optimal resource allocation (RA) in CRAN using the traditional approach is still challenging due to the complex structure. In this paper, we introduce the convolutional neural network-based deep Q-network (CNN-DQN) to balance the energy consumption and guarantee the user quality of service (QoS) demand in downlink CRAN. We first formulate the Markov decision process (MDP) for energy efficiency (EE) and build up a 3-layer CNN to capture the environment feature as an input state space. We then use DQN to turn on/off the RRHs dynamically based on the user QoS demand and energy consumption in the CRAN. Finally, we solve the RA problem based on the user constraint and transmit power to guarantee the user QoS demand and maximize the EE with a minimum number of active RRHs. In the end, we conduct the simulation to compare our proposed scheme with nature DQN and the traditional approach. 展开更多
关键词 energy efficiency(EE) markov decision process(MDP) convolutional neural network(CNN) cloud RAN deep q-network(DQN)
在线阅读 下载PDF
Manufacturing Resource Scheduling Based on Deep Q-Network 被引量:1
20
作者 ZHANG Yufei Zou Yuanhao ZHAO Xiaodong 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2022年第6期531-538,共8页
To optimize machine allocation and task dispatching in smart manufacturing factories, this paper proposes a manufacturing resource scheduling framework based on reinforcement learning(RL). The framework formulates the... To optimize machine allocation and task dispatching in smart manufacturing factories, this paper proposes a manufacturing resource scheduling framework based on reinforcement learning(RL). The framework formulates the entire scheduling process as a multi-stage sequential decision problem, and further obtains the scheduling order by the combination of deep convolutional neural network(CNN) and improved deep Q-network(DQN). Specifically, with respect to the representation of the Markov decision process(MDP), the feature matrix is considered as the state space and a set of heuristic dispatching rules are denoted as the action space. In addition, the deep CNN is employed to approximate the state-action values, and the double dueling deep Qnetwork with prioritized experience replay and noisy network(D3QPN2) is adopted to determine the appropriate action according to the current state. In the experiments, compared with the traditional heuristic method, the proposed method is able to learn high-quality scheduling policy and achieve shorter makespan on the standard public datasets. 展开更多
关键词 smart manufacturing job shop scheduling convolutional neural network deep q-network
原文传递
上一页 1 2 55 下一页 到第
使用帮助 返回顶部