期刊文献+
共找到138篇文章
< 1 2 7 >
每页显示 20 50 100
基于Double Q-Learning的改进蝗虫算法求解分布式柔性作业车间逆调度问题
1
作者 胡旭伦 唐红涛 《机床与液压》 北大核心 2025年第20期52-63,共12页
针对分布式柔性作业车间中存在的资源分配不均和调度稳定性不足问题,构建以最小化最大完工时间、机器总能耗和偏离度为目标的逆调度数学模型,提出一种基于Double Q-Learning的改进多目标蝗虫优化算法(DQIGOA)。针对该问题设计一种混合... 针对分布式柔性作业车间中存在的资源分配不均和调度稳定性不足问题,构建以最小化最大完工时间、机器总能耗和偏离度为目标的逆调度数学模型,提出一种基于Double Q-Learning的改进多目标蝗虫优化算法(DQIGOA)。针对该问题设计一种混合三层编码方式;提出一种基于逆调度特点的种群初始化方式以提高种群质量;引入权重平衡因子来提高非支配解存档中解集的多样性;将强化学习中的Double Q-Learning机制融入非支配解的选择过程,通过动态动作策略优化目标解的选取,提升调度方案的全局搜索能力与局部优化效率。最后构建26组算例,通过策略有效性分析证明了所提策略可显著提升DQIGOA算法的性能,并通过与NSGA-II、DE和SPEA-II算法进行对比证明DQIGOA算法的有效性。结果表明:相比NSGA-II、DE和SPEA-II算法,DQIGOA算法在HV、IGD、SP指标上均有优势,证明了DQIGOA能够有效提升解的收敛速度和多样性分布,在动态扰动条件下表现出更强的鲁棒性。 展开更多
关键词 分布式柔性作业车间 逆调度 蝗虫算法 double q-learning机制
在线阅读 下载PDF
基于softmax的加权Double Q-Learning算法 被引量:5
2
作者 钟雨昂 袁伟伟 关东海 《计算机科学》 CSCD 北大核心 2024年第S01期46-50,共5页
强化学习作为机器学习的一个分支,用于描述和解决智能体在与环境的交互过程中,通过学习策略以达成回报最大化的问题。Q-Learning作为无模型强化学习的经典方法,存在过估计引起的最大化偏差问题,并且在环境中奖励存在噪声时表现不佳。Dou... 强化学习作为机器学习的一个分支,用于描述和解决智能体在与环境的交互过程中,通过学习策略以达成回报最大化的问题。Q-Learning作为无模型强化学习的经典方法,存在过估计引起的最大化偏差问题,并且在环境中奖励存在噪声时表现不佳。Double Q-Learning(DQL)的出现解决了过估计问题,但同时造成了低估问题。为解决以上算法的高低估问题,提出了基于softmax的加权Q-Learning算法,并将其与DQL相结合,提出了一种新的基于softmax的加权Double Q-Learning算法(WDQL-Softmax)。该算法基于加权双估计器的构造,对样本期望值进行softmax操作得到权重,使用权重估计动作价值,有效平衡对动作价值的高估和低估问题,使估计值更加接近理论值。实验结果表明,在离散动作空间中,相比于Q-Learning算法、DQL算法和WDQL算法,WDQL-Softmax算法的收敛速度更快且估计值与理论值的误差更小。 展开更多
关键词 强化学习 q-learning double q-learning Softmax
在线阅读 下载PDF
Entropic noise induced stability and double entropic stochastic resonance induced by correlated noises 被引量:2
3
作者 曾春华 王华 王辉涛 《Chinese Physics B》 SCIE EI CAS CSCD 2011年第5期146-152,共7页
For the activated dynamics of a Brownian particle moving in a confined system with the presence of entropic barriers, this paper investigates a periodic driving and correlations between two noises. Within the two-stat... For the activated dynamics of a Brownian particle moving in a confined system with the presence of entropic barriers, this paper investigates a periodic driving and correlations between two noises. Within the two-state approximation, the explicit expressions of the mean first passage time (MFPT) and the spectral power amplification (SPA) axe obtained, respectively. Based on the numerical computations, it is found that: (i) The MFPT as a function of the noise intensity exhibits a maximum with the positive correlations between two noises (λ〉0), this maximum for MFPT shows the characteristic of the entropic noise induced stability (ENIS) effect. The intensity A of correlations between two noises can enhance the ENIS effect. (ii) The SPA as a function of the noise intensity exhibits a double-peak by tuning the noise correlation intensity λ, i.e., the existence of a double-peak behaviour is the identifying characteristic of the double entropic stochastic resonance phenomenon. 展开更多
关键词 correlated noises confined system entropic noise induced stability double entropic stochastic resonance ]
原文传递
Upper bound for the time derivative of entropy for a stochastic dynamical system with double singularities driven by non-Gaussian noise 被引量:2
4
作者 郭培荣 徐伟 刘迪 《Chinese Physics B》 SCIE EI CAS CSCD 2010年第3期233-238,共6页
A stochastic dynamical system with double singularities driven by non-Gaussian noise is investigated. The Fokker Plank equation of the system is obtained through the path-integral approach and the method of transforma... A stochastic dynamical system with double singularities driven by non-Gaussian noise is investigated. The Fokker Plank equation of the system is obtained through the path-integral approach and the method of transformation. Based on the definition of Shannon's information entropy and the Schwartz inequality principle, the upper bound for the time derivative of entropy is calculated both in the absence and in the presence of non-equilibrium constraint. The present calculations can be used to interpret the effects of the system dissipative parameter, the system singularity strength parameter, the noise correlation time and the noise deviation parameter on the upper bound. 展开更多
关键词 non-Gaussian noise stochastic dynamical system with double singularities informationentropy upper bound for the time derivative of entropy
原文传递
Supervisory control of the hybrid off-highway vehicle for fuel economy improvement using predictive double Q-learning with backup models 被引量:2
5
作者 SHUAI Bin LI Yan-fei +2 位作者 ZHOU Quan XU Hong-ming SHUAI Shi-jin 《Journal of Central South University》 SCIE EI CAS CSCD 2022年第7期2266-2278,共13页
This paper studied a supervisory control system for a hybrid off-highway electric vehicle under the chargesustaining(CS)condition.A new predictive double Q-learning with backup models(PDQL)scheme is proposed to optimi... This paper studied a supervisory control system for a hybrid off-highway electric vehicle under the chargesustaining(CS)condition.A new predictive double Q-learning with backup models(PDQL)scheme is proposed to optimize the engine fuel in real-world driving and improve energy efficiency with a faster and more robust learning process.Unlike the existing“model-free”methods,which solely follow on-policy and off-policy to update knowledge bases(Q-tables),the PDQL is developed with the capability to merge both on-policy and off-policy learning by introducing a backup model(Q-table).Experimental evaluations are conducted based on software-in-the-loop(SiL)and hardware-in-the-loop(HiL)test platforms based on real-time modelling of the studied vehicle.Compared to the standard double Q-learning(SDQL),the PDQL only needs half of the learning iterations to achieve better energy efficiency than the SDQL at the end learning process.In the SiL under 35 rounds of learning,the results show that the PDQL can improve the vehicle energy efficiency by 1.75%higher than SDQL.By implementing the PDQL in HiL under four predefined real-world conditions,the PDQL can robustly save more than 5.03%energy than the SDQL scheme. 展开更多
关键词 supervisory charge-sustaining control hybrid electric vehicle reinforcement learning predictive double q-learning
在线阅读 下载PDF
Time evolution of information entropy for a stochastic system with double singularities driven by quasimonochromatic noise 被引量:1
6
作者 郭永峰 谭建国 《Chinese Physics B》 SCIE EI CAS CSCD 2012年第12期99-103,共5页
This paper deals with the time evolution of information entropy for a stochastic system with double singularities driven by quasimonochromatic noise. The dimension of the Fokker Planck equation is reduced by the linea... This paper deals with the time evolution of information entropy for a stochastic system with double singularities driven by quasimonochromatic noise. The dimension of the Fokker Planck equation is reduced by the linear transfor- mation. The exact expression of the time dependence of information entropy is obtained based on the definition of Shannon's information entropy. The relationships between the properties of dissipative parameters, system singularity strength parameter, quasimonochromatic noise, and their effects on information entropy are discussed. 展开更多
关键词 information entropy quasimonochromatic noise Fokker-Planck equation stochastic sys-tem with double singularities
原文传递
Double BP Q-Learning Algorithm for Local Path Planning of Mobile Robot 被引量:1
7
作者 Guoming Liu Caihong Li +2 位作者 Tengteng Gao Yongdi Li Xiaopei He 《Journal of Computer and Communications》 2021年第6期138-157,共20页
Aiming at the dimension disaster problem, poor model generalization ability and deadlock problem in special obstacles environment caused by the increase of state information in the local path planning process of mobil... Aiming at the dimension disaster problem, poor model generalization ability and deadlock problem in special obstacles environment caused by the increase of state information in the local path planning process of mobile robot, this paper proposed a Double BP Q-learning algorithm based on the fusion of Double Q-learning algorithm and BP neural network. In order to solve the dimensional disaster problem, two BP neural network fitting value functions with the same network structure were used to replace the two <i>Q</i> value tables in Double Q-Learning algorithm to solve the problem that the <i>Q</i> value table cannot store excessive state information. By adding the mechanism of priority experience replay and using the parameter transfer to initialize the model parameters in different environments, it could accelerate the convergence rate of the algorithm, improve the learning efficiency and the generalization ability of the model. By designing specific action selection strategy in special environment, the deadlock state could be avoided and the mobile robot could reach the target point. Finally, the designed Double BP Q-learning algorithm was simulated and verified, and the probability of mobile robot reaching the target point in the parameter update process was compared with the Double Q-learning algorithm under the same condition of the planned path length. The results showed that the model trained by the improved Double BP Q-learning algorithm had a higher success rate in finding the optimal or sub-optimal path in the dense discrete environment, besides, it had stronger model generalization ability, fewer redundant sections, and could reach the target point without entering the deadlock zone in the special obstacles environment. 展开更多
关键词 Mobile Robot Local Path Planning double BP q-learning BP Neural Network Transfer Learning
在线阅读 下载PDF
基于Double Deep Q-learning的无线通信节点覆盖优化 被引量:1
8
作者 李忠涛 《电子技术与软件工程》 2021年第14期1-3,共3页
本文拟采用Double Deep Q-learning模型进行算法设计,该算法是强化学习中的一种values-based算法,实现一种神经网络模型来代替表格Q-Table,解决了系统状态过多导致的Q-Table过大问题。
关键词 无线通信节点 最优路径 double Deep q-learning
在线阅读 下载PDF
Double Pruning Structure Design for Deep Stochastic Configuration Networks Based on Mutual Information and Relevance
9
作者 YAN Aijun LI Jiale TANG Jian 《Instrumentation》 2022年第4期26-39,共14页
Deep stochastic configuration networks(DSCNs)produce redundant hidden nodes and connections during training,which complicates their model structures.Aiming at the above problems,this paper proposes a double pruning st... Deep stochastic configuration networks(DSCNs)produce redundant hidden nodes and connections during training,which complicates their model structures.Aiming at the above problems,this paper proposes a double pruning structure design algorithm for DSCNs based on mutual information and relevance.During the training process,the mutual information algorithm is used to calculate and sort the importance scores of the nodes in each hidden layer in a layer-by-layer manner,the node pruning rate of each layer is set according to the depth of the DSCN at the current time,the nodes that contribute little to the model are deleted,and the network-related parameters are updated.When the model completes the configuration procedure,the correlation evaluation strategy is used to sort the global connection weights and delete insignificance connections;then,the network parameters are updated after pruning is completed.The experimental results show that the proposed structure design method can effectively compress the scale of a DSCN model and improve its modeling speed;the model accuracy loss is small,and fine-tuning for accuracy restoration is not needed.The obtained DSCN model has certain application value in the field of regression analysis. 展开更多
关键词 Deep stochastic Configuration Networks Mutual Information RELEVANCE Hidden Node double Pruning
原文传递
Stochastic period-doubling bifurcation analysis of stochastic Bonhoeffer-van der Pol system 被引量:3
10
作者 张莹 徐伟 +1 位作者 方同 徐旭林 《Chinese Physics B》 SCIE EI CAS CSCD 2007年第7期1923-1933,共11页
In this paper, the Chebyshev polynomial approximation is applied to the problem of stochastic period-doubling bifurcation of a stochastic Bonhoeffer-van der Pol (BVP for short) system with a bounded random parameter... In this paper, the Chebyshev polynomial approximation is applied to the problem of stochastic period-doubling bifurcation of a stochastic Bonhoeffer-van der Pol (BVP for short) system with a bounded random parameter. In the analysis, the stochastic BVP system is transformed by the Chebyshev polynomial approximation into an equivalent deterministic system, whose response can be readily obtained by conventional numerical methods. In this way we have explored plenty of stochastic period-doubling bifurcation phenomena of the stochastic BVP system. The numerical simulations show that the behaviour of the stochastic period-doubling bifurcation in the stochastic BVP system is by and large similar to that in the deterministic mean-parameter BVP system, but there are still some featured differences between them. For example, in the stochastic dynamic system the period-doubling bifurcation point diffuses into a critical interval and the location of the critical interval shifts with the variation of intensity of the random parameter. The obtained results show that Chebyshev polynomial approximation is an effective approach to dynamical problems in some typical nonlinear systems with a bounded random parameter of an arch-like probability density function. 展开更多
关键词 Chebyshev polynomial approximation stochastic Bonhoeffer-van der Pol system stochastic period-doubling bifurcation bounded random parameter
原文传递
Stochastic period-doubling bifurcation analysis of a Rssler system with a bounded random parameter
11
作者 倪菲 徐伟 +1 位作者 方同 岳晓乐 《Chinese Physics B》 SCIE EI CAS CSCD 2010年第1期189-196,共8页
This paper aims to study the stochastic period-doubling bifurcation of the three-dimensional Rossler system with an arch-like bounded random parameter. First, we transform the stochastic RSssler system into its equiva... This paper aims to study the stochastic period-doubling bifurcation of the three-dimensional Rossler system with an arch-like bounded random parameter. First, we transform the stochastic RSssler system into its equivalent deterministic one in the sense of minimal residual error by the Chebyshev polynomial approximation method. Then, we explore the dynamical behaviour of the stochastic RSssler system through its equivalent deterministic system by numerical simulations. The numerical results show that some stochastic period-doubling bifurcation, akin to the conventional one in the deterministic case, may also appear in the stochastic Rossler system. In addition, we also examine the influence of the random parameter intensity on bifurcation phenomena in the stochastic Rossler system. 展开更多
关键词 Chebyshev polynomial approximation stochastic RSssler system stochastic period doubling bifurcation bounded random parameter
原文传递
Stochastic period-doubling bifurcation in biharmonic driven Duffing system with random parameter
12
作者 徐伟 马少娟 谢文贤 《Chinese Physics B》 SCIE EI CAS CSCD 2008年第3期857-864,共8页
Stochastic period-doubling bifurcation is explored in a forced Duffing system with a bounded random parameter as an additional weak harmonic perturbation added to the system. Firstly, the biharmonic driven Duffing sys... Stochastic period-doubling bifurcation is explored in a forced Duffing system with a bounded random parameter as an additional weak harmonic perturbation added to the system. Firstly, the biharmonic driven Duffing system with a random parameter is reduced to its equivalent deterministic one, and then the responses of the stochastic system can be obtained by available effective numerical methods. Finally, numerical simulations show that the phase of the additional weak harmonic perturbation has great influence on the stochastic period-doubling bifurcation in the biharmonic driven Duffing system. It is emphasized that, different from the deterministic biharmonic driven Duffing system, the intensity of random parameter in the Duffing system can also be taken as a bifurcation parameter, which can lead to the stochastic period-doubling bifurcations. 展开更多
关键词 random parameter stochastic Duffing system stochastic period-doubling bifurcation orthogonal polynomial approximation
原文传递
基于双估计器的改进Speedy Q-learning算法 被引量:7
13
作者 郑帅 罗飞 +2 位作者 顾春华 丁炜超 卢海峰 《计算机科学》 CSCD 北大核心 2020年第7期179-185,共7页
Q-learning算法是一种经典的强化学习算法,更新策略由于保守和过估计的原因,存在收敛速度慢的问题。Speedy Q-learning算法和Double Q-learning算法是Q-learning算法的两个变种,分别用于解决Q-learning算法收敛速度慢和过估计的问题。... Q-learning算法是一种经典的强化学习算法,更新策略由于保守和过估计的原因,存在收敛速度慢的问题。Speedy Q-learning算法和Double Q-learning算法是Q-learning算法的两个变种,分别用于解决Q-learning算法收敛速度慢和过估计的问题。文中基于Speedy Q-learning算法Q值的更新规则和蒙特卡洛强化学习的更新策略,通过理论分析及数学证明提出了其等价形式,从该等价形式可以看到,Speedy Q-learning算法由于将当前Q值的估计函数作为历史Q值的估计,虽然整体上提升了智能体的收敛速度,但是同样存在过估计问题,使得算法在迭代初期的收敛速度较慢。针对该问题,文中基于Double Q-learning算法中双估计器可以改善智能体收敛速度的特性,提出了一种改进算法Double Speedy Q-learning。其通过双估计器,分离最优动作和最大Q值的选择,改善了Speedy Q-learning算法在迭代初期的学习策略,提升了Speedy Q-learning算法的整体收敛速度。在不同规模的格子世界中进行实验,分别采用线性学习率和多项式学习率,来对比Q-learning算法及其改进算法在迭代初期的收敛速度和整体收敛速度。实验结果表明,Double Speedy Q-learning算法在迭代初期的收敛速度快于Speedy Q-learning算法,且其整体收敛速度明显快于对比算法,其实际平均奖励值和期望奖励值之间的差值最小。 展开更多
关键词 q-learning double q-learning Speedy q-learning 强化学习
在线阅读 下载PDF
Analysis of stochastic bifurcation and chaos in stochastic Duffing-van der Pol system via Chebyshev polynomial approximation 被引量:5
14
作者 马少娟 徐伟 +1 位作者 李伟 方同 《Chinese Physics B》 SCIE EI CAS CSCD 2006年第6期1231-1238,共8页
The Chebyshev polynomial approximation is applied to investigate the stochastic period-doubling bifurcation and chaos problems of a stochastic Duffing-van der Pol system with bounded random parameter of exponential pr... The Chebyshev polynomial approximation is applied to investigate the stochastic period-doubling bifurcation and chaos problems of a stochastic Duffing-van der Pol system with bounded random parameter of exponential probability density function subjected to a harmonic excitation. Firstly the stochastic system is reduced into its equivalent deterministic one, and then the responses of stochastic system can be obtained by numerical methods. Nonlinear dynamical behaviour related to stochastic period-doubling bifurcation and chaos in the stochastic system is explored. Numerical simulations show that similar to its counterpart in deterministic nonlinear system of stochastic period-doubling bifurcation and chaos may occur in the stochastic Duffing-van der Pol system even for weak intensity of random parameter. Simply increasing the intensity of the random parameter may result in the period-doubling bifurcation which is absent from the deterministic system. 展开更多
关键词 stochastic Duffing-van der Pol system Chebyshev polynomial approximation stochastic period-doubling bifurcation stochastic chaos
原文传递
A Double Shot Noise Process and Its Application in Insurance 被引量:2
15
作者 Angelos Dassios Jiwook Jang 《Journal of Mathematics and System Science》 2012年第2期82-93,共12页
The authors consider a compound Cox model of insurance risk with the additional economic assumption of a positive interest rate. As the authors note a duality result relating a compound Cox model of insurance risk wit... The authors consider a compound Cox model of insurance risk with the additional economic assumption of a positive interest rate. As the authors note a duality result relating a compound Cox model of insurance risk with a positive interest rate and a double shot noise process, the authors analyze a double shot noise process systematically for its theoretical distributional properties, based on the piecewise deterministic Markov process theory, and the martingale methodology. The authors also obtain the moments of aggregate accumulated/discounted claims where the claim arrival process follows a Cox process with shot noise intensity. Removing the parameters in a double shot noise process gradually, the authors show that it becomes a compound Cox process with shot noise intensity, a single shot noise process and a compound Poisson process. Numerical comparisons are shown between the moments (i.e. means and variances) of a compound Poisson model and their counterparts of a compound Cox model with/without considering a positive interest rate. For that purpose, the authors assume that claim sizes and primary event sizes follow an exponential distribution, respectively. 展开更多
关键词 double shot noise process a Cox process stochastic intensity and time value of claims aggregate accumulated/discounted claims.
在线阅读 下载PDF
Optimal investment with transaction costs based on exponential utility function:a parabolic double obstacle problem
16
作者 BAO Qun-fang YANG Jing-yang +1 位作者 SUN Chao LI Sheng-hong 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2011年第4期483-492,共10页
This paper concerns optimal investment problem with proportional transaction costs and finite time horizon based on exponential utility function. Using a partial differential equation approach, we reveal that the prob... This paper concerns optimal investment problem with proportional transaction costs and finite time horizon based on exponential utility function. Using a partial differential equation approach, we reveal that the problem is equivalent to a parabolic double obstacle problem involving two free boundaries that correspond to the optimal buying and selling policies. Numerical examples are obtained by the binomial method. 展开更多
关键词 Optimal investment transaction costs double obstacle problem stochastic control exponential utility function.
在线阅读 下载PDF
EXPONENTIAL CONVERGENCE FOR NONLINEAR SPDES WITH DOUBLE REFLECTING WALLS
17
作者 Dengdi CHEN Yan ZHENG 《Acta Mathematica Scientia》 SCIE CSCD 2024年第6期2465-2484,共20页
The present article is devoted to nonlinear stochastic partial differential equations with double reflecting walls driven by possibly degenerate,multiplicative noise.We prove that the corresponding Markov semigroup po... The present article is devoted to nonlinear stochastic partial differential equations with double reflecting walls driven by possibly degenerate,multiplicative noise.We prove that the corresponding Markov semigroup possesses an exponentially attracting invariant measure through asymptotic coupling,in which Foias-Prodi estimation and the truncation technique are crucial for the realization of the Girsanov transform. 展开更多
关键词 stochastic partial differential equations with double reflecting walls exponential mixing asymptotic coupling Girsanov transform
在线阅读 下载PDF
A firm-specific Malmquist productivity index model for stochastic data envelopment analysis:an application to commercial banks
18
作者 Alireza Amirteimoori Tofigh Allahviranloo Maryam Nematizadeh 《Financial Innovation》 2024年第1期1512-1538,共27页
In the data envelopment analysis(DEA)literature,productivity change captured by the Malmquist productivity index,especially in terms of a deterministic environment and stochastic variability in inputs and outputs,has ... In the data envelopment analysis(DEA)literature,productivity change captured by the Malmquist productivity index,especially in terms of a deterministic environment and stochastic variability in inputs and outputs,has been somewhat ignored.Therefore,this study developed a firm-specific,DEA-based Malmquist index model to examine the efficiency and productivity change of banks in a stochastic environment.First,in order to estimate bank-specific efficiency,we employed a two-stage double bootstrap DEA procedure.Specifically,in the first stage,the technical efficiency scores of banks were calculated by the classic DEA model,while in the second stage,the double bootstrap DEA model was applied to determine the effect of the contextual variables on bank efficiency.Second,we applied a two-stage procedure for measuring productivity change in which the first stage included the estimation of stochastic technical efficiency and the second stage included the regression of the estimated efficiency scores on a set of explanatory variables that influence relative performance.Finally,an empirical investigation of the Iranian banking sector,consisting of 120 bank-year observations of 15 banks from 2014 to 2021,was performed to measure their efficiency and productivity change.Based on the findings,the explanatory variables(i.e.,the nonperforming loan ratio and the number of branches)indicated an inverse relationship with stochastic technical efficiency and productivity change.The implication of the findings is that,in order to improve the efficiency and productivity of banks,it is important to optimize these factors. 展开更多
关键词 stochastic data envelopment analysis stochastic Malmquist productivity index double bootstrap procedure Technical efficiency BANKING
在线阅读 下载PDF
Option Pricing under the Double Exponential Jump-Diffusion Model with Stochastic Volatility and Interest Rate 被引量:3
19
作者 Rongda Chen Zexi Li +3 位作者 Liyuan Zeng Lean Yu Qi Lin Jia Liu 《Journal of Management Science and Engineering》 2017年第4期252-289,共38页
This paper proposes an efficient option pricing model that incorporates stochastic interest rate(SIR),stochastic volatility(SV),and double exponential jump into the jump-diffusion settings.The model comprehensively co... This paper proposes an efficient option pricing model that incorporates stochastic interest rate(SIR),stochastic volatility(SV),and double exponential jump into the jump-diffusion settings.The model comprehensively considers the leptokurtosis and heteroscedasticity of the underlying asset’s returns,rare events,and an SIR.Using the model,we deduce the pricing characteristic function and pricing formula of a European option.Then,we develop the Markov chain Monte Carlo method with latent variable to solve the problem of parameter estimation under the double exponential jump-diffusion model with SIR and SV.For verification purposes,we conduct time efficiency analysis,goodness of fit analysis,and jump/drift term analysis of the proposed model.In addition,we compare the pricing accuracy of the proposed model with those of the Black-Scholes and the Kou(2002)models.The empirical results show that the proposed option pricing model has high time efficiency,and the goodness of fit and pricing accuracy are significantly higher than those of the other two models. 展开更多
关键词 Option pricing model stochastic interest rate stochastic volatility double exponential jump Markov Chain Monte Carlo with Latent Variable
原文传递
Optimized Trajectory Design in UAV Based Cellular Networks for 3D Users: A Double Q-Learning Approach
20
作者 Xuanlin Liu Mingzhe Chen Changchuan Yin 《Journal of Communications and Information Networks》 CSCD 2019年第1期24-32,共9页
In this paper,the problem of trajectory de-sign of unmanned aerial vehicles(UAVs)for maximizing the number of satisfied users is studied in a UAV based cellular network where the UAV works as a flying base station tha... In this paper,the problem of trajectory de-sign of unmanned aerial vehicles(UAVs)for maximizing the number of satisfied users is studied in a UAV based cellular network where the UAV works as a flying base station that serves users,and the user indicates its satis-faction in terms of completion of its data request within an allowable maximum waiting time.The trajectory design is formulated as an optimization problem whose goal is to maximize the number of satisfied users.To solve this problem,a machine learning framework based on double Q-learning algorithm is proposed.The algorithm enables the UAV tofind the optimal trajectory that maximizes the number of satisfied users.Compared to the traditional learning algorithms,such as Q-learning that selects and evaluates the action using the same Q-table,the proposed algorithm can decouple the selection from the evaluation,therefore avoid overestimation which leads to sub-optimal policies.Simulation results show that the proposed algorithm can achieve up to 19.4% and 14.1% gains in terms of the number of satisfied users compared to random algorithm and Q-learning algorithm. 展开更多
关键词 UAV communication trajectory design double q-learning algorithm user satisfaction cellular network
原文传递
上一页 1 2 7 下一页 到第
使用帮助 返回顶部