期刊文献+
共找到13篇文章
< 1 >
每页显示 20 50 100
AoI-aware transmission control in real-time mmwave energy harvesting systems: a risk-sensitive reinforcement learning approach
1
作者 Marzieh Sheikhi Vesal Hakami 《Digital Communications and Networks》 2025年第3期850-865,共16页
The evolution of enabling technologies in wireless communications has paved the way for supporting novel applications with more demanding QoS requirements,but at the cost of increasing the complexity of optimizing the... The evolution of enabling technologies in wireless communications has paved the way for supporting novel applications with more demanding QoS requirements,but at the cost of increasing the complexity of optimizing the digital communication chain.In particular,Millimeter Wave(mmWave)communications provide an abundance of bandwidth,and energy harvesting supplies the network with a continual source of energy to facilitate self-sustainability;however,harnessing these technologies is challenging due to the stochastic dynamics of the mmWave channel as well as the random sporadic nature of the harvested energy.In this paper,we aim at the dynamic optimization of update transmissions in mmWave energy harvesting systems in terms of Age of Information(AoI).AoI has recently been introduced to quantify information freshness and is a more stringent QoS metric compared to conventional delay and throughput.However,most prior art has only addressed averagebased AoI metrics,which can be insufficient to capture the occurrence of rare but high-impact freshness violation events in time-critical scenarios.We formulate a control problem that aims to minimize the long-term entropic risk measure of AoI samples by configuring the“sense&transmit”of updates.Due to the high complexity of the exponential cost function,we reformulate the problem with an approximated mean-variance risk measure as the new objective.Under unknown system statistics,we propose a two-timescale model-free risk-sensitive reinforcement learning algorithm to compute a control policy that adapts to the trio of channel,energy,and AoI states.We evaluate the efficiency of the proposed scheme through extensive simulations. 展开更多
关键词 Age of information Millimeter wave Energy harvesting risk-sensitive Model-free reinforcement learning
在线阅读 下载PDF
A RISK-SENSITIVE STOCHASTIC MAXIMUM PRINCIPLE FOR OPTIMAL CONTROL OF JUMP DIFFUSIONS AND ITS APPLICATIONS 被引量:1
2
作者 史敬涛 吴臻 《Acta Mathematica Scientia》 SCIE CSCD 2011年第2期419-433,共15页
A stochastic maximum principle for the risk-sensitive optimal control prob- lem of jump diffusion processes with an exponential-of-integral cost functional is derived assuming that the value function is smooth, where ... A stochastic maximum principle for the risk-sensitive optimal control prob- lem of jump diffusion processes with an exponential-of-integral cost functional is derived assuming that the value function is smooth, where the diffusion and jump term may both depend on the control. The form of the maximum principle is similar to its risk-neutral counterpart. But the adjoint equations and the maximum condition heavily depend on the risk-sensitive parameter. As applications, a linear-quadratic risk-sensitive control problem is solved by using the maximum principle derived and explicit optimal control is obtained. 展开更多
关键词 risk-sensitive control jump diffusions maximum principle adioint equation
在线阅读 下载PDF
Risk-sensitive reinforcement learning algorithms with generalized average criterion
3
作者 殷苌茗 王汉兴 赵飞 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2007年第3期405-416,共12页
A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robnsticity of solutions. The robnsticity of solutions maybe becomes a very important property for a learning s... A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robnsticity of solutions. The robnsticity of solutions maybe becomes a very important property for a learning system when there exists non-matching between theory models and practical physical system, or the practical system is not static, or the availability of a control action changes along with the variety of time. The main contribution is that a set of approximation algorithms and their convergence results are given. A generalized average operator instead of the general optimal operator max (or rain) is applied to study a class of important learning algorithms, dynamic prOgramming algorithms, and discuss their convergences from theoretic point of view. The purpose for this research is to improve the robnsticity of reinforcement learning algorithms theoretically. 展开更多
关键词 reinforcement learning risk-sensitive generalized average algorithm convergence
在线阅读 下载PDF
Optimal Risk-Sensitive Filtering for System Stochastic of Second and Third Degree
4
作者 Ma Aracelia Alcorta-Garcia Sonia Gpe Anguiano Rostro Mauricio Torres Torres 《Intelligent Control and Automation》 2011年第1期47-56,共10页
The risk-sensitive filtering design problem with respect to the exponential mean-square cost criterion is con-sidered for stochastic Gaussian systems with polynomial of second and third degree drift terms and intensit... The risk-sensitive filtering design problem with respect to the exponential mean-square cost criterion is con-sidered for stochastic Gaussian systems with polynomial of second and third degree drift terms and intensity parameters multiplying diffusion terms in the state and observations equations. The closed-form optimal fil-tering equations are obtained using quadratic value functions as solutions to the corresponding Focker- Plank-Kolmogorov equation. The performance of the obtained risk-sensitive filtering equations for stochastic polynomial systems of second and third degree is verified in a numerical example against the optimal po-lynomial filtering equations (and extended Kalman-Bucy for system polynomial of second degree), through comparing the exponential mean-square cost criterion values. The simulation results reveal strong advan-tages in favor of the designed risk-sensitive equations for some values of the intensity parameters. 展开更多
关键词 OPTIMAL Nonlinear FILTERING risk-sensitive FILTERING Extended Kalman-Bucy FILTERING
在线阅读 下载PDF
Risk-Sensitive Linear-Quadratic Mean-Field Games:Asymptotic Solvability and Decentralized O(1/N)-Nash Equilibria In honour of the 80th birthday of Professor Peter Caines
5
作者 WANG Yu HUANG Minyi 《Journal of Systems Science & Complexity》 2025年第1期436-459,共24页
This paper considers risk-sensitive linear-quadratic mean-field games.By the so-called direct approach via dynamic programming,the authors determine the feedback Nash equilibrium in an N-player game.Subsequently,the a... This paper considers risk-sensitive linear-quadratic mean-field games.By the so-called direct approach via dynamic programming,the authors determine the feedback Nash equilibrium in an N-player game.Subsequently,the authors design a set of decentralized strategies by passing to the mean-field limit.The authors prove that the set of decentralized strategies constitutes an O(1/N)-Nash equilibrium when applied by the N players,and hence obtain so far the tightest equilibrium error bounds for this class of models. 展开更多
关键词 Asymptotic Nash equilibria decentralized strategies linear-quadratic mean-field games risk-sensitive costs
原文传递
Design of satisfaction output feedback controls for stochastic nonlinear systems under quadratic tracking risk-sensitive index 被引量:8
6
作者 刘允刚 张纪峰 潘子刚 《Science in China(Series F)》 2003年第2期126-144,共19页
In this paper, the design problem of satisfaction output feedback controls for stochastic nonlinear systems in strict feedback form under long-term tracking risk-sensitive index is investigated. The index function ado... In this paper, the design problem of satisfaction output feedback controls for stochastic nonlinear systems in strict feedback form under long-term tracking risk-sensitive index is investigated. The index function adopted here is of quadratic form usually encountered in practice, rather than of quartic one used to beg the essential difficulty on controller design and performance analysis of the closed-loop systems. For any given risk-sensitive parameter and desired index value, by using the integrator backstepping method, an output feedback control is constructively designed so that the closed-loop system is bounded in probability and the risk-sensitive index is upper bounded by the desired value. 展开更多
关键词 integrator backstepping nonlinear system stochastic disturbance risk-sensitive index output feedback.
原文传递
RISK-SENSITIVE FIXED-POINT SMOOTHING ESTIMATION FOR LINEAR DISCRETE-TIME SYSTEMS WITH MULTIPLE OUTPUT DELAYS 被引量:1
7
作者 ZHAO Hongguo CUI Peng 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2013年第2期137-150,共14页
This paper investigates the risk-sensitive fixed-point smoothing estimation for hnear omcrete-time systems with multiple time-delay measurements. The problem considered can be converted into an optimization one in ind... This paper investigates the risk-sensitive fixed-point smoothing estimation for hnear omcrete-time systems with multiple time-delay measurements. The problem considered can be converted into an optimization one in indefinite space. Then the risk-sensitive fixed-point smoother is obtained by solving the optimization problem via innovation analysis theory in indefinite space. Necessary and sufficient conditions guaranteeing the existence of the risk-sensitive smoother are also given when the risk-sensitive parameter is negative. Compared with the conventional approach, a significant advantage of presented approach is that it provides less computational cost. 展开更多
关键词 Innovation Riccati equation risk-sensitive time-delay.
原文传递
Partially Observed Risk-Sensitive Stochastic Control Problems with Non-Convexity Restriction
8
作者 MA Heping LI Ruijing 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2023年第2期672-685,共14页
The paper considers partially observed optimal control problems for risk-sensitive stochastic systems,where the control domain is non-convex and the diffusion term contains the control v.Utilizing Girsanov’s theorem,... The paper considers partially observed optimal control problems for risk-sensitive stochastic systems,where the control domain is non-convex and the diffusion term contains the control v.Utilizing Girsanov’s theorem,spike variational technique as well as duality method,the authors obtain four adjoint equations and establish a maximum principle under partial information.As an application,an example is presented to demonstrate the result. 展开更多
关键词 Girsanov's theorem maximum principle partial information risk-sensitive optimal control
原文传递
Data-Driven Direct Adaptive Risk-Sensitive Control of Stochastic Systems
9
作者 QIAO Nan LI Tao 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2024年第4期1446-1469,共24页
The authors propose a data-driven direct adaptive control law based on the adaptive dynamic programming(ADP) algorithm for continuous-time stochastic linear systems with partially unknown system dynamics and infinite ... The authors propose a data-driven direct adaptive control law based on the adaptive dynamic programming(ADP) algorithm for continuous-time stochastic linear systems with partially unknown system dynamics and infinite horizon quadratic risk-sensitive indices.The authors use online data of the system to iteratively solve the generalized algebraic Riccati equation(GARE) and to learn the optimal control law directly.For the case with measurable system noises,the authors show that the adaptive control law approximates the optimal control law as time goes on.For the case with unmeasurable system noises,the authors use the least-square solution calculated only from the measurable data instead of the real solution of the regression equation to iteratively solve the GARE.The authors also study the influences of the intensity of the system noises,the intensity of the exploration noises,the initial iterative matrix,and the sampling period on the convergence of the ADP algorithm.Finally,the authors present two numerical simulation examples to demonstrate the effectiveness of the proposed algorithms. 展开更多
关键词 Adaptive dynamic programming direct adaptive control generalized algebraic Riccati equation risk-sensitive control
原文传递
G-stochastic maximum principle for risk-sensitive control problem and its applications
10
作者 Meriyam Dassa Adel Chala 《Probability, Uncertainty and Quantitative Risk》 2023年第4期463-484,共22页
This study advances the G-stochastic maximum principle(G-SMP)from a risk-neutral framework to a risk-sensitive one.A salient feature of this advancement is its applicability to systems governed by stochastic different... This study advances the G-stochastic maximum principle(G-SMP)from a risk-neutral framework to a risk-sensitive one.A salient feature of this advancement is its applicability to systems governed by stochastic differential equations under G-Brownian motion(G-SDEs),where the control variable may influence all terms.We aim to generalize our findings from a risk-neutral context to a risk-sensitive performance cost.Initially,we introduced an auxiliary process to address risk-sensitive performance costs within the G-expectation framework.Subsequently,we established and validated the correlation between the G-expected exponential utility and the G-quadratic backward stochastic differential equation.Furthermore,we simplified the G-adjoint process from a dual-component structure to a singular component.Moreover,we explained the necessary optimality conditions for this model by considering a convex set of admissible controls.To describe the main findings,we present two examples:the first addresses the linear-quadratic problem and the second examines a Merton-type problem characterized by power utility. 展开更多
关键词 Stochastic optimal control G-EXPECTATION G-Brownian motion G-Stochastic differential equation G-stochastic maximum principle risk-sensitive control Logarithmic transformation
原文传递
Linear Exponential Quadratic Stochastic Differential Games and Applications
11
作者 Su Qing Zhao Jirui 《南开大学学报(自然科学版)》 北大核心 2025年第5期110-120,共11页
The two-player nonzero-sum linear-exponential-quadratic stochastic differential game is studied.The game takes into account the players'attitudes to risk.The nonlinear transformations and change of probability mea... The two-player nonzero-sum linear-exponential-quadratic stochastic differential game is studied.The game takes into account the players'attitudes to risk.The nonlinear transformations and change of probability measure techniques are used to study the existence of both open-loop and closed-loop Nash equilibria for the game.Some examples are constructed to illustrate their differences.Furthermore,theoretical results are applied to solve the risk-sensitive portfolio game problem in the financial market and show the effects of risk attitudes and economic performance on equilibria. 展开更多
关键词 risk-sensitive stochastic differential games linear-quadratic problem Nash equilibria open-loop and closed-loop
原文传递
Stressed portfolio optimization with semiparametric method 被引量:1
12
作者 Chuan-Hsiang Han Kun Wang 《Financial Innovation》 2022年第1期821-854,共34页
Tail risk is a classic topic in stressed portfolio optimization to treat unprecedented risks,while the traditional mean–variance approach may fail to perform well.This study proposes an innovative semiparametric meth... Tail risk is a classic topic in stressed portfolio optimization to treat unprecedented risks,while the traditional mean–variance approach may fail to perform well.This study proposes an innovative semiparametric method consisting of two modeling components:the nonparametric estimation and copula method for each marginal distribution of the portfolio and their joint distribution,respectively.We then focus on the optimal weights of the stressed portfolio and its optimal scale beyond the Gaussian restriction.Empirical studies include statistical estimation for the semiparametric method,risk measure minimization for optimal weights,and value measure maximization for the optimal scale to enlarge the investment.From the outputs of short-term and long-term data analysis,optimal stressed portfolios demonstrate the advantages of model flexibility to account for tail risk over the traditional mean–variance method. 展开更多
关键词 Portfolio optimization Tail risk Semiparametric method Kernel method Copula method Risk measure risk-sensitive value measure Scaling effect
在线阅读 下载PDF
Robust Designs Through Risk Sensitivity:An Overview
13
作者 BASAR Tamer 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第5期1634-1665,共32页
This is an overview paper on the relationship between risk-averse designs based on exponential loss functions with or without an additional unknown(adversarial)term and some classes of stochastic games.In particular,t... This is an overview paper on the relationship between risk-averse designs based on exponential loss functions with or without an additional unknown(adversarial)term and some classes of stochastic games.In particular,the paper discusses the equivalences between risk-averse controller and filter designs and saddle-point solutions of some corresponding risk-neutral stochastic differential games with different information structures for the players.One of the by-products of these analyses is that risk-averse controllers and filters(or estimators)for control and signal-measurement models are robust,through stochastic dissipation inequalities,to unmodeled perturbations in controlled system dynamics as well as signal and the measurement processes.The paper also discusses equivalences between risk-sensitive stochastic zero-sum differential games and some corresponding risk-neutral three-player stochastic zero-sum differential games,as well as robustness issues in stochastic nonzero-sum differential games with finite and infinite populations of players,with the latter belonging to the domain of mean-field games. 展开更多
关键词 Mean-field games risk-sensitive control risk-sensitive filtering risk-sensitive games risk sensitivity ROBUSTNESS
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部