Conventional resource provision algorithms focus on how to maximize resource utilization and meet a fixed constraint of response time which be written in service level agreement(SLA).Unfortunately,the expected respo...Conventional resource provision algorithms focus on how to maximize resource utilization and meet a fixed constraint of response time which be written in service level agreement(SLA).Unfortunately,the expected response time is highly variable and it is usually longer than the value of SLA.So,it leads to a poor resource utilization and unnecessary servers migration.We develop a framework for customer-driven dynamic resource allocation in cloud computing.Termed CDSMS(customer-driven service manage system),and the framework’s contributions are twofold.First,it can reduce the total migration times by adjusting the value of parameters of response time dynamically according to customers’profiles.Second,it can choose a best resource provision algorithm automatically in different scenarios to improve resource utilization.Finally,we perform a serious experiment in a real cloud computing platform.Experimental results show that CDSMS provides a satisfactory solution for the prediction of expected response time and the interval period between two tasks and reduce the total resource usage cost.展开更多
Achieving robust walking for different stairs is one of the most challenging tasks for quadruped robots in real world.Traditional model-based methods heavily rely on environmental factors,are burdened by intricate mod...Achieving robust walking for different stairs is one of the most challenging tasks for quadruped robots in real world.Traditional model-based methods heavily rely on environmental factors,are burdened by intricate modelling complexities,and lack generalizability.The potential for advancements in adaptive locomotion control,often impeded by complex modelling processes,can be substantially enhanced through the application of Reinforcement Learning(RL).In this paper,a learning-based method is proposed to directionally enhance the stair-climbing skill of quadruped robots under different stair conditions.First,the general policy model based on proprioceptive perception is trained as a pre-training model.Then,the pre-training model was initialized,and different terrain information from the stairs was introduced for customized training to enhance the stair-climbing skill without affecting the existing locomotion performance.Finally,the customized control policy is deployed to the real robot to realize motion control in real environments.The experimental results demonstrate that the customized control policy can significantly improve the motion performance of quadruped robots when facing complex stair terrains and has certain generalizability in other complex terrains.The proposed algorithm can be extended to various terrestrial environments.展开更多
In this paper,we present a new method for finding a fixed local-optimal policy for computing the customer lifetime value.The method is developed for a class of ergodic controllable finite Markov chains.We propose an a...In this paper,we present a new method for finding a fixed local-optimal policy for computing the customer lifetime value.The method is developed for a class of ergodic controllable finite Markov chains.We propose an approach based on a non-converging state-value function that fluctuates(increases and decreases) between states of the dynamic process.We prove that it is possible to represent that function in a recursive format using a one-step-ahead fixed-optimal policy.Then,we provide an analytical formula for the numerical realization of the fixed local-optimal strategy.We also present a second approach based on linear programming,to solve the same problem,that implement the c-variable method for making the problem computationally tractable.At the end,we show that these two approaches are related:after a finite number of iterations our proposed approach converges to same result as the linear programming method.We also present a non-traditional approach for ergodicity verification.The validity of the proposed methods is successfully demonstrated theoretically and,by simulated credit-card marketing experiments computing the customer lifetime value for both an optimization and a game theory approach.展开更多
This paper investigates a fluid model driven by an M/M/1 queue with working vacations and RCE (Removal of customer in the end) policy of negative customer. In the external environment, the negative customer is not s...This paper investigates a fluid model driven by an M/M/1 queue with working vacations and RCE (Removal of customer in the end) policy of negative customer. In the external environment, the negative customer is not served by the server and only removes the positive customer in the end one-to-one. We establish a fluid flow model based on this stochastic process, and obtain the mean buffer content and the probability of empty buffer for this fluid queue using the LT (Laplace transform) method. Moreover, several special cases of the model here are obtained. Finally, some numerical examples are presented to demonstrate the effects of parameters on the performance indices of the fluid model.展开更多
基金Supported by the National Natural Science Foundation of China(61272454)
文摘Conventional resource provision algorithms focus on how to maximize resource utilization and meet a fixed constraint of response time which be written in service level agreement(SLA).Unfortunately,the expected response time is highly variable and it is usually longer than the value of SLA.So,it leads to a poor resource utilization and unnecessary servers migration.We develop a framework for customer-driven dynamic resource allocation in cloud computing.Termed CDSMS(customer-driven service manage system),and the framework’s contributions are twofold.First,it can reduce the total migration times by adjusting the value of parameters of response time dynamically according to customers’profiles.Second,it can choose a best resource provision algorithm automatically in different scenarios to improve resource utilization.Finally,we perform a serious experiment in a real cloud computing platform.Experimental results show that CDSMS provides a satisfactory solution for the prediction of expected response time and the interval period between two tasks and reduce the total resource usage cost.
文摘Achieving robust walking for different stairs is one of the most challenging tasks for quadruped robots in real world.Traditional model-based methods heavily rely on environmental factors,are burdened by intricate modelling complexities,and lack generalizability.The potential for advancements in adaptive locomotion control,often impeded by complex modelling processes,can be substantially enhanced through the application of Reinforcement Learning(RL).In this paper,a learning-based method is proposed to directionally enhance the stair-climbing skill of quadruped robots under different stair conditions.First,the general policy model based on proprioceptive perception is trained as a pre-training model.Then,the pre-training model was initialized,and different terrain information from the stairs was introduced for customized training to enhance the stair-climbing skill without affecting the existing locomotion performance.Finally,the customized control policy is deployed to the real robot to realize motion control in real environments.The experimental results demonstrate that the customized control policy can significantly improve the motion performance of quadruped robots when facing complex stair terrains and has certain generalizability in other complex terrains.The proposed algorithm can be extended to various terrestrial environments.
文摘In this paper,we present a new method for finding a fixed local-optimal policy for computing the customer lifetime value.The method is developed for a class of ergodic controllable finite Markov chains.We propose an approach based on a non-converging state-value function that fluctuates(increases and decreases) between states of the dynamic process.We prove that it is possible to represent that function in a recursive format using a one-step-ahead fixed-optimal policy.Then,we provide an analytical formula for the numerical realization of the fixed local-optimal strategy.We also present a second approach based on linear programming,to solve the same problem,that implement the c-variable method for making the problem computationally tractable.At the end,we show that these two approaches are related:after a finite number of iterations our proposed approach converges to same result as the linear programming method.We also present a non-traditional approach for ergodicity verification.The validity of the proposed methods is successfully demonstrated theoretically and,by simulated credit-card marketing experiments computing the customer lifetime value for both an optimization and a game theory approach.
基金Supported by the National Natural Science Foundation of China(No.11201408)Natural Science Foundation of Hebei Province(No.A2013203148)
文摘This paper investigates a fluid model driven by an M/M/1 queue with working vacations and RCE (Removal of customer in the end) policy of negative customer. In the external environment, the negative customer is not served by the server and only removes the positive customer in the end one-to-one. We establish a fluid flow model based on this stochastic process, and obtain the mean buffer content and the probability of empty buffer for this fluid queue using the LT (Laplace transform) method. Moreover, several special cases of the model here are obtained. Finally, some numerical examples are presented to demonstrate the effects of parameters on the performance indices of the fluid model.