期刊文献+
共找到25篇文章
< 1 2 >
每页显示 20 50 100
Starlet:Network defense resource allocation with multi-armed bandits for cloud-edge crowd sensing in IoT
1
作者 Hui Xia Ning Huang +2 位作者 Xuecai Feng Rui Zhang Chao Liu 《Digital Communications and Networks》 SCIE CSCD 2024年第3期586-596,共11页
The cloud platform has limited defense resources to fully protect the edge servers used to process crowd sensing data in Internet of Things.To guarantee the network's overall security,we present a network defense ... The cloud platform has limited defense resources to fully protect the edge servers used to process crowd sensing data in Internet of Things.To guarantee the network's overall security,we present a network defense resource allocation with multi-armed bandits to maximize the network's overall benefit.Firstly,we propose the method for dynamic setting of node defense resource thresholds to obtain the defender(attacker)benefit function of edge servers(nodes)and distribution.Secondly,we design a defense resource sharing mechanism for neighboring nodes to obtain the defense capability of nodes.Subsequently,we use the decomposability and Lipschitz conti-nuity of the defender's total expected utility to reduce the difference between the utility's discrete and continuous arms and analyze the difference theoretically.Finally,experimental results show that the method maximizes the defender's total expected utility and reduces the difference between the discrete and continuous arms of the utility. 展开更多
关键词 Internet of things Defense resource sharing multi-armed bandits Defense resource allocation
在线阅读 下载PDF
Distributed Weighted Data Aggregation Algorithm in End-to-Edge Communication Networks Based on Multi-armed Bandit 被引量:1
2
作者 Yifei ZOU Senmao QI +1 位作者 Cong'an XU Dongxiao YU 《计算机科学》 CSCD 北大核心 2023年第2期13-22,共10页
As a combination of edge computing and artificial intelligence,edge intelligence has become a promising technique and provided its users with a series of fast,precise,and customized services.In edge intelligence,when ... As a combination of edge computing and artificial intelligence,edge intelligence has become a promising technique and provided its users with a series of fast,precise,and customized services.In edge intelligence,when learning agents are deployed on the edge side,the data aggregation from the end side to the designated edge devices is an important research topic.Considering the various importance of end devices,this paper studies the weighted data aggregation problem in a single hop end-to-edge communication network.Firstly,to make sure all the end devices with various weights are fairly treated in data aggregation,a distributed end-to-edge cooperative scheme is proposed.Then,to handle the massive contention on the wireless channel caused by end devices,a multi-armed bandit(MAB)algorithm is designed to help the end devices find their most appropriate update rates.Diffe-rent from the traditional data aggregation works,combining the MAB enables our algorithm a higher efficiency in data aggregation.With a theoretical analysis,we show that the efficiency of our algorithm is asymptotically optimal.Comparative experiments with previous works are also conducted to show the strength of our algorithm. 展开更多
关键词 Weighted data aggregation End-to-edge communication multi-armed bandit Edge intelligence
在线阅读 下载PDF
Cross-linking Copolymerization of Acrylic Acid and Multi-armed Cross-linkers
3
作者 Qiang CHEN Ying GUAN +2 位作者 Xian Min ZHANG Yu Xing PENG Jian XU 《Chinese Chemical Letters》 SCIE CAS CSCD 2001年第11期1029-1032,共4页
The overall cross-linking copolymerization of acrylic acid and multi-armed cross-linkers are investigated by in situ interferometry. The results show that the more arms the cross-linkers have, the higher the polymeriz... The overall cross-linking copolymerization of acrylic acid and multi-armed cross-linkers are investigated by in situ interferometry. The results show that the more arms the cross-linkers have, the higher the polymerization rate is. However, they also mean the existence of less cross-linking efficiency and some defects in gel network. 展开更多
关键词 multi-armed cross-linker acrylic acid in situ interferometry
在线阅读 下载PDF
Training a Quantum Neural Network to Solve the Contextual Multi-Armed Bandit Problem
4
作者 Wei Hu James Hu 《Natural Science》 2019年第1期17-27,共11页
Artificial intelligence has permeated all aspects of our lives today. However, to make AI behave like real AI, the critical bottleneck lies in the speed of computing. Quantum computers employ the peculiar and unique p... Artificial intelligence has permeated all aspects of our lives today. However, to make AI behave like real AI, the critical bottleneck lies in the speed of computing. Quantum computers employ the peculiar and unique properties of quantum states such as superposition, entanglement, and interference to process information in ways that classical computers cannot. As a new paradigm of computation, quantum computers are capable of performing tasks intractable for classical processors, thus providing a quantum leap in AI research and making the development of real AI a possibility. In this regard, quantum machine learning not only enhances the classical machine learning approach but more importantly it provides an avenue to explore new machine learning models that have no classical counterparts. The qubit-based quantum computers cannot naturally represent the continuous variables commonly used in machine learning, since the measurement outputs of qubit-based circuits are generally discrete. Therefore, a continuous-variable (CV) quantum architecture based on a photonic quantum computing model is selected for our study. In this work, we employ machine learning and optimization to create photonic quantum circuits that can solve the contextual multi-armed bandit problem, a problem in the domain of reinforcement learning, which demonstrates that quantum reinforcement learning algorithms can be learned by a quantum device. 展开更多
关键词 Continuous-Variable QUANTUM COMPUTERS QUANTUM Machine LEARNING QUANTUM Reinforcement LEARNING CONTEXTUAL multi-armed BANDIT PROBLEM
暂未订购
Strict greedy design paradigm applied to the stochastic multi-armed bandit problem
5
作者 Joey Hong 《机床与液压》 北大核心 2015年第6期1-6,共6页
The process of making decisions is something humans do inherently and routinely,to the extent that it appears commonplace. However,in order to achieve good overall performance,decisions must take into account both the... The process of making decisions is something humans do inherently and routinely,to the extent that it appears commonplace. However,in order to achieve good overall performance,decisions must take into account both the outcomes of past decisions and opportunities of future ones. Reinforcement learning,which is fundamental to sequential decision-making,consists of the following components: 1 A set of decisions epochs; 2 A set of environment states; 3 A set of available actions to transition states; 4 State-action dependent immediate rewards for each action.At each decision,the environment state provides the decision maker with a set of available actions from which to choose. As a result of selecting a particular action in the state,the environment generates an immediate reward for the decision maker and shifts to a different state and decision. The ultimate goal for the decision maker is to maximize the total reward after a sequence of time steps.This paper will focus on an archetypal example of reinforcement learning,the stochastic multi-armed bandit problem. After introducing the dilemma,I will briefly cover the most common methods used to solve it,namely the UCB and εn- greedy algorithms. I will also introduce my own greedy implementation,the strict-greedy algorithm,which more tightly follows the greedy pattern in algorithm design,and show that it runs comparably to the two accepted algorithms. 展开更多
关键词 Greedy algorithms Allocation strategy Stochastic multi-armed bandit problem
在线阅读 下载PDF
Diversity-Based Recruitment in Crowdsensing by Combinatorial Multi-Armed Bandits
6
作者 Abdalaziz Sawwan Jie Wu 《Tsinghua Science and Technology》 2025年第2期732-747,共16页
Mobile Crowdsensing(MCS)represents a transformative approach to collecting data from the environment as it utilizes the ubiquity and sensory capabilities of mobile devices with human participants.This paradigm enables... Mobile Crowdsensing(MCS)represents a transformative approach to collecting data from the environment as it utilizes the ubiquity and sensory capabilities of mobile devices with human participants.This paradigm enables scales of data collection critical for applications ranging from environmental monitoring to urban planning.However,the effective harnessing of this distributed data collection capability faces significant challenges.One of the most significant challenges is the variability in the sensing qualities of the participating devices while they are initially unknown and must be learned over time to optimize task assignments.This paper tackles the dual challenges of managing task diversity to mitigate data redundancy and optimizing task assignment amidst the inherent variability of worker performance.We introduce a novel model that dynamically adjusts task weights based on assignment frequency to promote diversity and incorporates a flexible approach to account for the different qualities of task completion,especially in scenarios with overlapping task assignments.Our strategy aims to maximize the overall weighted quality of data collected within the constraints of a predefined budget.Our strategy leverages a combinatorial multi-armed bandit framework with an upper confidence bound approach to guide decision-making.We demonstrate the efficacy of our approach through a combination of regret analysis and simulations grounded in realistic scenarios. 展开更多
关键词 diverse allocation mobile crowdsensing multi-agent systems multi-armed bandits online learning worker recruitment
原文传递
Mobility-Aware User Scheduling in Wireless Federated Learning with Contextual Multi-Armed Bandit
7
作者 Li Jun Sun Haiyang +4 位作者 Deng Xiumei Wei Kang Shi Long Liang Le Chen Wen 《China Communications》 2025年第11期256-272,共17页
Federated learning(FL)is an intricate and privacy-preserving technique that enables distributed mobile devices to collaboratively train a machine learning model.However,in real-world FL scenarios,the training performa... Federated learning(FL)is an intricate and privacy-preserving technique that enables distributed mobile devices to collaboratively train a machine learning model.However,in real-world FL scenarios,the training performance is affected by a combination of factors such as the mobility of user devices,limited communication and computational resources,thus making the user scheduling problem crucial.To tackle this problem,we jointly consider the user mobility,communication and computational capacities,and develop a stochastic optimization problem to minimize the convergence time.Specifically,we first establish a convergence bound on the training performance based on the heterogeneity of users’data,and then leverage this bound to derive the participation rate for each user.After deriving the user-specific participation rate,we aim to minimize the training latency by optimizing user scheduling under the constraints of the energy consumption and participation rate.Afterward,we transform this optimization problem to the contextual multi-armed bandit framework based on the Lyapunov method and solve it with the submodular reward enhanced linear upper confidence bound(SR-linUCB)algorithm.Experimental results demonstrate the superiority of our proposed algorithm on the training performance and time consumption compared with stateof-the-art algorithms for both independent and identically distributed(IID)and non-IID settings. 展开更多
关键词 contextual multi-armed bandit federated learning resource allocation upper confidence bound user scheduling
在线阅读 下载PDF
An in situ dressing material containing a multi-armed antibiotic for healing irregular wounds 被引量:1
8
作者 Ruihua Dong Mian Chen +4 位作者 Yuexiao Jia Hao Tang Ziyin Xiong Yunze Long Xingyu Jiang 《Aggregate》 EI CAS 2024年第3期248-257,共10页
Acute and infected wounds resulting from accidents,battlefield trauma,or surgical interventions have become a global healthcare burden due to the complex bacterial infection environment.However,conventional gauze dres... Acute and infected wounds resulting from accidents,battlefield trauma,or surgical interventions have become a global healthcare burden due to the complex bacterial infection environment.However,conventional gauze dressings present insufficient contact with irregular wounds and lack antibacterial activity against multi-drug-resistant bacteria.In this study,we develop in situ nanofibrous dressings tailored tofit wounds of various shapes and sizes while providing nanoscale comfort and excellent antibacterial properties.Our approach involves the fabrication of these dressings using a handheld electrospinning device that allows for the direct depo-sition of nanofiber dressings onto specific irregular wound sites,resulting in perfect conformal wound closure without any mismatch in 2 min.The nanofibrous dressings are loaded with multi-armed antibiotics that exhibit outstanding antibacterial activ-ity against Staphylococcus aureus(S.aureus)and methicillin-resistant S.aureus.Compared to conventional vancomycin,this in situ nanofibrous dressing shows great antibacterial performance against up to 98%of multi-drug-resistant bacteria.In vitro and in vivo experiments demonstrate the ability of in situ nanofibrous dressings to prevent multi-drug-resistant bacterial infection,greatly alleviate inflammation,and promote wound healing.Ourfindings highlight the potential of these personalized nanofibrous dressings for clinical applications,including emergency,accident,and surgical healthcare treatment. 展开更多
关键词 deposited nanofibrous dressing multi-armed antibiotics personalized healthcare portable electrospin-ning device
在线阅读 下载PDF
Demonstration-enhanced policy search for space multi-arm robot collaborative skill learning
9
作者 Tian GAO Chengfei YUE +1 位作者 Xiaozhe JU Tao LIN 《Chinese Journal of Aeronautics》 2025年第3期462-473,共12页
The increasing complexity of on-orbit tasks imposes great demands on the flexible operation of space robotic arms, prompting the development of space robots from single-arm manipulation to multi-arm collaboration. In ... The increasing complexity of on-orbit tasks imposes great demands on the flexible operation of space robotic arms, prompting the development of space robots from single-arm manipulation to multi-arm collaboration. In this paper, a combined approach of Learning from Demonstration (LfD) and Reinforcement Learning (RL) is proposed for space multi-arm collaborative skill learning. The combination effectively resolves the trade-off between learning efficiency and feasible solution in LfD, as well as the time-consuming pursuit of the optimal solution in RL. With the prior knowledge of LfD, space robotic arms can achieve efficient guided learning in high-dimensional state-action space. Specifically, an LfD approach with Probabilistic Movement Primitives (ProMP) is firstly utilized to encode and reproduce the demonstration actions, generating a distribution as the initialization of policy. Then in the RL stage, a Relative Entropy Policy Search (REPS) algorithm modified in continuous state-action space is employed for further policy improvement. More importantly, the learned behaviors can maintain and reflect the characteristics of demonstrations. In addition, a series of supplementary policy search mechanisms are designed to accelerate the exploration process. The effectiveness of the proposed method has been verified both theoretically and experimentally. Moreover, comparisons with state-of-the-art methods have confirmed the outperformance of the approach. 展开更多
关键词 Space multi-arm collaboration Demonstrations .Reinforcement Learning Probabilistic Movement Primitives Relative Entropy Policy Search Policy search mechanism
原文传递
Residential HVAC Aggregation Based on Risk-averse Multi-armed Bandit Learning for Secondary Frequency Regulation 被引量:8
10
作者 Xinyi Chen Qinran Hu +3 位作者 Qingxin Shi Xiangjun Quan Zaijun Wu Fangxing Li 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2020年第6期1160-1167,共8页
As the penetration of renewable energy continues to increase,stochastic and intermittent generation resources gradually replace the conventional generators,bringing significant challenges in stabilizing power system f... As the penetration of renewable energy continues to increase,stochastic and intermittent generation resources gradually replace the conventional generators,bringing significant challenges in stabilizing power system frequency.Thus,aggregating demand-side resources for frequency regulation attracts attentions from both academia and industry.However,in practice,conventional aggregation approaches suffer from random and uncertain behaviors of the users such as opting out control signals.The risk-averse multi-armed bandit learning approach is adopted to learn the behaviors of the users and a novel aggregation strategy is developed for residential heating,ventilation,and air conditioning(HVAC)to provide reliable secondary frequency regulation.Compared with the conventional approach,the simulation results show that the risk-averse multiarmed bandit learning approach performs better in secondary frequency regulation with fewer users being selected and opting out of the control.Besides,the proposed approach is more robust to random and changing behaviors of the users. 展开更多
关键词 HEATING ventilation and air conditioning(HVAC) load control multi-armed bandit online learning secondary frequency regulation
原文传递
Simultaneously inducing synthesis of semiconductor selenium multi-armed nanorods and nanobars through bio-membrane bi-templates 被引量:3
11
作者 LI Li1, WU Qingsheng1 DING Yaping2 & LI Ping3 1. Department of Chemistry, Tongji University, Shanghai 200092, China 2. Department of Chemistry, Shanghai University, Shanghai 200436, China 3. Department of Food, Anhui Agricultural University, Hefei 230036, China 《Science China Chemistry》 SCIE EI CAS 2004年第6期507-511,共5页
Multi-armed nanorods and nanobars of semiconductor selenium were simultane- ously synthesized in the light of biomineralization process through bio-membrane bi-templates of rush at room temperature. The multi-armed na... Multi-armed nanorods and nanobars of semiconductor selenium were simultane- ously synthesized in the light of biomineralization process through bio-membrane bi-templates of rush at room temperature. The multi-armed nanorods are 60 nm in diameter and 1.5 μm in length; the nanobars are 150 nm in diameter and 1000—1100 nm in length. The XRD pattern indicates that these nanocrystals were crystallized in the hexagonal structure with lattice constants a = 0.437 nm, c = 0.495 nm. The possible formation mechanism was investigated. 展开更多
关键词 selenium multi-armed nanorods nanobars bio-membrane bi-templates.
原文传递
Channel estimation based on multi-armed approach for maritime OFDM wireless communications
12
作者 Zhang Qianqian Xu Yanli 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2023年第4期75-85,120,共12页
With the development of maritime informatization and the increased generation of marine data,the demands of efficient and reliable maritime communication surge.However,harsh and dynamic marine communication environmen... With the development of maritime informatization and the increased generation of marine data,the demands of efficient and reliable maritime communication surge.However,harsh and dynamic marine communication environmentcan distort transmission signal,which significantly weaken the communication performance.Therefore,for maritime wireless communication system,the channel estimation is often required to detect the channel suffered from the impacts of changing factors.Since there is no universal maritime communication channel model and channel varies dynamically,channel estimation method needs to make decision dynamically without pre-knowledge of channel distribution.This paper studies the radio channel estimation problem of wireless communications over the sea surface.To improve the estimation accuracy,this paper utilizes multi-armed bandit(MAB)problem to deal with the uncertainty of channel state information(CSI),then proposes a dynamic channel estimation algorithm to explore the global changing channel information,and asymptotically minimize the estimation error.By the aid of MAB,the estimation is not only dynamic according to channel variation,but also does not need to know the channel distribution.Simulation results show that the proposed algorithm can achieve higher estimation accuracy compared to matching pursuit(MP)-based and fractional Fourier transform(FrFT)-based methods. 展开更多
关键词 MARITIME WIRELESS COMMUNICATIONS channel estimation multi-armed BANDIT
原文传递
Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
13
作者 Yifan Lin Yuhao Wang Enlu Zhou 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2023年第3期267-288,共22页
In this paper we consider the contextual multi-armed bandit problem for linear payoffs under a risk-averse criterion.At each round,contexts are revealed for each arm,and the decision maker chooses one arm to pull and ... In this paper we consider the contextual multi-armed bandit problem for linear payoffs under a risk-averse criterion.At each round,contexts are revealed for each arm,and the decision maker chooses one arm to pull and receives the corresponding reward.In particular,we consider mean-variance as the risk criterion,and the best arm is the one with the largest mean-variance reward.We apply the Thompson sampling algorithm for the disjoint model,and provide a comprehensive regret analysis for a variant of the proposed algorithm.For T rounds,K actions,and d-dimensional feature vectors,we prove a regret bound of O((1+ρ+1/ρ)d In T ln K/δ√dKT^(1+2∈)ln K/δ1/e)that holds with probability 1-δunder the mean-variance criterion with risk tolerance p,for any 0<ε<1/2,0<δ<1.The empirical performance of our proposed algorithms is demonstrated via a portfolio selection problem. 展开更多
关键词 multi-armed bandit CONTEXT RISK-AVERSE Thompson sampling
原文传递
Apo E基因multi-ARMS与PCR-RFLP分型法的比较研究
14
作者 唐慧 董虹 +3 位作者 高建梅 华映昆 张丽平 严新民 《大理学院学报(综合版)》 CAS 2002年第3期1-4,共4页
目的 :以聚合酶链反应—限制性酶切片段长度多态性(PCR -RFLP)方法作为参照 ,建立一种ApoE基因分型方法。方法 :以基因组DNA为模板 ,用PCR -RFLP法和等位基因特异性复合PCR方法(MultiplexAmplificationRefractoryMutationSystem,multi-A... 目的 :以聚合酶链反应—限制性酶切片段长度多态性(PCR -RFLP)方法作为参照 ,建立一种ApoE基因分型方法。方法 :以基因组DNA为模板 ,用PCR -RFLP法和等位基因特异性复合PCR方法(MultiplexAmplificationRefractoryMutationSystem,multi-ARMS)检测了75例个体的ApoE基因型。结果 :共检测出5种常见的ApoE基因型 :ε2/2 ,ε3/3 ,ε2/3,ε3/4 ,ε2/4。两种方法所得的结果完全一致。结论 :说明multi-ARMS方法快速简便、准确可靠 。 展开更多
关键词 APOE基因 multi-armS PCR-RFLP分型法 比较研究 基因分型
在线阅读 下载PDF
Impedance control of multi-arm space robot for the capture of non-cooperative targets 被引量:7
15
作者 GE Dongming SUN Guanghui +1 位作者 ZOU Yuanjie SHI Jixin 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第5期1051-1061,共11页
Robotic systems are expected to play an increasingly important role in future space activities. The robotic on-orbital service, whose key is the capturing technology, becomes a research hot spot in recent years. This ... Robotic systems are expected to play an increasingly important role in future space activities. The robotic on-orbital service, whose key is the capturing technology, becomes a research hot spot in recent years. This paper studies the dynamics modeling and impedance control of a multi-arm free-flying space robotic system capturing a non-cooperative target. Firstly, a control-oriented dynamics model is essential in control algorithm design and code realization. Unlike a numerical algorithm, an analytical approach is suggested. Using a general and a quasi-coordinate Lagrangian formulation, the kinematics and dynamics equations are derived.Then, an impedance control algorithm is developed which allows coordinated control of the multiple manipulators to capture a target.Through enforcing a reference impedance, end-effectors behave like a mass-damper-spring system fixed in inertial space in reaction to any contact force between the capture hands and the target. Meanwhile, the position and the attitude of the base are maintained stably by using gas jet thrusters to work against the manipulators' reaction. Finally, a simulation by using a space robot with two manipulators and a free-floating non-cooperative target is illustrated to verify the effectiveness of the proposed method. 展开更多
关键词 multi-arm space robot impedance control non-cooperative target CAPTURE
在线阅读 下载PDF
Stochastic programming based multi-arm bandit offloading strategy for internet of things
16
作者 Bin Cao Tingyong Wu Xiang Bai 《Digital Communications and Networks》 SCIE CSCD 2023年第5期1200-1211,共12页
In order to solve the high latency of traditional cloud computing and the processing capacity limitation of Internet of Things(IoT)users,Multi-access Edge Computing(MEC)migrates computing and storage capabilities from... In order to solve the high latency of traditional cloud computing and the processing capacity limitation of Internet of Things(IoT)users,Multi-access Edge Computing(MEC)migrates computing and storage capabilities from the remote data center to the edge of network,providing users with computation services quickly and directly.In this paper,we investigate the impact of the randomness caused by the movement of the IoT user on decision-making for offloading,where the connection between the IoT user and the MEC servers is uncertain.This uncertainty would be the main obstacle to assign the task accurately.Consequently,if the assigned task cannot match well with the real connection time,a migration(connection time is not enough to process)would be caused.In order to address the impact of this uncertainty,we formulate the offloading decision as an optimization problem considering the transmission,computation and migration.With the help of Stochastic Programming(SP),we use the posteriori recourse to compensate for inaccurate predictions.Meanwhile,in heterogeneous networks,considering multiple candidate MEC servers could be selected simultaneously due to overlapping,we also introduce the Multi-Arm Bandit(MAB)theory for MEC selection.The extensive simulations validate the improvement and effectiveness of the proposed SP-based Multi-arm bandit Method(SMM)for offloading in terms of reward,cost,energy consumption and delay.The results showthat SMMcan achieve about 20%improvement compared with the traditional offloading method that does not consider the randomness,and it also outperforms the existing SP/MAB based method for offloading. 展开更多
关键词 Multi-access computing Internet of things OFFLOADING Stochastic programming multi-arm bandit
在线阅读 下载PDF
Meta-Analysis of Multi-Arm Trials Using Binomial Approach
17
作者 Hathaikan Chootrakool Pichet Treewai 《Open Journal of Statistics》 2022年第1期15-32,共18页
Most meta-analysis has concentrated on combining of treatment effect measures based on comparisons of two treatments. Meta-analysis of multi-arm trials is a key component of submission to summarize evidence from all p... Most meta-analysis has concentrated on combining of treatment effect measures based on comparisons of two treatments. Meta-analysis of multi-arm trials is a key component of submission to summarize evidence from all possible studies. In this paper, an exact binomial model is proposed by using logistic regression model to compare different treatment in multi-arm trials. Two approaches such as unconditional maximum likelihood and conditional maximum likelihood have been determined and compared for the logistic regression model. The proposed models are performed using the data from 27 randomized clinical trials (RCTs) which determine the efficacy of antiplatelet therapy in reduction venous thrombosis and pulmonary embolism. 展开更多
关键词 META-ANALYSIS multi-arm Trials Binomial Approach
在线阅读 下载PDF
An Underwater Biomimetic Robot that can Swim,Bipedal Walk and Grasp
18
作者 Qiuxuan Wu Liwei Pan +5 位作者 FuLin Du ZhaoSheng Wu XiaoNi Chi FaRong Gao Jian Wang Anton A.Zhilenkov 《Journal of Bionic Engineering》 SCIE EI CSCD 2024年第3期1223-1237,共15页
In developing and exploring extreme and harsh underwater environments,underwater robots can effectively replace humans to complete tasks.To meet the requirements of underwater flexible motion and comprehensive subsea ... In developing and exploring extreme and harsh underwater environments,underwater robots can effectively replace humans to complete tasks.To meet the requirements of underwater flexible motion and comprehensive subsea operation,a novel octopus-inspired robot with eight soft limbs was designed and developed.This robot possesses the capabilities of underwater bipedal walking,multi-arm swimming,and grasping objects.To closely interact with the underwater seabed environment and minimize disturbance,the robot employs a cable-driven flexible arm for its walking in underwater floor through a bipedal walking mode.The multi-arm swimming offers a means of three-dimensional spatial movement,allowing the robot to swiftly explore and navigate over large areas,thereby enhancing its flexibility.Furthermore,the robot’s walking arm enables it to grasp and transport objects underwater,thereby enhancing its practicality in underwater environments.A simplified motion models and gait generation strategies were proposed for two modes of robot locomotion:swimming and walking,inspired by the movement characteristics of octopus-inspired multi-arm swimming and bipedal walking.Through experimental verification,the robot’s average speed of underwater bipedal walking reaches 7.26 cm/s,while the horizontal movement speed for multi-arm swimming is 8.6 cm/s. 展开更多
关键词 Underwater soft robots Underwater bipedal walking multi-arm swimming Cable drive
在线阅读 下载PDF
Matching while Learning: Wireless Scheduling for Age of Information Optimization at the Edge 被引量:3
19
作者 Kun Guo Hao Yang +2 位作者 Peng Yang Wei Feng Tony Q.S.Quek 《China Communications》 SCIE CSCD 2023年第3期347-360,共14页
In this paper,we investigate the minimization of age of information(AoI),a metric that measures the information freshness,at the network edge with unreliable wireless communications.Particularly,we consider a set of u... In this paper,we investigate the minimization of age of information(AoI),a metric that measures the information freshness,at the network edge with unreliable wireless communications.Particularly,we consider a set of users transmitting status updates,which are collected by the user randomly over time,to an edge server through unreliable orthogonal channels.It begs a natural question:with random status update arrivals and obscure channel conditions,can we devise an intelligent scheduling policy that matches the users and channels to stabilize the queues of all users while minimizing the average AoI?To give an adequate answer,we define a bipartite graph and formulate a dynamic edge activation problem with stability constraints.Then,we propose an online matching while learning algorithm(MatL)and discuss its implementation for wireless scheduling.Finally,simulation results demonstrate that the MatL is reliable to learn the channel states and manage the users’buffers for fresher information at the edge. 展开更多
关键词 information freshness Lyapunov opti-mization multi-armed bandit wireless scheduling
在线阅读 下载PDF
Optimal index shooting policy for layered missile defense system 被引量:2
20
作者 LI Longyue FAN Chengli +2 位作者 XING Qinghua XU Hailong ZHAO Huizhen 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第1期118-129,共12页
In order to cope with the increasing threat of the ballistic missile(BM)in a shorter reaction time,the shooting policy of the layered defense system needs to be optimized.The main decisionmaking problem of shooting op... In order to cope with the increasing threat of the ballistic missile(BM)in a shorter reaction time,the shooting policy of the layered defense system needs to be optimized.The main decisionmaking problem of shooting optimization is how to choose the next BM which needs to be shot according to the previous engagements and results,thus maximizing the expected return of BMs killed or minimizing the cost of BMs penetration.Motivated by this,this study aims to determine an optimal shooting policy for a two-layer missile defense(TLMD)system.This paper considers a scenario in which the TLMD system wishes to shoot at a collection of BMs one at a time,and to maximize the return obtained from BMs killed before the system demise.To provide a policy analysis tool,this paper develops a general model for shooting decision-making,the shooting engagements can be described as a discounted reward Markov decision process.The index shooting policy is a strategy that can effectively balance the shooting returns and the risk that the defense mission fails,and the goal is to maximize the return obtained from BMs killed before the system demise.The numerical results show that the index policy is better than a range of competitors,especially the mean returns and the mean killing BM number. 展开更多
关键词 Gittins index shooting policy layered missile defense multi-armed bandits problem Markov decision process
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部