期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
Physically informed hierarchical learning based soft sensing for aero-engine health management unit
1
作者 Aina WANG Pan QIN +2 位作者 Yunbo YUAN Guang ZHAO Ximing SUN 《Chinese Journal of Aeronautics》 2025年第3期374-385,共12页
Partial Differential Equations(PDEs)are model candidates of soft sensing for aero-engine health management units.The existing Physics-Informed Neural Networks(PINNs)have made achievements.However,unmeasurable aero-eng... Partial Differential Equations(PDEs)are model candidates of soft sensing for aero-engine health management units.The existing Physics-Informed Neural Networks(PINNs)have made achievements.However,unmeasurable aero-engine driving sources lead to unknown PDE driving terms,which weaken PINNs feasibility.To this end,Physically Informed Hierarchical Learning followed by Recurrent-Prediction Term(PIHL-RPT)is proposed.First,PIHL is proposed for learning nonhomogeneous PDE solutions,in which two networks NetU and NetG are constructed.NetU is for learning solutions satisfying PDEs;NetG is for learning driving terms to regularize NetU training.Then,we propose a hierarchical learning strategy to optimize and couple NetU and NetG,which are integrated into a data-physics-hybrid loss function.Besides,we prove PIHL-RPT can iteratively generate a series of networks converging to a function,which can approximate a solution to well-posed PDE.Furthermore,RPT is proposed for prediction improvement of PIHL,in which network NetU-RP is constructed to compensate for information loss caused by data sampling and driving sources’immeasurability.Finally,artificial datasets and practical vibration process datasets from our wear experiment platform are used to verify the feasibility and effectiveness of PIHL-RPT based soft sensing.Meanwhile,comparisons with relevant methods,discussions,and PIHL-RPT based health monitoring example are given. 展开更多
关键词 hierarchical learning strategy Monitoring:Partial differen tial equations with unmeasurable driving terms Physically informed hierarchical learning followed by recurrent-prediction term Soft sensing
原文传递
Continuous-time hierarchical reinforcement learning for satellite pursuit decision
2
作者 Linsen WEI Xin NING +3 位作者 Xiaobin LIAN Feng WANG Gaopeng ZHANG Mingpei LIN 《Chinese Journal of Aeronautics》 2025年第12期363-375,共13页
The satellite orbital pursuit game focuses on studying spacecraft maneuvering strategies in space.Traditional numerical methods often face real-time inadequacies and adaptability limitations when dealing with highly n... The satellite orbital pursuit game focuses on studying spacecraft maneuvering strategies in space.Traditional numerical methods often face real-time inadequacies and adaptability limitations when dealing with highly nonlinear problems.With the advancement of Deep Reinforcement Learning(DRL)technology,continuous-time orbital control capabilities have significantly improved.Despite this,the existing DRL technologies still need adjustments in action delay and discretization structure to better adapt to practical application scenarios.Combining continuous learning and model planning demonstrates the adaptability of these methods in continuous-time decision problems.Additionally,to more effectively handle action delay issues,a new scheduled action execution technique has been developed.This technique optimizes action execution timing through real-time policy adjustments,thus adapting to the dynamic changes in the orbital environment.A Hierarchical Reinforcement Learning(HRL)strategy was also adopted to simplify the decision-making process for long-distance pursuit tasks by setting phased subgoals to gradually approach the target.The effectiveness of the proposed strategy in practical satellite pursuit scenarios has been verified through simulations of two different tasks. 展开更多
关键词 Continuous-time decision hierarchical reinforcement learning Intelligent decision Orbital pursuit game Trajectory planning
原文传递
Multi-missile coordinated penetration strategy based on hierarchical reinforcement learning in reduced space
3
作者 Yaoluo HUI Xiumin LI +2 位作者 Chen LIANG Zenghui ZHANG Jianing YAN 《Chinese Journal of Aeronautics》 2025年第9期304-322,共19页
A group optimal penetration strategy in complex attack and defense confrontation situation is proposed in this paper to solve the coordinated penetration decision-making problem of endo-atmospheric gliding simultaneou... A group optimal penetration strategy in complex attack and defense confrontation situation is proposed in this paper to solve the coordinated penetration decision-making problem of endo-atmospheric gliding simultaneous multi-missile penetration of interceptors.First,the problem of large search space of multi-missile coordinated penetration maneuvers is fully considered,and the flight corridor of multi-missile coordinated penetration is designed to constrain search space of multi-agent coordinated strategy,comprehensively considering path constraints and anticollision constraints of gliding multi-missile flight.Then,a multi-missile hierarchical coordinated decision-making mechanism based on confrontation situation is proposed,and the swarm penetration strategy is optimized with the goal of maximizing swarm penetration effectiveness.The upper layer plans the swarm penetration formation according to confrontation situation,and generates the swarm coordinated penetration trajectory based on Multi-Agent Deep Deterministic Policy Gradient(MADDPG)method.The lower layer interpolates and smooths penetration trajectory,and generates the penetration guidance command based on Soft Actor-Critic and Extended Proportional Guidance(SAC-EPG)method.Simulation results verify that the proposed multi-missile cooperative penetration method based on hierarchical reinforcement learning converges faster than the penetration method based on MADDPG,and can quickly learn multi-missile cooperative penetration skills.In addition,multi-missile coordination can give full play to the group's detection and maneuverability,and occupy favorable penetration time and space through coordinated ballistic maneuvers.Thus the success rate of group penetration can be improved. 展开更多
关键词 Hypersonic glide vehicle Multi-missile coordinated penetration Penetration corridor design Cooperative guidance Multi-agent hierarchical reinforcement learning hierarchical decisionmaking architecture
原文传递
Identification of Visibility Level for Enhanced Road Safety under Different Visibility Conditions:A Hierarchical Clustering-Based Learning Model
4
作者 Asmat Ullah Yar Muhammad +4 位作者 Bakht Zada Korhan Cengiz Nikola Ivkovic Mario Konecki Abid Yahya 《Computers, Materials & Continua》 2025年第11期3767-3786,共20页
Low visibility conditions,particularly those caused by fog,significantly affect road safety and reduce drivers’ability to see ahead clearly.The conventional approaches used to address this problem primarily rely on i... Low visibility conditions,particularly those caused by fog,significantly affect road safety and reduce drivers’ability to see ahead clearly.The conventional approaches used to address this problem primarily rely on instrument-based and fixed-threshold-based theoretical frameworks,which face challenges in adaptability and demonstrate lower performance under varying environmental conditions.To overcome these challenges,we propose a real-time visibility estimation model that leverages roadside CCTV cameras to monitor and identify visibility levels under different weather conditions.The proposedmethod begins by identifying specific regions of interest(ROI)in the CCTVimages and focuses on extracting specific features such as the number of lines and contours detected within these regions.These features are then provided as an input to the proposed hierarchical clusteringmodel,which classifies them into different visibility levels without the need for predefined rules and threshold values.In the proposed approach,we used two different distance similaritymetrics,namely dynamic time warping(DTW)and Euclidean distance,alongside the proposed hierarchical clustering model and noted its performance in terms of numerous evaluation measures.The proposed model achieved an average accuracy of 97.81%,precision of 91.31%,recall of 91.25%,and F1-score of 91.27% using theDTWdistancemetric.We also conducted experiments for other deep learning(DL)-based models used in the literature and compared their performances with the proposed model.The experimental results demonstrate that the proposedmodel ismore adaptable and consistent compared to themethods used in the literature.The proposedmethod provides drivers real-time and accurate visibility information and enhances road safety during low visibility conditions. 展开更多
关键词 CCTV images road safety and security visibility level estimation hierarchical clustering learning feature extraction safe and secure transportation
在线阅读 下载PDF
Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification 被引量:1
5
作者 Chenghao Liu Fei Zhu +1 位作者 Quan Liu Yuchen Fu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第10期1686-1696,共11页
In reinforcement learning an agent may explore ineffectively when dealing with sparse reward tasks where finding a reward point is difficult.To solve the problem,we propose an algorithm called hierarchical deep reinfo... In reinforcement learning an agent may explore ineffectively when dealing with sparse reward tasks where finding a reward point is difficult.To solve the problem,we propose an algorithm called hierarchical deep reinforcement learning with automatic sub-goal identification via computer vision(HADS)which takes advantage of hierarchical reinforcement learning to alleviate the sparse reward problem and improve efficiency of exploration by utilizing a sub-goal mechanism.HADS uses a computer vision method to identify sub-goals automatically for hierarchical deep reinforcement learning.Due to the fact that not all sub-goal points are reachable,a mechanism is proposed to remove unreachable sub-goal points so as to further improve the performance of the algorithm.HADS involves contour recognition to identify sub-goals from the state image where some salient states in the state image may be recognized as sub-goals,while those that are not will be removed based on prior knowledge.Our experiments verified the effect of the algorithm. 展开更多
关键词 hierarchical control hierarchical reinforcement learning OPTION sparse reward sub-goal
在线阅读 下载PDF
Hierarchical reinforcement learning guidance with threat avoidance 被引量:1
6
作者 LI Bohao WU Yunjie LI Guofei 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第5期1173-1185,共13页
The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchic... The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation.A novel guidance law is presented by exploiting the deep reinforcement learning(DRL)with the hierarchical deep deterministic policy gradient(DDPG)algorithm.The reward functions are constructed to minimize the line-of-sight(LOS)angle rate and avoid the threat caused by the opposed obstacles.To attenuate the chattering of the acceleration,a hierarchical reinforcement learning structure and an improved reward function with action penalty are put forward.The simulation results validate that the missile under the proposed method can hit the target successfully and keep away from the threatened areas effectively. 展开更多
关键词 guidance law deep reinforcement learning(DRL) threat avoidance hierarchical reinforcement learning
在线阅读 下载PDF
Hierarchical Federated Learning: Architecture, Challenges, and Its Implementation in Vehicular Networks 被引量:1
7
作者 YAN Jintao CHEN Tan +3 位作者 XIE Bowen SUN Yuxuan ZHOU Sheng NIU Zhisheng 《ZTE Communications》 2023年第1期38-45,共8页
Federated learning(FL)is a distributed machine learning(ML)framework where several clients cooperatively train an ML model by exchanging the model parameters without directly sharing their local data.In FL,the limited... Federated learning(FL)is a distributed machine learning(ML)framework where several clients cooperatively train an ML model by exchanging the model parameters without directly sharing their local data.In FL,the limited number of participants for model aggregation and communication latency are two major bottlenecks.Hierarchical federated learning(HFL),with a cloud-edge-client hierarchy,can leverage the large coverage of cloud servers and the low transmission latency of edge servers.There are growing research interests in implementing FL in vehicular networks due to the requirements of timely ML training for intelligent vehicles.However,the limited number of participants in vehicular networks and vehicle mobility degrade the performance of FL training.In this context,HFL,which stands out for lower latency,wider coverage and more participants,is promising in vehicular networks.In this paper,we begin with the background and motivation of HFL and the feasibility of implementing HFL in vehicular networks.Then,the architecture of HFL is illustrated.Next,we clarify new issues in HFL and review several existing solutions.Furthermore,we introduce some typical use cases in vehicular networks as well as our initial efforts on implementing HFL in vehicular networks.Finally,we conclude with future research directions. 展开更多
关键词 hierarchical federated learning vehicular network MOBILITY convergence analysis
在线阅读 下载PDF
Hierarchical Reinforcement Learning Adversarial Algorithm Against Opponent with Fixed Offensive Strategy
8
作者 赵英策 张广浩 +1 位作者 邢正宇 李建勋 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第3期471-479,共9页
Based on option-critic algorithm,a new adversarial algorithm named deterministic policy network with option architecture is proposed to improve agent's performance against opponent with fixed offensive algorithm.A... Based on option-critic algorithm,a new adversarial algorithm named deterministic policy network with option architecture is proposed to improve agent's performance against opponent with fixed offensive algorithm.An option network is introduced in upper level design,which can generate activated signal from defensive and of-fensive strategies according to temporary situation.Then the lower level executive layer can figure out interactive action with guidance of activated signal,and the value of both activated signal and interactive action is evaluated by critic structure together.This method could release requirement of semi Markov decision process effectively and eventually simplified network structure by eliminating termination possibility layer.According to the result of experiment,it is proved that new algorithm switches strategy style between offensive and defensive ones neatly and acquires more reward from environment than classical deep deterministic policy gradient algorithm does. 展开更多
关键词 hierarchical reinforcement learning fixed offensive strategy option architecture deterministic gradi-entpolicy
原文传递
Hierarchical Federated Learning Architectures for the Metaverse
9
作者 GU Cheng LI Baochun 《ZTE Communications》 2024年第2期39-48,共10页
In the context of edge computing environments in general and the metaverse in particular,federated learning(FL)has emerged as a distributed machine learning paradigm that allows multiple users to collaborate on traini... In the context of edge computing environments in general and the metaverse in particular,federated learning(FL)has emerged as a distributed machine learning paradigm that allows multiple users to collaborate on training a shared machine learning model locally,eliminating the need for uploading raw data to a central server.It is perhaps the only training paradigm that preserves the privacy of user data,which is essential for computing environments as personal as the metaverse.However,the original FL architecture proposed is not scalable to a large number of user devices in the metaverse community.To mitigate this problem,hierarchical federated learning(HFL)has been introduced as a general distributed learning paradigm,inspiring a number of research works.In this paper,we present several types of HFL architectures,with a special focus on the three-layer client-edge-cloud HFL architecture,which is most pertinent to the metaverse due to its delay-sensitive nature.We also examine works that take advantage of the natural layered organization of three-layer client-edge-cloud HFL to tackle some of the most challenging problems in FL within the metaverse.Finally,we outline some future research directions of HFL in the metaverse. 展开更多
关键词 federated learning hierarchical federated learning metaverse
在线阅读 下载PDF
CADGen:Computer-aided design sequence construction with a guided codebook learning
10
作者 Shengdi Zhou Xiaoqiang Zan +1 位作者 Zhuqing Li Bin Zhou 《Digital Twins and Applications》 2024年第1期75-87,共13页
Computer-aided design(CAD)software continues to be a crucial tool in digital twin application and manufacturing,facilitating the design of various products.We present a novel CAD generation method,an agent that constr... Computer-aided design(CAD)software continues to be a crucial tool in digital twin application and manufacturing,facilitating the design of various products.We present a novel CAD generation method,an agent that constructs the CAD sequences containing the sketch-and-extrude modelling operations efficiently and with high quality.Starting from the sketch and extrusion operation sequences,we utilise the transformer encoder to encode them into different disentangled codebooks to represent their distribution properties while considering their correlations.Then,a combination of auto-regressive and non-autoregressive samplers is trained to sample the code for CAD sequence con-struction.Extensive experiments demonstrate that our model generates diverse and high-quality CAD models.We also show some cases of real digital twin applications and indicate that our generated model can be used as the data source for the digital twin platform,exhibiting designers'potential. 展开更多
关键词 CAD sequence construction code sample computer‐aided design digital twins hierarchical code learning
在线阅读 下载PDF
Autonomous navigation system for flapping wing aerial vehicle based on event-trigger planner
11
作者 Changhao CHEN Bifeng SONG +1 位作者 Qiang FU Jiaxing GAO 《Chinese Journal of Aeronautics》 2025年第12期282-298,共17页
Flapping Wing Aerial Vehicles(FWAVs)hold immense potential for applications such as search-and-rescue missions in complex terrains,environmental monitoring in hazardous areas,and exploration in confined spaces.However... Flapping Wing Aerial Vehicles(FWAVs)hold immense potential for applications such as search-and-rescue missions in complex terrains,environmental monitoring in hazardous areas,and exploration in confined spaces.However,their adoption is hindered by the challenges of autonomous navigation in unknown environments,exacerbated by their limited onboard computational resources and demanding flight dynamics.This work addresses these challenges by presenting a lightweight,vision-based autonomous navigation system weighing 26.0 g,enabling FWAVs to achieve obstacle-avoidance flight at a speed of 9.0 m/s.Central to this system is a novel end-toend Bi-level Cooperative Policy(BCP)that significantly improves flight efficiency and safety.BCP employs lightweight neural networks for real-time performance and leverages Hierarchical Reinforcement Learning(HRL)for robust and efficient training.Quantitative evaluations show that BCP achieves up to 6.5%shorter path lengths,11.2%faster task completion time,and improved explainability compared to state-of-the-art reinforcement learning algorithms.Additionally,BCP demonstrates 35.7%more efficient and stable training,reducing computational overhead while maintaining high performance.The system design incorporates optimized lightweight components,including a 4.0 g customized stereo camera,a 6.0 g 3D-printed camera mount,and a 16.0 g onboard computer,all tailored to FWAV applications.Real-flight experiments validate the sim-toreal transferability of the proposed navigation system,demonstrating its readiness for real-world deployment in challenging scenarios.This research advances the practicality of FWAVs,paving the way for their broader adoption in critical missions where compact,agile aerial robots are indispensable. 展开更多
关键词 Autonomous navigation FWAV hierarchical reinforcement learning STEREO Unknown environments
原文传递
Latent Landmark Graph for Efficient Explorationexploitation Balance in Hierarchical Reinforcement Learning
12
作者 Qingyang Zhang Hongming Zhang +1 位作者 Dengpeng Xing Bo Xu 《Machine Intelligence Research》 2025年第2期267-288,共22页
Goal-conditioned hierarchical reinforcement learning(GCHRL)decomposes the desired goal into subgoals and conducts exploration and exploitation in the subgoal space.Its effectiveness heavily relies on subgoal represent... Goal-conditioned hierarchical reinforcement learning(GCHRL)decomposes the desired goal into subgoals and conducts exploration and exploitation in the subgoal space.Its effectiveness heavily relies on subgoal representation and selection.However,existing works do not consider distinct information across hierarchical time scales when learning subgoal representations and lack a subgoal selection strategy that balances exploration and exploitation.In this paper,we propose a novel method for efficient exploration-exploitation balance in HIerarchical reinforcement learning by dynamically constructing Latent Landmark graphs(HILL).HILL transforms the reward maximization problem of GCHRL into the shortest path planning on graphs.To effectively consider the hierarchical time-scale information,HILL adopts a contrastive representation learning objective to learn informative latent representations.Based on these representations,HILL dynamically constructs latent landmark graphs and selects subgoals using two measures to balance exploration and exploitation.We implement two variants:HILL-hf generates graphs periodically,while HILL-lf generates graphs adaptively.Empirical results on continuous control tasks with sparse rewards demonstrate that both variants outperform state-of-the-art baselines in sample efficiency and asymptotic performance,with HILL-lf further reducing training time by 40%compared to HILL-hf. 展开更多
关键词 hierarchical reinforcement learning representation learning latent landmark graph contrastive learning exploration and exploitation.
原文传递
Analysis of deep learning under adversarial attacks in hierarchical federated learning
13
作者 Duaa S.Alqattan Vaclav Snasel +1 位作者 Rajiv Ranjan Varun Ojha 《High-Confidence Computing》 2025年第4期119-141,共23页
Hierarchical Federated Learning(HFL)extends traditional Federated Learning(FL)by introducing multi-level aggregation in which model updates pass through clients,edge servers,and a global server.While this hierarchical... Hierarchical Federated Learning(HFL)extends traditional Federated Learning(FL)by introducing multi-level aggregation in which model updates pass through clients,edge servers,and a global server.While this hierarchical structure enhances scalability,it also increases vulnerability to adversarial attacks—such as data poisoning and model poisoning—that disrupt learning by introducing discrepancies at the edge server level.These discrepancies propagate through aggregation,affecting model consistency and overall integrity.Existing studies on adversarial behaviour in FL primarily rely on single-metric approaches—such as cosine similarity or Euclidean distance—to assess model discrepancies and filter out anomalous updates.However,these methods fail to capture the diverse ways adversarial attacks influence model updates,particularly in highly heterogeneous data environments and hierarchical structures.Attackers can exploit the limitations of single-metric defences by crafting updates that seem benign under one metric while remaining anomalous under another.Moreover,prior studies have not systematically analysed how model discrepancies evolve over time,vary across regions,or affect clustering structures in HFL architectures.To address these limitations,we propose the Model Discrepancy Score(MDS),a multi-metric framework that integrates Dissimilarity,Distance,Uncorrelation,and Divergence to provide a comprehensive analysis of how adversarial activity affects model discrepancies.Through temporal,spatial,and clustering analyses,we examine how attacks affect model discrepancies at the edge server level in 3LHFL and 4LHFL architectures and evaluate MDS’s ability to distinguish between benign and malicious servers.Our results show that while 4LHFL effectively mitigates discrepancies in regional attack scenarios,it struggles with distributed attacks due to additional aggregation layers that obscure distinguishable discrepancy patterns over time,across regions,and within clustering structures.Factors influencing detection include data heterogeneity,attack sophistication,and hierarchical aggregation depth.These findings highlight the limitations of single-metric approaches and emphasize the need for multi-metric strategies such as MDS to enhance HFL security. 展开更多
关键词 hierarchical federated learning Model discrepancy Targeted label flipping Untargeted label flipping Client-side sign flipping Server-side sign flipping
在线阅读 下载PDF
A trajectory planning and tracking method based on deep hierarchical reinforcement learning
14
作者 Jiajie Zhang Bao-Lin Ye +2 位作者 Xin Wang Lingxi Li Bo Song 《Journal of Intelligent and Connected Vehicles》 2025年第2期20-28,共9页
To improve the driving efficiency of unmanned vehicles in a complex urban traffic flow environment and the safety and passenger comfort of vehicles when changing lanes,we propose a hierarchical reinforcement learning(... To improve the driving efficiency of unmanned vehicles in a complex urban traffic flow environment and the safety and passenger comfort of vehicles when changing lanes,we propose a hierarchical reinforcement learning(HRL)-based vehicle trajectory planning and tracking method.First,we present a hierarchical control framework for vehicle trajectory tracking that is based on deep reinforcement learning(DRL)and model predictive control(MPC).We design an upper-level decision model based on the trust region policy optimization algorithm integrated with long short-term memory to obtain more accurate strategies.Second,to improve stability and passenger comfort,we constructed a lower controller that combines the Bezier curve fitting method and an MPC controller.Finally,the proposed method was simulated via the car learning to act(CARLA)simulator,which is based on an unreal engine.Random urban traffic-flow test scenarios were used to simulate a real urban road-traffic environment.The simulation results illustrate that the proposed method can complete the vehicle trajectory planning and tracking task well.Compared with the existing RL methods,our proposed method has the lowest collision rate of 1.5%and achieves an average speed improvement of 7.04%.Moreover,our proposed method has better comfort performance and lower fuel consumption during the driving process. 展开更多
关键词 deep reinforcement learning(DRL) trust region policy optimization(TRPO) hierarchical reinforcement learning(HRL) model predictive control(MPC) trajectory tracking
在线阅读 下载PDF
Hierarchical federated transfer learning in digital twin-based vehicular networks
15
作者 Qasim Zia Saide Zhu +2 位作者 Haoxin Wang Zafar Iqbal Yingshu Li 《High-Confidence Computing》 2025年第4期8-18,共11页
In recent research on the Digital Twin-based Vehicular Ad hoc Network(DT-VANET),Federated Learning(FL)has shown its ability to provide data privacy.However,Federated learning struggles to adequately train a global mod... In recent research on the Digital Twin-based Vehicular Ad hoc Network(DT-VANET),Federated Learning(FL)has shown its ability to provide data privacy.However,Federated learning struggles to adequately train a global model when confronted with data heterogeneity and data sparsity among vehicles,which ensure suboptimal accuracy in making precise predictions for different vehicle types.To address these challenges,this paper combines Federated Transfer Learning(FTL)to conduct vehicle clustering related to types of vehicles and proposes a novel Hierarchical Federated Transfer Learning(HFTL).We construct a framework for DT-VANET,along with two algorithms designed for cloud server model updates and intra-cluster federated transfer learning,to improve the accuracy of the global model.In addition,we developed a data quality score-based mechanism to prevent the global model from being affected by malicious vehicles.Lastly,detailed experiments on real-world datasets are conducted,considering different performance metrics that verify the effectiveness and efficiency of our algorithm. 展开更多
关键词 Vehicular ad-hoc network hierarchical federated transfer learning Vehicular digital twin Autonomous vehicle Digital twin-based vehicular networks
在线阅读 下载PDF
Autonomous Overtaking for Intelligent Vehicles Considering Social Preference Based on Hierarchical Reinforcement Learning 被引量:6
16
作者 Hongliang Lu Chao Lu +2 位作者 Yang Yu Guangming Xiong Jianwei Gong 《Automotive Innovation》 EI CSCD 2022年第2期195-208,共14页
As intelligent vehicles usually have complex overtaking process,a safe and efficient automated overtaking system(AOS)is vital to avoid accidents caused by wrong operation of drivers.Existing AOSs rarely consider longi... As intelligent vehicles usually have complex overtaking process,a safe and efficient automated overtaking system(AOS)is vital to avoid accidents caused by wrong operation of drivers.Existing AOSs rarely consider longitudinal reactions of the overtaken vehicle(OV)during overtaking.This paper proposed a novel AOS based on hierarchical reinforcement learning,where the longitudinal reaction is given by a data-driven social preference estimation.This AOS incorporates two modules that can function in different overtaking phases.The first module based on semi-Markov decision process and motion primitives is built for motion planning and control.The second module based on Markov decision process is designed to enable vehicles to make proper decisions according to the social preference of OV.Based on realistic overtaking data,the proposed AOS and its modules are verified experimentally.The results of the tests show that the proposed AOS can realize safe and effective overtaking in scenes built by realistic data,and has the ability to flexibly adjust lateral driving behavior and lane changing position when the OVs have different social preferences. 展开更多
关键词 Automated overtaking system Semi-Markov decision process hierarchical reinforcement learning Social preference
原文传递
Autonomic discovery of subgoals in hierarchical reinforcement learning 被引量:1
17
作者 XIAO Ding LI Yi-tong SHI Chuan 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2014年第5期94-104,共11页
Option is a promising method to discover the hierarchical structure in reinforcement learning (RL) for learning acceleration. The key to option discovery is about how an agent can find useful subgoals autonomically ... Option is a promising method to discover the hierarchical structure in reinforcement learning (RL) for learning acceleration. The key to option discovery is about how an agent can find useful subgoals autonomically among the passing trails. By analyzing the agent's actions in the trails, useful heuristics can be found. Not only does the agent pass subgoals more frequently, but also its effective actions are restricted in subgoals. As a consequence, the subgoals can be deemed as the most matching action-restricted states in the paths. In the grid-world environment, the concept of the unique-direction value reflecting the action-restricted property was introduced to find the most matching action-restricted states. The unique-direction-value (UDV) approach is chosen to form options offline and online autonomically. Experiments show that the approach can find subgoals correctly. Thus the Q-learning with options found on both offline and online process can accelerate learning significantly. 展开更多
关键词 hierarchical reinforcement learning OPTION Q-learning SUBGOAL UDV
原文传递
Two-level hierarchical feature learning for image classification 被引量:4
18
作者 Guang-hui SONG Xiao-gang JIN +1 位作者 Gen-lang CHEN Yan NIE 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2016年第9期897-906,共10页
In some image classification tasks, similarities among different categories are different and the samples are usually misclassified as highly similar categories. To distinguish highly similar categories, more specific... In some image classification tasks, similarities among different categories are different and the samples are usually misclassified as highly similar categories. To distinguish highly similar categories, more specific features are required so that the classifier can improve the classification performance. In this paper, we propose a novel two-level hierarchical feature learning framework based on the deep convolutional neural network(CNN), which is simple and effective. First, the deep feature extractors of different levels are trained using the transfer learning method that fine-tunes the pre-trained deep CNN model toward the new target dataset. Second, the general feature extracted from all the categories and the specific feature extracted from highly similar categories are fused into a feature vector. Then the final feature representation is fed into a linear classifier. Finally, experiments using the Caltech-256, Oxford Flower-102, and Tasmania Coral Point Count(CPC) datasets demonstrate that the expression ability of the deep features resulting from two-level hierarchical feature learning is powerful. Our proposed method effectively increases the classification accuracy in comparison with flat multiple classification methods. 展开更多
关键词 Transfer learning Feature learning Deep convolutional neural network hierarchical classification Spectral clustering
原文传递
Method for the classification of tea diseases via weighted sampling and hierarchical classification learning
19
作者 Rujia Li Weibo Qin +5 位作者 Yiting He Yadong Li Rongbiao Ji Yehui Wu Jiaojiao Chen Jianping Yang 《International Journal of Agricultural and Biological Engineering》 SCIE 2024年第3期211-221,共11页
This study proposed a weighted sampling hierarchical classification learning method based on an efficient backbone network model to address the problems of high costs,low accuracy,and time-consuming traditional tea di... This study proposed a weighted sampling hierarchical classification learning method based on an efficient backbone network model to address the problems of high costs,low accuracy,and time-consuming traditional tea disease recognition methods.This method enhances the feature extraction ability by conducting hierarchical classification learning based on the EfficientNet model,effectively alleviating the impact of high similarity between tea diseases on the model’s classification performance.To better solve the problem of few and unevenly distributed tea disease samples,this study introduced a weighted sampling scheme to optimize data processing,which not only alleviates the overfitting effect caused by too few sample data but also balances the probability of extracting imbalanced classification data.The experimental results show that the proposed method was significant in identifying both healthy tea leaves and four common leaf diseases of tea(tea algal spot disease,tea white spot disease,tea anthracnose disease,and tea leaf blight disease).After applying the“weighted sampling hierarchical classification learning method”to train 7 different efficient backbone networks,most of their accuracies have improved.The EfficientNet-B1 model proposed in this study achieved an accuracy rate of 99.21%after adopting this learning method,which is higher than EfficientNet-b2(98.82%)and MobileNet-V3(98.43%).In addition,to better apply the results of identifying tea diseases,this study developed a mini-program that operates on WeChat.Users can quickly obtain accurate identification results and corresponding disease descriptions and prevention methods through simple operations.This intelligent tool for identifying tea diseases can serve as an auxiliary tool for farmers,consumers,and related scientific researchers and has certain practical value. 展开更多
关键词 tea diseases hierarchical classification learning weighted sampling classification method EfficientNet miniprogram
原文传递
Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem
20
作者 Xiaoyong Gao Yixu Yang +4 位作者 Diao Peng Shanghe Li Chaodong Tan Feifei Li Tao Chen 《Complex System Modeling and Simulation》 2024年第4期339-352,共14页
Shuttle tankers scheduling is an important task in offshore oil and gas transportation process,which involves operating time window fulfillment,optimal transportation planning,and proper inventory management.However,c... Shuttle tankers scheduling is an important task in offshore oil and gas transportation process,which involves operating time window fulfillment,optimal transportation planning,and proper inventory management.However,conventional approaches like Mixed lnteger Linear Programming(MlLP)or meta heuristic algorithms often fail in long running time.In this paper,a Graph Pointer Network(GPN)based Hierarchical Curriculum Reinforcement Learning(HCRl)method is proposed to solve Shuttle Tankers Scheduling Problem(STSP)The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially.An asynchronous training strategy is developed to address the coupling between stages.Comparison experiments demonstrate that the proposed HCRL method achieves 12%shortel tour lengths on average compared to heuristic algorithms.Additional experiments validate its generalizability to unseen instances and scalability to larger instances. 展开更多
关键词 graph pointer network hierarchical reinforcement learning curriculum learning shuttle tanker scheduling problem
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部