期刊文献+
共找到11,815篇文章
< 1 2 250 >
每页显示 20 50 100
Unraveling nonlinear relationship of built environment on pre-sale and second-hand housing prices using multi-source big data and machine learning
1
作者 Qian Zeng Hao Wu +3 位作者 Luyao Zhou Xue Gao Ningyuan Fei Bart Julien Dewancker 《Frontiers of Architectural Research》 2025年第6期1636-1653,共18页
Pre-sale and second-hand housing transaction modes dominate China’s real estate market. However, many existing studies tend to treat the housing market as a homogeneous entity, overlooking the heterogeneity in core i... Pre-sale and second-hand housing transaction modes dominate China’s real estate market. However, many existing studies tend to treat the housing market as a homogeneous entity, overlooking the heterogeneity in core influencing factors across different transaction types. Thoroughly understanding the factors affecting various housing types can assist policymakers in formulating differentiated regulatory decisions through environmental intervention. Therefore, this study utilized multi-source big data and compared the performance of multiple machine learning models to evaluate the relative importance and nonlinear effects of buildinglevel, neighborhood-level, and street-level built environment factors on pre-sale and secondhand housing prices. The empirical study of Chengdu, China revealed that distance to city center was the most significant explanatory factor influencing pre-sale and second-hand housing prices among all factors. Significant differences existed between neighborhood-level and street-level built environment factors’ nonlinear and threshold effects on pre-sale and second-hand housing prices. Notably, subway accessibility showed a U-shaped impact on pre-sale housing prices. To the best of our knowledge, our study is one of the early studies systematically investigating the influencing differences between pre-sale housing prices and second-hand housing prices, providing robust evidence for regulating housing prices through environmental interventions and offering critical references for policymakers and market participants. 展开更多
关键词 Real estate market Machine learning Random forest Semantic segmentation Street view image
原文传递
The Educational Implications of Wang Yangming’s Ecological View of ‘Benevolence as the Unity of Heaven and Earth’—Based upon ‘Inquiry on the Great Learning’
2
作者 Yinuo Zhou Tengteng Zhuang 《教育技术与创新》 2025年第3期67-73,共7页
The relationship between heaven and humanity is one of the fundamental philosophical foundations of ecological ethics in ancient Chinese Confucian thought.As a master of Confucian philosophy of mind,Wang Yangming inte... The relationship between heaven and humanity is one of the fundamental philosophical foundations of ecological ethics in ancient Chinese Confucian thought.As a master of Confucian philosophy of mind,Wang Yangming integrated the traditional Confucian discourse on the relationship between heaven and humanity into the principles of the philosophy of mind.Building on the traditional doctrine of benevolence centered on moral concern,he further developed an ecological view of‘benevolence as the unity of heaven and earth’.In his work Inquiry on the Great Learning,Wang Yangming systematically elaborated on this notion,emphasizing the philosophical expression of the relationship between humans and nature within an ethical framework and outlining the new implications of traditional Confucian ecological thought.This paper aims to analyze Wang Yangming’s ecological view of‘benevolence as the unity of heaven and earth’by examining the ecological ideas in his Inquiry on the Great Learning.On this basis,it seeks to refine the valuable achievements of traditional Chinese ecological civilization thought and strengthen the theoretical foundation of contemporary ecological ideas with Chinese characteristics. 展开更多
关键词 Wang Yangming benevolence as the unity of heaven and earth ecological view educational implications Inquiry on the Great learning
在线阅读 下载PDF
Towards Collaborative Robotics in Top View Surveillance:A Framework for Multiple Object Tracking by Detection Using Deep Learning 被引量:9
3
作者 Imran Ahmed Sadia Din +2 位作者 Gwanggil Jeon Francesco Piccialli Giancarlo Fortino 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第7期1253-1270,共18页
Collaborative Robotics is one of the high-interest research topics in the area of academia and industry.It has been progressively utilized in numerous applications,particularly in intelligent surveillance systems.It a... Collaborative Robotics is one of the high-interest research topics in the area of academia and industry.It has been progressively utilized in numerous applications,particularly in intelligent surveillance systems.It allows the deployment of smart cameras or optical sensors with computer vision techniques,which may serve in several object detection and tracking tasks.These tasks have been considered challenging and high-level perceptual problems,frequently dominated by relative information about the environment,where main concerns such as occlusion,illumination,background,object deformation,and object class variations are commonplace.In order to show the importance of top view surveillance,a collaborative robotics framework has been presented.It can assist in the detection and tracking of multiple objects in top view surveillance.The framework consists of a smart robotic camera embedded with the visual processing unit.The existing pre-trained deep learning models named SSD and YOLO has been adopted for object detection and localization.The detection models are further combined with different tracking algorithms,including GOTURN,MEDIANFLOW,TLD,KCF,MIL,and BOOSTING.These algorithms,along with detection models,help to track and predict the trajectories of detected objects.The pre-trained models are employed;therefore,the generalization performance is also investigated through testing the models on various sequences of top view data set.The detection models achieved maximum True Detection Rate 93%to 90%with a maximum 0.6%False Detection Rate.The tracking results of different algorithms are nearly identical,with tracking accuracy ranging from 90%to 94%.Furthermore,a discussion has been carried out on output results along with future guidelines. 展开更多
关键词 Collaborative robotics deep learning object detection and tracking top view video surveillance
在线阅读 下载PDF
Distributed Byzantine-Resilient Learning of Multi-UAV Systems via Filter-Based Centerpoint Aggregation Rules
4
作者 Yukang Cui Linzhen Cheng +1 位作者 Michael Basin Zongze Wu 《IEEE/CAA Journal of Automatica Sinica》 2025年第5期1056-1058,共3页
Dear Editor,Through distributed machine learning,multi-UAV systems can achieve global optimization goals without a centralized server,such as optimal target tracking,by leveraging local calculation and communication w... Dear Editor,Through distributed machine learning,multi-UAV systems can achieve global optimization goals without a centralized server,such as optimal target tracking,by leveraging local calculation and communication with neighbors.In this work,we implement the stochastic gradient descent algorithm(SGD)distributedly to optimize tracking errors based on local state and aggregation of the neighbors'estimation.However,Byzantine agents can mislead neighbors,causing deviations from optimal tracking.We prove that the swarm achieves resilient convergence if aggregated results lie within the normal neighbors'convex hull,which can be guaranteed by the introduced centerpoint-based aggregation rule.In the given simulated scenarios,distributed learning using average,geometric median(GM),and coordinate-wise median(CM)based aggregation rules fail to track the target.Compared to solely using the centerpoint aggregation method,our approach,which combines a pre-filter with the centroid aggregation rule,significantly enhances resilience against Byzantine attacks,achieving faster convergence and smaller tracking errors. 展开更多
关键词 global optimization goals multi UAV systems filter based centerpoint aggregation distributed learning optimal target trackingby stochastic gradient descent algorithm sgd distributedly optimize tracking distributed machine learningmulti uav
在线阅读 下载PDF
Strengthening human papillomavirus vaccination programs through multi-country peer learning:lessons from the CHIC initiative
5
作者 Christopher Morgan Mary Carol Jennings +8 位作者 Dur-e-Nayab Waheed Nicolas Theopold Anissa Sidibe Ana Bolio Elaine Charurat Felix Ricardo Burdier Emilie Karafillakis Shana Kagan Alex Vorsters 《Cancer Biology & Medicine》 2025年第9期997-1001,共5页
Introduction Human papillomavirus(HPV)vaccination is a cornerstone of cervical cancer prevention,particularly in low-and middle-income countries(LMICs),where the burden of disease remains high~1.The World Health Organ... Introduction Human papillomavirus(HPV)vaccination is a cornerstone of cervical cancer prevention,particularly in low-and middle-income countries(LMICs),where the burden of disease remains high~1.The World Health Organization(WHO)HPV Vaccine Introduction Clearing House reported that 147 countries(of 194 reporting)had fully introduced the HPV vaccine into their national schedules as of 20242.After COVID-19 pandemic disruptions,global coverage is again increasing. 展开更多
关键词 WHO HPV vaccine introduction clearing house multi country peer learning cervical cancer prevention CHIC initiative global coverage human papillomavirus vaccination human papillomavirus hpv vaccination low middle income countries
暂未订购
AI-Powered Threat Detection in Online Communities: A Multi-Modal Deep Learning Approach
6
作者 Ravi Teja Potla 《Journal of Computer and Communications》 2025年第2期155-171,共17页
The fast increase of online communities has brought about an increase in cyber threats inclusive of cyberbullying, hate speech, misinformation, and online harassment, making content moderation a pressing necessity. Tr... The fast increase of online communities has brought about an increase in cyber threats inclusive of cyberbullying, hate speech, misinformation, and online harassment, making content moderation a pressing necessity. Traditional single-modal AI-based detection systems, which analyze both text, photos, or movies in isolation, have established useless at taking pictures multi-modal threats, in which malicious actors spread dangerous content throughout a couple of formats. To cope with these demanding situations, we advise a multi-modal deep mastering framework that integrates Natural Language Processing (NLP), Convolutional Neural Networks (CNNs), and Long Short-Term Memory (LSTM) networks to become aware of and mitigate online threats effectively. Our proposed model combines BERT for text class, ResNet50 for photograph processing, and a hybrid LSTM-3-d CNN community for video content material analysis. We constructed a large-scale dataset comprising 500,000 textual posts, 200,000 offensive images, and 50,000 annotated motion pictures from more than one platform, which includes Twitter, Reddit, YouTube, and online gaming forums. The system became carefully evaluated using trendy gadget mastering metrics which include accuracy, precision, remember, F1-score, and ROC-AUC curves. Experimental outcomes demonstrate that our multi-modal method extensively outperforms single-modal AI classifiers, achieving an accuracy of 92.3%, precision of 91.2%, do not forget of 90.1%, and an AUC rating of 0.95. The findings validate the necessity of integrating multi-modal AI for actual-time, high-accuracy online chance detection and moderation. Future paintings will have consciousness on improving hostile robustness, enhancing scalability for real-world deployment, and addressing ethical worries associated with AI-driven content moderation. 展开更多
关键词 multi-Model AI Deep learning Natural Language Processing (NLP) Explainable AI (XI) Federated learning Cyber Threat Detection LSTM CNNS
在线阅读 下载PDF
Adaptive topology learning of camera network across non-overlapping views 被引量:1
7
作者 杨彪 林国余 张为公 《Journal of Southeast University(English Edition)》 EI CAS 2015年第1期61-66,共6页
An adaptive topology learning approach is proposed to learn the topology of a practical camera network in an unsupervised way. The nodes are modeled by the Gaussian mixture model. The connectivity between nodes is jud... An adaptive topology learning approach is proposed to learn the topology of a practical camera network in an unsupervised way. The nodes are modeled by the Gaussian mixture model. The connectivity between nodes is judged by their cross-correlation function, which is also used to calculate their transition time distribution. The mutual information of the connected node pair is employed for transition probability calculation. A false link eliminating approach is proposed, along with a topology updating strategy to improve the learned topology. A real monitoring system with five disjoint cameras is built for experiments. Comparative results with traditional methods show that the proposed method is more accurate in topology learning and is more robust to environmental changes. 展开更多
关键词 non-overlapping views mutual information Gaussian mixture model adaptive topology learning cross-correlation function
在线阅读 下载PDF
基于Q-Learning反馈机制的短距离无线通信网络多信道调度方法
8
作者 李忠 严莉 《计算机与网络》 2025年第5期470-479,共10页
由于传统信道调度方法受传统固定规则影响,导致出现信道资源利用率低下、数据通信不稳定等问题。为解决这一问题,提出基于Q-Learning反馈机制的短距离无线通信网络多信道调度方法。深入核心网系统架构与无线接入网系统架构的拓扑架构与... 由于传统信道调度方法受传统固定规则影响,导致出现信道资源利用率低下、数据通信不稳定等问题。为解决这一问题,提出基于Q-Learning反馈机制的短距离无线通信网络多信道调度方法。深入核心网系统架构与无线接入网系统架构的拓扑架构与底层逻辑,分析短距离无线通信网络架构;基于Dijkstra算法,结合短距离无线通信网络通信节点无向图进行网络信道节点优化部署;计算多信道状态特征参数,构建信道状态预估模型,预估短距离无线通信网络多信道状态;创新性地基于Q-Learning反馈机制,利用Q-Learning算法的强化学习能力,将强化学习过程视为马尔可夫决策过程,实现短距离无线通信网络多信道调度。实验结果表明:利用设计方法获取的平均丢包率最大值为0.03、网络吞吐量最大值为4.5 Mb/s,能够在维持较低丢包率的同时,保持较高的吞吐量,具有较高的信道资源利用效率。在低流量负载区,通信延迟均低于0.4 s、在高流量负载区通信延迟最高为0.4 s,最低值为0.26 s,可以有效实现通信数据高效、稳定传输。 展开更多
关键词 Q-learning反馈机制 短距离 无线通信网络 多信道调度 信道状态 马尔可夫决策
在线阅读 下载PDF
Multi-tasking to Address Diversity in Language Learning
9
作者 雷琨 《海外英语》 2014年第21期98-99,103,共3页
With focus now placed on the learner, more attention is given to his learning style, multiple intelligence and developing learning strategies to enable him to make sense of and use of the target language appropriately... With focus now placed on the learner, more attention is given to his learning style, multiple intelligence and developing learning strategies to enable him to make sense of and use of the target language appropriately in varied contexts and with different uses of the language. To attain this, the teacher is tasked with designing, monitoring and processing language learning activities for students to carry out and in the process learn by doing and reflecting on the learning process they went through as they interacted socially with each other. This paper describes a task named"The Fishbowl Technique"and found to be effective in large ESL classes in the secondary level in the Philippines. 展开更多
关键词 multi-tasking DIVERSITY learning STYLE the fishbow
在线阅读 下载PDF
Length matters:Scalable fast encrypted internet traffic service classification based on multiple protocol data unit length sequence with composite deep learning 被引量:4
10
作者 Zihan Chen Guang Cheng +3 位作者 Ziheng Xu Shuyi Guo Yuyang Zhou Yuyu Zhao 《Digital Communications and Networks》 SCIE CSCD 2022年第3期289-302,共14页
As an essential function of encrypted Internet traffic analysis,encrypted traffic service classification can support both coarse-grained network service traffic management and security supervision.However,the traditio... As an essential function of encrypted Internet traffic analysis,encrypted traffic service classification can support both coarse-grained network service traffic management and security supervision.However,the traditional plaintext-based Deep Packet Inspection(DPI)method cannot be applied to such a classification.Moreover,machine learning-based existing methods encounter two problems during feature selection:complex feature overcost processing and Transport Layer Security(TLS)version discrepancy.In this paper,we consider differences between encryption network protocol stacks and propose a composite deep learning-based method in multiprotocol environments using a sliding multiple Protocol Data Unit(multiPDU)length sequence as features by fully utilizing the Markov property in a multiPDU length sequence and maintaining suitability with a TLS-1.3 environment.Control experiments show that both Length-Sensitive(LS)composite deep learning model using a capsule neural network and LS-long short time memory achieve satisfactory effectiveness in F1-score and performance.Owing to faster feature extraction,our method is suitable for actual network environments and superior to state-of-the-art methods. 展开更多
关键词 Encrypted internet traffic Encrypted traffic service classification multi PDU length sequence Length sensitive composite deep learning TLS-1.3
在线阅读 下载PDF
Rapid visual screening of soft-story buildings from street view images using deep learning classification 被引量:2
11
作者 Qian Yu Chaofeng Wang +4 位作者 Frank McKenna Stella XYu Ertugrul Taciroglu Barbaros Cetiner Kincho HLaw 《Earthquake Engineering and Engineering Vibration》 SCIE EI CSCD 2020年第4期827-838,共12页
Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt ver... Rapid and accurate identification of potential structural deficiencies is a crucial task in evaluating seismic vulnerability of large building inventories in a region. In the case of multi-story structures, abrupt vertical variations of story stiffness are known to significantly increase the likelihood of collapse during moderate or severe earthquakes. Identifying and retrofitting buildings with such irregularities—generally termed as soft-story buildings—is, therefore, vital in earthquake preparedness and loss mitigation efforts. Soft-story building identification through conventional means is a labor-intensive and time-consuming process. In this study, an automated procedure was devised based on deep learning techniques for identifying soft-story buildings from street-view images at a regional scale. A database containing a large number of building images and a semi-automated image labeling approach that effectively annotates new database entries was developed for developing the deep learning model. Extensive computational experiments were carried out to examine the effectiveness of the proposed procedure, and to gain insights into automated soft-story building identification. 展开更多
关键词 soft-story building deep learning CNN rapid visual screening street view image
在线阅读 下载PDF
Exploring Deep Reinforcement Learning with Multi Q-Learning 被引量:27
12
作者 Ethan Duryea Michael Ganger Wei Hu 《Intelligent Control and Automation》 2016年第4期129-144,共16页
Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but... Q-learning is a popular temporal-difference reinforcement learning algorithm which often explicitly stores state values using lookup tables. This implementation has been proven to converge to the optimal solution, but it is often beneficial to use a function-approximation system, such as deep neural networks, to estimate state values. It has been previously observed that Q-learning can be unstable when using value function approximation or when operating in a stochastic environment. This instability can adversely affect the algorithm’s ability to maximize its returns. In this paper, we present a new algorithm called Multi Q-learning to attempt to overcome the instability seen in Q-learning. We test our algorithm on a 4 × 4 grid-world with different stochastic reward functions using various deep neural networks and convolutional networks. Our results show that in most cases, Multi Q-learning outperforms Q-learning, achieving average returns up to 2.5 times higher than Q-learning and having a standard deviation of state values as low as 0.58. 展开更多
关键词 Reinforcement learning Deep learning multi Q-learning
在线阅读 下载PDF
A Robust Approach for Multi Classification-Based Intrusion Detection through Stacking Deep Learning Models 被引量:1
13
作者 Samia Allaoua Chelloug 《Computers, Materials & Continua》 SCIE EI 2024年第6期4845-4861,共17页
Intrusion detection is a predominant task that monitors and protects the network infrastructure.Therefore,many datasets have been published and investigated by researchers to analyze and understand the problem of intr... Intrusion detection is a predominant task that monitors and protects the network infrastructure.Therefore,many datasets have been published and investigated by researchers to analyze and understand the problem of intrusion prediction and detection.In particular,the Network Security Laboratory-Knowledge Discovery in Databases(NSL-KDD)is an extensively used benchmark dataset for evaluating intrusion detection systems(IDSs)as it incorporates various network traffic attacks.It is worth mentioning that a large number of studies have tackled the problem of intrusion detection using machine learning models,but the performance of these models often decreases when evaluated on new attacks.This has led to the utilization of deep learning techniques,which have showcased significant potential for processing large datasets and therefore improving detection accuracy.For that reason,this paper focuses on the role of stacking deep learning models,including convolution neural network(CNN)and deep neural network(DNN)for improving the intrusion detection rate of the NSL-KDD dataset.Each base model is trained on the NSL-KDD dataset to extract significant features.Once the base models have been trained,the stacking process proceeds to the second stage,where a simple meta-model has been trained on the predictions generated from the proposed base models.The combination of the predictions allows the meta-model to distinguish different classes of attacks and increase the detection rate.Our experimental evaluations using the NSL-KDD dataset have shown the efficacy of stacking deep learning models for intrusion detection.The performance of the ensemble of base models,combined with the meta-model,exceeds the performance of individual models.Our stacking model has attained an accuracy of 99%and an average F1-score of 93%for the multi-classification scenario.Besides,the training time of the proposed ensemble model is lower than the training time of benchmark techniques,demonstrating its efficiency and robustness. 展开更多
关键词 Intrusion detection multi classification deep learning STACKING NSL-KDD
在线阅读 下载PDF
Multi-Agent Deep Reinforcement Learning for Cross-Layer Scheduling in Mobile Ad-Hoc Networks 被引量:1
14
作者 Xinxing Zheng Yu Zhao +1 位作者 Joohyun Lee Wei Chen 《China Communications》 SCIE CSCD 2023年第8期78-88,共11页
Due to the fading characteristics of wireless channels and the burstiness of data traffic,how to deal with congestion in Ad-hoc networks with effective algorithms is still open and challenging.In this paper,we focus o... Due to the fading characteristics of wireless channels and the burstiness of data traffic,how to deal with congestion in Ad-hoc networks with effective algorithms is still open and challenging.In this paper,we focus on enabling congestion control to minimize network transmission delays through flexible power control.To effectively solve the congestion problem,we propose a distributed cross-layer scheduling algorithm,which is empowered by graph-based multi-agent deep reinforcement learning.The transmit power is adaptively adjusted in real-time by our algorithm based only on local information(i.e.,channel state information and queue length)and local communication(i.e.,information exchanged with neighbors).Moreover,the training complexity of the algorithm is low due to the regional cooperation based on the graph attention network.In the evaluation,we show that our algorithm can reduce the transmission delay of data flow under severe signal interference and drastically changing channel states,and demonstrate the adaptability and stability in different topologies.The method is general and can be extended to various types of topologies. 展开更多
关键词 Ad-hoc network cross-layer scheduling multi agent deep reinforcement learning interference elimination power control queue scheduling actorcritic methods markov decision process
在线阅读 下载PDF
A Distributed Cooperative Dynamic Task Planning Algorithm for Multiple Satellites Based on Multi-agent Hybrid Learning 被引量:16
15
作者 WANG Chong LI Jun JING Ning WANG Jun CHEN Hao 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2011年第4期493-505,共13页
Traditionally, heuristic re-planning algorithms are used to tackle the problem of dynamic task planning for multiple satellites. However, the traditional heuristic strategies depend on the concrete tasks, which often ... Traditionally, heuristic re-planning algorithms are used to tackle the problem of dynamic task planning for multiple satellites. However, the traditional heuristic strategies depend on the concrete tasks, which often affect the result’s optimality. Noticing that the historical information of cooperative task planning will impact the latter planning results, we propose a hybrid learning algorithm for dynamic multi-satellite task planning, which is based on the multi-agent reinforcement learning of policy iteration and the transfer learning. The reinforcement learning strategy of each satellite is described with neural networks. The policy neural network individuals with the best topological structure and weights are found by applying co-evolutionary search iteratively. To avoid the failure of the historical learning caused by the randomly occurring observation requests, a novel approach is proposed to balance the quality and efficiency of the task planning, which converts the historical learning strategy to the current initial learning strategy by applying the transfer learning algorithm. The simulations and analysis show the feasibility and adaptability of the proposed approach especially for the situation with randomly occurring observation requests. 展开更多
关键词 multiple satellites dynamic task planning problem multi-agent systems reinforcement learning neuroevolution of augmenting topologies transfer learning
原文传递
A Distributed Algorithm for Parallel Multi-task Allocation Based on Profit Sharing Learning 被引量:7
16
作者 SU Zhao-Pin JIANG Jian-Guo +1 位作者 LIANG Chang-Yong ZHANG Guo-Fu 《自动化学报》 EI CSCD 北大核心 2011年第7期865-872,共8页
经由联盟形成的任务分配是在多代理人系统(妈) 的几应用程序域的基本研究挑战,例如资源分配,灾难反应管理等等。怎么以一种分布式的方式分配许多未解决的任务到一些代理人,主要处理。在这篇论文,我们在自我组织、自我学习的代理人... 经由联盟形成的任务分配是在多代理人系统(妈) 的几应用程序域的基本研究挑战,例如资源分配,灾难反应管理等等。怎么以一种分布式的方式分配许多未解决的任务到一些代理人,主要处理。在这篇论文,我们在自我组织、自我学习的代理人之中建议一个分布式的平行多工分配算法。处理状况,我们在二维的房间地理上驱散代理人和任务,然后介绍为寻找它的任务由的一个单个代理人的分享学习的利润(PSL ) 不断自我学习。我们也在代理人之中为通讯和协商介绍策略分配真实工作量到每个 tasked 代理人。最后,评估建议算法的有效性,我们把它与 Shehory 和 Krau 被许多研究人员在最近的年里讨论的分布式的任务分配算法作比较。试验性的结果证明建议算法罐头快速为每项任务形成一个解决的联盟。而且,建议算法罐头明确地告诉我们每个 tasked 代理人的真实工作量,并且能因此为实际控制任务提供一本特定、重要的参考书。 展开更多
关键词 自动化系统 自动化技术 ICA 数据处理
在线阅读 下载PDF
Collaborative multi-agent reinforcement learning based on experience propagation 被引量:5
17
作者 Min Fang Frans C.A. Groen 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2013年第4期683-689,共7页
For multi-agent reinforcement learning in Markov games, knowledge extraction and sharing are key research problems. State list extracting means to calculate the optimal shared state path from state trajectories with c... For multi-agent reinforcement learning in Markov games, knowledge extraction and sharing are key research problems. State list extracting means to calculate the optimal shared state path from state trajectories with cycles. A state list extracting algorithm checks cyclic state lists of a current state in the state trajectory, condensing the optimal action set of the current state. By reinforcing the optimal action selected, the action policy of cyclic states is optimized gradually. The state list extracting is repeatedly learned and used as the experience knowledge which is shared by teams. Agents speed up the rate of convergence by experience sharing. Competition games of preys and predators are used for the experiments. The results of experiments prove that the proposed algorithms overcome the lack of experience in the initial stage, speed up learning and improve the performance. 展开更多
关键词 multi-AGENT Q learning state list extracting experience sharing.
在线阅读 下载PDF
Fault diagnosis model based on multi-manifold learning and PSO-SVM for machinery 被引量:6
18
作者 Wang Hongjun Xu Xiaoli Rosen B G 《仪器仪表学报》 EI CAS CSCD 北大核心 2014年第S2期210-214,共5页
Fault diagnosis technology plays an important role in the industries due to the emergency fault of a machine could bring the heavy lost for the people and the company. A fault diagnosis model based on multi-manifold l... Fault diagnosis technology plays an important role in the industries due to the emergency fault of a machine could bring the heavy lost for the people and the company. A fault diagnosis model based on multi-manifold learning and particle swarm optimization support vector machine(PSO-SVM) is studied. This fault diagnosis model is used for a rolling bearing experimental of three kinds faults. The results are verified that this model based on multi-manifold learning and PSO-SVM is good at the fault sensitive features acquisition with effective accuracy. 展开更多
关键词 FAULT diagnosis multi-manifold learning particle SWARM optimization support vector machine
在线阅读 下载PDF
面向物流机器人的改进Q-Learning动态避障算法研究 被引量:2
19
作者 王力 赵全海 黄石磊 《计算机测量与控制》 2025年第3期267-274,共8页
为提升物流机器人(AMR)在复杂环境中的自主导航与避障能力,改善传统Q-Learning算法在动态环境中的收敛速度慢、路径规划不够优化等问题;研究引入模糊退火算法对Q-Learning算法进行路径节点和搜索路径优化,删除多余节点和非必要转折;并... 为提升物流机器人(AMR)在复杂环境中的自主导航与避障能力,改善传统Q-Learning算法在动态环境中的收敛速度慢、路径规划不够优化等问题;研究引入模糊退火算法对Q-Learning算法进行路径节点和搜索路径优化,删除多余节点和非必要转折;并为平衡好Q-Learning算法的探索和利用问题,提出以贪婪法优化搜索策略,并借助改进动态窗口法对进行路径节点和平滑加速改进,实现局部路径规划,以提高改进Q-Learning算法在AMR动态避障中的搜索性能和效率;结果表明,改进Q-Learning算法能有效优化搜索路径,能较好避开动态障碍物和静态障碍物,与其他算法的距离差幅至少大于1 m;改进算法在局部路径中的避障轨迹更趋近于期望值,最大搜索时间不超过3 s,优于其他算法,且其在不同场景下的避障路径长度和运动时间减少幅度均超过10%,避障成功率超过90%;研究方法能满足智慧仓储、智能制造等工程领域对物流机器人高效、安全作业的需求。 展开更多
关键词 物流机器人 Q-learning算法 DWA 多目标规划 障碍物 避障
在线阅读 下载PDF
Multi-task Coalition Parallel Formation Strategy Based on Reinforcement Learning 被引量:6
20
作者 JIANG Jian-Guo SU Zhao-Pin +1 位作者 QI Mei-Bin ZHANG Guo-Fu 《自动化学报》 EI CSCD 北大核心 2008年第3期349-352,共4页
代理人联盟是代理人协作和合作的一种重要方式。形成一个联盟,代理人能提高他们的能力解决问题并且获得更多的实用程序。在这份报纸,新奇多工联盟平行形成策略被介绍,并且多工联盟形成的过程是一个 Markov 决定过程的结论理论上被证... 代理人联盟是代理人协作和合作的一种重要方式。形成一个联盟,代理人能提高他们的能力解决问题并且获得更多的实用程序。在这份报纸,新奇多工联盟平行形成策略被介绍,并且多工联盟形成的过程是一个 Markov 决定过程的结论理论上被证明。而且,学习的加强被用来解决多工联盟平行的代理人行为策略,和这个过程形成被描述。在多工面向的领域,策略罐头有效地并且平行形式多工联盟。 展开更多
关键词 强化学习 多任务合并 平行排列 马尔可夫决策过程
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部