期刊文献+
共找到292篇文章
< 1 2 15 >
每页显示 20 50 100
Multi-objective optimization of microwave power transmission system architecture with engineering consideration
1
作者 DONG Shiwei SHINOHARA Naoki 《中国空间科学技术(中英文)》 北大核心 2025年第4期114-122,共9页
In the last decade,space solar power satellites(SSPSs)have been conceived to support net-zero carbon emissions and have attracted considerable attention.Electric energy is transmitted to the ground via a microwave pow... In the last decade,space solar power satellites(SSPSs)have been conceived to support net-zero carbon emissions and have attracted considerable attention.Electric energy is transmitted to the ground via a microwave power beam,a technology known as microwave power transmission(MPT).Due to the vast transmission distance of tens of thousands of kilometers,the power transmitting antenna array must span up to 1 kilometer in diameter.At the same time,the size of the rectifying array on the ground should extend over a few kilometers.This makes the MPT system of SSPSs significantly larger than the existing aerospace engineering system.To design and operate a rational MPT system,comprehensive optimization is required.Taking the space MPT system engineering into consideration,a novel multi-objective optimization function is proposed and further analyzed.The multi-objective optimization problem is modeled mathematically.Beam collection efficiency(BCE)is the primary factor,followed by the thermal management capability.Some tapers,designed to solve the conflict between BCE and the thermal problem,are reviewed.In addition to these two factors,rectenna design complexity is included as a functional factor in the optimization objective.Weight coefficients are assigned to these factors to prioritize them.Radiating planar arrays with different aperture illumination fields are studied,and their performances are compared using the multi-objective optimization function.Transmitting array size,rectifying array size,transmission distance,and transmitted power remaine constant in various cases,ensuring fair comparisons.The analysis results show that the proposed optimization function is effective in optimizing and selecting the MPT system architecture.It is also noted that the multi-objective optimization function can be expanded to include other factors in the future. 展开更多
关键词 space solar power satellite(SSPS) microwave power transmission(MPT) multi-objective function beam collection efficiency(BCE) system engineering
在线阅读 下载PDF
Strength,Self-flowing,and Multi-objective Optimization of Cemented Paste Backfill Materials Base on RSM-DF
2
作者 LIU Chunkang WANG Hongjiang +2 位作者 WANG Hui SUN Jiaqi BAI Longjian 《Journal of Wuhan University of Technology(Materials Science)》 2025年第2期449-461,共13页
The multi-objective optimization of backfill effect based on response surface methodology and desirability function(RSM-DF)was conducted.Firstly,the test results show that the uniaxial compressive strength(UCS)increas... The multi-objective optimization of backfill effect based on response surface methodology and desirability function(RSM-DF)was conducted.Firstly,the test results show that the uniaxial compressive strength(UCS)increases with cement sand ratio(CSR),slurry concentration(SC),and curing age(CA),while flow resistance(FR)increases with SC and backfill flow rate(BFR),and decreases with CSR.Then the regression models of UCS and FR as response values were established through RSM.Multi-factor interaction found that CSR-CA impacted UCS most,while SC-BFR impacted FR most.By introducing the desirability function,the optimal backfill parameters were obtained based on RSM-DF(CSR is 1:6.25,SC is 69%,CA is 11.5 d,and BFR is 90 m^(3)/h),showing close results of Design Expert and high reliability for optimization.For a copper mine in China,RSM-DF optimization will reduce cement consumption by 4758 t per year,increase tailings consumption by about 6700 t,and reduce CO_(2)emission by about 4758 t.Thus,RSM-DF provides a new approach for backfill parameters optimization,which has important theoretical and practical values. 展开更多
关键词 cemented paste backfill response surface methodology desirability function multi-objective optimization
原文传递
Variable reward function-driven strategies for impulsive orbital attack-defense games under multiple constraints and victory conditions
3
作者 Liran Zhao Sihan Xu +1 位作者 Qinbo Sun Zhaohui Dang 《Defence Technology(防务技术)》 2025年第9期159-183,共25页
This paper investigates impulsive orbital attack-defense(AD)games under multiple constraints and victory conditions,involving three spacecraft:attacker,target,and defender.In the AD scenario,the attacker aims to breac... This paper investigates impulsive orbital attack-defense(AD)games under multiple constraints and victory conditions,involving three spacecraft:attacker,target,and defender.In the AD scenario,the attacker aims to breach the defender's interception to rendezvous with the target,while the defender seeks to protect the target by blocking or actively pursuing the attacker.Four different maneuvering constraints and five potential game outcomes are incorporated to more accurately model AD game problems and increase complexity,thereby reducing the effectiveness of traditional methods such as differential games and game-tree searches.To address these challenges,this study proposes a multiagent deep reinforcement learning solution with variable reward functions.Two attack strategies,Direct attack(DA)and Bypass attack(BA),are developed for the attacker,each focusing on different mission priorities.Similarly,two defense strategies,Direct interdiction(DI)and Collinear interdiction(CI),are designed for the defender,each optimizing specific defensive actions through tailored reward functions.Each reward function incorporates both process rewards(e.g.,distance and angle)and outcome rewards,derived from physical principles and validated via geometric analysis.Extensive simulations of four strategy confrontations demonstrate average defensive success rates of 75%for DI vs.DA,40%for DI vs.BA,80%for CI vs.DA,and 70%for CI vs.BA.Results indicate that CI outperforms DI for defenders,while BA outperforms DA for attackers.Moreover,defenders achieve their objectives more effectively under identical maneuvering capabilities.Trajectory evolution analyses further illustrate the effectiveness of the proposed variable reward function-driven strategies.These strategies and analyses offer valuable guidance for practical orbital defense scenarios and lay a foundation for future multi-agent game research. 展开更多
关键词 Orbital attack-defense game Impulsive maneuver Multi-agent deep reinforcement learning reward function design
在线阅读 下载PDF
Multi-objective optimization of rolling schedule based on cost function for tandem cold mill 被引量:4
4
作者 陈树宗 张欣 +3 位作者 彭良贵 张殿华 孙杰 刘印忠 《Journal of Central South University》 SCIE EI CAS 2014年第5期1733-1740,共8页
In terms of tandem cold mill productivity and product quality, a multi-objective optimization model of rolling schedule based on cost fimction was proposed to determine the stand reductions, inter-stand tensions and r... In terms of tandem cold mill productivity and product quality, a multi-objective optimization model of rolling schedule based on cost fimction was proposed to determine the stand reductions, inter-stand tensions and rolling speeds for a specified product. The proposed schedule optimization model consists of several single cost fi.mctions, which take rolling force, motor power, inter-stand tension and stand reduction into consideration. The cost function, which can evaluate how far the rolling parameters are from the ideal values, was minimized using the Nelder-Mead simplex method. The proposed rolling schedule optimization method has been applied successfully to the 5-stand tandem cold mill in Tangsteel, and the results from a case study show that the proposed method is superior to those based on empirical formulae. 展开更多
关键词 tandem cold mill multi-object optimization rolling schedule cost function simplex algorithm
在线阅读 下载PDF
Reward Function Design Method for Long Episode Pursuit Tasks Under Polar Coordinate in Multi-Agent Reinforcement Learning
5
作者 DONG Yubo CUI Tao +3 位作者 ZHOU Yufan SONG Xun ZHU Yue DONG Peng 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第4期646-655,共10页
Multi-agent reinforcement learning has recently been applied to solve pursuit problems.However,it suffers from a large number of time steps per training episode,thus always struggling to converge effectively,resulting... Multi-agent reinforcement learning has recently been applied to solve pursuit problems.However,it suffers from a large number of time steps per training episode,thus always struggling to converge effectively,resulting in low rewards and an inability for agents to learn strategies.This paper proposes a deep reinforcement learning(DRL)training method that employs an ensemble segmented multi-reward function design approach to address the convergence problem mentioned before.The ensemble reward function combines the advantages of two reward functions,which enhances the training effect of agents in long episode.Then,we eliminate the non-monotonic behavior in reward function introduced by the trigonometric functions in the traditional 2D polar coordinates observation representation.Experimental results demonstrate that this method outperforms the traditional single reward function mechanism in the pursuit scenario by enhancing agents’policy scores of the task.These ideas offer a solution to the convergence challenges faced by DRL models in long episode pursuit problems,leading to an improved model training performance. 展开更多
关键词 multi-agent reinforcement learning deep reinforcement learning(DRL) long episode reward function
原文传递
Multi-objective Function Optimization for Environmental Control of a Greenhouse Based on a RBF and NSGA-Ⅱ
6
作者 Zhou Xiu-li Liu Ming-wei +3 位作者 Wang Ling Xu Xiao-chuan Chen Gang Wang De-fu 《Journal of Northeast Agricultural University(English Edition)》 CAS 2021年第1期75-89,共15页
To better meet the needs of crop growth and achieve energy savings and efficiency enhancements,constructing a reliable environmental model to optimize greenhouse decision parameters is an important problem to be solve... To better meet the needs of crop growth and achieve energy savings and efficiency enhancements,constructing a reliable environmental model to optimize greenhouse decision parameters is an important problem to be solved.In this work,a radial-basis function(RBF)neural network was used to mine the potential changes of a greenhouse environment,a temperature error model was established,a multi-objective optimization function of energy consumption was constructed and the corresponding decision parameters were optimized by using a non-dominated sorting genetic algorithm with an elite strategy(NSGA-Ⅱ).The simulation results showed that RBF could clarify the nonlinear relationship among the greenhouse environment variables and decision parameters and the greenhouse temperature.The NSGA-Ⅱ could well search for the Pareto solution for the objective functions.The experimental results showed that after 40 min of combined control of sunshades and sprays,the temperature was reduced from 31℃to 25℃,and the power consumption was 0.5 MJ.Compared with tire three days of July 24,July 25 and July 26,2017,the energy consumption of the controlled production greenhouse was reduced by 37.5%,9.1%and 28.5%,respectively. 展开更多
关键词 greenhouse temperature multi-objective optimization radial-basis function(RBF) non-dominated sorting genetic algorithm with an elite strategy(NSGA-Ⅱ)
在线阅读 下载PDF
A Penalty Function Algorithm with Objective Parameters and Constraint Penalty Parameter for Multi-Objective Programming
7
作者 Zhiqing Meng Rui Shen Min Jiang 《American Journal of Operations Research》 2014年第6期331-339,共9页
In this paper, we present an algorithm to solve the inequality constrained multi-objective programming (MP) by using a penalty function with objective parameters and constraint penalty parameter. First, the penalty fu... In this paper, we present an algorithm to solve the inequality constrained multi-objective programming (MP) by using a penalty function with objective parameters and constraint penalty parameter. First, the penalty function with objective parameters and constraint penalty parameter for MP and the corresponding unconstraint penalty optimization problem (UPOP) is defined. Under some conditions, a Pareto efficient solution (or a weakly-efficient solution) to UPOP is proved to be a Pareto efficient solution (or a weakly-efficient solution) to MP. The penalty function is proved to be exact under a stable condition. Then, we design an algorithm to solve MP and prove its convergence. Finally, numerical examples show that the algorithm may help decision makers to find a satisfactory solution to MP. 展开更多
关键词 multi-objective Programming PENALTY function Objective PARAMETERS CONSTRAINT PENALTY Parameter PARETO Weakly-Efficient Solution
在线阅读 下载PDF
Optimality for Multi-Objective Programming Involving Arcwise Connected d-Type-I Functions
8
作者 Guolin Yu Min Wang 《American Journal of Operations Research》 2011年第4期243-248,共6页
This paper deals with the optimality conditions and dual theory of multi-objective programming problems involving generalized convexity. New classes of generalized type-I functions are introduced for arcwise connected... This paper deals with the optimality conditions and dual theory of multi-objective programming problems involving generalized convexity. New classes of generalized type-I functions are introduced for arcwise connected functions, and examples are given to show the existence of these functions. By utilizing the new concepts, several sufficient optimality conditions and Mond-Weir type duality results are proposed for non-differentiable multi-objective programming problem. 展开更多
关键词 multi-objective Programming Pareto Efficient Solution Arcwise Connected d-Type-I functionS OPTIMALITY Conditions Duality
在线阅读 下载PDF
Multi-Objective Redundancy Optimization of Continuous-Point Robot Milling Path in Shipbuilding 被引量:3
9
作者 Jianjun Yao Chen Qian +1 位作者 Yikun Zhang Geyang Yu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第2期1283-1303,共21页
The 6-DOF manipulator provides a new option for traditional shipbuilding for its advantages of vast working space,low power consumption,and excellent flexibility.However,the rotation of the end effector along the tool... The 6-DOF manipulator provides a new option for traditional shipbuilding for its advantages of vast working space,low power consumption,and excellent flexibility.However,the rotation of the end effector along the tool axis is functionally redundant when using a robotic arm for five-axis machining.In the process of ship construction,the performance of the parts’protective coating needs to bemachined tomeet the Performance Standard of Protective Coatings(PSPC).The arbitrary redundancy configuration in path planning will result in drastic fluctuations in the robot joint angle,greatly reducing machining quality and efficiency.There have been some studies on singleobjective optimization of redundant variables,However,the quality and efficiency of milling are not affected by a single factor,it is usually influenced by several factors,such as the manipulator stiffness,the joint motion smoothness,and the energy consumption.To solve this problem,this paper proposed a new path optimization method for the industrial robot when it is used for five-axis machining.The path smoothness performance index and the energy consumption index are established based on the joint acceleration and the joint velocity,respectively.The path planning issue is formulated as a constrained multi-objective optimization problem by taking into account the constraints of joint limits and singularity avoidance.Then,the path is split into multiple segments for optimization to avoid the slow convergence rate caused by the high dimension.An algorithm combining the non-dominated sorting genetic algorithm(NSGA-II)and the differential evolution(DE)algorithm is employed to solve the above optimization problem.The simulations validate the effectiveness of the algorithm,showing the improvement of smoothness and the reduction of energy consumption. 展开更多
关键词 SHIPBUILDING robot milling functional redundancy path optimization multi-objective
在线阅读 下载PDF
A vague-set-based fuzzy multi-objective decision making model for bidding purchase 被引量:4
10
作者 WANG Zhou-jing QIAN Edward Y. 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2007年第4期644-650,共7页
A vague-set-based fuzzy multi-objective decision making model is developed for evaluating bidding plans in a bid- ding purchase process. A group of decision-makers (DMs) first independently assess bidding plans accord... A vague-set-based fuzzy multi-objective decision making model is developed for evaluating bidding plans in a bid- ding purchase process. A group of decision-makers (DMs) first independently assess bidding plans according to their experience and preferences, and these assessments may be expressed as linguistic terms, which are then converted to fuzzy numbers. The resulting decision matrices are then transformed to objective membership grade matrices. The lower bound of satisfaction and upper bound of dissatisfaction are used to determine each bidding plan’s supporting, opposing, and neutral objective sets, which together determine the vague value of a bidding plan. Finally, a score function is employed to rank all bidding plans. A new score function based on vague sets is introduced in the model and a novel method is presented for calculating the lower bound of sat- isfaction and upper bound of dissatisfaction. In a vague-set-based fuzzy multi-objective decision making model, different valua- tions for upper and lower bounds of satisfaction usually lead to distinct ranking results. Therefore, it is crucial to effectively contain DMs’ arbitrariness and subjectivity when these values are determined. 展开更多
关键词 Fuzzy multi-objective decision making model Vague set Score function Lower bound of satisfaction Upper bound of dissatisfaction
在线阅读 下载PDF
Overview of multi-objective optimization methods 被引量:2
11
作者 LeiXiujuan ShiZhongke 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2004年第2期142-146,共5页
To assist readers to have a comprehensive understanding, the classical and intelligent methods roundly based on precursory research achievements are summarized in this paper. First, basic conception and description ab... To assist readers to have a comprehensive understanding, the classical and intelligent methods roundly based on precursory research achievements are summarized in this paper. First, basic conception and description about multi-objective (MO) optimization are introduced. Then some definitions and related terminologies are given. Furthermore several MO optimization methods including classical and current intelligent methods are discussed one by one succinctly. Finally evaluations on advantages and disadvantages about these methods are made at the end of the paper. 展开更多
关键词 multi-objective optimization objective function Pareto optimality genetic algorithms simulated annealing fuzzy logical.
在线阅读 下载PDF
Multi-objective forest harvesting under sustainable and economic principles 被引量:1
12
作者 Talles Hudson Souza Lacerda Luciano Cavalcante de Jesus Franca +5 位作者 Isáira Leite e Lopes Sammilly Lorrayne Souza Lacerda Evandro OrfanóFigueiredo Bruno Henrique Groenner Barbosa Carolina Souza Jarochinski e Silva Lucas Rezende Gomide 《Journal of Forestry Research》 SCIE CAS CSCD 2023年第5期1379-1394,共16页
Selective logging is well-recognized as an effective practice in sustainable forest management.However,the ecological efficiency or resilience of the residual stand is often in doubt.Recovery time depends on operation... Selective logging is well-recognized as an effective practice in sustainable forest management.However,the ecological efficiency or resilience of the residual stand is often in doubt.Recovery time depends on operational variables,diversity,and forest structure.Selective logging is excellent but is open to changes.This may be resolved by mathematical programming and this study integrates the economic-ecological aspects in multi-objective function by applying two evolutionary algorithms.The function maximizes remaining stand diversity,merchantable logs,and the inverse of distance between trees for harvesting and log landings points.The Brazilian rainforest database(566 trees)was used to simulate our 216-ha model.The log landing design has a maximum volume limit of 500 m3.The nondominated sorting genetic algorithm was applied to solve the main optimization problem.In parallel,a sub-problem(p-facility allocation)was solved for landing allocation by a genetic algorithm.Pareto frontier analysis was applied to distinguish the gradientsα-economic,β-ecological,andγ-equilibrium.As expected,the solutions have high diameter changes in the residual stand(average removal of approximately 16 m^(3) ha^(-1)).All solutions showed a grouping of trees selected for harvesting,although there was no formation of large clearings(percentage of canopy removal<7%,with an average of 2.5 ind ha^(-1)).There were no differences in floristic composition by preferentially selecting species with greater frequency in the initial stand for harvesting.This implies a lower impact on the demographic rates of the remaining stand.The methodology should support projects of reduced impact logging by using spatial-diversity information to guide better practices in tropical forests. 展开更多
关键词 Amazon rainforest management Computational intelligence multi-objective functions Evolutionary computing
在线阅读 下载PDF
Optimal Kernel-based Extreme Learning and Multi-objective Function-aided Task Scheduling for Solving Load Balancing Problems in Cloud Environment
13
作者 Ravi Gugulothu Vijaya Saradhi Thommandru Suneetha Bulla 《Journal of Systems Science and Systems Engineering》 2025年第4期385-409,共25页
Workload balancing in cloud computing is not yet resolved,particularly considering Infrastructure as a Service(IaaS)in the cloud network.The problem of being underloaded or overloaded should not occur at the time of t... Workload balancing in cloud computing is not yet resolved,particularly considering Infrastructure as a Service(IaaS)in the cloud network.The problem of being underloaded or overloaded should not occur at the time of the server or host accessing the cloud which may lead to create system crash problem.Thus,to resolve these existing problems,an efficient task scheduling algorithm is required for distributing the tasks over the entire feasible resources,which is termed load balancing.The load balancing approach assures that the entire Virtual Machines(VMs)are utilized appropriately.So,it is highly essential to develop a load-balancing model in a cloud environment based on machine learning and optimization strategies.Here,the computing and networking data is utilized for the analysis to observe the traffic as well as performance patterns.The acquired data is offered to the machine learning decision to select the right server by predicting the performance effectively by employing an Optimal Kernel-based Extreme Learning Machine(OK-ELM)and their parameter is tuned by the developed hybrid approach Population Size-based Mud Ring Tunicate Swarm Algorithm(PS-MRTSA).Further,effective scheduling is performed to resolve the load balancing issues by employing the developed model MR-TSA.Here,the developed approach effectively resolves the multi-objective constraints such as Response time,Resource cost,and energy consumption.Thus,the recommended load balancing model securesan enhanced performance rate than the traditional approaches over several experimental analyses. 展开更多
关键词 Cloud environment load balancing problem optimal kernel-based extreme learning machine population size-based mud ring tunicate swarm algorithm multi-objective function
原文传递
A Study on the Multi-Objective Optimization Method of Brackets in Ship Structures
14
作者 LIU Fan HU Yu-meng +2 位作者 FENG Guo-qing ZHAO Wei-dong ZHANG Ming 《China Ocean Engineering》 SCIE EI CSCD 2022年第2期208-222,共15页
The shape and size optimization of brackets in hull structures was conducted to achieve the simultaneous reduction of mass and high stress,where the parametric finite element model was built based on Patran Command La... The shape and size optimization of brackets in hull structures was conducted to achieve the simultaneous reduction of mass and high stress,where the parametric finite element model was built based on Patran Command Language codes.The optimization procedure was executed on Isight platform,on which the linear dimensionless method was introduced to establish the weighted multi-objective function.The extreme processing method was applied and proved effective to normalize the objectives.The bracket was optimized under the typical single loads and design waves,accompanied by the different proportions of weights in the objective function,in which the safety factor function was further established,including yielding,buckling,and fatigue strength,and the weight minimization and safety maximization of the bracket were obtained.The findings of this study illustrate that the dimensionless objectives share equal contributions to the multi-objective function,which enhances the role of weights in the optimization. 展开更多
关键词 BRACKETS parametric finite element model multi-objective optimization extreme processing method safety factor function weighted multi-objective function
在线阅读 下载PDF
Multi-objective robust controller synthesis for discrete-time systems with convex polytopic uncertain domain
15
作者 张彦虎 颜文俊 +1 位作者 卢建宁 赵光宙 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2005年第B08期87-93,共7页
Multi-objective robust state-feedback controller synthesis problems for linear discrete-time uncertain systems are addressed. Based on parameter-dependent Lyapunov functions, the Gl2 and GH2 norm expressed in terms of... Multi-objective robust state-feedback controller synthesis problems for linear discrete-time uncertain systems are addressed. Based on parameter-dependent Lyapunov functions, the Gl2 and GH2 norm expressed in terms of LMI (Linear Matrix Inequality) characterizations are further generalized to cope with the robust analysis for convex polytopic uncertain system. Robust state-feedback controller synthesis conditions are also derived for this class of uncertain systems. Using the above results, multi-objective state-feedback controller synthesis procedures which involve the LMI optimization technique are developed and less conservative than the existing one. An illustrative example verified the validity of the approach. 展开更多
关键词 Gl2 and GH2 performance multi-objective optimization Robust controller synthesis Parameter-dependent Lyapunov functions Convex polytopic uncertain system
在线阅读 下载PDF
Multi-objective Particle Swarm Optimization Algorithm Based on Performance and Reliability of Discrete System Resources Configuration
16
作者 周国财 高翔 《Journal of Donghua University(English Edition)》 EI CAS 2014年第6期850-852,共3页
Considering research on multi-objective optimization for reliability and performance suffering cost constraints in digital circuits,an improved multi-objective optimization algorithm based on performance and reliabili... Considering research on multi-objective optimization for reliability and performance suffering cost constraints in digital circuits,an improved multi-objective optimization algorithm based on performance and reliability was proposed to solve the problem of discrete system resources configuration in this paper. This algorithm used the particle-swarm optimization( PSO) to evaluate the tradeoffs configuration of the system resources between reliability and performance and proved the feasibility through the simulation.Finally, the information of resources configuration from optimization algorithm was used to effectively guide the system design so as to mitigate soft errors caused by single event effect( SEE). 展开更多
关键词 multi-objective optimization function module soft error triple modular redundancy(TMR)
在线阅读 下载PDF
基于深度强化学习的游戏智能引导算法 被引量:2
17
作者 白天 吕璐瑶 +1 位作者 李储 何加亮 《吉林大学学报(理学版)》 北大核心 2025年第1期91-98,共8页
针对传统游戏智能体算法存在模型输入维度大及训练时间长的问题,提出一种结合状态信息转换与奖励函数塑形技术的新型深度强化学习游戏智能引导算法.首先,利用Unity引擎提供的接口直接读取游戏后台信息,以有效压缩状态空间的维度,减少输... 针对传统游戏智能体算法存在模型输入维度大及训练时间长的问题,提出一种结合状态信息转换与奖励函数塑形技术的新型深度强化学习游戏智能引导算法.首先,利用Unity引擎提供的接口直接读取游戏后台信息,以有效压缩状态空间的维度,减少输入数据量;其次,通过精细化设计奖励机制,加速模型的收敛过程;最后,从主观定性和客观定量两方面对该算法模型与现有方法进行对比实验,实验结果表明,该算法不仅显著提高了模型的训练效率,还大幅度提高了智能体的性能. 展开更多
关键词 深度强化学习 游戏智能体 奖励函数塑形 近端策略优化算法
在线阅读 下载PDF
基于对抗强化学习的无人机逃离路径规划方法
18
作者 黄湘松 王梦宇 潘大鹏 《航空学报》 北大核心 2025年第17期292-307,共16页
在无人机技术迅速发展的背景下,如何应对其他无人机的恶意追捕成为了无人机安全防护中的重要课题。针对通过使用对抗强化学习算法,提升无人机在敌对环境中的适应性和生存能力这一问题,利用对抗强化学习框架,针对无人机逃逸过程中接收错... 在无人机技术迅速发展的背景下,如何应对其他无人机的恶意追捕成为了无人机安全防护中的重要课题。针对通过使用对抗强化学习算法,提升无人机在敌对环境中的适应性和生存能力这一问题,利用对抗强化学习框架,针对无人机逃逸过程中接收错误信息对决策产生干扰的问题进行了处理,以围捕者与逃逸者之间的对抗为基础,优化运输无人机的策略以应对围捕者的行为。针对传统的强化学习方法中的稀疏奖励问题,结合人工势场法提出逐步奖励策略机制,使得无人机可以更有效地适应围捕环境。结果表明,该算法相比于近端策略优化(PPO)算法,无人机的逃逸成功率提升了54.47%,同时运输时间减少了34.35%,显著提高了无人机的运输效率。结果为无人机的安全防护提供了新的技术方案,并探索了对抗强化学习在恶意追捕情境下的应用潜力。 展开更多
关键词 对抗训练 强化学习 逃逸路径规划 逃逸决策 奖励函数
原文传递
基于功能磁共振技术探讨首发抑郁症奖赏网络功能异常研究
19
作者 肖雪 田静 +6 位作者 孙绪 任渝棠 徐辉 余学 李明山 孙继飞 侯小兵 《精神医学杂志》 2025年第2期118-122,共5页
目的利用功能磁共振成像(fMRI)技术,探讨首发抑郁症(FED)患者纹状体亚区的脑功能改变情况。方法研究共纳入21例FED患者(FED组)和22名健康对照者(对照组)。两组均接受fMRI扫描,以纹状体亚区作为种子点,采用功能连接(FC)分析方法,比较两... 目的利用功能磁共振成像(fMRI)技术,探讨首发抑郁症(FED)患者纹状体亚区的脑功能改变情况。方法研究共纳入21例FED患者(FED组)和22名健康对照者(对照组)。两组均接受fMRI扫描,以纹状体亚区作为种子点,采用功能连接(FC)分析方法,比较两组脑网络的差异。同时,收集FED组17项汉密尔顿抑郁量表(HAMD-17)评分,分析异常脑区与临床症状之间的相关性。结果与对照组比较,FED组在左侧腹侧纹状体上部与左侧中央后回的FC减低(P<0.005),右侧腹侧纹状体上部与左侧尾状核的FC增高(P<0.005),左侧背侧尾状核与右侧颞中回的FC增高(P<0.005),右侧背侧尾侧壳核与右侧顶下小叶的FC减低(P<0.005),左侧背侧吻侧壳核与右侧顶下小叶的FC减低(P<0.005),腹侧吻侧壳核与右侧缘上回的FC减低(P<0.005)。相关性分析结果显示,左侧背侧吻侧壳核与右侧顶下小叶的FC值与HAMD-17评分呈正相关(P<0.05)。结论FED患者在纹状体亚区与默认网络及感觉运动皮层的功能连接上存在异常改变,且奖赏网络内部的功能连接也出现异常。本研究为理解FED的神经病理机制及靶向治疗提供了新的视角。 展开更多
关键词 首发抑郁症 功能磁共振成像 功能连接 奖赏网络
暂未订购
如何实现真正的智能?——关于智能体中事实性计算与价值性算计深度融合的思考
20
作者 刘伟 龙擎天 马楠 《电子科技大学学报(社科版)》 2025年第5期1-7,共7页
该文深入探讨了人工智能(AI)实现真正智能的关键路径,主张通过事实性计算与价值性算计的深度融合,推动AI从单纯的自动化工具向具备认知与道德能力的智能系统转变。当前AI在处理客观数据方面表现出色,但在理解和应对人类社会的复杂价值... 该文深入探讨了人工智能(AI)实现真正智能的关键路径,主张通过事实性计算与价值性算计的深度融合,推动AI从单纯的自动化工具向具备认知与道德能力的智能系统转变。当前AI在处理客观数据方面表现出色,但在理解和应对人类社会的复杂价值问题时存在明显不足,可能导致决策偏差和伦理困境。论文明确区分事实性事实与价值性事实,指出后者基于前者并影响事实认知;在强化学习中,提出奖惩函数设计需兼顾事实准确性与价值目标;在人机环境系统中,构建动态协同框架以融合事实的可计算性与价值的可判定性;强调通过具身认知等技术改进人机交互,推动AI从功能性模拟转向机制性模拟;在多人多智能体系统中,分析多事实与多价值的动态交互,突出通信与自组织的必要性。AI若要跨越工具理性,迈向价值理性,必须实现事实性与价值性的有机结合。 展开更多
关键词 人机交互 智能体 人工智能 价值与事实 奖惩函数
在线阅读 下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部