Reinforcement learning(RL)has been widely studied as an efficient class of machine learning methods for adaptive optimal control under uncertainties.In recent years,the applications of RL in optimised decision-making ...Reinforcement learning(RL)has been widely studied as an efficient class of machine learning methods for adaptive optimal control under uncertainties.In recent years,the applications of RL in optimised decision-making and motion control of intelligent vehicles have received increasing attention.Due to the complex and dynamic operating environments of intelligent vehicles,it is necessary to improve the learning efficiency and generalisation ability of RL-based decision and control algorithms under different conditions.This survey systematically examines the theoretical foundations,algorithmic advancements and practical challenges of applying RL to intelligent vehicle systems operating in complex and dynamic environments.The major algorithm frameworks of RL are first introduced,and the recent advances in RL-based decision-making and control of intelligent vehicles are overviewed.In addition to self-learning decision and control approaches using state measurements,the developments of DRL methods for end-to-end driving control of intelligent vehicles are summarised.The open problems and directions for further research works are also discussed.展开更多
Information plays a crucial role in guiding behavioral decisions during public health emergencies. Individuals communicate to acquire relevant knowledge about an epidemic, which influences their decisions to adopt pro...Information plays a crucial role in guiding behavioral decisions during public health emergencies. Individuals communicate to acquire relevant knowledge about an epidemic, which influences their decisions to adopt protective measures.However, whether to disseminate specific information is also a behavioral decision. In light of this understanding, we develop a coupled information–vaccination–epidemic model to depict these co-evolutionary dynamics in a three-layer network. Negative information dissemination and vaccination are treated as separate decision-making processes. We then examine the combined effects of herd and risk motives on information dissemination and vaccination decisions through the lens of game theory. The microscopic Markov chain approach(MMCA) is used to describe the dynamic process and to derive the epidemic threshold. Simulation results indicate that increasing the cost of negative information dissemination and providing timely clarification can effectively control the epidemic. Furthermore, a phenomenon of diminishing marginal utility is observed as the cost of dissemination increases, suggesting that authorities do not need to overinvest in suppressing negative information. Conversely, reducing the cost of vaccination and increasing vaccine efficacy emerge as more effective strategies for outbreak control. In addition, we find that the scale of the epidemic is greater when the herd motive dominates behavioral decision-making. In conclusion, this study provides a new perspective for understanding the complexity of epidemic spreading by starting with the construction of different behavioral decisions.展开更多
In the rapidly evolving technological landscape,state-owned enterprises(SOEs)encounter significant challenges in sustaining their competitiveness through efficient R&D management.Integrated Product Development(IPD...In the rapidly evolving technological landscape,state-owned enterprises(SOEs)encounter significant challenges in sustaining their competitiveness through efficient R&D management.Integrated Product Development(IPD),with its emphasis on cross-functional teamwork,concurrent engineering,and data-driven decision-making,has been widely recognized for enhancing R&D efficiency and product quality.However,the unique characteristics of SOEs pose challenges to the effective implementation of IPD.The advancement of big data and artificial intelligence technologies offers new opportunities for optimizing IPD R&D management through data-driven decision-making models.This paper constructs and validates a data-driven decision-making model tailored to the IPD R&D management of SOEs.By integrating data mining,machine learning,and other advanced analytical techniques,the model serves as a scientific and efficient decision-making tool.It aids SOEs in optimizing R&D resource allocation,shortening product development cycles,reducing R&D costs,and improving product quality and innovation.Moreover,this study contributes to a deeper theoretical understanding of the value of data-driven decision-making in the context of IPD.展开更多
As the economy grows, environmental issues are becoming increasingly severe, making the promotion of green behavior more urgent. Information dissemination and policy regulation play crucial roles in influencing and am...As the economy grows, environmental issues are becoming increasingly severe, making the promotion of green behavior more urgent. Information dissemination and policy regulation play crucial roles in influencing and amplifying the spread of green behavior across society. To this end, a novel three-layer model in multilayer networks is proposed. In the novel model, the information layer describes green information spreading, the physical contact layer depicts green behavior propagation, and policy regulation is symbolized by an isolated node beneath the two layers. Then, we deduce the green behavior threshold for the three-layer model using the microscopic Markov chain approach. Moreover, subject to some individuals who are more likely to influence others or become green nodes and the limitations of the capacity of policy regulation, an optimal scheme is given that could optimize policy interventions to most effectively prompt green behavior.Subsequently, simulations are performed to validate the preciseness and theoretical results of the new model. It reveals that policy regulation can prompt the prevalence and outbreak of green behavior. Then, the green behavior is more likely to spread and be prevalent in the SF network than in the ER network. Additionally, optimal allocation is highly successful in facilitating the dissemination of green behavior. In practice, the optimal allocation strategy could prioritize interventions at critical nodes or regions, such as highly connected urban areas, where the impact of green behavior promotion would be most significant.展开更多
In tunnel construction,tunnel boring machine(TBM)tunnelling typically relies on manual experience with sub-optimal control parameters,which can easily lead to inefficiency and high costs.This study proposed an intelli...In tunnel construction,tunnel boring machine(TBM)tunnelling typically relies on manual experience with sub-optimal control parameters,which can easily lead to inefficiency and high costs.This study proposed an intelligent decision-making method for TBM tunnelling control parameters based on multiobjective optimization(MOO).First,the effective TBM operation dataset is obtained through data preprocessing of the Songhua River(YS)tunnel project in China.Next,the proposed method begins with developing machine learning models for predicting TBM tunnelling performance parameters(i.e.total thrust and cutterhead torque),rock mass classification,and hazard risks(i.e.tunnel collapse and shield jamming).Then,considering three optimal objectives,(i.e.,penetration rate,rock-breaking energy consumption,and cutterhead hob wear),the MOO framework and corresponding mathematical expression are established.The Pareto optimal front is solved using DE-NSGA-II algorithm.Finally,the optimal control parameters(i.e.,advance rate and cutterhead rotation speed)are obtained by the satisfactory solution determination criterion,which can balance construction safety and efficiency with satisfaction.Furthermore,the proposed method is validated through 50 cases of TBM tunnelling,showing promising potential of application.展开更多
Several models of multi-criteria decision-making(MCDM)have identified the optimal alternative electrical energy sources to supply certain load in an isolated region in Al-Minya City,Egypt.The load demand consists of w...Several models of multi-criteria decision-making(MCDM)have identified the optimal alternative electrical energy sources to supply certain load in an isolated region in Al-Minya City,Egypt.The load demand consists of water pumping system with a water desalination unit.Various options containing three different power sources:only DG,PV-B system,and hybrid PV-DG-B,two different sizes of reverse osmosis(RO)units;RO-250 and RO-500,two strategies of energy management;load following(LF)and cycle charging(CC),and two sizes of DG;5 and 10 kW were taken into account.Eight attributes,including operating cost,renewable fraction,initial cost,the cost of energy,excess energy,unmet load,breakeven grid extension distance,and the amount of CO_(2),were used during the evaluation process.To estimate these parameters,HOMER®software was employed to perform both the simulation and optimization process.Four different weight estimation methods were considered;no priority of criteria,based on a pairwise comparisons matrix of the criteria,CRITIC-method,and entropy-based method.The main findings(output results)confirmed that the optimal option for the case study was hybrid PV-DG-B with the following specification:5 kW DG,RO-500,and load following control strategy.Under this condition,the annual operating cost and initial costs were$5546 and$161022,respectively,whereas the cost of energy was 0.077$/kWh.The excess energy and unmet loads were 40998 and 2371 kWh,respectively.The breakeven grid extension distance and the amount of CO_(2) were 3.31 km and 5171 kg per year,respectively.Compared with DG only,the amount of CO_(2) has been sharply reduced by 113939 kg per year.展开更多
The randomness and uncertainty of renewable energy generation are expected to significantly change the optimal decision-making of trans-provincial electricity market subjects.Therefore,it is beneficial to optimize the...The randomness and uncertainty of renewable energy generation are expected to significantly change the optimal decision-making of trans-provincial electricity market subjects.Therefore,it is beneficial to optimize the interests of each of these subjects,considering the unpredictable risks of renewable energy under the renewable portfolio standards(RPS)and researching their effects on the optimal decision-making of transprovincial electricity market multi-subjects.First,we develop a trans-provincial trading market mechanism for renewable energy and clarify the electricity supply and demand relation and the green certificates supply and demand relation of trans-provincial electricitymarketmulti-subjects.Then,under the RPS,we construct a multi-subject game model of the power supply chain that recognizes the risks,and adopt the reverse induction method to discuss the optimum risk-taking judgment of each subject in the trans-provincial electricity market.Finally,we useMATLAB to verify the viability and efficacy of the proposed gamemodel,and obtain a certain reference value for the optimal decision-making of trans-provincial electricity market subjects.In summary,we consider the uncertainty risks of renewable energy under RPS,study the effects of the green certificate price and risk aversion coefficient in the RPS mechanism on the optimal decisionmaking of trans-provincial electricity market subjects,and obtain the changing trends of two different power products and those of different electricity market subjects under the influence of the green certificate price and risk aversion coefficient,which have a certain reference value for studying the factors affecting the optimal decision-making of trans-provincial electricity market subjects.展开更多
Combining the heuristic algorithm (HA) developed based on the specific knowledge of the cooperative multiple target attack (CMTA) tactics and the particle swarm optimization (PSO), a heuristic particle swarm opt...Combining the heuristic algorithm (HA) developed based on the specific knowledge of the cooperative multiple target attack (CMTA) tactics and the particle swarm optimization (PSO), a heuristic particle swarm optimization (HPSO) algorithm is proposed to solve the decision-making (DM) problem. HA facilitates to search the local optimum in the neighborhood of a solution, while the PSO algorithm tends to explore the search space for possible solutions. Combining the advantages of HA and PSO, HPSO algorithms can find out the global optimum quickly and efficiently. It obtains the DM solution by seeking for the optimal assignment of missiles of friendly fighter aircrafts (FAs) to hostile FAs. Simulation results show that the proposed algorithm is superior to the general PSO algorithm and two GA based algorithms in searching for the best solution to the DM problem.展开更多
A decision-making problem of missile-target assignment with a novel particle swarm optimization algorithm is proposed when it comes to a multiple target collaborative combat situation.The threat function is establishe...A decision-making problem of missile-target assignment with a novel particle swarm optimization algorithm is proposed when it comes to a multiple target collaborative combat situation.The threat function is established to describe air combat situation.Optimization function is used to find an optimal missile-target assignment.An improved particle swarm optimization algorithm is utilized to figure out the optimization function with less parameters,which is based on the adaptive random learning approach.According to the coordinated attack tactics,there are some adjustments to the assignment.Simulation example results show that it is an effective algorithm to handle with the decision-making problem of the missile-target assignment(MTA)in air combat.展开更多
Due to the numerous variables to take into account as well as the inherent ambiguity and uncertainty,evaluating educational institutions can be difficult.The concept of a possibility Pythagorean fuzzy hypersoft set(pP...Due to the numerous variables to take into account as well as the inherent ambiguity and uncertainty,evaluating educational institutions can be difficult.The concept of a possibility Pythagorean fuzzy hypersoft set(pPyFHSS)is more flexible in this regard than other theoretical fuzzy set-like models,even though some attempts have been made in the literature to address such uncertainties.This study investigates the elementary notions of pPyFHSS including its set-theoretic operations union,intersection,complement,OR-and AND-operations.Some results related to these operations are also modified for pPyFHSS.Additionally,the similarity measures between pPyFHSSs are formulated with the assistance of numerical examples and results.Lastly,an intelligent decision-assisted mechanism is developed with the proposal of a robust algorithm based on similarity measures for solving multi-attribute decision-making(MADM)problems.A case study that helps the decision-makers assess the best educational institution is discussed to validate the suggested system.The algorithmic results are compared with the most pertinent model to evaluate the adaptability of pPyFHSS,as it generalizes the classical possibility fuzzy set-like theoretical models.Similarly,while considering significant evaluating factors,the flexibility of pPyFHSS is observed through structural comparison.展开更多
Suicide risk constitutes a complex set of interacting demographic, clinical, psychobiological and environmental variables. Impulsivity is a long-known risk factor for suicide attempts. However, research based on clear...Suicide risk constitutes a complex set of interacting demographic, clinical, psychobiological and environmental variables. Impulsivity is a long-known risk factor for suicide attempts. However, research based on clearer conceptual refinement in this area is imperative. One emerging field of study is that of decision-making. Impulsivity involves a failure of higher-order control, including decision-making. Using standardized operational definitions that take into consideration relevant aspects of impulsivity, including state- and trait-components and a deeper understanding of the process of decision-making in the suicidal mind, we may come a step closer to understanding suicidality and winning the fight in this scourge of human suffering.展开更多
This paper explores the decision-making mechanism of the consuming behavior hidden behind the sudden popularity of the Oriental Selection Company in terms of the mental accounting theory.Firstly,according to the“Non-...This paper explores the decision-making mechanism of the consuming behavior hidden behind the sudden popularity of the Oriental Selection Company in terms of the mental accounting theory.Firstly,according to the“Non-alternative”characteristics of mental accounting,this paper expounds how the strategy of the bilingual live-streaming of Oriental Selection promoters stimulates consumers’desire to buy the advertised products and services whilst using the utility theory of mental accounting to analyze how Oriental Selection promoters improve consumers’acquisition utility and total utility.Secondly,we sum up the successful experiences of Oriental Selection:The live-streaming industry should apply the theory of mental accounting in effectively overcoming the shortcomings of the live-streamed marketing by stimulating consumers’desire and influencing their decision-making behavior through the streaming of content that triggers them to make purchases.This is achievable by abandoning the traditional ways of loudly urging consumers to buy goods.Finally,this paper puts forward some suggestions on how to use the mental accounting theory in promoting sustainable consumption and points out the prospects for Oriental Selection.展开更多
Recent research demonstrates the need for comprehensive frameworks to achieve an appropriate level of resilience(e.g.,energy,seismic)of the European building stock,through integrated retrofitting interventions.Differe...Recent research demonstrates the need for comprehensive frameworks to achieve an appropriate level of resilience(e.g.,energy,seismic)of the European building stock,through integrated retrofitting interventions.Different frameworks have been proposed to identify optimal interventions when several feasible alternatives are available,considering multiple decision variables of different nature,such as social,economic,or technical.Within these efforts and frameworks,less attention has been paid to the post-earthquake recovery time of buildings and communities,thus ignoring the significance of reaching a desired recovery state(e.g.,functional recovery)within a specified time frame.To overcome this limitation,this study estimates post-earthquake recovery times and uses them as one of the decision variables in multi-criteria identification of optimal retrofitting of an existing RC building.The case-study building is representative of the Italian school buildings constructed between the 1960s and 1970s and was analysed under two seismic hazard levels(moderate and high).Following the identification of the main structural deficiencies of the as-built structure through nonlinear static analyses,four seismic retrofit measures were selected.Then,the earthquake-induced downtime of each of the four retrofitted building configurations was assessed,analysing the different recovery times as a function of the seismic hazard level and the recovery state.A downtime-based metric,namely the expected annual downtime,was introduced as decision variable within an available multi-criteria decision-making framework to include the impact of downtime,rank the four retrofit measures and identify the preferable one.展开更多
This paper studies the global existence and large-time behaviors of weak solutions to the kinetic particle model coupled with the incompressible Navier-Stokes equations in IR3.First,we obtain the global weak solution ...This paper studies the global existence and large-time behaviors of weak solutions to the kinetic particle model coupled with the incompressible Navier-Stokes equations in IR3.First,we obtain the global weak solution using the characteristic and energy methods.Then,under the small assumption of the mass of the particle,we show that the solutions decay at the algebraic time-decay rate.Finally,it is also proved that the above rate is optimal.It should be remarked that if the particle in the coupled system vanishes(i.e.f=O),our works coincide with the classical results by Schonbek[32](J Amer Math Soc,1991,4:423-449),which can be regarded as a generalization from a single fuid model to the two-phase fluid one.展开更多
Public-private partnerships(PPPs)have been used by governments around the world to procure and construct infrastructural amenities.It relies on private sector expertise and funding to achieve this lofty objective.Howe...Public-private partnerships(PPPs)have been used by governments around the world to procure and construct infrastructural amenities.It relies on private sector expertise and funding to achieve this lofty objective.However,given the uncertainties of project management,transparency,accountability,and expropriation,this phenomenon has gained tremendous attention in recent years due to the important role it plays in curbing infrastructural deficits globally.Interestingly,the reasonable benefit distribution scheme in a PPP project is related to the behavior decisionmaking of the government and social capital,aswell as the performance of the project.In this paper,the government and social capital which are the key stakeholders of PPP projects were selected as the research objects.Based on the fuzzy expected value model and game theory,a hybrid method was adopted in this research taking into account the different risk preferences of both public entities and private parties under the fuzzy demand environment.To alleviate the problem of insufficient utilization of social capital in a PPP project,this paper seeks to grasp the relationship that exists between the benefit distribution of stakeholders,their behavioral decision-making,and project performance,given that they impact the performance of both public entities and private parties,as well as assist in maximizing the overall utility of the project.Furthermore,four game models were constructed in this study,while the expected value and opportunity-constrained programming model for optimal decision-making were derived using alternate perspectives of both centralized decision-making and decentralized decision-making.Afterward,the optimal behavioral decision-making of public entities and private parties in four scenarios was discussed and thereafter compared,which led to an ensuing discussion on the benefit distribution system under centralized decision-making.Lastly,based on an example case,the influence of different confidence levels,price,and fuzzy uncertainties of PPP projects on the equilibrium strategy results of both parties were discussed,giving credence to the effectiveness of the hybrid method.The results indicate that adjusting different confidence levels yields different equilibriumpoints,and therefore signposts that social capital has a fair perception of opportunities,as well as identifies reciprocal preferences.Nevertheless,we find that an increase in the cost coefficient of the government and social capital does not inhibit the effort of both parties.Our results also indicate that a reasonable benefit distribution of PPP projects can assist them in realizing optimum Pareto improvements over time.The results provide us with very useful strategies and recommendations to improve the overall performance of PPP projects in China.展开更多
We are concerned with the large-time behavior of 3D quasilinear hyperbolic equations with nonlinear damping.The main novelty of this paper is two-fold.First,we prove the optimal decay rates of the second and third ord...We are concerned with the large-time behavior of 3D quasilinear hyperbolic equations with nonlinear damping.The main novelty of this paper is two-fold.First,we prove the optimal decay rates of the second and third order spatial derivatives of the solution,which are the same as those of the heat equation,and in particular,are faster than ones of previous related works.Second,for well-chosen initial data,we also show that the lower optimal L^(2) convergence rate of the k(∈[0,3])-order spatial derivatives of the solution is(1+t)^(-(2+2k)/4).Therefore,our decay rates are optimal in this sense.The proofs are based on the Fourier splitting method,low-frequency and high-frequency decomposition,and delicate energy estimates.展开更多
Using the dynamic optimization theory, we described a decision-making model for farmer choosing land use when there are several different kinds of uses for land. To obtain an empirical model that could be easily appli...Using the dynamic optimization theory, we described a decision-making model for farmer choosing land use when there are several different kinds of uses for land. To obtain an empirical model that could be easily applied, decision rules for farmer with a single static expectation were given.展开更多
Based on the theory of consumer behavior,this paper analyzes the current situation of tourism shopping market in Kunming,and analyzes the decision-making behavior of tourists shopping in Kunming with the questionnaire...Based on the theory of consumer behavior,this paper analyzes the current situation of tourism shopping market in Kunming,and analyzes the decision-making behavior of tourists shopping in Kunming with the questionnaire survey,and clarifies the influencing factors of the decision-making behavior of visitors to Kunming.In the future,the influencing factors of Kunming tourists'shopping decision-making behavior are combined with the current situation of Kunming's tourism shopping market.The problems of cheating-induced shopping,the high price of shopping products,the low level of tourism shopping experience and the imperfect after-sales service are analyzed.Finally,the corresponding countermeasures and suggestions are proposed from four aspects:rectifying the tourism shopping market,establishing a sound price supervision mechanism,strengthening the tourism shopping experience,and improving after-sales service.展开更多
In the real situations of supply chain, there are different parts such as facilities, logistics warehouses and retail stores and they handle common kinds of products. In this research, these situations are focused on ...In the real situations of supply chain, there are different parts such as facilities, logistics warehouses and retail stores and they handle common kinds of products. In this research, these situations are focused on as the background of this research. They deal with the common quantities of their products, but due to their different environments, the optimal production quantity of one part can be unacceptable to another part and it may suffer a heavy loss. To avoid that kind of unacceptable situations, the common production quantities should be acceptable to all parts in one supply chain. Therefore, the motivation of this research is the necessity of the method to find the production quantities that make all decision makers acceptable is needed. However, it is difficult to find the production quantities that make all decision makers acceptable. Moreover, their acceptable ranges do not always have common ranges. In the decision making of car design, there are similar situations to this type of decision making. The performance of a car consists of purposes such as fuel efficiency, size and so on. Improving one purpose makes another worse and the relationship between these purposes is tradeoff. In these cases, Suriawase process is applied. This process consists of negotiations and reviews of the requirements of the purposes. In the step of negotiations, the requirements of the purposes are share among all decision makers and the solution that makes them as satisfied as possible. In the step of reviews of the requirements, they are reviewed based on the result of the negotiation if the result is unacceptable to some of decision makers. Therefore, through the iterations of the two steps, the solution that makes all decision makers satisfied is obtained. However, in the previous research, the effects that one decision maker reviews requirements in Suriawase process are quantified, but the mathematical model to modify the ranges of production quantities of all decision makers simultaneously is not shown. Therefore, in this research, based on Suriawase process, the mathematical model of multi-player multi-objective decision making is proposed. The mathematical model of multi-player multi-objective decision making by using linear physical programming (LPP) and robust optimization (RO) in the previous research is the basis of the methods of this research. LPP is one of the multi-objective optimization methods and RO is used to make the balance of the preference levels among decision makers. In LPP, the preference ranges of all objective functions are needed, so as the hypothesis of this research. In the research referred in this research, the method to control the effect of RO is not shown. If the effect of RO is too big, the average of the preference level becomes worse. The purpose of this research is to reproduce the mathematical model of multi-player multi-objective decision making based on Suriawase process and propose the method to control the effect of RO. In the proposed model, a set of the solutions of the negotiation problem is obtained and it is proved by the result of the numerical experiment. Therefore, the conclusion that the proposed model is available to obtain a set of the solutions of the negotiation problems in supply chain.展开更多
The high temperature deformation behaviors of α+β type titanium alloy TC11 (Ti-6.5Al-3.5Mo-1.5Zr-0.3Si) with coarse lamellar starting microstructure were investigated based on the hot compression tests in the tem...The high temperature deformation behaviors of α+β type titanium alloy TC11 (Ti-6.5Al-3.5Mo-1.5Zr-0.3Si) with coarse lamellar starting microstructure were investigated based on the hot compression tests in the temperature range of 950-1100 ℃ and the strain rate range of 0.001-10 s-1. The processing maps at different strains were then constructed based on the dynamic materials model, and the hot compression process parameters and deformation mechanism were optimized and analyzed, respectively. The results show that the processing maps exhibit two domains with a high efficiency of power dissipation and a flow instability domain with a less efficiency of power dissipation. The types of domains were characterized by convergence and divergence of the efficiency of power dissipation, respectively. The convergent domain in a+fl phase field is at the temperature of 950-990 ℃ and the strain rate of 0.001-0.01 s^-1, which correspond to a better hot compression process window of α+β phase field. The peak of efficiency of power dissipation in α+β phase field is at 950 ℃ and 0.001 s 1, which correspond to the best hot compression process parameters of α+β phase field. The convergent domain in β phase field is at the temperature of 1020-1080 ℃ and the strain rate of 0.001-0.1 s^-l, which correspond to a better hot compression process window of β phase field. The peak of efficiency of power dissipation in ℃ phase field occurs at 1050 ℃ over the strain rates from 0.001 s^-1 to 0.01 s^-1, which correspond to the best hot compression process parameters of ,8 phase field. The divergence domain occurs at the strain rates above 0.5 s^-1 and in all the tested temperature range, which correspond to flow instability that is manifested as flow localization and indicated by the flow softening phenomenon in stress-- strain curves. The deformation mechanisms of the optimized hot compression process windows in a+β and β phase fields are identified to be spheroidizing and dynamic recrystallizing controlled by self-diffusion mechanism, respectively. The microstructure observation of the deformed specimens in different domains matches very well with the optimized results.展开更多
基金supported by the National Natural Science Foundation of China under Grant T2521006,Grant 62403483,Grant 62533021 and Grant U24A20279.
文摘Reinforcement learning(RL)has been widely studied as an efficient class of machine learning methods for adaptive optimal control under uncertainties.In recent years,the applications of RL in optimised decision-making and motion control of intelligent vehicles have received increasing attention.Due to the complex and dynamic operating environments of intelligent vehicles,it is necessary to improve the learning efficiency and generalisation ability of RL-based decision and control algorithms under different conditions.This survey systematically examines the theoretical foundations,algorithmic advancements and practical challenges of applying RL to intelligent vehicle systems operating in complex and dynamic environments.The major algorithm frameworks of RL are first introduced,and the recent advances in RL-based decision-making and control of intelligent vehicles are overviewed.In addition to self-learning decision and control approaches using state measurements,the developments of DRL methods for end-to-end driving control of intelligent vehicles are summarised.The open problems and directions for further research works are also discussed.
基金Project supported by the National Natural Science Foundation of China (Grant No. 72174121)the Program for Professor of Special Appointment (Eastern Scholar) at Shanghai Institutions of Higher Learning, and the Soft Science Research Project of Shanghai (Grant No. 22692112600)。
文摘Information plays a crucial role in guiding behavioral decisions during public health emergencies. Individuals communicate to acquire relevant knowledge about an epidemic, which influences their decisions to adopt protective measures.However, whether to disseminate specific information is also a behavioral decision. In light of this understanding, we develop a coupled information–vaccination–epidemic model to depict these co-evolutionary dynamics in a three-layer network. Negative information dissemination and vaccination are treated as separate decision-making processes. We then examine the combined effects of herd and risk motives on information dissemination and vaccination decisions through the lens of game theory. The microscopic Markov chain approach(MMCA) is used to describe the dynamic process and to derive the epidemic threshold. Simulation results indicate that increasing the cost of negative information dissemination and providing timely clarification can effectively control the epidemic. Furthermore, a phenomenon of diminishing marginal utility is observed as the cost of dissemination increases, suggesting that authorities do not need to overinvest in suppressing negative information. Conversely, reducing the cost of vaccination and increasing vaccine efficacy emerge as more effective strategies for outbreak control. In addition, we find that the scale of the epidemic is greater when the herd motive dominates behavioral decision-making. In conclusion, this study provides a new perspective for understanding the complexity of epidemic spreading by starting with the construction of different behavioral decisions.
文摘In the rapidly evolving technological landscape,state-owned enterprises(SOEs)encounter significant challenges in sustaining their competitiveness through efficient R&D management.Integrated Product Development(IPD),with its emphasis on cross-functional teamwork,concurrent engineering,and data-driven decision-making,has been widely recognized for enhancing R&D efficiency and product quality.However,the unique characteristics of SOEs pose challenges to the effective implementation of IPD.The advancement of big data and artificial intelligence technologies offers new opportunities for optimizing IPD R&D management through data-driven decision-making models.This paper constructs and validates a data-driven decision-making model tailored to the IPD R&D management of SOEs.By integrating data mining,machine learning,and other advanced analytical techniques,the model serves as a scientific and efficient decision-making tool.It aids SOEs in optimizing R&D resource allocation,shortening product development cycles,reducing R&D costs,and improving product quality and innovation.Moreover,this study contributes to a deeper theoretical understanding of the value of data-driven decision-making in the context of IPD.
基金Project supported by the National Natural Science Foundation of China (Grant No. 62371253)the Postgraduate Research and Practice Innovation Program of Jiangsu Province, China (Grant No. KYCX24_1179)。
文摘As the economy grows, environmental issues are becoming increasingly severe, making the promotion of green behavior more urgent. Information dissemination and policy regulation play crucial roles in influencing and amplifying the spread of green behavior across society. To this end, a novel three-layer model in multilayer networks is proposed. In the novel model, the information layer describes green information spreading, the physical contact layer depicts green behavior propagation, and policy regulation is symbolized by an isolated node beneath the two layers. Then, we deduce the green behavior threshold for the three-layer model using the microscopic Markov chain approach. Moreover, subject to some individuals who are more likely to influence others or become green nodes and the limitations of the capacity of policy regulation, an optimal scheme is given that could optimize policy interventions to most effectively prompt green behavior.Subsequently, simulations are performed to validate the preciseness and theoretical results of the new model. It reveals that policy regulation can prompt the prevalence and outbreak of green behavior. Then, the green behavior is more likely to spread and be prevalent in the SF network than in the ER network. Additionally, optimal allocation is highly successful in facilitating the dissemination of green behavior. In practice, the optimal allocation strategy could prioritize interventions at critical nodes or regions, such as highly connected urban areas, where the impact of green behavior promotion would be most significant.
基金supported by the National Natural Science Foundation of China(Grant No.52179105)China Postdoctoral Science Foundation(Grant No.2024M762193)。
文摘In tunnel construction,tunnel boring machine(TBM)tunnelling typically relies on manual experience with sub-optimal control parameters,which can easily lead to inefficiency and high costs.This study proposed an intelligent decision-making method for TBM tunnelling control parameters based on multiobjective optimization(MOO).First,the effective TBM operation dataset is obtained through data preprocessing of the Songhua River(YS)tunnel project in China.Next,the proposed method begins with developing machine learning models for predicting TBM tunnelling performance parameters(i.e.total thrust and cutterhead torque),rock mass classification,and hazard risks(i.e.tunnel collapse and shield jamming).Then,considering three optimal objectives,(i.e.,penetration rate,rock-breaking energy consumption,and cutterhead hob wear),the MOO framework and corresponding mathematical expression are established.The Pareto optimal front is solved using DE-NSGA-II algorithm.Finally,the optimal control parameters(i.e.,advance rate and cutterhead rotation speed)are obtained by the satisfactory solution determination criterion,which can balance construction safety and efficiency with satisfaction.Furthermore,the proposed method is validated through 50 cases of TBM tunnelling,showing promising potential of application.
文摘Several models of multi-criteria decision-making(MCDM)have identified the optimal alternative electrical energy sources to supply certain load in an isolated region in Al-Minya City,Egypt.The load demand consists of water pumping system with a water desalination unit.Various options containing three different power sources:only DG,PV-B system,and hybrid PV-DG-B,two different sizes of reverse osmosis(RO)units;RO-250 and RO-500,two strategies of energy management;load following(LF)and cycle charging(CC),and two sizes of DG;5 and 10 kW were taken into account.Eight attributes,including operating cost,renewable fraction,initial cost,the cost of energy,excess energy,unmet load,breakeven grid extension distance,and the amount of CO_(2),were used during the evaluation process.To estimate these parameters,HOMER®software was employed to perform both the simulation and optimization process.Four different weight estimation methods were considered;no priority of criteria,based on a pairwise comparisons matrix of the criteria,CRITIC-method,and entropy-based method.The main findings(output results)confirmed that the optimal option for the case study was hybrid PV-DG-B with the following specification:5 kW DG,RO-500,and load following control strategy.Under this condition,the annual operating cost and initial costs were$5546 and$161022,respectively,whereas the cost of energy was 0.077$/kWh.The excess energy and unmet loads were 40998 and 2371 kWh,respectively.The breakeven grid extension distance and the amount of CO_(2) were 3.31 km and 5171 kg per year,respectively.Compared with DG only,the amount of CO_(2) has been sharply reduced by 113939 kg per year.
基金This work was supported by Project of Philosophy and Social Science Foundation of Shanghai,China(Grant No.2020BGL011).
文摘The randomness and uncertainty of renewable energy generation are expected to significantly change the optimal decision-making of trans-provincial electricity market subjects.Therefore,it is beneficial to optimize the interests of each of these subjects,considering the unpredictable risks of renewable energy under the renewable portfolio standards(RPS)and researching their effects on the optimal decision-making of transprovincial electricity market multi-subjects.First,we develop a trans-provincial trading market mechanism for renewable energy and clarify the electricity supply and demand relation and the green certificates supply and demand relation of trans-provincial electricitymarketmulti-subjects.Then,under the RPS,we construct a multi-subject game model of the power supply chain that recognizes the risks,and adopt the reverse induction method to discuss the optimum risk-taking judgment of each subject in the trans-provincial electricity market.Finally,we useMATLAB to verify the viability and efficacy of the proposed gamemodel,and obtain a certain reference value for the optimal decision-making of trans-provincial electricity market subjects.In summary,we consider the uncertainty risks of renewable energy under RPS,study the effects of the green certificate price and risk aversion coefficient in the RPS mechanism on the optimal decisionmaking of trans-provincial electricity market subjects,and obtain the changing trends of two different power products and those of different electricity market subjects under the influence of the green certificate price and risk aversion coefficient,which have a certain reference value for studying the factors affecting the optimal decision-making of trans-provincial electricity market subjects.
文摘Combining the heuristic algorithm (HA) developed based on the specific knowledge of the cooperative multiple target attack (CMTA) tactics and the particle swarm optimization (PSO), a heuristic particle swarm optimization (HPSO) algorithm is proposed to solve the decision-making (DM) problem. HA facilitates to search the local optimum in the neighborhood of a solution, while the PSO algorithm tends to explore the search space for possible solutions. Combining the advantages of HA and PSO, HPSO algorithms can find out the global optimum quickly and efficiently. It obtains the DM solution by seeking for the optimal assignment of missiles of friendly fighter aircrafts (FAs) to hostile FAs. Simulation results show that the proposed algorithm is superior to the general PSO algorithm and two GA based algorithms in searching for the best solution to the DM problem.
基金jointly granted by the Science and Technology on Avionics Integration Laboratory and the Aeronautical Science Foundation of China (No. 2016ZC15008)
文摘A decision-making problem of missile-target assignment with a novel particle swarm optimization algorithm is proposed when it comes to a multiple target collaborative combat situation.The threat function is established to describe air combat situation.Optimization function is used to find an optimal missile-target assignment.An improved particle swarm optimization algorithm is utilized to figure out the optimization function with less parameters,which is based on the adaptive random learning approach.According to the coordinated attack tactics,there are some adjustments to the assignment.Simulation example results show that it is an effective algorithm to handle with the decision-making problem of the missile-target assignment(MTA)in air combat.
基金supported by the Deanship of Graduate Studies and Scientific Research at Qassim University(QU-APC-2024-9/1).
文摘Due to the numerous variables to take into account as well as the inherent ambiguity and uncertainty,evaluating educational institutions can be difficult.The concept of a possibility Pythagorean fuzzy hypersoft set(pPyFHSS)is more flexible in this regard than other theoretical fuzzy set-like models,even though some attempts have been made in the literature to address such uncertainties.This study investigates the elementary notions of pPyFHSS including its set-theoretic operations union,intersection,complement,OR-and AND-operations.Some results related to these operations are also modified for pPyFHSS.Additionally,the similarity measures between pPyFHSSs are formulated with the assistance of numerical examples and results.Lastly,an intelligent decision-assisted mechanism is developed with the proposal of a robust algorithm based on similarity measures for solving multi-attribute decision-making(MADM)problems.A case study that helps the decision-makers assess the best educational institution is discussed to validate the suggested system.The algorithmic results are compared with the most pertinent model to evaluate the adaptability of pPyFHSS,as it generalizes the classical possibility fuzzy set-like theoretical models.Similarly,while considering significant evaluating factors,the flexibility of pPyFHSS is observed through structural comparison.
文摘Suicide risk constitutes a complex set of interacting demographic, clinical, psychobiological and environmental variables. Impulsivity is a long-known risk factor for suicide attempts. However, research based on clearer conceptual refinement in this area is imperative. One emerging field of study is that of decision-making. Impulsivity involves a failure of higher-order control, including decision-making. Using standardized operational definitions that take into consideration relevant aspects of impulsivity, including state- and trait-components and a deeper understanding of the process of decision-making in the suicidal mind, we may come a step closer to understanding suicidality and winning the fight in this scourge of human suffering.
文摘This paper explores the decision-making mechanism of the consuming behavior hidden behind the sudden popularity of the Oriental Selection Company in terms of the mental accounting theory.Firstly,according to the“Non-alternative”characteristics of mental accounting,this paper expounds how the strategy of the bilingual live-streaming of Oriental Selection promoters stimulates consumers’desire to buy the advertised products and services whilst using the utility theory of mental accounting to analyze how Oriental Selection promoters improve consumers’acquisition utility and total utility.Secondly,we sum up the successful experiences of Oriental Selection:The live-streaming industry should apply the theory of mental accounting in effectively overcoming the shortcomings of the live-streamed marketing by stimulating consumers’desire and influencing their decision-making behavior through the streaming of content that triggers them to make purchases.This is achievable by abandoning the traditional ways of loudly urging consumers to buy goods.Finally,this paper puts forward some suggestions on how to use the mental accounting theory in promoting sustainable consumption and points out the prospects for Oriental Selection.
基金funded by the Italian Civil Protection Department and“PriorBuilt-Prioritisation of the Italian regions for seismic and energy performance upgrading of the existing buildings”funded by ReLUIS.Additionally,it was developed as part of the activities of CONSTRUCT–Instituto de I&D em Estruturas e Construções(UID/04708),CERIS(UIDB/04625)+1 种基金the project SERENE(2022.08138.PTDC)all funded by Fundação para a Ciência e a Tecnologia,I.P./MCTES(PIDDAC).
文摘Recent research demonstrates the need for comprehensive frameworks to achieve an appropriate level of resilience(e.g.,energy,seismic)of the European building stock,through integrated retrofitting interventions.Different frameworks have been proposed to identify optimal interventions when several feasible alternatives are available,considering multiple decision variables of different nature,such as social,economic,or technical.Within these efforts and frameworks,less attention has been paid to the post-earthquake recovery time of buildings and communities,thus ignoring the significance of reaching a desired recovery state(e.g.,functional recovery)within a specified time frame.To overcome this limitation,this study estimates post-earthquake recovery times and uses them as one of the decision variables in multi-criteria identification of optimal retrofitting of an existing RC building.The case-study building is representative of the Italian school buildings constructed between the 1960s and 1970s and was analysed under two seismic hazard levels(moderate and high).Following the identification of the main structural deficiencies of the as-built structure through nonlinear static analyses,four seismic retrofit measures were selected.Then,the earthquake-induced downtime of each of the four retrofitted building configurations was assessed,analysing the different recovery times as a function of the seismic hazard level and the recovery state.A downtime-based metric,namely the expected annual downtime,was introduced as decision variable within an available multi-criteria decision-making framework to include the impact of downtime,rank the four retrofit measures and identify the preferable one.
基金supported by the Anhui Provincial Natural Science Foundation(2408085QA031)the third author's work was supported by the National Natural Science Foundation of China(12001033).
文摘This paper studies the global existence and large-time behaviors of weak solutions to the kinetic particle model coupled with the incompressible Navier-Stokes equations in IR3.First,we obtain the global weak solution using the characteristic and energy methods.Then,under the small assumption of the mass of the particle,we show that the solutions decay at the algebraic time-decay rate.Finally,it is also proved that the above rate is optimal.It should be remarked that if the particle in the coupled system vanishes(i.e.f=O),our works coincide with the classical results by Schonbek[32](J Amer Math Soc,1991,4:423-449),which can be regarded as a generalization from a single fuid model to the two-phase fluid one.
基金supported by the National Natural Science Foundation of China(No.62141302)the Humanities Social Science Programming Project of the Ministry of Education of China(No.20YJA630059)+2 种基金the Natural Science Foundation of Jiangxi Province of China(No.20212BAB201011)the China Postdoctoral Science Foundation(No.2019M662265)the Research Project of Economic and Social Development in Liaoning Province of China(No.2022lslybkt-053).
文摘Public-private partnerships(PPPs)have been used by governments around the world to procure and construct infrastructural amenities.It relies on private sector expertise and funding to achieve this lofty objective.However,given the uncertainties of project management,transparency,accountability,and expropriation,this phenomenon has gained tremendous attention in recent years due to the important role it plays in curbing infrastructural deficits globally.Interestingly,the reasonable benefit distribution scheme in a PPP project is related to the behavior decisionmaking of the government and social capital,aswell as the performance of the project.In this paper,the government and social capital which are the key stakeholders of PPP projects were selected as the research objects.Based on the fuzzy expected value model and game theory,a hybrid method was adopted in this research taking into account the different risk preferences of both public entities and private parties under the fuzzy demand environment.To alleviate the problem of insufficient utilization of social capital in a PPP project,this paper seeks to grasp the relationship that exists between the benefit distribution of stakeholders,their behavioral decision-making,and project performance,given that they impact the performance of both public entities and private parties,as well as assist in maximizing the overall utility of the project.Furthermore,four game models were constructed in this study,while the expected value and opportunity-constrained programming model for optimal decision-making were derived using alternate perspectives of both centralized decision-making and decentralized decision-making.Afterward,the optimal behavioral decision-making of public entities and private parties in four scenarios was discussed and thereafter compared,which led to an ensuing discussion on the benefit distribution system under centralized decision-making.Lastly,based on an example case,the influence of different confidence levels,price,and fuzzy uncertainties of PPP projects on the equilibrium strategy results of both parties were discussed,giving credence to the effectiveness of the hybrid method.The results indicate that adjusting different confidence levels yields different equilibriumpoints,and therefore signposts that social capital has a fair perception of opportunities,as well as identifies reciprocal preferences.Nevertheless,we find that an increase in the cost coefficient of the government and social capital does not inhibit the effort of both parties.Our results also indicate that a reasonable benefit distribution of PPP projects can assist them in realizing optimum Pareto improvements over time.The results provide us with very useful strategies and recommendations to improve the overall performance of PPP projects in China.
基金partially supported by the National Nature Science Foundation of China(12271114)the Guangxi Natural Science Foundation(2023JJD110009,2019JJG110003,2019AC20214)+2 种基金the Innovation Project of Guangxi Graduate Education(JGY2023061)the Key Laboratory of Mathematical Model and Application(Guangxi Normal University)the Education Department of Guangxi Zhuang Autonomous Region。
文摘We are concerned with the large-time behavior of 3D quasilinear hyperbolic equations with nonlinear damping.The main novelty of this paper is two-fold.First,we prove the optimal decay rates of the second and third order spatial derivatives of the solution,which are the same as those of the heat equation,and in particular,are faster than ones of previous related works.Second,for well-chosen initial data,we also show that the lower optimal L^(2) convergence rate of the k(∈[0,3])-order spatial derivatives of the solution is(1+t)^(-(2+2k)/4).Therefore,our decay rates are optimal in this sense.The proofs are based on the Fourier splitting method,low-frequency and high-frequency decomposition,and delicate energy estimates.
文摘Using the dynamic optimization theory, we described a decision-making model for farmer choosing land use when there are several different kinds of uses for land. To obtain an empirical model that could be easily applied, decision rules for farmer with a single static expectation were given.
文摘Based on the theory of consumer behavior,this paper analyzes the current situation of tourism shopping market in Kunming,and analyzes the decision-making behavior of tourists shopping in Kunming with the questionnaire survey,and clarifies the influencing factors of the decision-making behavior of visitors to Kunming.In the future,the influencing factors of Kunming tourists'shopping decision-making behavior are combined with the current situation of Kunming's tourism shopping market.The problems of cheating-induced shopping,the high price of shopping products,the low level of tourism shopping experience and the imperfect after-sales service are analyzed.Finally,the corresponding countermeasures and suggestions are proposed from four aspects:rectifying the tourism shopping market,establishing a sound price supervision mechanism,strengthening the tourism shopping experience,and improving after-sales service.
文摘In the real situations of supply chain, there are different parts such as facilities, logistics warehouses and retail stores and they handle common kinds of products. In this research, these situations are focused on as the background of this research. They deal with the common quantities of their products, but due to their different environments, the optimal production quantity of one part can be unacceptable to another part and it may suffer a heavy loss. To avoid that kind of unacceptable situations, the common production quantities should be acceptable to all parts in one supply chain. Therefore, the motivation of this research is the necessity of the method to find the production quantities that make all decision makers acceptable is needed. However, it is difficult to find the production quantities that make all decision makers acceptable. Moreover, their acceptable ranges do not always have common ranges. In the decision making of car design, there are similar situations to this type of decision making. The performance of a car consists of purposes such as fuel efficiency, size and so on. Improving one purpose makes another worse and the relationship between these purposes is tradeoff. In these cases, Suriawase process is applied. This process consists of negotiations and reviews of the requirements of the purposes. In the step of negotiations, the requirements of the purposes are share among all decision makers and the solution that makes them as satisfied as possible. In the step of reviews of the requirements, they are reviewed based on the result of the negotiation if the result is unacceptable to some of decision makers. Therefore, through the iterations of the two steps, the solution that makes all decision makers satisfied is obtained. However, in the previous research, the effects that one decision maker reviews requirements in Suriawase process are quantified, but the mathematical model to modify the ranges of production quantities of all decision makers simultaneously is not shown. Therefore, in this research, based on Suriawase process, the mathematical model of multi-player multi-objective decision making is proposed. The mathematical model of multi-player multi-objective decision making by using linear physical programming (LPP) and robust optimization (RO) in the previous research is the basis of the methods of this research. LPP is one of the multi-objective optimization methods and RO is used to make the balance of the preference levels among decision makers. In LPP, the preference ranges of all objective functions are needed, so as the hypothesis of this research. In the research referred in this research, the method to control the effect of RO is not shown. If the effect of RO is too big, the average of the preference level becomes worse. The purpose of this research is to reproduce the mathematical model of multi-player multi-objective decision making based on Suriawase process and propose the method to control the effect of RO. In the proposed model, a set of the solutions of the negotiation problem is obtained and it is proved by the result of the numerical experiment. Therefore, the conclusion that the proposed model is available to obtain a set of the solutions of the negotiation problems in supply chain.
基金Project (51005112) supported by the National Natural Science Foundation of ChinaProject (2010ZF56019) supported by the Aviation Science Foundation of China+1 种基金Project (GJJ11156) supported by the Education Commission of Jiangxi Province, ChinaProject(GF200901008) supported by the Open Fund of National Defense Key Disciplines Laboratory of Light Alloy Processing Science and Technology, China
文摘The high temperature deformation behaviors of α+β type titanium alloy TC11 (Ti-6.5Al-3.5Mo-1.5Zr-0.3Si) with coarse lamellar starting microstructure were investigated based on the hot compression tests in the temperature range of 950-1100 ℃ and the strain rate range of 0.001-10 s-1. The processing maps at different strains were then constructed based on the dynamic materials model, and the hot compression process parameters and deformation mechanism were optimized and analyzed, respectively. The results show that the processing maps exhibit two domains with a high efficiency of power dissipation and a flow instability domain with a less efficiency of power dissipation. The types of domains were characterized by convergence and divergence of the efficiency of power dissipation, respectively. The convergent domain in a+fl phase field is at the temperature of 950-990 ℃ and the strain rate of 0.001-0.01 s^-1, which correspond to a better hot compression process window of α+β phase field. The peak of efficiency of power dissipation in α+β phase field is at 950 ℃ and 0.001 s 1, which correspond to the best hot compression process parameters of α+β phase field. The convergent domain in β phase field is at the temperature of 1020-1080 ℃ and the strain rate of 0.001-0.1 s^-l, which correspond to a better hot compression process window of β phase field. The peak of efficiency of power dissipation in ℃ phase field occurs at 1050 ℃ over the strain rates from 0.001 s^-1 to 0.01 s^-1, which correspond to the best hot compression process parameters of ,8 phase field. The divergence domain occurs at the strain rates above 0.5 s^-1 and in all the tested temperature range, which correspond to flow instability that is manifested as flow localization and indicated by the flow softening phenomenon in stress-- strain curves. The deformation mechanisms of the optimized hot compression process windows in a+β and β phase fields are identified to be spheroidizing and dynamic recrystallizing controlled by self-diffusion mechanism, respectively. The microstructure observation of the deformed specimens in different domains matches very well with the optimized results.