期刊文献+
共找到27,163篇文章
< 1 2 250 >
每页显示 20 50 100
基于Q-learning的专家权重优化与多级共识反馈决策
1
作者 杜秀丽 程伟龙 +2 位作者 高星 潘成胜 吕亚娜 《计算机应用研究》 北大核心 2026年第2期420-426,共7页
针对动态复杂多属性决策环境下大规模异构专家群体共识达成效率低、权重分配不精准的问题,提出一种基于Q-learning的权重优化与多级共识反馈方法,旨在提升共识水平与决策质量。该方法通过将专家权重动态调整建模为马尔可夫决策过程,利用... 针对动态复杂多属性决策环境下大规模异构专家群体共识达成效率低、权重分配不精准的问题,提出一种基于Q-learning的权重优化与多级共识反馈方法,旨在提升共识水平与决策质量。该方法通过将专家权重动态调整建模为马尔可夫决策过程,利用Q-learning实现权重自适应优化,并设计涵盖属性、方案、专家与群体四个层级的多级共识反馈机制,从而精准识别并协调不同来源的分歧。实验结果表明,该方法能够显著降低共识达成所需迭代次数,提升权重分配与专家专业度的匹配精度,并获得更可靠的方案排序结果,验证了其在大规模异构专家群体中的鲁棒性与计算效率。研究表明,所提方法为复杂多属性群体决策问题提供了有效的共识建模与决策支持工具。 展开更多
关键词 群体决策 q-learning 多层共识反馈 动态权重调整
在线阅读 下载PDF
基于Q-Learning长尾延迟优化的SSD-SMR写缓存策略研究
2
作者 刘健 章步镐 +4 位作者 方匡弛 刘宣锋 孙国道 梁荣华 梁浩然 《计算机工程》 北大核心 2026年第3期287-298,共12页
随着全球数据规模的不断增大,如何以低成本的方式有效提升数据的访问性能是存储系统面临的一项重要挑战,使用低延迟、高带宽的固态硬盘(SSD)和低成本、高存储密度的叠瓦式磁盘(SMR)来构建缓存系统,成为一种有效的解决方案。但是,SMR固... 随着全球数据规模的不断增大,如何以低成本的方式有效提升数据的访问性能是存储系统面临的一项重要挑战,使用低延迟、高带宽的固态硬盘(SSD)和低成本、高存储密度的叠瓦式磁盘(SMR)来构建缓存系统,成为一种有效的解决方案。但是,SMR固有的机械运动和多磁道堆叠的特性导致其写性能较差,SSD中的脏数据频繁写回SMR所导致的大量读-合并-写(RMW)操作可能会引起严重的长尾延迟现象。为此,基于SSD-SMR混合存储架构提出一种结合强化学习Q-Learning算法的缓存替换优化策略。通过学习SMR设备的I/O负载状况与延迟之间的经验知识来控制对SMR的写入,当SMR负载较大时,通过控制缓存中脏数据的逐出来减少SMR因写回而产生的大量RMW操作,从而优化系统在不同负载下的尾部延迟开销。将Q-Learning算法与基于数据流行度的缓存算法LRU以及SMR感知的缓存算法SAC进行结合,使用真实企业Trace和YCSB生成的模拟Trace进行测试,实验结果表明,所提方法能够有效提升现有缓存算法的性能,可以降低57.06%的平均延迟和87.49%的尾部延迟。 展开更多
关键词 q-learning算法 I/O负载 长尾延迟 缓存替换算法 混合存储
在线阅读 下载PDF
基于Q-Learning的多模态自适应光伏功率优化组合预测
3
作者 隗知初 杨苹 +3 位作者 周钱雨凡 陈文皓 万思洋 崔嘉雁 《电力工程技术》 北大核心 2026年第1期115-124,163,共11页
针对光伏功率序列波动性强、随机性高的问题,文中提出一种基于Q-Learning的多模态自适应光伏功率优化组合预测模型。首先,采用鲸鱼优化算法的变分模态分解方法,将原始光伏功率序列分解成不同子模态,并通过集成特征筛选模型,确定各子模... 针对光伏功率序列波动性强、随机性高的问题,文中提出一种基于Q-Learning的多模态自适应光伏功率优化组合预测模型。首先,采用鲸鱼优化算法的变分模态分解方法,将原始光伏功率序列分解成不同子模态,并通过集成特征筛选模型,确定各子模态序列最敏感的气象因素。然后,构建反向传播神经网络、双向长短期记忆网络、门控循环单元网络和时间卷积网络4种基础预测模型。考虑到不同模型对不同频率特征的子序列预测能力不同,利用Q-Learning算法自适应选择各模态对应的最优基础模型组合方式。最后,将不同子模态的预测结果叠加重构,得到最终预测结果,并利用高分辨率光伏气象功率数据集进行验证。结果证明,文中所提出的基于Q-Learning的多模态自适应光伏功率优化组合预测模型,相较于单一模型的预测误差平均绝对误差下降了16.18%,均方误差下降了17.00%。 展开更多
关键词 鲸鱼优化算法 变分模态分解 q-learning 功率预测 组合模型 光伏发电
在线阅读 下载PDF
基于随机森林与Q-learning融合的多元电力数据存储优化决策方法
4
作者 叶学顺 贾东梨 +2 位作者 周俊 唐英 贾梓豪 《科学技术与工程》 北大核心 2026年第3期1065-1074,共10页
大规模和多样的电力数据存储面临效率低和内存容量不足的瓶颈问题。数据索引和数据压缩等传统数据存储优化方法各有优劣势,如何有效应用于电力数据存储是目前研究的难点。为了解决这个问题,提出了一种融合随机森林和Q-learning的多元电... 大规模和多样的电力数据存储面临效率低和内存容量不足的瓶颈问题。数据索引和数据压缩等传统数据存储优化方法各有优劣势,如何有效应用于电力数据存储是目前研究的难点。为了解决这个问题,提出了一种融合随机森林和Q-learning的多元电力数据存储优化决策方法。该方法中的关键技术包括:首先提出了基于改进随机森林算法的存储优化策略决策模型,引入信息增益方法,综合评价数据存储时对数据库的数据访问频率、查询时间、存储速度以及数据冗余率等因素影响,做出数据直接存储、数据索引存储和数据压缩存储的存储优化方法策略决策;其次提出了基于改进Q-learning算法的数据存储算法决策模型,引入多尺度学习机制、优先经验放回机制和正负向奖励机制,决策数据索引存储时适用的索引算法以及数据压缩存储时适用的数据压缩算法。本方法有效融合了数据索引与数据压缩的技术优势,大幅提升数据存储效率并节约存储空间,为大规模多元电力数据管理提供新的解决方案。 展开更多
关键词 随机森林算法 q-learning算法 数据存储优化方法 数据索引算法 数据压缩算法
在线阅读 下载PDF
A Kind of Fast Iterative Methods With the Application Based on Diagonal Matrix Splitting 被引量:1
5
作者 XU Qiuyan 《宁夏大学学报(自然科学版中英文)》 2026年第1期1-13,共13页
The fast solution of linear equations has always been one of the hot spots in scientific computing.A kind of the diagonal matrix splitting iteration methods are provided,which is different from the classical matrix sp... The fast solution of linear equations has always been one of the hot spots in scientific computing.A kind of the diagonal matrix splitting iteration methods are provided,which is different from the classical matrix splitting methods.Taking the decomposition of the diagonal elements for coefficient matrix as the key point,some new preconditioners are constructed.Taking the tri-diagonal coefficient matrix as an example,the convergence domains and optimal relaxation factor of the new method are analyzed theoretically.The presented new iteration methods are applied to solve linear algebraic equations,even 2D and 3D diffusion problems with the fully implicit discretization.The results of numerical experiments are matched with the theoretical analysis,and show that the iteration numbers are reduced greatly.The superiorities of presented iteration methods exceed some classical iteration methods dramatically. 展开更多
关键词 iterATION matrix splitting diffusion equation CONVERGENCE optimal relaxation factor
在线阅读 下载PDF
A New Inversion-free Iterative Method for Solving the Nonlinear Matrix Equation and Its Application in Optimal Control
6
作者 GAO Xiangyu XIE Weiwei ZHANG Lina 《应用数学》 北大核心 2026年第1期143-150,共8页
In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to ... In this paper,we consider the maximal positive definite solution of the nonlinear matrix equation.By using the idea of Algorithm 2.1 in ZHANG(2013),a new inversion-free method with a stepsize parameter is proposed to obtain the maximal positive definite solution of nonlinear matrix equation X+A^(*)X|^(-α)A=Q with the case 0<α≤1.Based on this method,a new iterative algorithm is developed,and its convergence proof is given.Finally,two numerical examples are provided to show the effectiveness of the proposed method. 展开更多
关键词 Nonlinear matrix equation Maximal positive definite solution Inversion-free iterative method Optimal control
在线阅读 下载PDF
FAIR-DQL:Fairness-Aware Deep Q-Learning for Enhanced Resource Allocation and RIS Optimization in High-Altitude Platform Networks
7
作者 Muhammad Ejaz Muhammad Asim +1 位作者 Mudasir Ahmad Wani Kashish Ara Shakil 《Computers, Materials & Continua》 2026年第3期758-779,共22页
The integration of High-Altitude Platform Stations(HAPS)with Reconfigurable Intelligent Surfaces(RIS)represents a critical advancement for next-generation wireless networks,offering unprecedented opportunities for ubi... The integration of High-Altitude Platform Stations(HAPS)with Reconfigurable Intelligent Surfaces(RIS)represents a critical advancement for next-generation wireless networks,offering unprecedented opportunities for ubiquitous connectivity.However,existing research reveals significant gaps in dynamic resource allocation,joint optimization,and equitable service provisioning under varying channel conditions,limiting practical deployment of these technologies.This paper addresses these challenges by proposing a novel Fairness-Aware Deep Q-Learning(FAIRDQL)framework for joint resource management and phase configuration in HAPS-RIS systems.Our methodology employs a comprehensive three-tier algorithmic architecture integrating adaptive power control,priority-based user scheduling,and dynamic learning mechanisms.The FAIR-DQL approach utilizes advanced reinforcement learning with experience replay and fairness-aware reward functions to balance competing objectives while adapting to dynamic environments.Key findings demonstrate substantial improvements:9.15 dB SINR gain,12.5 bps/Hz capacity,78%power efficiency,and 0.82 fairness index.The framework achieves rapid 40-episode convergence with consistent delay performance.These contributions establish new benchmarks for fairness-aware resource allocation in aerial communications,enabling practical HAPS-RIS deployments in rural connectivity,emergency communications,and urban networks. 展开更多
关键词 Wireless communication high-altitude platform station reconfigurable intelligent surfaces deep q-learning
在线阅读 下载PDF
Dynamic Integration of Q-Learning and A-APF for Efficient Path Planning in Complex Underground Mining Environments
8
作者 Chang Su Liangliang Zhao Dongbing Xiang 《Computers, Materials & Continua》 2026年第2期1017-1040,共24页
To address low learning efficiency and inadequate path safety in spraying robot navigation within complex obstacle-rich environments—with dense,dynamic,unpredictable obstacles challenging conventional methods—this p... To address low learning efficiency and inadequate path safety in spraying robot navigation within complex obstacle-rich environments—with dense,dynamic,unpredictable obstacles challenging conventional methods—this paper proposes a hybrid algorithm integrating Q-learning and improved A*-Artificial Potential Field(A-APF).Centered on theQ-learning framework,the algorithmleverages safety-oriented guidance generated byA-APF and employs a dynamic coordination mechanism that adaptively balances exploration and exploitation.The proposed system comprises four core modules:(1)an environment modeling module that constructs grid-based obstacle maps;(2)an A-APF module that combines heuristic search from A*algorithm with repulsive force strategies from APF to generate guidance;(3)a Q-learning module that learns optimal state-action values(Q-values)through spraying robot-environment interaction and a reward function emphasizing path optimality and safety;and(4)a dynamic optimization module that ensures adaptive cooperation between Q-learning and A-APF through exploration rate control and environment-aware constraints.Simulation results demonstrate that the proposed method significantly enhances path safety in complex underground mining environments.Quantitative results indicate that,compared to the traditional Q-learning algorithm,the proposed method shortens training time by 42.95% and achieves a reduction in training failures from 78 to just 3.Compared to the static fusion algorithm,it further reduces both training time(by 10.78%)and training failures(by 50%),thereby improving overall training efficiency. 展开更多
关键词 q-learning A*algorithm artificial potential field path planning hybrid algorithm
在线阅读 下载PDF
A Hybrid Approach to Software Testing Efficiency:Stacked Ensembles and Deep Q-Learning for Test Case Prioritization and Ranking
9
作者 Anis Zarrad Thomas Armstrong Jaber Jemai 《Computers, Materials & Continua》 2026年第3期1726-1746,共21页
Test case prioritization and ranking play a crucial role in software testing by improving fault detection efficiency and ensuring software reliability.While prioritization selects the most relevant test cases for opti... Test case prioritization and ranking play a crucial role in software testing by improving fault detection efficiency and ensuring software reliability.While prioritization selects the most relevant test cases for optimal coverage,ranking further refines their execution order to detect critical faults earlier.This study investigates machine learning techniques to enhance both prioritization and ranking,contributing to more effective and efficient testing processes.We first employ advanced feature engineering alongside ensemble models,including Gradient Boosted,Support Vector Machines,Random Forests,and Naive Bayes classifiers to optimize test case prioritization,achieving an accuracy score of 0.98847 and significantly improving the Average Percentage of Fault Detection(APFD).Subsequently,we introduce a deep Q-learning framework combined with a Genetic Algorithm(GA)to refine test case ranking within priority levels.This approach achieves a rank accuracy of 0.9172,demonstrating robust performance despite the increasing computational demands of specialized variation operators.Our findings highlight the effectiveness of stacked ensemble learning and reinforcement learning in optimizing test case prioritization and ranking.This integrated approach improves testing efficiency,reduces late-stage defects,and improves overall software stability.The study provides valuable information for AI-driven testing frameworks,paving the way for more intelligent and adaptive software quality assurance methodologies. 展开更多
关键词 Software testing test case prioritization test case ranking machine learning reinforcement learning deep q-learning
在线阅读 下载PDF
Multi-antiderivative transformation alternating iterative deep learning method for solving anisotropic scattering neutron transport equations
10
作者 Dong Liu Bin Zhang +4 位作者 Qi Luo Heng Zhang Yong Jiang Xian-Tao Cui Chen Zhao 《Nuclear Science and Techniques》 2026年第5期375-393,共19页
Deep learning methods have achieved significant progress in solving partial differential equations.However,when applied to the widely used anisotropic scattering neutron transport equations in reactor engineering,thes... Deep learning methods have achieved significant progress in solving partial differential equations.However,when applied to the widely used anisotropic scattering neutron transport equations in reactor engineering,these encounter significant challenges.To address this issue,this study introduces a multi-antiderivative transformation alternating iterative deep learning method(M-AIM).This method transforms the integral terms of the scattering and fission sources in the transport equation into multiple antiderivative functions corresponding to the integrand,converts the differential-integral form of the transport equation into an exact differential equation,and establishes the necessary constraints for a unique solution.The M-AIM uses multiple deep neural networks to map the unknown angular flux density of transport equations and represents various forms of antiderivative functions.It constructs the corresponding weighted loss functions.By alternating iterative training with deep learning methods applied to these neural networks,the loss is reduced gradually.When the loss decreases to a preset minimum,the neural network approaches a numerical solution for both angular flux density and antiderivative functions.This paper presents a numerical verification of geometries such as flat plates and spheres.It verifies the validity of the theoretical framework and associated methods.The study contributes to the development of novel technical approaches for applying deep learning to solve anisotropic scattering neutron transport equations in reactor engineering. 展开更多
关键词 Neutron transport equations Anisotropic scattering Multi-antiderivative Alternating iteration Deep learning
在线阅读 下载PDF
Dual-Mode Data-Driven Iterative Learning Control:Applications in Precision Manufacturing and Intelligent Transportation Systems
11
作者 Lei Wang Menghan Wei +3 位作者 Ziwei Huangfu Shunjie Zhu Xuejian Ge Zhengquan Li 《Computers, Materials & Continua》 2026年第2期153-184,共32页
Iterative Learning Control(ILC)provides an effective framework for optimizing repetitive tasks,making it particularly suitable for high-precision applications in both precision manufacturing and intelligent transporta... Iterative Learning Control(ILC)provides an effective framework for optimizing repetitive tasks,making it particularly suitable for high-precision applications in both precision manufacturing and intelligent transportation systems(ITS).This paper presents a systematic review of ILC's developmental progress,current methodologies,and practical implementations across these two critical domains.The review first analyzes the key technical challenges encountered when integrating ILC into precision manufacturing workflows.Through case studies,it evaluates demonstrated improvements in positioning accuracy,surface finish quality,and production throughput.Furthermore,the study examines ILC’s applications in ITS,with particular focus on vehicular motion control applications including autonomous vehicle trajectory tracking,platoon coordination,and traffic signal timing optimization,where its data-driven characteristics enhance adaptability to dynamic environments.Finally,the paper proposes targeted future research directions that are essential for fully realizing ILC’s potential in advancing these interconnected yet distinct fields. 展开更多
关键词 iterative learning control systematic review precisionmanufacturing intelligent transportation systems
在线阅读 下载PDF
Unpacking the Role of Grammarly in Iterative Continuation Tasks to Develop L2 Grammar Learning Strategies,Grit,and Competence
12
作者 Jianling Zhan Chuyi Zhou 《Chinese Journal of Applied Linguistics》 2026年第1期112-132,161,共22页
The iterative continuation task(ICT)requires English as a foreign language(EFL)learners to read a segment and write a continuation that aligns with the preceding segment of an English novel with successive turns,offer... The iterative continuation task(ICT)requires English as a foreign language(EFL)learners to read a segment and write a continuation that aligns with the preceding segment of an English novel with successive turns,offering exposure to diverse grammatical structures and opportunities for contextualized usage.Given the importance of integrating technology into second language(L2)writing and the critical role that grammar plays in L2 writing development,automated written corrective feedback provided by Grammarly has gained significant attention.This study investigates the impact of Grammarly on grammar learning strategies,grammar grit,and grammar competence among EFL college students engaged in ICT.This study employed a mixed-methods sequential exploratory design;56 participants were divided into an experimental group(n=28),receiving Grammarly feedback for ICT,and a control group(n=28),completing ICT without Grammarly feedback.Quantitative results revealed that both groups showed improvements in L2 grammar learning strategies,grit and competence.For the experimental group,significant differences were observed across all variables of L2 grammar learning strategies,grit,and competence between pre-and post-tests.For the control group,significant differences were only observed in the affective dimension of grammar learning strategies,Consistency of Interest(COI)of grammar grit,and grammar competence.However,the control group presented a significantly higher improvement in grammar competence.Qualitative analysis showed both positive and negative perceptions of Grammarly.The pedagogical implications of integrating Grammarly and ICT for L2 grammar development are discussed. 展开更多
关键词 grammar learning strategies grammar grit grammar competence iterative continuation tasks Grammarly
在线阅读 下载PDF
CAPGen: An MLLM-Based Framework Integrated with Iterative Optimization Mechanism for Cultural Artifacts Poster Generation
13
作者 Qianqian Hu Chuhan Li +1 位作者 Mohan Zhang Fang Liu 《Computers, Materials & Continua》 2026年第1期494-510,共17页
Due to the digital transformation tendency among cultural institutions and the substantial influence of the social media platform,the demands of visual communication keep increasing for promoting traditional cultural ... Due to the digital transformation tendency among cultural institutions and the substantial influence of the social media platform,the demands of visual communication keep increasing for promoting traditional cultural artifacts online.As an effective medium,posters serve to attract public attention and facilitate broader engagement with cultural artifacts.However,existing poster generation methods mainly rely on fixed templates and manual design,which limits their scalability and adaptability to the diverse visual and semantic features of the artifacts.Therefore,we propose CAPGen,an automated aesthetic Cultural Artifacts Poster Generation framework built on a Multimodal Large Language Model(MLLM)with integrated iterative optimization.During our research,we collaborated with designers to define principles of graphic design for cultural artifact posters,to guide the MLLM in generating layout parameters.Later,we generated these parameters into posters.Finally,we refined the posters using an MLLM integrated with a multi-round iterative optimization mechanism.Qualitative results show that CAPGen consistently outperforms baseline methods in both visual quality and aesthetic performance.Furthermore,ablation studies indicate that the prompt,iterative optimization mechanism,and design principles significantly enhance the effectiveness of poster generation. 展开更多
关键词 Aesthetic poster generation prompt engineering multimodal large language models iterative optimization design principles
在线阅读 下载PDF
基于深度Q-learning算法的智能电网管控模型研究
14
作者 王筠 李志鹏 +2 位作者 项旭 张军堂 石雷波 《自动化技术与应用》 2026年第2期54-57,142,共5页
设计基于深度Q-learning算法的智能电网管控模型,将可验证声明(verifiable credential, VC)和分布式数字身份(decentralized identity, DID)作为应用程序身份凭证与软件定义网络(software-defined networking, SDN)控制器,结合动态信任... 设计基于深度Q-learning算法的智能电网管控模型,将可验证声明(verifiable credential, VC)和分布式数字身份(decentralized identity, DID)作为应用程序身份凭证与软件定义网络(software-defined networking, SDN)控制器,结合动态信任评估算法与基于属性的访问控制策略,构建基于区块链的智能电网分布式SDN管控模型。在资源分配、网络拓扑动态变化以及安全威胁不断演变的情况下,实施基于区块链的分布式SDN网络的优化。实验测试结果表明,设计方法在通过深度Q-learning优化模型后累积奖励明显大幅增加,在多种安全性能方面表现出色,能够清除恶意域,确保网络环境的安全。 展开更多
关键词 SDN控制器 分布式SDN网络 深度q-learning算法 区块链 智能电网管控模型
在线阅读 下载PDF
Global context-aware multi-scale feature iterative refinement for aviation-road traffic semantic segmentation
15
作者 Mengyue ZHANG Shichun YANG +1 位作者 Xinjie FENG Yaoguang CAO 《Chinese Journal of Aeronautics》 2026年第2期429-441,共13页
Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made re... Semantic segmentation for mixed scenes of aerial remote sensing and road traffic is one of the key technologies for visual perception of flying cars.The State-of-the-Art(SOTA)semantic segmentation methods have made remarkable achievements in both fine-grained segmentation and real-time performance.However,when faced with the huge differences in scale and semantic categories brought about by the mixed scenes of aerial remote sensing and road traffic,they still face great challenges and there is little related research.Addressing the above issue,this paper proposes a semantic segmentation model specifically for mixed datasets of aerial remote sensing and road traffic scenes.First,a novel decoding-recoding multi-scale feature iterative refinement structure is proposed,which utilizes the re-integration and continuous enhancement of multi-scale information to effectively deal with the huge scale differences between cross-domain scenes,while using a fully convolutional structure to ensure the lightweight and real-time requirements.Second,a welldesigned cross-window attention mechanism combined with a global information integration decoding block forms an enhanced global context perception,which can effectively capture the long-range dependencies and multi-scale global context information of different scenes,thereby achieving fine-grained semantic segmentation.The proposed method is tested on a large-scale mixed dataset of aerial remote sensing and road traffic scenes.The results confirm that it can effectively deal with the problem of large-scale differences in cross-domain scenes.Its segmentation accuracy surpasses that of the SOTA methods,which meets the real-time requirements. 展开更多
关键词 Aviation-road traffic Flying cars Global context-aware Multi-scale feature iterative refinement Semantic segmentation
原文传递
An iterative regularized inversion method of fracture width and height using cross-well optical fiber strain
16
作者 CHEN Ming WANG Ziang +2 位作者 GUO Tiankui LIU Yongzan CHEN Zuorong 《Petroleum Exploration and Development》 2026年第1期235-248,共14页
The forward model of optical fiber strain induced by fractures,together with the associated model resolution matrix,is used to demonstrate the interpretability of fracture parameters once the fracture intersects the f... The forward model of optical fiber strain induced by fractures,together with the associated model resolution matrix,is used to demonstrate the interpretability of fracture parameters once the fracture intersects the fiber.A regularized inversion framework for fracture parameters is established to evaluate the influence of measured data quality on the accuracy of iterative regularized inversion.An interpretation approach for both fracture width and height is proposed,and the synthetic forward data with measurement error and field examples are employed to validate the accuracy of the simultaneous inversion of fracture width and height.The results indicate that,after the fracture contacts the fiber,the strain response is strongly sensitive only to the fracture parameters at the intersection location,whereas the interpretability of parameters at other locations remains limited.The iterative regularized inversion method effectively suppresses the impact of measurement error and exhibits high computational efficiency,showing clear advantages for inversion applications.When incorporating the first-order regularization with a Neumann boundary constraint on the tip width,the inverted fracture-width distribution becomes highly sensitive to fracture height;thus,combined with a bisection strategy,simultaneous inversion of fracture width and height can be achieved.Examination using the model resolution matrix,noisy synthetic data,and field data confirms that the iterative regularized inversion model for fracture width and height provides high interpretive accuracy and can be applied to the calculation and analysis of fracture width,fracture height,net pressure and other parameters. 展开更多
关键词 optical fiber strain fracture diagnosis forward model model resolution iterative regularized inversion computational efficiency fracture parameter interpretation
在线阅读 下载PDF
玻尔兹曼优化Q-learning的高速铁路越区切换控制算法 被引量:4
17
作者 陈永 康婕 《控制理论与应用》 北大核心 2025年第4期688-694,共7页
针对5G-R高速铁路越区切换使用固定切换阈值,且忽略了同频干扰、乒乓切换等的影响,导致越区切换成功率低的问题,提出了一种玻尔兹曼优化Q-learning的越区切换控制算法.首先,设计了以列车位置–动作为索引的Q表,并综合考虑乒乓切换、误... 针对5G-R高速铁路越区切换使用固定切换阈值,且忽略了同频干扰、乒乓切换等的影响,导致越区切换成功率低的问题,提出了一种玻尔兹曼优化Q-learning的越区切换控制算法.首先,设计了以列车位置–动作为索引的Q表,并综合考虑乒乓切换、误码率等构建Q-learning算法回报函数;然后,提出玻尔兹曼搜索策略优化动作选择,以提高切换算法收敛性能;最后,综合考虑基站同频干扰的影响进行Q表更新,得到切换判决参数,从而控制切换执行.仿真结果表明:改进算法在不同运行速度和不同运行场景下,较传统算法能有效提高切换成功率,且满足无线通信服务质量QoS的要求. 展开更多
关键词 越区切换 5G-R q-learning算法 玻尔兹曼优化策略
在线阅读 下载PDF
多代理Nash Q-Learning模型行动选择策略研究
18
作者 韩松 李璨 《中国管理科学》 北大核心 2025年第12期110-120,共11页
多代理Q-Learning模型的行动选择策略优化是复杂经济学博弈模拟过程中亟待解决的问题之一。本文将强制ε-greedy行动选择策略引入多代理Nash Q-Learning模型中,通过博弈实验对比该行动选择策略与经典ε-greedy策略的效果,探究该行动选... 多代理Q-Learning模型的行动选择策略优化是复杂经济学博弈模拟过程中亟待解决的问题之一。本文将强制ε-greedy行动选择策略引入多代理Nash Q-Learning模型中,通过博弈实验对比该行动选择策略与经典ε-greedy策略的效果,探究该行动选择策略对算法计算速度和收敛情况的影响;同时,根据实验结果进行了算法真实性理论验证,并基于多代理模型的性质给出强制ε-greedy的普适性推论。模拟结果表明,强制ε-greedy适用于更复杂、涉及状态行动更多、回合更多的博弈,此时能有效提升多代理Q-Learning算法运行性能,但由于其本质是初期增加对行动的探索,这会消耗一些回合,导致均衡收敛率下降。因此,强制ε-greedy带来的性能提升与损失的均衡收敛率是使用者在应用该策略时需要权衡的问题。 展开更多
关键词 Nash q-learning 强制ε-greedy 行动选择
原文传递
基于改进Q-learning算法的XGBoost模型智能预测页岩断裂韧性
19
作者 张艳 王宗勇 +3 位作者 张豪 吴建成 祝春波 吴高平 《长江大学学报(自然科学版)》 2025年第5期58-65,共8页
岩石的断裂韧性是影响裂缝扩展及延伸的重要因素,同时也是储层可压性评价的关键参数。但目前断裂韧性直接测试较为复杂,且现有的断裂韧性预测方法多基于断裂韧性与其他物理参数之间的拟合关系,难以形成整个井段的连续剖面。通过室内断... 岩石的断裂韧性是影响裂缝扩展及延伸的重要因素,同时也是储层可压性评价的关键参数。但目前断裂韧性直接测试较为复杂,且现有的断裂韧性预测方法多基于断裂韧性与其他物理参数之间的拟合关系,难以形成整个井段的连续剖面。通过室内断裂韧性实验,分析了页岩断裂韧性与其他物理力学参数之间的关系,建立了断裂韧性拟合公式,同时采用XGBoost模型,利用地球物理测井数据,通过改进的Q-learning算法优化XGBoost模型超参数,实现了岩石断裂韧性的预测。研究结果表明,Ⅰ型断裂韧性与抗拉强度、声波速度相关性较高,与密度相关性较低,与纵波速度、横波速度、抗拉强度、岩石密度均成正相关。基于改进的Q-learning优化断裂韧性智能预测的XGBoost模型预测准确性较高,预测断裂韧性与拟合断裂韧性相关度高达0.981,所提出的岩石断裂韧性预测模型是可靠的,可为压裂工程设计提供参考。 展开更多
关键词 断裂韧性 测井数据 智能算法 q-learning XGBoost 压裂设计
在线阅读 下载PDF
无监督环境下改进Q-learning算法在网络异常诊断中的应用
20
作者 梁西陈 《六盘水师范学院学报》 2025年第3期89-97,共9页
针对无监督环境下传统网络异常诊断算法存在异常点定位和异常数据分类准确率低等不足,通过设计一种基于改进Q-learning算法的无线网络异常诊断方法:首先基于ADU(Asynchronous Data Unit异步数据单元)单元采集无线网络的数据流,并提取数... 针对无监督环境下传统网络异常诊断算法存在异常点定位和异常数据分类准确率低等不足,通过设计一种基于改进Q-learning算法的无线网络异常诊断方法:首先基于ADU(Asynchronous Data Unit异步数据单元)单元采集无线网络的数据流,并提取数据包特征;然后构建Q-learning算法模型探索状态值和奖励值的平衡点,利用SA(Simulated Annealing模拟退火)算法从全局视角对下一时刻状态进行精确识别;最后确定训练样本的联合分布概率,提升输出值的逼近性能以达到平衡探索与代价之间的均衡。测试结果显示:改进Q-learning算法的网络异常定位准确率均值达99.4%,在不同类型网络异常的分类精度和分类效率等方面,也优于三种传统网络异常诊断方法。 展开更多
关键词 无监督 改进q-learning ADU单元 状态值 联合分布概率
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部