张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(d...张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(digital signal processors,DSPs)的加速器已被集成至通用计算系统。然而,传统面向多核CPU和GPU的张量转置库因架构差异无法充分适配DSP架构。一方面,DSP架构的向量化计算潜力尚未得到充分挖掘;另一方面,其复杂的片上存储体系与多层次共享内存结构为张量并行程序设计带来了显著挑战。针对国产多核DSP的架构特点,提出ftmTT算法,并设计实现了一个面向多核DSP架构的通用张量转置库。ftmTT算法通过设计适配DSP架构的高效内存访问模式充分挖掘其并行化和向量化潜力,其核心创新包括:1)采用分块策略将高维张量转置转化为多核DSP平台所提供的矩阵转置内核操作;2)提出基于DMA点对点传输的张量数据块访存合并方案来降低数据搬运开销;3)通过双缓冲设计异步重叠转置计算与DMA传输实现计算通信隐藏,最终面向多核DSP实现高性能并行张量转置。在国产多核DSP平台FT-M7032的实验表明,ftmTT张量转置算法取得了最高达理论带宽75.96%的性能,达到FT-M7032平台STREAM带宽99.23%的性能。展开更多
Let K_(j)/Q,1≤j≤ν,ν≥2 be quadratic fields with pairwise coprime discriminants Dj,and let τ_(kj)^(K_(j))(n)be the divisor function associated to Dedekind zeta function SK_(j)(s).In this paper,we consider a multid...Let K_(j)/Q,1≤j≤ν,ν≥2 be quadratic fields with pairwise coprime discriminants Dj,and let τ_(kj)^(K_(j))(n)be the divisor function associated to Dedekind zeta function SK_(j)(s).In this paper,we consider a multidimensional general divisor problem related to the τ_(kj)^(K_(j))(n)involving several number fields over square integers,by establishing the corresponding asymptotic formula.As an application,we also obtain the asymptotic formula of variance of these coefi icients.展开更多
This paper is concerned with a class of nonlinear fractional differential equations with a disturbance parameter in the integral boundary conditions on the infinite interval.By using Guo-Krasnoselskii fixed point theo...This paper is concerned with a class of nonlinear fractional differential equations with a disturbance parameter in the integral boundary conditions on the infinite interval.By using Guo-Krasnoselskii fixed point theorem,fixed point index theory and the analytic technique,we give the bifurcation point of the parameter which divides the range of parameter for the existence of at least two,one and no positive solutions for the problem.And,by using a fixed point theorem of generalized concave operator and cone theory,we establish the maximum parameter interval for the existence of the unique positive solution for the problem and show that such a positive solution continuously depends on the parameter.In the end,some examples are given to illustrate our main results.展开更多
With the development of technology,diffusion model-based solvers have shown significant promise in solving Combinatorial Optimization(CO)problems,particularly in tackling Non-deterministic Polynomial-time hard(NP-hard...With the development of technology,diffusion model-based solvers have shown significant promise in solving Combinatorial Optimization(CO)problems,particularly in tackling Non-deterministic Polynomial-time hard(NP-hard)problems such as the Traveling Salesman Problem(TSP).However,existing diffusion model-based solvers typically employ a fixed,uniform noise schedule(e.g.,linear or cosine annealing)across all training instances,failing to fully account for the unique characteristics of each problem instance.To address this challenge,we present GraphGuided Diffusion Solvers(GGDS),an enhanced method for improving graph-based diffusion models.GGDS leverages Graph Neural Networks(GNNs)to capture graph structural information embedded in node coordinates and adjacency matrices,dynamically adjusting the noise levels in the diffusion model.This study investigates the TSP by examining two distinct time-step noise generation strategies:cosine annealing and a Neural Network(NN)-based approach.We evaluate their performance across different problem scales,particularly after integrating graph structural information.Experimental results indicate that GGDS outperforms previous methods with average performance improvements of 18.7%,6.3%,and 88.7%on TSP-500,TSP-100,and TSP-50,respectively.Specifically,GGDS demonstrates superior performance on TSP-500 and TSP-50,while its performance on TSP-100 is either comparable to or slightly better than that of previous methods,depending on the chosen noise schedule and decoding strategy.展开更多
This study examines the mediating role of positive psychological capital and the moderating role of ethnicity in the relationship between mindfulness and internalizing/externalizing problems among adolescents.The stud...This study examines the mediating role of positive psychological capital and the moderating role of ethnicity in the relationship between mindfulness and internalizing/externalizing problems among adolescents.The study sample comprized Chinese adolescents(N=637 ethnic minority;females=40.97%,meam age=12.68,SD=0.49 years;N=636 Han;females=49.06%,mean age=12.71,SD=0.47 years).The participants completed the Child and Adolescent Mindfulness Measure,the Positive Psycap Questionnaire,and the Youth Self-Report.Results from the moderated mediation analysis showed mindfulness was negatively associated with both internalizing and externalizing problems.Ethnicity moderated the relationship between mindfulness and internalizing problems to be stronger for Han adolescents compared to ethnic minority adolescents.Psychological capital mediated the relationship between mindfulness and internalizing problems in both groups,with a negative direction.Findings support the Conservation of Resources theory and highlight mindfulness as a personal resource fostering adolescent well-being in multicultural contexts.展开更多
This paper is concerned with the following nonlinear Steklov problemΔu=0 in D,∂vu=λf(u)on∂D,where D is the unit disk in the plane,∂v denotes the unit outward normal derivative.For each k∈N,under some natural condit...This paper is concerned with the following nonlinear Steklov problemΔu=0 in D,∂vu=λf(u)on∂D,where D is the unit disk in the plane,∂v denotes the unit outward normal derivative.For each k∈N,under some natural conditions on f,using the Crandall-Rabinowitz bifurcation theorem,we obtain a bifurcation curve emanating from(k,0).Furthermore,we also analyze the local structure of bifurcation curves and stability of solutions on them.Specifically,our results indicate the bifurcation is critical for each k and is subcritical(supercritical)if f'''(0)>0(f'''(0)<0).展开更多
In educational settings,instructors often lead students through hands-on software projects,sometimes engaging two different schools or departments.How can such collaborations be made more efficient,and how can student...In educational settings,instructors often lead students through hands-on software projects,sometimes engaging two different schools or departments.How can such collaborations be made more efficient,and how can students truly experience the importance of teamwork and the impact of organizational structure on project complexity?To answer these questions,we introduce the requirement-driven organization structure(R-DOS)approach,which tightly couples software requirements with the actual development process.By extending problem-frames modeling and focusing on requirements,R-DOS allows educators and students to(1)diagnose structural flaws early,(2)prescribe role-level and communication fixes,and(3)observe-in real time-how poor structure can derail a project while good structure accelerates learning and delivery.展开更多
Generalised reduced masses with a set of equations governing the three relative motions between two of 3-bodies in their gravitational field are established,of which the dynamic characteristics of 3-body dynamics,fund...Generalised reduced masses with a set of equations governing the three relative motions between two of 3-bodies in their gravitational field are established,of which the dynamic characteristics of 3-body dynamics,fundamental bases of this paper,are revealed.Based on these findings,an equivalent system is developed,which is a 2-body system with its total mass,constant angular momentum,kinetic and potential energies same as the total ones of three relative motions,so that it can be solved using the well-known theory of the 2-body system.From the solution of an equivalent system with the revealed characteristics of three relative motions,the general theoretical solutions of the 3-body system are obtained in the curve-integration forms along the orbits in the imaged radial motion space.The possible periodical orbits with generalised Kepler’s law are presented.Following the description and mathematical demonstrations of the proposed methods,the examples including Euler’s/Lagrange’s problems,and a reported numerical one are solved to validate the proposed methods.The methods derived from the 3-body system are extended to N-body problems.展开更多
Sensitivity of observational data is important in the study of Glacial Isostatic Adjustment(GIA).However,depending on whether sensitivity is used for the Inverse Problem or the Forward Problem,the final formulation an...Sensitivity of observational data is important in the study of Glacial Isostatic Adjustment(GIA).However,depending on whether sensitivity is used for the Inverse Problem or the Forward Problem,the final formulation and display of the sensitivity kernel will be different.Unfortunately,in the past,both perspectives give the same name to their quantity computed/displayed,and that has caused some confusion.To distinguish between the two,their perspective should be added to the names.This paper focuses only on the perspective of the Forward Problem where the input parameters are known.The Perturbation method has been successfully used in the computation of the sensitivity kernels of observations on 1D and 3D viscosity variations from the Forward perspective.One aim of this paper is to review and clarify the physics of the Perturbation method and bring out some important aspects of this method that have been misunderstood or neglected.Another aim is to present sensitivity kernels from the Perturbation method using 3D(both radially and laterally heterogeneous)Earth models with realistic ice history.These new results are now suitable for future comparison with those from new methods using the Forward perspective.Finally,the sensitivity computations for realistic ice histories on a 3D Earth is reviewed and used to search for optimal locations of new GIA observations.展开更多
Cubic-shaped magnetic particles subjected to a dimensionless uniaxial anisotropy(Q=0.1)aligned with one of the crystallographic axes provide an ideal system for investigating magnetic equilibrium states.In this system...Cubic-shaped magnetic particles subjected to a dimensionless uniaxial anisotropy(Q=0.1)aligned with one of the crystallographic axes provide an ideal system for investigating magnetic equilibrium states.In this system,three fundamental magnetization configurations are identified:(i)the flower state,(ii)the twisted flower state,and(iii)the vortex state.This problem corresponds to standard problem No.3 proposed by the NIST Micromagnetics Modeling Group,widely adopted as a benchmark for validating computational micromagnetics methods.In this work,we approach the problem using a computational method based on direct dipolar interactions,in contrast to conventional techniques that typically compute the demagnetizing field via finite difference-based fast Fourier transform(FFT)methods,tensor grid approaches,or finite element formulations.Our results are compared with established literature data,focusing on the dimensionless parameterλ=L/l_(ex),where L is the cube edge length and l_(ex)is the exchange length of the material.To analyze equilibrium state transitions,we systematically varied the size L as a function of the simulation cell number N and intercellular spacing a,determining the criticalλvalue associated with configuration changes.Our simulations reveal that the transition between the twisted flower and vortex states occurs atλ≈8.45,consistent with values reported in the literature,validating our code(Grupo de Física da Matéeria Condensada-UFJF),and shows that this standard problem can be resolved using only interaction dipolar of a direct way without the need for sophisticated additional calculations.展开更多
文摘张量转置(tensor transposition)作为基础张量运算原语,广泛应用于信号处理、科学计算以及深度学习等各种领域,在张量数据密集型应用及高性能计算中具有重要作用。随着能效指标在高性能计算系统中的重要性日益凸显,基于数字信号处理器(digital signal processors,DSPs)的加速器已被集成至通用计算系统。然而,传统面向多核CPU和GPU的张量转置库因架构差异无法充分适配DSP架构。一方面,DSP架构的向量化计算潜力尚未得到充分挖掘;另一方面,其复杂的片上存储体系与多层次共享内存结构为张量并行程序设计带来了显著挑战。针对国产多核DSP的架构特点,提出ftmTT算法,并设计实现了一个面向多核DSP架构的通用张量转置库。ftmTT算法通过设计适配DSP架构的高效内存访问模式充分挖掘其并行化和向量化潜力,其核心创新包括:1)采用分块策略将高维张量转置转化为多核DSP平台所提供的矩阵转置内核操作;2)提出基于DMA点对点传输的张量数据块访存合并方案来降低数据搬运开销;3)通过双缓冲设计异步重叠转置计算与DMA传输实现计算通信隐藏,最终面向多核DSP实现高性能并行张量转置。在国产多核DSP平台FT-M7032的实验表明,ftmTT张量转置算法取得了最高达理论带宽75.96%的性能,达到FT-M7032平台STREAM带宽99.23%的性能。
基金Supported in part by NSFC(Nos.12401011,12201214)National Key Research and Development Program of China(No.2021YFA1000700)+3 种基金Shaanxi Fundamental Science Research Project for Mathematics and Physics(No.23JSQ053)Science and Technology Program for Youth New Star of Shaanxi Province(No.2025ZC-KJXX-29)Natural Science Basic Research Program of Shaanxi Province(No.2025JC-YBQN-091)Scientific Research Foundation for Young Talents of WNU(No.2024XJ-QNRC-01)。
文摘Let K_(j)/Q,1≤j≤ν,ν≥2 be quadratic fields with pairwise coprime discriminants Dj,and let τ_(kj)^(K_(j))(n)be the divisor function associated to Dedekind zeta function SK_(j)(s).In this paper,we consider a multidimensional general divisor problem related to the τ_(kj)^(K_(j))(n)involving several number fields over square integers,by establishing the corresponding asymptotic formula.As an application,we also obtain the asymptotic formula of variance of these coefi icients.
基金Supported by the National Natural Science Foundation of China(11361047)Fundamental Research Program of Shanxi Province(20210302124529)。
文摘This paper is concerned with a class of nonlinear fractional differential equations with a disturbance parameter in the integral boundary conditions on the infinite interval.By using Guo-Krasnoselskii fixed point theorem,fixed point index theory and the analytic technique,we give the bifurcation point of the parameter which divides the range of parameter for the existence of at least two,one and no positive solutions for the problem.And,by using a fixed point theorem of generalized concave operator and cone theory,we establish the maximum parameter interval for the existence of the unique positive solution for the problem and show that such a positive solution continuously depends on the parameter.In the end,some examples are given to illustrate our main results.
基金supported by the National Science and Technology Council,Taiwan,under grant no.NSTC 114-2221-E-197-005-MY3.
文摘With the development of technology,diffusion model-based solvers have shown significant promise in solving Combinatorial Optimization(CO)problems,particularly in tackling Non-deterministic Polynomial-time hard(NP-hard)problems such as the Traveling Salesman Problem(TSP).However,existing diffusion model-based solvers typically employ a fixed,uniform noise schedule(e.g.,linear or cosine annealing)across all training instances,failing to fully account for the unique characteristics of each problem instance.To address this challenge,we present GraphGuided Diffusion Solvers(GGDS),an enhanced method for improving graph-based diffusion models.GGDS leverages Graph Neural Networks(GNNs)to capture graph structural information embedded in node coordinates and adjacency matrices,dynamically adjusting the noise levels in the diffusion model.This study investigates the TSP by examining two distinct time-step noise generation strategies:cosine annealing and a Neural Network(NN)-based approach.We evaluate their performance across different problem scales,particularly after integrating graph structural information.Experimental results indicate that GGDS outperforms previous methods with average performance improvements of 18.7%,6.3%,and 88.7%on TSP-500,TSP-100,and TSP-50,respectively.Specifically,GGDS demonstrates superior performance on TSP-500 and TSP-50,while its performance on TSP-100 is either comparable to or slightly better than that of previous methods,depending on the chosen noise schedule and decoding strategy.
基金supported by the Guizhou Provincial Science and Technology Projects[Basic Science of Guizhou-[2024]Youth 309,Guizhou Platform Talents[2021]1350-046]Zunyi Science and Technology Cooperation[HZ(2024)311]+3 种基金Funding of the Chinese Academy of Social Sciences(2024SYZH005)Peking University Longitudinal Scientific Research Technical Service Project(G-252)Guizhou Provincial Graduate Student Research Fund Project(2024YJSKYJJ339)Zunyi Medical University Graduate Research Fund Project(ZYK206).
文摘This study examines the mediating role of positive psychological capital and the moderating role of ethnicity in the relationship between mindfulness and internalizing/externalizing problems among adolescents.The study sample comprized Chinese adolescents(N=637 ethnic minority;females=40.97%,meam age=12.68,SD=0.49 years;N=636 Han;females=49.06%,mean age=12.71,SD=0.47 years).The participants completed the Child and Adolescent Mindfulness Measure,the Positive Psycap Questionnaire,and the Youth Self-Report.Results from the moderated mediation analysis showed mindfulness was negatively associated with both internalizing and externalizing problems.Ethnicity moderated the relationship between mindfulness and internalizing problems to be stronger for Han adolescents compared to ethnic minority adolescents.Psychological capital mediated the relationship between mindfulness and internalizing problems in both groups,with a negative direction.Findings support the Conservation of Resources theory and highlight mindfulness as a personal resource fostering adolescent well-being in multicultural contexts.
基金Supported by the National Natural Science Foundation of China(Grant No.12371110).
文摘This paper is concerned with the following nonlinear Steklov problemΔu=0 in D,∂vu=λf(u)on∂D,where D is the unit disk in the plane,∂v denotes the unit outward normal derivative.For each k∈N,under some natural conditions on f,using the Crandall-Rabinowitz bifurcation theorem,we obtain a bifurcation curve emanating from(k,0).Furthermore,we also analyze the local structure of bifurcation curves and stability of solutions on them.Specifically,our results indicate the bifurcation is critical for each k and is subcritical(supercritical)if f'''(0)>0(f'''(0)<0).
基金supported by the National Natural Science Foundation of China(No.62362006)Guangxi Science and Technology Project(Key Research&Development)(No.GuiKeAB24010343)+1 种基金Guangxi“Bagui Scholar”Teams for Innovation and Research,Innovation Project of Guangxi Graduate Education(No.YCSW2025193)Guangxi Collaborative Innovation Center of Multi-source Information Integration and Intelligent Processing.
文摘In educational settings,instructors often lead students through hands-on software projects,sometimes engaging two different schools or departments.How can such collaborations be made more efficient,and how can students truly experience the importance of teamwork and the impact of organizational structure on project complexity?To answer these questions,we introduce the requirement-driven organization structure(R-DOS)approach,which tightly couples software requirements with the actual development process.By extending problem-frames modeling and focusing on requirements,R-DOS allows educators and students to(1)diagnose structural flaws early,(2)prescribe role-level and communication fixes,and(3)observe-in real time-how poor structure can derail a project while good structure accelerates learning and delivery.
文摘Generalised reduced masses with a set of equations governing the three relative motions between two of 3-bodies in their gravitational field are established,of which the dynamic characteristics of 3-body dynamics,fundamental bases of this paper,are revealed.Based on these findings,an equivalent system is developed,which is a 2-body system with its total mass,constant angular momentum,kinetic and potential energies same as the total ones of three relative motions,so that it can be solved using the well-known theory of the 2-body system.From the solution of an equivalent system with the revealed characteristics of three relative motions,the general theoretical solutions of the 3-body system are obtained in the curve-integration forms along the orbits in the imaged radial motion space.The possible periodical orbits with generalised Kepler’s law are presented.Following the description and mathematical demonstrations of the proposed methods,the examples including Euler’s/Lagrange’s problems,and a reported numerical one are solved to validate the proposed methods.The methods derived from the 3-body system are extended to N-body problems.
文摘Sensitivity of observational data is important in the study of Glacial Isostatic Adjustment(GIA).However,depending on whether sensitivity is used for the Inverse Problem or the Forward Problem,the final formulation and display of the sensitivity kernel will be different.Unfortunately,in the past,both perspectives give the same name to their quantity computed/displayed,and that has caused some confusion.To distinguish between the two,their perspective should be added to the names.This paper focuses only on the perspective of the Forward Problem where the input parameters are known.The Perturbation method has been successfully used in the computation of the sensitivity kernels of observations on 1D and 3D viscosity variations from the Forward perspective.One aim of this paper is to review and clarify the physics of the Perturbation method and bring out some important aspects of this method that have been misunderstood or neglected.Another aim is to present sensitivity kernels from the Perturbation method using 3D(both radially and laterally heterogeneous)Earth models with realistic ice history.These new results are now suitable for future comparison with those from new methods using the Forward perspective.Finally,the sensitivity computations for realistic ice histories on a 3D Earth is reviewed and used to search for optimal locations of new GIA observations.
基金CAPES,CNPq,and FAPEMIG(Brazilian Agencies)for their financial support。
文摘Cubic-shaped magnetic particles subjected to a dimensionless uniaxial anisotropy(Q=0.1)aligned with one of the crystallographic axes provide an ideal system for investigating magnetic equilibrium states.In this system,three fundamental magnetization configurations are identified:(i)the flower state,(ii)the twisted flower state,and(iii)the vortex state.This problem corresponds to standard problem No.3 proposed by the NIST Micromagnetics Modeling Group,widely adopted as a benchmark for validating computational micromagnetics methods.In this work,we approach the problem using a computational method based on direct dipolar interactions,in contrast to conventional techniques that typically compute the demagnetizing field via finite difference-based fast Fourier transform(FFT)methods,tensor grid approaches,or finite element formulations.Our results are compared with established literature data,focusing on the dimensionless parameterλ=L/l_(ex),where L is the cube edge length and l_(ex)is the exchange length of the material.To analyze equilibrium state transitions,we systematically varied the size L as a function of the simulation cell number N and intercellular spacing a,determining the criticalλvalue associated with configuration changes.Our simulations reveal that the transition between the twisted flower and vortex states occurs atλ≈8.45,consistent with values reported in the literature,validating our code(Grupo de Física da Matéeria Condensada-UFJF),and shows that this standard problem can be resolved using only interaction dipolar of a direct way without the need for sophisticated additional calculations.