随着光伏发电在全球能源体系中占比不断提升,超短期光伏发电量预测对电力系统调度与安全运行至关重要。然而,光伏发电量受多因素影响,具有显著随机性与波动性。为此,提出了一种基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法...随着光伏发电在全球能源体系中占比不断提升,超短期光伏发电量预测对电力系统调度与安全运行至关重要。然而,光伏发电量受多因素影响,具有显著随机性与波动性。为此,提出了一种基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法。首先通过皮尔逊相关分析筛选关键特征,并利用孤立森林算法检测异常值,结合线性插值法和标准化完成数据预处理。随后,通过时间卷积网络(Temporal Convolutional Network,TCN)提取时序特征,再利用双向长短期记忆网络(Bidirectional Long Short-Term Memory,BiLSTM)网络捕获前后向时间依赖关系,并在输出端引入注意力机制聚焦关键时间步特征。最后,在Desert Knowledge Australia Solar Centre(DKASC)数据集上的对比实验表明,与传统LSTM、BiLSTM模型相比,提出的TCN-BiLSTM-Attention模型在预测精度、稳定性等方面均表现出一定优势。展开更多
To enhance speech emotion recognition capability,this study constructs a speech emotion recognition model integrating the adaptive acoustic mixup(AAM)and improved coordinate and shuffle attention(ICASA)methods.The AAM...To enhance speech emotion recognition capability,this study constructs a speech emotion recognition model integrating the adaptive acoustic mixup(AAM)and improved coordinate and shuffle attention(ICASA)methods.The AAM method optimizes data augmentation by combining a sample selection strategy and dynamic interpolation coefficients,thus enabling information fusion of speech data with different emotions at the acoustic level.The ICASA method enhances feature extraction capability through dynamic fusion of the improved coordinate attention(ICA)and shuffle attention(SA)techniques.The ICA technique reduces computational overhead by employing depth-separable convolution and an h-swish activation function and captures long-range dependencies of multi-scale time-frequency features using the attention weights.The SA technique promotes feature interaction through channel shuffling,which helps the model learn richer and more discriminative emotional features.Experimental results demonstrate that,compared to the baseline model,the proposed model improves the weighted accuracy by 5.42%and 4.54%,and the unweighted accuracy by 3.37%and 3.85%on the IEMOCAP and RAVDESS datasets,respectively.These improvements were confirmed to be statistically significant by independent samples t-tests,further supporting the practical reliability and applicability of the proposed model in real-world emotion-aware speech systems.展开更多
Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the b...Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation.展开更多
Modern business information systems face significant challenges in managing heterogeneous data sources,integrating disparate systems,and providing real-time decision support in complex enterprise environments.Contempo...Modern business information systems face significant challenges in managing heterogeneous data sources,integrating disparate systems,and providing real-time decision support in complex enterprise environments.Contemporary enterprises typically operate 200+interconnected systems,with research indicating that 52% of organizations manage three or more enterprise content management systems,creating information silos that reduce operational efficiency by up to 35%.While attention mechanisms have demonstrated remarkable success in natural language processing and computer vision,their systematic application to business information systems remains largely unexplored.This paper presents the theoretical foundation for a Hierarchical Attention-Based Business Information System(HABIS)framework that applies multi-level attention mechanisms to enterprise environments.We provide a comprehensive mathematical formulation of the framework,analyze its computational complexity,and present a proof-of-concept implementation with simulation-based validation that demonstrates a 42% reduction in crosssystem query latency compared to legacy ERP modules and 70% improvement in prediction accuracy over baseline methods.The theoretical framework introduces four hierarchical attention levels:system-level attention for dynamic weighting of business systems,process-level attention for business process prioritization,data-level attention for critical information selection,and temporal attention for time-sensitive pattern recognition.Our complexity analysis demonstrates that the framework achieves O(n log n)computational complexity for attention computation,making it scalable to large enterprise environments including retail supply chains with 200+system-scale deployments.The proof-of-concept implementation validates the theoretical framework’s feasibility withMSE loss of 0.439 and response times of 0.000120 s per query,demonstrating its potential for addressing key challenges in business information systems.This work establishes a foundation for future empirical research and practical implementation of attention-driven enterprise systems.展开更多
[目的/意义]针对温室温湿度预测中多传感器数据融合可靠性低、传统模型忽略温湿度动态耦合,以及参数调优依赖人工经验等问题。[方法]首先,对传统卡尔曼(Kalman)滤波算法实施改进,通过动态调整过程噪声协方差和观测噪声协方差,结合新息...[目的/意义]针对温室温湿度预测中多传感器数据融合可靠性低、传统模型忽略温湿度动态耦合,以及参数调优依赖人工经验等问题。[方法]首先,对传统卡尔曼(Kalman)滤波算法实施改进,通过动态调整过程噪声协方差和观测噪声协方差,结合新息方差动态分配多传感器权重。其次,针对温湿度的强耦合性及其协同控制的需求,构建多输出长短期记忆-注意力机制(Long Short-Term Memory-Attention,LSTM-Attention)模型,以温湿度协同预测为目标,引入注意力机制自适应加权关键环境因子,并采用灰狼优化算法(Grey Wolf Optimizer,GWO)自动对超参数进行寻优。[结果和讨论]提出的自适应卡尔曼滤波算法在多点温湿度融合中的平均绝对偏差分别为1.59℃和8.64%,比传统卡尔曼滤波算法分别降低1.24%、8.57%。以该算法融合结果作为模型训练集,模型在温湿度预测中决定系数R2分别达到98.2%和99.3%,比传统Kalman提升4.7%和4.3%。GWO-LSTM-Atten⁃tion模型的温湿度预测均方根误差分别为0.7768℃和2.0564%,比LSTM、LSTM-Attention时间序列预测模型分别降低15.6%、6.6%,湿度分别降低29.2%、5.7%。[结论]提出的自适应卡尔曼融合算法能够有效抑制异常值影响,可在非平稳环境变化下实现多传感器数据可靠融合。在温室多环境因子预测中,GWO-LSTM-Attention模型温湿度预测值在未来可作为控制温室环境的重要参考,进而实现对温室环境的实时调控。展开更多
Graph Federated Learning(GFL)has shown great potential in privacy protection and distributed intelligence through distributed collaborative training of graph-structured data without sharing raw information.However,exi...Graph Federated Learning(GFL)has shown great potential in privacy protection and distributed intelligence through distributed collaborative training of graph-structured data without sharing raw information.However,existing GFL approaches often lack the capability for comprehensive feature extraction and adaptive optimization,particularly in non-independent and identically distributed(NON-IID)scenarios where balancing global structural understanding and local node-level detail remains a challenge.To this end,this paper proposes a novel framework called GFL-SAR(Graph Federated Collaborative Learning Framework Based on Structural Amplification and Attention Refinement),which enhances the representation learning capability of graph data through a dual-branch collaborative design.Specifically,we propose the Structural Insight Amplifier(SIA),which utilizes an improved Graph Convolutional Network(GCN)to strengthen structural awareness and improve modeling of topological patterns.In parallel,we propose the Attentive Relational Refiner(ARR),which employs an enhanced Graph Attention Network(GAT)to perform fine-grained modeling of node relationships and neighborhood features,thereby improving the expressiveness of local interactions and preserving critical contextual information.GFL-SAR effectively integrates multi-scale features from every branch via feature fusion and federated optimization,thereby addressing existing GFL limitations in structural modeling and feature representation.Experiments on standard benchmark datasets including Cora,Citeseer,Polblogs,and Cora_ML demonstrate that GFL-SAR achieves superior performance in classification accuracy,convergence speed,and robustness compared to existing methods,confirming its effectiveness and generalizability in GFL tasks.展开更多
Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional atten...Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional attention modules such as SE-Net,CBAM,ECA-Net,and CA adopt a deterministic paradigm,assigning fixed scalar weights to features without modeling ambiguity or confidence.To overcome these limitations,this paper proposes the Fuzzy Attention Network Layer(FANL),which integrates intuitionistic fuzzy set theory with convolutional neural networks to explicitly represent feature uncertainty through membership(μ),non-membership(ν),and hesitation(π)degrees.FANLconsists of four coremodules:(1)feature dimensionality reduction via global pooling,(2)fuzzymodeling using learnable clustering centers,(3)adaptive attention generation through weighted fusion of fuzzy components,and(4)feature refinement through residual connections.A cross-layer guidance mechanism is further introduced to enhance hierarchical feature propagation,allowing high-level semantic features to incorporate fine-grained texture information from shallow layers.Comprehensive experiments on three benchmark datasets—PathMNIST-30000,full PathMNIST,and Blood MNIST—demonstrate the effectiveness and generalizability of FANL.The model achieves 84.41±0.56%accuracy and a 1.69%improvement over the baseline CNN while maintaining lightweight computational complexity.Ablation studies show that removing any component causes a 1.7%–2.0%performance drop,validating the synergistic contribution of each module.Furthermore,FANL provides superior uncertainty calibration(ECE=0.0452)and interpretable selective prediction under uncertainty.Overall,FANL presents an efficient and uncertaintyaware attention framework that improves both accuracy and reliability,offering a promising direction for robust visual recognition under ambiguous or noisy conditions.展开更多
文摘随着光伏发电在全球能源体系中占比不断提升,超短期光伏发电量预测对电力系统调度与安全运行至关重要。然而,光伏发电量受多因素影响,具有显著随机性与波动性。为此,提出了一种基于TCN-BiLSTM-Attention模型的超短期光伏发电量预测方法。首先通过皮尔逊相关分析筛选关键特征,并利用孤立森林算法检测异常值,结合线性插值法和标准化完成数据预处理。随后,通过时间卷积网络(Temporal Convolutional Network,TCN)提取时序特征,再利用双向长短期记忆网络(Bidirectional Long Short-Term Memory,BiLSTM)网络捕获前后向时间依赖关系,并在输出端引入注意力机制聚焦关键时间步特征。最后,在Desert Knowledge Australia Solar Centre(DKASC)数据集上的对比实验表明,与传统LSTM、BiLSTM模型相比,提出的TCN-BiLSTM-Attention模型在预测精度、稳定性等方面均表现出一定优势。
基金supported by the National Natural Science Foundation of China under Grant No.12204062the Natural Science Foundation of Shandong Province under Grant No.ZR2022MF330。
文摘To enhance speech emotion recognition capability,this study constructs a speech emotion recognition model integrating the adaptive acoustic mixup(AAM)and improved coordinate and shuffle attention(ICASA)methods.The AAM method optimizes data augmentation by combining a sample selection strategy and dynamic interpolation coefficients,thus enabling information fusion of speech data with different emotions at the acoustic level.The ICASA method enhances feature extraction capability through dynamic fusion of the improved coordinate attention(ICA)and shuffle attention(SA)techniques.The ICA technique reduces computational overhead by employing depth-separable convolution and an h-swish activation function and captures long-range dependencies of multi-scale time-frequency features using the attention weights.The SA technique promotes feature interaction through channel shuffling,which helps the model learn richer and more discriminative emotional features.Experimental results demonstrate that,compared to the baseline model,the proposed model improves the weighted accuracy by 5.42%and 4.54%,and the unweighted accuracy by 3.37%and 3.85%on the IEMOCAP and RAVDESS datasets,respectively.These improvements were confirmed to be statistically significant by independent samples t-tests,further supporting the practical reliability and applicability of the proposed model in real-world emotion-aware speech systems.
基金supported by Institute of Information&Communications Technology Planning&Evaluation(IITP)under the Metaverse Support Program to Nurture the Best Talents(IITP-2024-RS-2023-00254529)grant funded by the Korea government(MSIT).
文摘Brain tumors require precise segmentation for diagnosis and treatment plans due to their complex morphology and heterogeneous characteristics.While MRI-based automatic brain tumor segmentation technology reduces the burden on medical staff and provides quantitative information,existing methodologies and recent models still struggle to accurately capture and classify the fine boundaries and diverse morphologies of tumors.In order to address these challenges and maximize the performance of brain tumor segmentation,this research introduces a novel SwinUNETR-based model by integrating a new decoder block,the Hierarchical Channel-wise Attention Decoder(HCAD),into a powerful SwinUNETR encoder.The HCAD decoder block utilizes hierarchical features and channelspecific attention mechanisms to further fuse information at different scales transmitted from the encoder and preserve spatial details throughout the reconstruction phase.Rigorous evaluations on the recent BraTS GLI datasets demonstrate that the proposed SwinHCAD model achieved superior and improved segmentation accuracy on both the Dice score and HD95 metrics across all tumor subregions(WT,TC,and ET)compared to baseline models.In particular,the rationale and contribution of the model design were clarified through ablation studies to verify the effectiveness of the proposed HCAD decoder block.The results of this study are expected to greatly contribute to enhancing the efficiency of clinical diagnosis and treatment planning by increasing the precision of automated brain tumor segmentation.
文摘Modern business information systems face significant challenges in managing heterogeneous data sources,integrating disparate systems,and providing real-time decision support in complex enterprise environments.Contemporary enterprises typically operate 200+interconnected systems,with research indicating that 52% of organizations manage three or more enterprise content management systems,creating information silos that reduce operational efficiency by up to 35%.While attention mechanisms have demonstrated remarkable success in natural language processing and computer vision,their systematic application to business information systems remains largely unexplored.This paper presents the theoretical foundation for a Hierarchical Attention-Based Business Information System(HABIS)framework that applies multi-level attention mechanisms to enterprise environments.We provide a comprehensive mathematical formulation of the framework,analyze its computational complexity,and present a proof-of-concept implementation with simulation-based validation that demonstrates a 42% reduction in crosssystem query latency compared to legacy ERP modules and 70% improvement in prediction accuracy over baseline methods.The theoretical framework introduces four hierarchical attention levels:system-level attention for dynamic weighting of business systems,process-level attention for business process prioritization,data-level attention for critical information selection,and temporal attention for time-sensitive pattern recognition.Our complexity analysis demonstrates that the framework achieves O(n log n)computational complexity for attention computation,making it scalable to large enterprise environments including retail supply chains with 200+system-scale deployments.The proof-of-concept implementation validates the theoretical framework’s feasibility withMSE loss of 0.439 and response times of 0.000120 s per query,demonstrating its potential for addressing key challenges in business information systems.This work establishes a foundation for future empirical research and practical implementation of attention-driven enterprise systems.
文摘[目的/意义]针对温室温湿度预测中多传感器数据融合可靠性低、传统模型忽略温湿度动态耦合,以及参数调优依赖人工经验等问题。[方法]首先,对传统卡尔曼(Kalman)滤波算法实施改进,通过动态调整过程噪声协方差和观测噪声协方差,结合新息方差动态分配多传感器权重。其次,针对温湿度的强耦合性及其协同控制的需求,构建多输出长短期记忆-注意力机制(Long Short-Term Memory-Attention,LSTM-Attention)模型,以温湿度协同预测为目标,引入注意力机制自适应加权关键环境因子,并采用灰狼优化算法(Grey Wolf Optimizer,GWO)自动对超参数进行寻优。[结果和讨论]提出的自适应卡尔曼滤波算法在多点温湿度融合中的平均绝对偏差分别为1.59℃和8.64%,比传统卡尔曼滤波算法分别降低1.24%、8.57%。以该算法融合结果作为模型训练集,模型在温湿度预测中决定系数R2分别达到98.2%和99.3%,比传统Kalman提升4.7%和4.3%。GWO-LSTM-Atten⁃tion模型的温湿度预测均方根误差分别为0.7768℃和2.0564%,比LSTM、LSTM-Attention时间序列预测模型分别降低15.6%、6.6%,湿度分别降低29.2%、5.7%。[结论]提出的自适应卡尔曼融合算法能够有效抑制异常值影响,可在非平稳环境变化下实现多传感器数据可靠融合。在温室多环境因子预测中,GWO-LSTM-Attention模型温湿度预测值在未来可作为控制温室环境的重要参考,进而实现对温室环境的实时调控。
基金supported by National Natural Science Foundation of China(62466045)Inner Mongolia Natural Science Foundation Project(2021LHMS06003)Inner Mongolia University Basic Research Business Fee Project(114).
文摘Graph Federated Learning(GFL)has shown great potential in privacy protection and distributed intelligence through distributed collaborative training of graph-structured data without sharing raw information.However,existing GFL approaches often lack the capability for comprehensive feature extraction and adaptive optimization,particularly in non-independent and identically distributed(NON-IID)scenarios where balancing global structural understanding and local node-level detail remains a challenge.To this end,this paper proposes a novel framework called GFL-SAR(Graph Federated Collaborative Learning Framework Based on Structural Amplification and Attention Refinement),which enhances the representation learning capability of graph data through a dual-branch collaborative design.Specifically,we propose the Structural Insight Amplifier(SIA),which utilizes an improved Graph Convolutional Network(GCN)to strengthen structural awareness and improve modeling of topological patterns.In parallel,we propose the Attentive Relational Refiner(ARR),which employs an enhanced Graph Attention Network(GAT)to perform fine-grained modeling of node relationships and neighborhood features,thereby improving the expressiveness of local interactions and preserving critical contextual information.GFL-SAR effectively integrates multi-scale features from every branch via feature fusion and federated optimization,thereby addressing existing GFL limitations in structural modeling and feature representation.Experiments on standard benchmark datasets including Cora,Citeseer,Polblogs,and Cora_ML demonstrate that GFL-SAR achieves superior performance in classification accuracy,convergence speed,and robustness compared to existing methods,confirming its effectiveness and generalizability in GFL tasks.
文摘Deep learning attentionmechanisms have achieved remarkable progress in computer vision,but still face limitations when handling images with ambiguous boundaries and uncertain feature representations.Conventional attention modules such as SE-Net,CBAM,ECA-Net,and CA adopt a deterministic paradigm,assigning fixed scalar weights to features without modeling ambiguity or confidence.To overcome these limitations,this paper proposes the Fuzzy Attention Network Layer(FANL),which integrates intuitionistic fuzzy set theory with convolutional neural networks to explicitly represent feature uncertainty through membership(μ),non-membership(ν),and hesitation(π)degrees.FANLconsists of four coremodules:(1)feature dimensionality reduction via global pooling,(2)fuzzymodeling using learnable clustering centers,(3)adaptive attention generation through weighted fusion of fuzzy components,and(4)feature refinement through residual connections.A cross-layer guidance mechanism is further introduced to enhance hierarchical feature propagation,allowing high-level semantic features to incorporate fine-grained texture information from shallow layers.Comprehensive experiments on three benchmark datasets—PathMNIST-30000,full PathMNIST,and Blood MNIST—demonstrate the effectiveness and generalizability of FANL.The model achieves 84.41±0.56%accuracy and a 1.69%improvement over the baseline CNN while maintaining lightweight computational complexity.Ablation studies show that removing any component causes a 1.7%–2.0%performance drop,validating the synergistic contribution of each module.Furthermore,FANL provides superior uncertainty calibration(ECE=0.0452)and interpretable selective prediction under uncertainty.Overall,FANL presents an efficient and uncertaintyaware attention framework that improves both accuracy and reliability,offering a promising direction for robust visual recognition under ambiguous or noisy conditions.