电力系统的安全稳定运行是保障国家能源安全和经济发展的关键,而这在很大程度上依赖于对电力物联设备故障的准确预测。当前,随着电力物联网技术的发展,大量的数据被采集,但这些数据的潜在价值尚未得到充分挖掘,这在一定程度上限制了故...电力系统的安全稳定运行是保障国家能源安全和经济发展的关键,而这在很大程度上依赖于对电力物联设备故障的准确预测。当前,随着电力物联网技术的发展,大量的数据被采集,但这些数据的潜在价值尚未得到充分挖掘,这在一定程度上限制了故障预测的准确性,影响了电力系统的可靠运行。针对这一问题,该文提出了一种创新的基于GraphSAGE(Graph Sample and Aggregate)算法的电力物联设备故障预测。该方法通过PowerGraph数据集,将电力物联设备故障场景细分为四类,利用GraphSAGE模型的特性,深入学习和分析节点特征与边特征,从而实现对物联设备故障的有效预测。实验结果表明,该方法准确率达到97.5%,相较于其它传统方法,准确率提高了0.39%~6.21%,同时GraphSAGE模型实现了快速训练。该方法为电力物联设备安全稳定运行提供重要决策支持,能够对动态和相互联系的复杂系统进行更精细的分析,并增强电力系统运营部门对潜在干扰的预见和应对能力。展开更多
Computational approaches for predicting drug-target interactions(DTIs)are pivotal in advancing drug discovery.Current methodologies leveraging heterogeneous networks often fall short in fully integrating both local an...Computational approaches for predicting drug-target interactions(DTIs)are pivotal in advancing drug discovery.Current methodologies leveraging heterogeneous networks often fall short in fully integrating both local and global network information.To comprehensively consider network information,we propose DHGT-DTI,a novel deep learning-based approach for DTI prediction.Specifically,we capture the local and global structural information of the network from both neighborhood and meta-path per-spectives.In the neighborhood perspective,we employ a heterogeneous graph neural network(HGNN),which extends Graph Sample and Aggregate(GraphSAGE)to handle diverse node and edge types,effectively learning local network structures.In the meta-path perspective,we introduce a Graph Transformer with residual connections to model higher-order relationships defined by meta-paths,such as"drug-disease-drug",and use an attention mechanism to fuse information across multiple meta-paths.The learned features from these dual perspectives are synergistically integrated for DTI prediction via a matrix decomposition method.Furthermore,DHGT-DTI reconstructs not only the DTI network but also auxiliary networks to bolster prediction accuracy.Comprehensive experiments on two benchmark datasets validate the superiority of DHGT-DTI over existing baseline methods.Additionally,case studies on six drugs used to treat Parkinson's disease not only validate the practical utility of DHGT-DTI but also highlight its broader potential in accelerating drug discovery for other diseases.展开更多
环状RNA (circRNA)是一类内源性的非编码RNA,许多研究表明circRNA在复杂疾病中发挥着重要作用。然而,由于circRNA的功能复杂性和实验验证的高成本,传统的实验方法难以高效挖掘circRNA与疾病的关联关系,因此迫切需要高效的计算方法来揭示...环状RNA (circRNA)是一类内源性的非编码RNA,许多研究表明circRNA在复杂疾病中发挥着重要作用。然而,由于circRNA的功能复杂性和实验验证的高成本,传统的实验方法难以高效挖掘circRNA与疾病的关联关系,因此迫切需要高效的计算方法来揭示circRNA与疾病的关联关系。在现有数据库的基础上,本文提出了一种基于GraphSAGE模型的circRNA与疾病关联预测方法,通过整合circRNA相似性、疾病相似性以及已知的circRNA-disease关联数据构建异质图,随后借助GraphSAGE模型获得异质图中节点对应特征的高阶聚合表示,从而有效预测circRNA-disease关联。实验结果表明,GraphSAGE模型的AUC值为0.921,F1-score为0.865,Precision为0.879,Recall为0.852,以上四个评估指标均优于现有的DWNN-RLS和RWR模型。总之,GraphSAGE是预测circRNA-disease关联的有效方法。Circular RNA (circRNA) is a class of endogenous non-coding RNAs. Many studies have shown that circRNA plays an important role in complex diseases. However, due to the functional complexity of circRNA and the high cost of experimental verification, it is difficult for traditional experimental methods to efficiently mine the association between circRNA and disease, so efficient computational methods are urgently needed to reveal the association between circRNA and disease. Based on the existing database, this paper proposed a method for predicting the association between circRNA and disease based on GraphSAGE model. By integrating circRNA similarity, disease similarity and known circRNA-disease association data, a heterogeneous graph network was constructed, and then a high-level aggregated representation of the corresponding features of nodes in the heterogeneous graph network was obtained by GraphSAGE model, so as to effectively predict the circRNA-disease association. The experimental results demonstrate that the GraphSAGE model achieves an AUC of 0.921, F1-score of 0.865, Precision of 0.879 and Recall of 0.852, all of which were better than the existing DWNN-RLS and RWR models. In conclusion, GraphSAGE is an effective method to predict the association of circRNA-disease.展开更多
当前Web追踪领域主要使用浏览器指纹对用户进行追踪。针对浏览器指纹追踪技术存在指纹随时间动态变化、不易长期追踪等问题,提出一种关注节点和边缘特征的改进图采样聚合算法(An Improved Graph SAmple and AGgregatE with Both Node an...当前Web追踪领域主要使用浏览器指纹对用户进行追踪。针对浏览器指纹追踪技术存在指纹随时间动态变化、不易长期追踪等问题,提出一种关注节点和边缘特征的改进图采样聚合算法(An Improved Graph SAmple and AGgregatE with Both Node and Edge Features,NE-GraphSAGE)用于浏览器指纹追踪。首先以浏览器指纹为节点、指纹之间特征相似度为边构建图数据。其次对图神经网络中的GraphSAGE算法进行改进使其不仅能关注节点特征,而且能捕获边缘信息并对边缘分类,从而识别指纹。最后将NE-GraphSAGE算法与Eckersley算法、FPStalker算法和LSTM算法进行对比,验证NE-GraphSAGE算法的识别效果。实验结果表明,NE-GraphSAGE算法在准确率和追踪时长上均有不同程度的提升,最大追踪时长可达80天,相比其他3种算法性能更优,验证了NE-GraphSAGE算法对浏览器指纹长期追踪的能力。展开更多
Diabetic Kidney Disease (DKD) is a common chronic complication of diabetes. Despite advancements in accurately identifying biomarkers for detecting and diagnosing this harmful disease, there remains an urgent need for...Diabetic Kidney Disease (DKD) is a common chronic complication of diabetes. Despite advancements in accurately identifying biomarkers for detecting and diagnosing this harmful disease, there remains an urgent need for new biomarkers to enable early detection of DKD. In this study, we modeled publicly available transcriptome datasets as a graph problem and used GraphSAGE Neural Networks (GNNs) to identify potential biomarkers. The GraphSAGE model effectively learned representations that captured the intricate interactions, dependencies among genes, and disease-specific gene expression patterns necessary to classify samples as DKD and Control. We finally extracted the features of importance;the identified set of genes exhibited an impressive ability to distinguish between healthy and unhealthy samples, even though these genes differ from previous research findings. The unexpected biomarker variations in this study suggest more exploration and validation studies for discovering biomarkers in DKD. In conclusion, our study showcases the effectiveness of modeling transcriptome data as a graph problem, demonstrates the use of GraphSAGE models for biomarker discovery in DKD, and advocates for integrating advanced machine-learning techniques in DKD biomarker research, emphasizing the need for a holistic approach to unravel the intricacies of biological systems.展开更多
Network attacks have become a critical issue in the internet security domain.Artificial intelligence technology-based detection methodologies have attracted attention;however,recent studies have struggled to adapt to ...Network attacks have become a critical issue in the internet security domain.Artificial intelligence technology-based detection methodologies have attracted attention;however,recent studies have struggled to adapt to changing attack patterns and complex network environments.In addition,it is difficult to explain the detection results logically using artificial intelligence.We propose a method for classifying network attacks using graph models to explain the detection results.First,we reconstruct the network packet data into a graphical structure.We then use a graph model to predict network attacks using edge classification.To explain the prediction results,we observed numerical changes by randomly masking and calculating the importance of neighbors,allowing us to extract significant subgraphs.Our experiments on six public datasets demonstrate superior performance with an average F1-score of 0.960 and accuracy of 0.964,outperforming traditional machine learning and other graph models.The visual representation of the extracted subgraphs highlights the neighboring nodes that have the greatest impact on the results,thus explaining detection.In conclusion,this study demonstrates that graph-based models are suitable for network attack detection in complex environments,and the importance of graph neighbors can be calculated to efficiently analyze the results.This approach can contribute to real-world network security analyses and provide a new direction in the field.展开更多
Graph Neural Networks(GNNs),as a deep learning framework specifically designed for graph-structured data,have achieved deep representation learning of graph data through message passing mechanisms and have become a co...Graph Neural Networks(GNNs),as a deep learning framework specifically designed for graph-structured data,have achieved deep representation learning of graph data through message passing mechanisms and have become a core technology in the field of graph analysis.However,current reviews on GNN models are mainly focused on smaller domains,and there is a lack of systematic reviews on the classification and applications of GNN models.This review systematically synthesizes the three canonical branches of GNN,Graph Convolutional Network(GCN),Graph Attention Network(GAT),and Graph Sampling Aggregation Network(GraphSAGE),and analyzes their integration pathways from both structural and feature perspectives.Drawing on representative studies,we identify three major integration patterns:cascaded fusion,where heterogeneous modules such as Convolutional Neural Network(CNN),Long Short-Term Memory(LSTM),and GraphSAGE are sequentially combined for hierarchical feature learning;parallel fusion,where multi-branch architectures jointly encode complementary graph features;and feature-level fusion,which employs concatenation,weighted summation,or attention-based gating to adaptively merge multi-source embeddings.Through these patterns,integrated GNNs achieve enhanced expressiveness,robustness,and scalability across domains including transportation,biomedicine,and cybersecurity.展开更多
Data-driven prediction of time series is significant in many scientific research fields such as global climate change and weather forecast.For global monthly mean temperature series,considering the strong potential of...Data-driven prediction of time series is significant in many scientific research fields such as global climate change and weather forecast.For global monthly mean temperature series,considering the strong potential of deep neural network for extracting data features,this paper proposes a data-driven model,ResGraphNet,which improves the prediction accuracy of time series by an embedded residual module in GraphSAGE layers.The experimental results of a global mean temperature dataset,HadCRUT5,show that compared with 11 traditional prediction technologies,the proposed ResGraphNet obtains the best accuracy.The error indicator predicted by the proposed ResGraphNet is smaller than that of the other 11 prediction models.Furthermore,the performance on seven temperature datasets shows the excellent generalization of the ResGraphNet.Finally,based on our proposed ResGraphNet,the predicted 2022 annual anomaly of global temperature is 0.74722℃,which provides confidence for limiting warming to 1.5℃ above pre-industrial levels.展开更多
文摘电力系统的安全稳定运行是保障国家能源安全和经济发展的关键,而这在很大程度上依赖于对电力物联设备故障的准确预测。当前,随着电力物联网技术的发展,大量的数据被采集,但这些数据的潜在价值尚未得到充分挖掘,这在一定程度上限制了故障预测的准确性,影响了电力系统的可靠运行。针对这一问题,该文提出了一种创新的基于GraphSAGE(Graph Sample and Aggregate)算法的电力物联设备故障预测。该方法通过PowerGraph数据集,将电力物联设备故障场景细分为四类,利用GraphSAGE模型的特性,深入学习和分析节点特征与边特征,从而实现对物联设备故障的有效预测。实验结果表明,该方法准确率达到97.5%,相较于其它传统方法,准确率提高了0.39%~6.21%,同时GraphSAGE模型实现了快速训练。该方法为电力物联设备安全稳定运行提供重要决策支持,能够对动态和相互联系的复杂系统进行更精细的分析,并增强电力系统运营部门对潜在干扰的预见和应对能力。
基金the National Natural Science Foundation of China(Grant Nos.:62272288,U22A2041)Fundamental Research Funds for the Central Universities,Shaanxi Normal University(Grant No.:GK202302006)the Scientific Research Fund of Hunan Provincial Education Department of China(Grant No.:22B0097).
文摘Computational approaches for predicting drug-target interactions(DTIs)are pivotal in advancing drug discovery.Current methodologies leveraging heterogeneous networks often fall short in fully integrating both local and global network information.To comprehensively consider network information,we propose DHGT-DTI,a novel deep learning-based approach for DTI prediction.Specifically,we capture the local and global structural information of the network from both neighborhood and meta-path per-spectives.In the neighborhood perspective,we employ a heterogeneous graph neural network(HGNN),which extends Graph Sample and Aggregate(GraphSAGE)to handle diverse node and edge types,effectively learning local network structures.In the meta-path perspective,we introduce a Graph Transformer with residual connections to model higher-order relationships defined by meta-paths,such as"drug-disease-drug",and use an attention mechanism to fuse information across multiple meta-paths.The learned features from these dual perspectives are synergistically integrated for DTI prediction via a matrix decomposition method.Furthermore,DHGT-DTI reconstructs not only the DTI network but also auxiliary networks to bolster prediction accuracy.Comprehensive experiments on two benchmark datasets validate the superiority of DHGT-DTI over existing baseline methods.Additionally,case studies on six drugs used to treat Parkinson's disease not only validate the practical utility of DHGT-DTI but also highlight its broader potential in accelerating drug discovery for other diseases.
文摘环状RNA (circRNA)是一类内源性的非编码RNA,许多研究表明circRNA在复杂疾病中发挥着重要作用。然而,由于circRNA的功能复杂性和实验验证的高成本,传统的实验方法难以高效挖掘circRNA与疾病的关联关系,因此迫切需要高效的计算方法来揭示circRNA与疾病的关联关系。在现有数据库的基础上,本文提出了一种基于GraphSAGE模型的circRNA与疾病关联预测方法,通过整合circRNA相似性、疾病相似性以及已知的circRNA-disease关联数据构建异质图,随后借助GraphSAGE模型获得异质图中节点对应特征的高阶聚合表示,从而有效预测circRNA-disease关联。实验结果表明,GraphSAGE模型的AUC值为0.921,F1-score为0.865,Precision为0.879,Recall为0.852,以上四个评估指标均优于现有的DWNN-RLS和RWR模型。总之,GraphSAGE是预测circRNA-disease关联的有效方法。Circular RNA (circRNA) is a class of endogenous non-coding RNAs. Many studies have shown that circRNA plays an important role in complex diseases. However, due to the functional complexity of circRNA and the high cost of experimental verification, it is difficult for traditional experimental methods to efficiently mine the association between circRNA and disease, so efficient computational methods are urgently needed to reveal the association between circRNA and disease. Based on the existing database, this paper proposed a method for predicting the association between circRNA and disease based on GraphSAGE model. By integrating circRNA similarity, disease similarity and known circRNA-disease association data, a heterogeneous graph network was constructed, and then a high-level aggregated representation of the corresponding features of nodes in the heterogeneous graph network was obtained by GraphSAGE model, so as to effectively predict the circRNA-disease association. The experimental results demonstrate that the GraphSAGE model achieves an AUC of 0.921, F1-score of 0.865, Precision of 0.879 and Recall of 0.852, all of which were better than the existing DWNN-RLS and RWR models. In conclusion, GraphSAGE is an effective method to predict the association of circRNA-disease.
文摘当前Web追踪领域主要使用浏览器指纹对用户进行追踪。针对浏览器指纹追踪技术存在指纹随时间动态变化、不易长期追踪等问题,提出一种关注节点和边缘特征的改进图采样聚合算法(An Improved Graph SAmple and AGgregatE with Both Node and Edge Features,NE-GraphSAGE)用于浏览器指纹追踪。首先以浏览器指纹为节点、指纹之间特征相似度为边构建图数据。其次对图神经网络中的GraphSAGE算法进行改进使其不仅能关注节点特征,而且能捕获边缘信息并对边缘分类,从而识别指纹。最后将NE-GraphSAGE算法与Eckersley算法、FPStalker算法和LSTM算法进行对比,验证NE-GraphSAGE算法的识别效果。实验结果表明,NE-GraphSAGE算法在准确率和追踪时长上均有不同程度的提升,最大追踪时长可达80天,相比其他3种算法性能更优,验证了NE-GraphSAGE算法对浏览器指纹长期追踪的能力。
文摘Diabetic Kidney Disease (DKD) is a common chronic complication of diabetes. Despite advancements in accurately identifying biomarkers for detecting and diagnosing this harmful disease, there remains an urgent need for new biomarkers to enable early detection of DKD. In this study, we modeled publicly available transcriptome datasets as a graph problem and used GraphSAGE Neural Networks (GNNs) to identify potential biomarkers. The GraphSAGE model effectively learned representations that captured the intricate interactions, dependencies among genes, and disease-specific gene expression patterns necessary to classify samples as DKD and Control. We finally extracted the features of importance;the identified set of genes exhibited an impressive ability to distinguish between healthy and unhealthy samples, even though these genes differ from previous research findings. The unexpected biomarker variations in this study suggest more exploration and validation studies for discovering biomarkers in DKD. In conclusion, our study showcases the effectiveness of modeling transcriptome data as a graph problem, demonstrates the use of GraphSAGE models for biomarker discovery in DKD, and advocates for integrating advanced machine-learning techniques in DKD biomarker research, emphasizing the need for a holistic approach to unravel the intricacies of biological systems.
基金supported by the MSIT(Ministry of Science and ICT),Republic of Korea,under the ICAN(ICT Challenge and Advanced Network of HRD)support program(IITP-2025-RS-2023-00259497)supervised by the IITP(Institute for Information&Communications Technology Planning&Evaluation)and was supported by Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Republic of Korea government(MSIT)(No.IITP-2025-RS-2023-00254129+1 种基金Graduate School of Metaverse Convergence(Sungkyunkwan University))was supported by the Basic Science Research Program of the National Research Foundation(NRF)funded by the Republic of Korean government(MSIT)(No.RS-2024-00346737).
文摘Network attacks have become a critical issue in the internet security domain.Artificial intelligence technology-based detection methodologies have attracted attention;however,recent studies have struggled to adapt to changing attack patterns and complex network environments.In addition,it is difficult to explain the detection results logically using artificial intelligence.We propose a method for classifying network attacks using graph models to explain the detection results.First,we reconstruct the network packet data into a graphical structure.We then use a graph model to predict network attacks using edge classification.To explain the prediction results,we observed numerical changes by randomly masking and calculating the importance of neighbors,allowing us to extract significant subgraphs.Our experiments on six public datasets demonstrate superior performance with an average F1-score of 0.960 and accuracy of 0.964,outperforming traditional machine learning and other graph models.The visual representation of the extracted subgraphs highlights the neighboring nodes that have the greatest impact on the results,thus explaining detection.In conclusion,this study demonstrates that graph-based models are suitable for network attack detection in complex environments,and the importance of graph neighbors can be calculated to efficiently analyze the results.This approach can contribute to real-world network security analyses and provide a new direction in the field.
基金funded by Guangzhou Huashang University(2024HSZD01,HS2023JYSZH01).
文摘Graph Neural Networks(GNNs),as a deep learning framework specifically designed for graph-structured data,have achieved deep representation learning of graph data through message passing mechanisms and have become a core technology in the field of graph analysis.However,current reviews on GNN models are mainly focused on smaller domains,and there is a lack of systematic reviews on the classification and applications of GNN models.This review systematically synthesizes the three canonical branches of GNN,Graph Convolutional Network(GCN),Graph Attention Network(GAT),and Graph Sampling Aggregation Network(GraphSAGE),and analyzes their integration pathways from both structural and feature perspectives.Drawing on representative studies,we identify three major integration patterns:cascaded fusion,where heterogeneous modules such as Convolutional Neural Network(CNN),Long Short-Term Memory(LSTM),and GraphSAGE are sequentially combined for hierarchical feature learning;parallel fusion,where multi-branch architectures jointly encode complementary graph features;and feature-level fusion,which employs concatenation,weighted summation,or attention-based gating to adaptively merge multi-source embeddings.Through these patterns,integrated GNNs achieve enhanced expressiveness,robustness,and scalability across domains including transportation,biomedicine,and cybersecurity.
基金Supported by the National Natural Science Foundation of China under Grant 41974137.
文摘Data-driven prediction of time series is significant in many scientific research fields such as global climate change and weather forecast.For global monthly mean temperature series,considering the strong potential of deep neural network for extracting data features,this paper proposes a data-driven model,ResGraphNet,which improves the prediction accuracy of time series by an embedded residual module in GraphSAGE layers.The experimental results of a global mean temperature dataset,HadCRUT5,show that compared with 11 traditional prediction technologies,the proposed ResGraphNet obtains the best accuracy.The error indicator predicted by the proposed ResGraphNet is smaller than that of the other 11 prediction models.Furthermore,the performance on seven temperature datasets shows the excellent generalization of the ResGraphNet.Finally,based on our proposed ResGraphNet,the predicted 2022 annual anomaly of global temperature is 0.74722℃,which provides confidence for limiting warming to 1.5℃ above pre-industrial levels.