The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to u...The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.展开更多
微表情检测旨在视频中定位幅度微弱、时间短暂的表情区间。其难点在于有效提取面部区域间的动态关联特征和多尺度时序特征,进而精准捕捉面部各区域微小动作之间的关联。针对这些问题,提出了一种融合自适应图注意力和多尺度可变空洞卷积...微表情检测旨在视频中定位幅度微弱、时间短暂的表情区间。其难点在于有效提取面部区域间的动态关联特征和多尺度时序特征,进而精准捕捉面部各区域微小动作之间的关联。针对这些问题,提出了一种融合自适应图注意力和多尺度可变空洞卷积的微表情检测网络(AG-DDNet)。通过引入参数可学习矩阵来实现键值对的特征变换,通过计算面部区域特征向量间的相似度得到动态邻接矩阵,并结合图注意力机制计算区域间权重系数,实现特征的动态融合;采用了多尺度可变空洞卷积模块,通过自适应池化与卷积组合的预测器生成动态感受野,从而实现多尺度的特征提取;引入基于Fisher信息矩阵的自然梯度优化机制,通过Fisher Adam优化器有效捕捉参数空间的几何结构信息,实现学习率的精确自适应调整,从而显著增强了模型对微表情和宏表情的协同检测能力。在微表情检测任务中,该算法与同类代表性算法相比,在CAS(ME)2数据集和SAMM Long Videos数据集上的性能分别提升了54.20%和20.11%。与最新算法相比,两个数据集上的提升幅度分别为38.43%和6.81%,有效证明了该方法在长视频微表情检测任务上的优越性能。展开更多
为解决篇章级多事件抽取中事件及论元角色间全局语义关联缺失、文档信息利用不足的问题,提出了基于论元关联和图神经网络的篇章级多事件抽取(document-level multi-event extraction based on argument correlation and graph neural ne...为解决篇章级多事件抽取中事件及论元角色间全局语义关联缺失、文档信息利用不足的问题,提出了基于论元关联和图神经网络的篇章级多事件抽取(document-level multi-event extraction based on argument correlation and graph neural network,DEEACG)模型。首先,使用基于变换器的双向编码器表示(bidirectional encoder representations from Transformers,BERT)模块获取实体,并引入实体共事件性预测任务,增强实体间的语义关联。接着,引入可学习的事件代理节点,构建包含实体、上下文和代理节点的异构图,通过特征线性调制图神经网络(graph neural network with feature-wise linear modulation,GNN-FiLM)与多头自注意力机制,实现多事件间的全局交互与语义融合。然后,通过多层感知机进行事件类型检测。最后,构建双投影空间建模论元关联,采用Bron-Kerbosch算法提取图中极大团作为候选论元组合,并结合多头注意力实现论元角色分类。结果表明,DEEACG模型在中文金融公告(Chinese financial announcements,ChFinAnn)数据集的多事件抽取任务中性能明显提升,与关系增强文档级事件抽取(relation-enabled document-level event extraction,ReDEE)模型相比,F1均值提升了2.1个百分点。该研究证实DEEACG模型能有效捕捉多事件间语义关联,适用于篇章级多事件抽取任务。展开更多
The prediction of regional traffic flows is important for traffic control and management in an intelligent traffic system.With the help of deep neural networks,the convolutional neural network or residual neural netwo...The prediction of regional traffic flows is important for traffic control and management in an intelligent traffic system.With the help of deep neural networks,the convolutional neural network or residual neural network,which can be applied only to regular grids,is adopted to capture the spatial dependence for flow prediction.However,the obtained regions are always irregular considering the road network and administrative boundaries;thus,dividing the city into grids is inaccurate for prediction.In this paper,we propose a new model based on multi-graph convolutional network and gated recurrent unit(MGCN-GRU)to predict traffic flows for irregular regions.Specifically,we first construct heterogeneous inter-region graphs for a city to reflect the rela-tionships among regions.In each graph,nodes represent the irregular regions and edges represent the relationship types between regions.Then,we propose a multi-graph convolutional network to fuse different inter-region graphs and additional attributes.The GRU is further used to capture the temporal dependence and to predict future traffic flows.Experimental results based on three real-world large-scale datasets(public bicycle system dataset,taxi dataset,and dockless bike-sharing dataset)show that our MGCN-GRU model outperforms a variety of existing methods.展开更多
文摘The ability to accurately predict urban traffic flows is crucial for optimising city operations.Consequently,various methods for forecasting urban traffic have been developed,focusing on analysing historical data to understand complex mobility patterns.Deep learning techniques,such as graph neural networks(GNNs),are popular for their ability to capture spatio-temporal dependencies.However,these models often become overly complex due to the large number of hyper-parameters involved.In this study,we introduce Dynamic Multi-Graph Spatial-Temporal Graph Neural Ordinary Differential Equation Networks(DMST-GNODE),a framework based on ordinary differential equations(ODEs)that autonomously discovers effective spatial-temporal graph neural network(STGNN)architectures for traffic prediction tasks.The comparative analysis of DMST-GNODE and baseline models indicates that DMST-GNODE model demonstrates superior performance across multiple datasets,consistently achieving the lowest Root Mean Square Error(RMSE)and Mean Absolute Error(MAE)values,alongside the highest accuracy.On the BKK(Bangkok)dataset,it outperformed other models with an RMSE of 3.3165 and an accuracy of 0.9367 for a 20-min interval,maintaining this trend across 40 and 60 min.Similarly,on the PeMS08 dataset,DMST-GNODE achieved the best performance with an RMSE of 19.4863 and an accuracy of 0.9377 at 20 min,demonstrating its effectiveness over longer periods.The Los_Loop dataset results further emphasise this model’s advantage,with an RMSE of 3.3422 and an accuracy of 0.7643 at 20 min,consistently maintaining superiority across all time intervals.These numerical highlights indicate that DMST-GNODE not only outperforms baseline models but also achieves higher accuracy and lower errors across different time intervals and datasets.
文摘微表情检测旨在视频中定位幅度微弱、时间短暂的表情区间。其难点在于有效提取面部区域间的动态关联特征和多尺度时序特征,进而精准捕捉面部各区域微小动作之间的关联。针对这些问题,提出了一种融合自适应图注意力和多尺度可变空洞卷积的微表情检测网络(AG-DDNet)。通过引入参数可学习矩阵来实现键值对的特征变换,通过计算面部区域特征向量间的相似度得到动态邻接矩阵,并结合图注意力机制计算区域间权重系数,实现特征的动态融合;采用了多尺度可变空洞卷积模块,通过自适应池化与卷积组合的预测器生成动态感受野,从而实现多尺度的特征提取;引入基于Fisher信息矩阵的自然梯度优化机制,通过Fisher Adam优化器有效捕捉参数空间的几何结构信息,实现学习率的精确自适应调整,从而显著增强了模型对微表情和宏表情的协同检测能力。在微表情检测任务中,该算法与同类代表性算法相比,在CAS(ME)2数据集和SAMM Long Videos数据集上的性能分别提升了54.20%和20.11%。与最新算法相比,两个数据集上的提升幅度分别为38.43%和6.81%,有效证明了该方法在长视频微表情检测任务上的优越性能。
文摘为解决篇章级多事件抽取中事件及论元角色间全局语义关联缺失、文档信息利用不足的问题,提出了基于论元关联和图神经网络的篇章级多事件抽取(document-level multi-event extraction based on argument correlation and graph neural network,DEEACG)模型。首先,使用基于变换器的双向编码器表示(bidirectional encoder representations from Transformers,BERT)模块获取实体,并引入实体共事件性预测任务,增强实体间的语义关联。接着,引入可学习的事件代理节点,构建包含实体、上下文和代理节点的异构图,通过特征线性调制图神经网络(graph neural network with feature-wise linear modulation,GNN-FiLM)与多头自注意力机制,实现多事件间的全局交互与语义融合。然后,通过多层感知机进行事件类型检测。最后,构建双投影空间建模论元关联,采用Bron-Kerbosch算法提取图中极大团作为候选论元组合,并结合多头注意力实现论元角色分类。结果表明,DEEACG模型在中文金融公告(Chinese financial announcements,ChFinAnn)数据集的多事件抽取任务中性能明显提升,与关系增强文档级事件抽取(relation-enabled document-level event extraction,ReDEE)模型相比,F1均值提升了2.1个百分点。该研究证实DEEACG模型能有效捕捉多事件间语义关联,适用于篇章级多事件抽取任务。
基金the National Natural Science Foundation of China(No.61903109)the Zhejiang Provincial Natural Science Foundation of China(No.LY19F030021)。
文摘The prediction of regional traffic flows is important for traffic control and management in an intelligent traffic system.With the help of deep neural networks,the convolutional neural network or residual neural network,which can be applied only to regular grids,is adopted to capture the spatial dependence for flow prediction.However,the obtained regions are always irregular considering the road network and administrative boundaries;thus,dividing the city into grids is inaccurate for prediction.In this paper,we propose a new model based on multi-graph convolutional network and gated recurrent unit(MGCN-GRU)to predict traffic flows for irregular regions.Specifically,we first construct heterogeneous inter-region graphs for a city to reflect the rela-tionships among regions.In each graph,nodes represent the irregular regions and edges represent the relationship types between regions.Then,we propose a multi-graph convolutional network to fuse different inter-region graphs and additional attributes.The GRU is further used to capture the temporal dependence and to predict future traffic flows.Experimental results based on three real-world large-scale datasets(public bicycle system dataset,taxi dataset,and dockless bike-sharing dataset)show that our MGCN-GRU model outperforms a variety of existing methods.