期刊文献+
共找到447篇文章
< 1 2 23 >
每页显示 20 50 100
Using BlazePose on Spatial Temporal Graph Convolutional Networks for Action Recognition 被引量:2
1
作者 Motasem S.Alsawadi El-Sayed M.El-kenawy Miguel Rio 《Computers, Materials & Continua》 SCIE EI 2023年第1期19-36,共18页
The ever-growing available visual data(i.e.,uploaded videos and pictures by internet users)has attracted the research community’s attention in the computer vision field.Therefore,finding efficient solutions to extrac... The ever-growing available visual data(i.e.,uploaded videos and pictures by internet users)has attracted the research community’s attention in the computer vision field.Therefore,finding efficient solutions to extract knowledge from these sources is imperative.Recently,the BlazePose system has been released for skeleton extraction from images oriented to mobile devices.With this skeleton graph representation in place,a Spatial-Temporal Graph Convolutional Network can be implemented to predict the action.We hypothesize that just by changing the skeleton input data for a different set of joints that offers more information about the action of interest,it is possible to increase the performance of the Spatial-Temporal Graph Convolutional Network for HAR tasks.Hence,in this study,we present the first implementation of the BlazePose skeleton topology upon this architecture for action recognition.Moreover,we propose the Enhanced-BlazePose topology that can achieve better results than its predecessor.Additionally,we propose different skeleton detection thresholds that can improve the accuracy performance even further.We reached a top-1 accuracy performance of 40.1%on the Kinetics dataset.For the NTU-RGB+D dataset,we achieved 87.59%and 92.1%accuracy for Cross-Subject and Cross-View evaluation criteria,respectively. 展开更多
关键词 Action recognition BlazePose graph neural network OpenPose SKELETON spatial temporal graph convolution network
在线阅读 下载PDF
Skeleton Split Strategies for Spatial Temporal Graph Convolution Networks
2
作者 Motasem S.Alsawadi Miguel Rio 《Computers, Materials & Continua》 SCIE EI 2022年第6期4643-4658,共16页
Action recognition has been recognized as an activity in which individuals’behaviour can be observed.Assembling profiles of regular activities such as activities of daily living can support identifying trends in the ... Action recognition has been recognized as an activity in which individuals’behaviour can be observed.Assembling profiles of regular activities such as activities of daily living can support identifying trends in the data during critical events.A skeleton representation of the human body has been proven to be effective for this task.The skeletons are presented in graphs form-like.However,the topology of a graph is not structured like Euclideanbased data.Therefore,a new set of methods to perform the convolution operation upon the skeleton graph is proposed.Our proposal is based on the Spatial Temporal-Graph Convolutional Network(ST-GCN)framework.In this study,we proposed an improved set of label mapping methods for the ST-GCN framework.We introduce three split techniques(full distance split,connection split,and index split)as an alternative approach for the convolution operation.The experiments presented in this study have been trained using two benchmark datasets:NTU-RGB+D and Kinetics to evaluate the performance.Our results indicate that our split techniques outperform the previous partition strategies and aremore stable during training without using the edge importance weighting additional training parameter.Therefore,our proposal can provide a more realistic solution for real-time applications centred on daily living recognition systems activities for indoor environments. 展开更多
关键词 Skeleton split strategies spatial temporal graph convolutional neural networks skeleton joints action recognition
在线阅读 下载PDF
Local-global dynamic correlations based spatial-temporal convolutional network for traffic flow forecasting
3
作者 ZHANG Hong GONG Lei +2 位作者 ZHAO Tianxin ZHANG Xijun WANG Hongyan 《High Technology Letters》 EI CAS 2024年第4期370-379,共10页
Traffic flow forecasting plays a crucial role and is the key technology to realize dynamic traffic guidance and active traffic control in intelligent traffic systems(ITS).Aiming at the complex local and global spatial... Traffic flow forecasting plays a crucial role and is the key technology to realize dynamic traffic guidance and active traffic control in intelligent traffic systems(ITS).Aiming at the complex local and global spatial-temporal dynamic characteristics of traffic flow,this paper proposes a new traffic flow forecasting model spatial-temporal attention graph neural network(STA-GNN)by combining at-tention mechanism(AM)and spatial-temporal convolutional network.The model learns the hidden dynamic local spatial correlations of the traffic network by combining the dynamic adjacency matrix constructed by the graph learning layer with the graph convolutional network(GCN).The local tem-poral correlations of traffic flow at different scales are extracted by stacking multiple convolutional kernels in temporal convolutional network(TCN).And the global spatial-temporal dependencies of long-time sequences of traffic flow are captured by the spatial-temporal attention mechanism(STAtt),which enhances the global spatial-temporal modeling and the representational ability of model.The experimental results on two datasets,METR-LA and PEMS-BAY,show the proposed STA-GNN model outperforms the common baseline models in forecasting accuracy. 展开更多
关键词 traffic flow forecasting graph convolutional network(GCN) temporal convolu-tional network(TCN) attention mechanism(AM)
在线阅读 下载PDF
Occluded Gait Emotion Recognition Based on Multi-Scale Suppression Graph Convolutional Network
4
作者 Yuxiang Zou Ning He +2 位作者 Jiwu Sun Xunrui Huang Wenhua Wang 《Computers, Materials & Continua》 SCIE EI 2025年第1期1255-1276,共22页
In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accurac... In recent years,gait-based emotion recognition has been widely applied in the field of computer vision.However,existing gait emotion recognition methods typically rely on complete human skeleton data,and their accuracy significantly declines when the data is occluded.To enhance the accuracy of gait emotion recognition under occlusion,this paper proposes a Multi-scale Suppression Graph ConvolutionalNetwork(MS-GCN).TheMS-GCN consists of three main components:Joint Interpolation Module(JI Moudle),Multi-scale Temporal Convolution Network(MS-TCN),and Suppression Graph Convolutional Network(SGCN).The JI Module completes the spatially occluded skeletal joints using the(K-Nearest Neighbors)KNN interpolation method.The MS-TCN employs convolutional kernels of various sizes to comprehensively capture the emotional information embedded in the gait,compensating for the temporal occlusion of gait information.The SGCN extracts more non-prominent human gait features by suppressing the extraction of key body part features,thereby reducing the negative impact of occlusion on emotion recognition results.The proposed method is evaluated on two comprehensive datasets:Emotion-Gait,containing 4227 real gaits from sources like BML,ICT-Pollick,and ELMD,and 1000 synthetic gaits generated using STEP-Gen technology,and ELMB,consisting of 3924 gaits,with 1835 labeled with emotions such as“Happy,”“Sad,”“Angry,”and“Neutral.”On the standard datasets Emotion-Gait and ELMB,the proposed method achieved accuracies of 0.900 and 0.896,respectively,attaining performance comparable to other state-ofthe-artmethods.Furthermore,on occlusion datasets,the proposedmethod significantly mitigates the performance degradation caused by occlusion compared to other methods,the accuracy is significantly higher than that of other methods. 展开更多
关键词 KNN interpolation multi-scale temporal convolution suppression graph convolutional network gait emotion recognition human skeleton
在线阅读 下载PDF
Human Motion Prediction Based on Multi-Level Spatial and Temporal Cues Learning
5
作者 Jiayi Geng Yuxuan Wu +5 位作者 Wenbo Lu Pengxiang Su Amel Ksibi Wei Li Zaffar Ahmed Shaikh Di Gai 《Computers, Materials & Continua》 2025年第11期3689-3707,共19页
Predicting human motion based on historical motion sequences is a fundamental problem in computer vision,which is at the core of many applications.Existing approaches primarily focus on encoding spatial dependencies a... Predicting human motion based on historical motion sequences is a fundamental problem in computer vision,which is at the core of many applications.Existing approaches primarily focus on encoding spatial dependencies among human joints while ignoring the temporal cues and the complex relationships across non-consecutive frames.These limitations hinder the model’s ability to generate accurate predictions over longer time horizons and in scenarios with complex motion patterns.To address the above problems,we proposed a novel multi-level spatial and temporal learning model,which consists of a Cross Spatial Dependencies Encoding Module(CSM)and a Dynamic Temporal Connection Encoding Module(DTM).Specifically,the CSM is designed to capture complementary local and global spatial dependent information at both the joint level and the joint pair level.We further present DTM to encode diverse temporal evolution contexts and compress motion features to a deep level,enabling the model to capture both short-term and long-term dependencies efficiently.Extensive experiments conducted on the Human 3.6M and CMU Mocap datasets demonstrate that our model achieves state-of-the-art performance in both short-term and long-term predictions,outperforming existing methods by up to 20.3% in accuracy.Furthermore,ablation studies confirm the significant contributions of the CSM and DTM in enhancing prediction accuracy. 展开更多
关键词 Human motion prediction spatial dependencies learning temporal context learning graph convolutional networks transformer
在线阅读 下载PDF
融合时空注意力的改进ST-GCN人体动作识别方法研究 被引量:1
6
作者 雷建云 梁钧 +2 位作者 夏梦 张慧丽 田祚汉 《中南民族大学学报(自然科学版)》 2025年第4期526-535,共10页
针对现有的人体骨架动作识别算法不能充分发掘运动的时空特征问题,提出了一种基于融合时空注意力的改进图卷积网络模型.该模型包含空间注意力机制和时间注意力机制,利用时空注意力机制从时间和空间两个维度分别提取动作的全局时空特征.... 针对现有的人体骨架动作识别算法不能充分发掘运动的时空特征问题,提出了一种基于融合时空注意力的改进图卷积网络模型.该模型包含空间注意力机制和时间注意力机制,利用时空注意力机制从时间和空间两个维度分别提取动作的全局时空特征.将这二者融合到统一的时空图卷积网络(ST-GCN)框架中,实现了端到端的训练.在Kinetics和NTU RGB+D两个公开数据集的对比实验证明:改进模型在NTU-RGB+D数据集上的CS标准下取得了82.37%的Top-1精度,在CV标准下取得89.84%的Top-1精度,相比原来的ST-GCN算法,分别提升0.87%的Top-1精度和1.54%的Top-5精度.在Kinetics数据集上,改进模型取得了31.78%的精度,与ST-GCN相比提高了1.08%.由此验证了改进方法的有效性. 展开更多
关键词 图卷积网络 骨架数据 动作识别 时空注意力
在线阅读 下载PDF
A Spatio-Temporal Heterogeneity Data Accuracy Detection Method Fused by GCN and TCN
7
作者 Tao Liu Kejia Zhang +4 位作者 Jingsong Yin Yan Zhang Zihao Mu Chunsheng Li Yanan Hu 《Computer Systems Science & Engineering》 SCIE EI 2023年第11期2563-2582,共20页
Spatio-temporal heterogeneous data is the database for decisionmaking in many fields,and checking its accuracy can provide data support for making decisions.Due to the randomness,complexity,global and local correlatio... Spatio-temporal heterogeneous data is the database for decisionmaking in many fields,and checking its accuracy can provide data support for making decisions.Due to the randomness,complexity,global and local correlation of spatiotemporal heterogeneous data in the temporal and spatial dimensions,traditional detection methods can not guarantee both detection speed and accuracy.Therefore,this article proposes a method for detecting the accuracy of spatiotemporal heterogeneous data by fusing graph convolution and temporal convolution networks.Firstly,the geographic weighting function is introduced and improved to quantify the degree of association between nodes and calculate the weighted adjacency value to simplify the complex topology.Secondly,design spatiotemporal convolutional units based on graph convolutional neural networks and temporal convolutional networks to improve detection speed and accuracy.Finally,the proposed method is compared with three methods,ARIMA,T-GCN,and STGCN,in real scenarios to verify its effectiveness in terms of detection speed,detection accuracy and stability.The experimental results show that the RMSE,MAE,and MAPE of this method are the smallest in the cases of simple connectivity and complex connectivity degree,which are 13.82/12.08,2.77/2.41,and 16.70/14.73,respectively.Also,it detects the shortest time of 672.31/887.36,respectively.In addition,the evaluation results are the same under different time periods of processing and complex topology environment,which indicates that the detection accuracy of this method is the highest and has good research value and application prospects. 展开更多
关键词 Spatiotemporal heterogeneity data data accuracy complex topology structure graph convolutional networks temporal convolutional networks
在线阅读 下载PDF
Multi-Polar Evolution of Global Inventive Talent Flow Network-An Endogenous Migration Model and Empirical Analysis
8
作者 Zheng Jianghuai Sun Dongqing +1 位作者 Dai Wei Shi Lei 《China Economist》 2025年第4期80-100,共21页
The global clustering of inventive talent shapes innovation capacity and drives economic growth.For China,this process is especially crucial in sustaining its development momentum.This paper draws on data from the EPO... The global clustering of inventive talent shapes innovation capacity and drives economic growth.For China,this process is especially crucial in sustaining its development momentum.This paper draws on data from the EPO Worldwide Patent Statistical Database(PATSTAT)to extract global inventive talent mobility information and analyzes the spatial structural evolution of the global inventive talent flow network.The study finds that this network is undergoing a multi-polar transformation,characterized by the rising importance of a few central countries-such as the United States,Germany,and China-and the increasing marginalization of many peripheral countries.In response to this typical phenomenon,the paper constructs an endogenous migration model and conducts empirical testing using the Temporal Exponential Random Graph Model(TERGM).The results reveal several endogenous mechanisms driving global inventive talent flows,including reciprocity,path dependence,convergence effects,transitivity,and cyclic structures,all of which contribute to the network’s multi-polar trend.In addition,differences in regional industrial structures significantly influence talent mobility choices and are a decisive factor in the formation of poles within the multi-polar landscape.Based on these findings,it is suggested that efforts be made to foster two-way channels for talent exchange between China and other global innovation hubs,in order to enhance international collaboration and knowledge flow.We should aim to reduce the migration costs and institutional barriers faced by R&D personnel,thereby encouraging greater mobility of high-skilled talent.Furthermore,the government is advised to strategically leverage regional strengths in high-tech industries as a lever to capture competitive advantages in emerging technologies and products,ultimately strengthening the country’s position in the global innovation landscape. 展开更多
关键词 Inventive talent flow network MULTIPOLARITY spatial structural evolution regional industrial structure disparities temporal exponential random graph model(TERGM)
在线阅读 下载PDF
改进ST-GCN的人体跌倒检测
9
作者 王世刚 邓珍妮 饶淼淼 《计算机系统应用》 2025年第8期159-168,共10页
针对ST-GCN算法在动作识别中需要预先定义人体骨架拓扑图及准确率有待提高等问题,提出了基于OpenPose与改进ST-GCN结合的跌倒检测算法.利用OpenPose算法提取人体骨骼关键点数据,将骨骼关键点数据输入改进的ST-GCN算法中进行动作识别.对S... 针对ST-GCN算法在动作识别中需要预先定义人体骨架拓扑图及准确率有待提高等问题,提出了基于OpenPose与改进ST-GCN结合的跌倒检测算法.利用OpenPose算法提取人体骨骼关键点数据,将骨骼关键点数据输入改进的ST-GCN算法中进行动作识别.对ST-GCN算法进行改进,引入自适应图卷积模块,通过动态调整图结构,增强模型对不同动作类型特征提取的灵活性;引入注意力机制模块,进一步提升模型的识别性能.在公开数据集上验证的结果显示,NTU-RGB+D 60数据集上,X-Sub和X-View的top-1准确率与改进前相比分别提高2.2%和2.5%;Kinetics-Skeleton数据集上,top-1和top-5准确率分别提高3.1%和4%.自建数据集上的准确率与改进前相比提高4.7%.实验结果表明,所提出的算法满足实际应用需求. 展开更多
关键词 时空图卷积 人体姿态估计 跌倒检测 计算机视觉
在线阅读 下载PDF
改进的ST-GCN单人姿态估计算法研究
10
作者 史健婷 王印冉 詹怀远 《计算机技术与发展》 2025年第1期61-66,共6页
近年来,单人姿态估计广泛应用在各个领域,降低单人姿态估计算法对标记数据的依赖同时提高其准确率是计算机视觉中一个具有挑战但是十分重要的课题。针对此问题,该文提出一种改进的时空图卷积神经网络(Spatio-Temporal Graph Convolution... 近年来,单人姿态估计广泛应用在各个领域,降低单人姿态估计算法对标记数据的依赖同时提高其准确率是计算机视觉中一个具有挑战但是十分重要的课题。针对此问题,该文提出一种改进的时空图卷积神经网络(Spatio-Temporal Graph Convolutional Networks,ST-GCN)的方法。在原来的ST-GCN的基础上,融合MoveNet轻量级神经网络,利用MoveNet的关键点识别功能,解决ST-GCN需要预先标注关键点数据的问题。引入SimAM注意力机制,解决原来的ST-GCN不能很好地区分通道中重点信息,将所有的信息一视同仁的问题。增加ReLU6-Sigmoid组合激活函数,解决原有的激活函数训练波动,非线性拟合不足的问题。即:在提高了原时空图卷积神经网络的检测精度的同时,减少了应用过程中对于标记数据的依赖,降低了训练时的损失率精确率的波动。对于改进后的时空图卷积神经网络,在FLORENCE 3D ACTIONS数据集上证明了其有效性。结果表明,改进后的时空图卷积神经网络准确率从0.8695提升到0.956521。F1值由0.887566提高到0.965432。 展开更多
关键词 计算机视觉 改进的时空图卷积神经网络 模型融合 SimAM ReLU6-Sigmoid
在线阅读 下载PDF
基于新分区策略的ST-GCN人体动作识别 被引量:6
11
作者 杨世强 李卓 +3 位作者 王金华 贺朵 李琦 李德信 《计算机集成制造系统》 EI CSCD 北大核心 2023年第12期4040-4050,共11页
人体动作识别是智能监控、人机交互、机器人等领域的一项重要技术,基于人体骨架序列的动作识别方法在面对复杂背景以及人体尺度、视角和运动速度等变化时具有先天优势。时空图卷积神经网络模型(ST-GCN)在人体行为识别中具有卓越的识别性... 人体动作识别是智能监控、人机交互、机器人等领域的一项重要技术,基于人体骨架序列的动作识别方法在面对复杂背景以及人体尺度、视角和运动速度等变化时具有先天优势。时空图卷积神经网络模型(ST-GCN)在人体行为识别中具有卓越的识别性能,针对ST-GCN网络模型中的分区策略只关注局部动作的问题,设计了一种新的分区策略,通过关联根节点与更远节点,加强身体各部分信息联系和局部运动之间的联系,将根节点的相邻区域划分为根节点本身、向心群、远向心群、离心群和远离心群等5个区域,同时为各区域赋予不同的权重,提升了模型对整体动作的感知能力。最后,分别在公开数据集和真实场景下进行实验测试,结果表明,在大规模数据集Kinetics-skeleton上获得了31.1%的Top-1分类准确率,相比原模型提升了0.4%;在NTU-RGB+D的两个子数据集上分别获得了83.7%和91.6%的Top-1性能指标,相比原模型提升了2.3%和3.3%;在真实场景下,所提模型对动作变化明显且区别大的动作如俯卧撑和慢跑识别率高,对局部运动和动作变化相近的动作如鼓掌和摇头识别率偏低,尚有进一步提高的空间。 展开更多
关键词 动作识别 深度学习 时空图卷积神经网络模型 分区策略 骨架序列
在线阅读 下载PDF
基于关联分区和ST-GCN的人体行为识别 被引量:10
12
作者 刘锁兰 顾嘉晖 +1 位作者 王洪元 张云鹏 《计算机工程与应用》 CSCD 北大核心 2021年第13期168-175,共8页
基于骨骼的动作识别因不受人体物理特征的影响,简单清晰地传达了人体行为识别的重要信息而受到广泛关注。传统的应用程序骨架建模通常依赖遍历规则的人为设置而导致表达能力有限和推广困难。因此,在近年来热门的时空图卷积网络(ST-GCN)... 基于骨骼的动作识别因不受人体物理特征的影响,简单清晰地传达了人体行为识别的重要信息而受到广泛关注。传统的应用程序骨架建模通常依赖遍历规则的人为设置而导致表达能力有限和推广困难。因此,在近年来热门的时空图卷积网络(ST-GCN)模型基础上提出了一种新的划分骨架关节点的分区策略。该策略相比于原始分区方法加强了身体相对位置之间的关系,从而有利于提高骨架关节点信息在时间和空间上的关联。与此同时,在训练过程中通过设置不同的迭代学习率以进一步提高识别精度。在两个不同性质的大规模数据集Kinetics和NTURGB+D上与现有方法进行识别效果的比较,实验结果表明了该方法的有效性。 展开更多
关键词 行为识别 关节点 时空图卷积网络(st-gcn) 分区策略 学习率
在线阅读 下载PDF
基于ST-GCN的空中交通管制员不安全行为识别 被引量:5
13
作者 王超 徐楚昕 +1 位作者 董杰 王志锋 《中国安全科学学报》 CAS CSCD 北大核心 2023年第5期42-48,共7页
为预防和监督空中交通管制(ATC)工作中的违章行为,利用智能视频分析技术,研究适用于管制员坐姿工作的不安全行为识别模型。首先,分析管制员不安全工作行为的隐蔽性特征,总结5种典型管制员不安全行为,包括伸懒腰、瞌睡、低头入睡、歪头... 为预防和监督空中交通管制(ATC)工作中的违章行为,利用智能视频分析技术,研究适用于管制员坐姿工作的不安全行为识别模型。首先,分析管制员不安全工作行为的隐蔽性特征,总结5种典型管制员不安全行为,包括伸懒腰、瞌睡、低头入睡、歪头入睡和半躺入睡,并构建管制员不安全工作状态视频数据集(CUWS);其次,提出一种能描述管制员坐姿的骨架关键点拓展算法,基于时空图卷积网络(ST-GCN)搭建适用于管制员坐姿与腿部遮蔽情况下的不安全行为识别模型ATC-ST-GCN,并给出管制员不安全行为识别的工作流程;最后,利用CUWS数据集进行ATC-ST-GCN模型的训练和测试,并利用管制室实际监控视频开展验证试验。结果表明:该模型能够在有限验证数据集上实现5种典型不安全行为识别,准确率达到93.65%。试验结果证明该模型具有一定的科学性与有效性。 展开更多
关键词 时空图卷积网络(st-gcn) 空中交通管制(ATC) 不安全行为 管制员 行为识别
原文传递
基于多特征融合的GraphHeat-ChebNet隧道形变预测模型 被引量:1
14
作者 熊安萍 李梦凡 龙林波 《重庆邮电大学学报(自然科学版)》 CSCD 北大核心 2023年第1期164-175,共12页
对隧道的形变进行预测是隧道结构异常检测的内容之一。为了充分挖掘形变特征的时空关联性,针对隧道内衬多个断面的形变同时预测,提出一种基于多特征融合的GraphHeat-ChebNet隧道形变预测模型。所提模型中利用GraphHeat和ChebNet这2种图... 对隧道的形变进行预测是隧道结构异常检测的内容之一。为了充分挖掘形变特征的时空关联性,针对隧道内衬多个断面的形变同时预测,提出一种基于多特征融合的GraphHeat-ChebNet隧道形变预测模型。所提模型中利用GraphHeat和ChebNet这2种图卷积网络(graph convolution net,GCN)分别提取特征信号的低频和高频部分,并获取形变特征的空间关联性,ConvGRUs网络用于提取特征在时间上的关联性,通过三阶段融合方法保留挖掘的信息。为了解决实验数据在时间维度上不充足的问题,引入双层滑动窗口机制。此外,所提模型与其他模型或算法在不同数据集上实验比较,衡量一天和两天预测值的误差指标优于其他模型,而且对大部分节点预测的误差较低。说明模型受样本节点数影响较小,能较好地预测一天和两天的形变,模型学习特征与时空模式的能力较强,泛化性较好。 展开更多
关键词 隧道形变 预测模型 融合时空数据 滑动窗口 图卷积网络(GCN)
在线阅读 下载PDF
面向交通流量预测的时空Graph-CoordAttention网络 被引量:2
15
作者 刘建松 康雁 +2 位作者 李浩 王韬 王海宁 《计算机科学》 CSCD 北大核心 2023年第S01期558-564,共7页
交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧... 交通预测是城市智能交通系统的一个重要研究组成部分,使人们的出行更加效率和安全。由于复杂的时间和空间依赖性,准确预测交通流量仍然是一个巨大的挑战。近年来,图卷积网络(GCN)在交通预测方面表现出巨大的潜力,但基于GCN的模型往往侧重于单独捕捉时间和空间的依赖性,忽视了时间和空间依赖性之间的动态关联性,不能很好地融合它们。此外,以前的方法使用现实世界的静态交通网络来构建空间邻接矩阵,这可能忽略了动态的空间依赖性。为了克服这些局限性,并提高模型的性能,提出了一种新颖的时空Graph-CoordAttention网络(STGCA)。具体来说,提出了时空同步模块,用来建模不同时刻的时空依赖交融关系。然后,提出了一种动态图学习的方案,基于车流量之间数据关联,挖掘出潜在的图信息。在4个公开的数据集上和现有基线模型进行对比实验,STGCA表现了优异的性能。 展开更多
关键词 交通流量预测 时空预测 图卷积网络 注意力机制 时空依赖
在线阅读 下载PDF
Video summarization with a graph convolutional attention network 被引量:3
16
作者 Ping LI Chao TANG Xianghua XU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2021年第6期902-913,共12页
Video summarization has established itself as a fundamental technique for generating compact and concise video, which alleviates managing and browsing large-scale video data. Existing methods fail to fully consider th... Video summarization has established itself as a fundamental technique for generating compact and concise video, which alleviates managing and browsing large-scale video data. Existing methods fail to fully consider the local and global relations among frames of video, leading to a deteriorated summarization performance. To address the above problem, we propose a graph convolutional attention network(GCAN) for video summarization. GCAN consists of two parts, embedding learning and context fusion, where embedding learning includes the temporal branch and graph branch. In particular, GCAN uses dilated temporal convolution to model local cues and temporal self-attention to exploit global cues for video frames. It learns graph embedding via a multi-layer graph convolutional network to reveal the intrinsic structure of frame samples. The context fusion part combines the output streams from the temporal branch and graph branch to create the context-aware representation of frames, on which the importance scores are evaluated for selecting representative frames to generate video summary. Experiments are carried out on two benchmark databases, Sum Me and TVSum, showing that the proposed GCAN approach enjoys superior performance compared to several state-of-the-art alternatives in three evaluation settings. 展开更多
关键词 temporal learning Self-attention mechanism graph convolutional network Context fusion Video summarization
原文传递
基于ST-GCN警用巡逻机器人警情识别系统设计 被引量:5
17
作者 胡丽军 吴燕玲 +1 位作者 宋全军 徐湛楠 《传感器与微系统》 CSCD 北大核心 2023年第6期78-81,共4页
针对现有警用巡逻机器人警情识别系统识别种类单一、识别率较低等问题,基于时空—图卷积网络(ST-GCN)和OpenPose算法的融合,面向跌倒(fall)、打砸(smash)和推搡(push)3种警情,设计了警用巡逻机器人警情识别系统。通过在真实场景测试,3... 针对现有警用巡逻机器人警情识别系统识别种类单一、识别率较低等问题,基于时空—图卷积网络(ST-GCN)和OpenPose算法的融合,面向跌倒(fall)、打砸(smash)和推搡(push)3种警情,设计了警用巡逻机器人警情识别系统。通过在真实场景测试,3种警情识别率分别为:跌倒85%,打砸80%,推搡83%;实时识别帧率为10 fps。实验结果表明:所设计的警情识别系统可以实现多种警情的实时、准确、可靠识别,具有较高的应用价值。 展开更多
关键词 时空—图卷积网络 OpenPose算法 警情识别 警用巡逻机器人
在线阅读 下载PDF
S^(2)ANet:Combining local spectral and spatial point grouping for point cloud processing
18
作者 Yujie LIU Xiaorui SUN +1 位作者 Wenbin SHAO Yafu YUAN 《虚拟现实与智能硬件(中英文)》 EI 2024年第4期267-279,共13页
Background Despite the recent progress in 3D point cloud processing using deep convolutional neural networks,the inability to extract local features remains a challenging problem.In addition,existing methods consider ... Background Despite the recent progress in 3D point cloud processing using deep convolutional neural networks,the inability to extract local features remains a challenging problem.In addition,existing methods consider only the spatial domain in the feature extraction process.Methods In this paper,we propose a spectral and spatial aggregation convolutional network(S^(2)ANet),which combines spectral and spatial features for point cloud processing.First,we calculate the local frequency of the point cloud in the spectral domain.Then,we use the local frequency to group points and provide a spectral aggregation convolution module to extract the features of the points grouped by the local frequency.We simultaneously extract the local features in the spatial domain to supplement the final features.Results S^(2)ANet was applied in several point cloud analysis tasks;it achieved stateof-the-art classification accuracies of 93.8%,88.0%,and 83.1%on the ModelNet40,ShapeNetCore,and ScanObjectNN datasets,respectively.For indoor scene segmentation,training and testing were performed on the S3DIS dataset,and the mean intersection over union was 62.4%.Conclusions The proposed S^(2)ANet can effectively capture the local geometric information of point clouds,thereby improving accuracy on various tasks. 展开更多
关键词 Local frequency Spectral and spatial aggregation convolution Spectral group convolution Point cloud representation learning graph convolutional network
在线阅读 下载PDF
一种融合ST-GCN算法的高速公路节假日流量预测模型 被引量:2
19
作者 贾百强 徐延军 周涛 《上海船舶运输科学研究所学报》 2022年第5期58-65,共8页
为高效准确地预测节假日期间高速公路重要路段的交通流量,提出一种融合时空图卷积网络(Spatio-Temporal Graph Convolutional Network,ST-GCN)的高速公路节假日流量预测模型。该模型采用平均速度、交通流量等重要的交通指标作为交通状... 为高效准确地预测节假日期间高速公路重要路段的交通流量,提出一种融合时空图卷积网络(Spatio-Temporal Graph Convolutional Network,ST-GCN)的高速公路节假日流量预测模型。该模型采用平均速度、交通流量等重要的交通指标作为交通状态评价体系要素,对节假日期间高速公路的交通态势进行综合预测;融合ST-GCN算法模型,综合考虑时空特性,得到准确度较高的预测结果。以宁夏回族自治区高速公路的节假日交通信息为研究对象,对该模型的有效性进行验证,结果表明,该模型相比其他常用预测模型准确度更高,具有更好的稳定性和鲁棒性,预测结果可供高速公路的管理和运营参考。 展开更多
关键词 时空图卷积网络(st-gcn)模型 流量预测 高速公路 节假日 交互预测 在线学习
在线阅读 下载PDF
基于ST-GCN短时路况预测算法的预警系统 被引量:1
20
作者 李长亮 《上海船舶运输科学研究所学报》 2023年第1期49-54,共6页
为提升高速公路车速预测的准确性,针对现有车速预测模型存在的时间相关性和空间相关性部分缺失的问题,提出一种基于时空图卷积网络(Spatio-Temporal Graph Convolution Network,ST-GCN)短时路况预测算法的预警系统。该算法综合考虑时间... 为提升高速公路车速预测的准确性,针对现有车速预测模型存在的时间相关性和空间相关性部分缺失的问题,提出一种基于时空图卷积网络(Spatio-Temporal Graph Convolution Network,ST-GCN)短时路况预测算法的预警系统。该算法综合考虑时间相关性和空间相关性的影响,根据实时和历史的交通数据,通过建立ST-GCN模型分析预测未来某段时间的交通流速度和路况。将预测结果推送给布设在高速公路上的多彩智能情报板,通过多彩智能情报板上显示的信息诱导司乘用户的行为,从而降低事故发生率,提升高速公路通行效率。 展开更多
关键词 短时路况预测 速度预测 时空图卷积网络(st-gcn) 注意力网络 长短记忆网络
在线阅读 下载PDF
上一页 1 2 23 下一页 到第
使用帮助 返回顶部