基于话题模型的视频动作识别系统研究

Human Action Recognition System Based on Topic Model

下载PDF

导出

摘要从视频中识别人体动作是目前计算机视觉领域一个具有挑战性的方向。本文采用文本处理领域的bag-of-words方法,将视频表示为文章。在视频中寻找局部区域内在时间与空间上变化最大的点,作为时空兴趣点,在兴趣点上采集的视觉特征,作为文章中的词汇。在此基础上引入主题模型,对于视频中的隐含主题进行分析。最终通过主题在视频中的分布,经过判别法则识别其中的人物动作。通过在公开的视觉数据集上进行测试,结果表明本方法的表现接近或超过目前国际上领先的方法。 Human action recognition from video sequences is a challenging problem in computer vision. This paper uses the bag- of-words paradigm inherited from text analysis to represent a clip of video as a document. The local features are extracted from spatio-temporal interest points which are points with local maximum variation in both space and time dmnain. Then topic models on video documents are applied to analyze the latent topics and actions in the video are recognized in a discriminative fashion. The proposed system is tested on both simple and complex data sets. Experiment result shows that the approach is comparable or better than all published state-of-the-art methods.

作者施惟

机构地区上海交通大学智能计算与智能系统教育部-微软重点实验室上海交通大学计算机科学与工程系

出处《计算机与现代化》 2013年第4期1-4,共4页 Computer and Modernization

基金国家自然科学基金资助项目(61272251)

关键词人物动作识别时空兴趣点 bag—of-words模型主题模型 human action recognition spatio-temporal interest point bag-of-words model topic model

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1Efros A, Berg A, Mori G, et al. Recognizing action at adistance [ C ] // Proceedings of the 9th IEEE InternationalConference on Computer Vision. 2003,2:726-733.
2Shechtman E, Irani M. Space-time behavior based correla-tion[C]// Proceedings of the 2005 IEEE Computer SocietyConference on Computer Vision and Pattern Recognition. 2005,1:405-412.
3Ke Y, Sukthankar R, Hebert M. Spatio-temporal shapeand flow correlation for action recognition [ C ] // Proceed-ings of the 2007 IEEE Conference on Computer Vision andPattern Recognition. 2007.
4Laptev I. On space-time interest points [ J]. InternationalJournal of Computer Vision, 2005 ,64(2-3) : 107-123.
5Dollar P, Rabaud V,Cotttell G, et al. Behavior recogni-tion via sparse spatio-temporal features [ C ] // Proceedingsof the 2nd Joint IEEE International Workshop on VisualSurveillance and Performance Evaluation of Tracking andSurveillance. 2005:65-72.
6Scovanner P, Ali S, Shah M. A 3-dimensional sift descrip-tor and its application to action recognition[C]// Proceed-ings of the 15th International Conference on Multimedia.2007:357-360.
7Laptev I,Marszalek M, Schmid C, et al. Learning realis-tic human actions from movies [ C ] // Proceedings of the2008 IEEE Conference on Computer Vision and PatternRecognition. 2008.
8Savarese S, Delpozo A, Niebles J C, et al. Spatial-tempo-ral correlations for unsupervised action classification [ C]//Proceedings of the 2008 IEEE Workshop on Motion andVideo Computing. 2008.
9Harris C, Stephens M. A combined comer and edge detec-tor[ C]// Proceedings of the 4th Alvey Vision Conference.1988:147-151.
10Blei DM, Ng A Y, Jordan M I. Latent Dirichlet allocation[J]. The Journal of Machine Learning Research,2003,3(3/1):993-1022.

1徐文博,吴恋,于国龙.基于SIFT特征图像检索的分布式应用[J].贵州师范学院学报,2016,32(9):13-17.
2邱少霞,陈晓松,林惺,万力衡,钟映春.融合Bag-of-Words的室内场景分类研究[J].电子世界,2015(17):158-159.
3熊琰铖,孙涵,刘宁钟.基于几何信息的特征匹配改进算法[J].小型微型计算机系统,2015,36(11):2568-2571. 被引量：1
4廖建文,陈文伟.信息系统相容性和属性约简理论[J].应用科技,2012,39(3):51-55.
5王誉天,袁江涛,秦海权,刘鑫.基于Bag-of-words和Hash编码的近似重复图像检测算法[J].计算机应用,2013,33(3):667-669.
6费旋珈,孔莹莹.基于SIFT-SVM的北冰洋海冰识别研究[J].电子技术与软件工程,2016(24):92-95. 被引量：3
7吴迪.一种基于压缩感知理论的纹理分类方法[J].计算机应用研究,2016,33(1):291-295.
8罗会兰,杜连平.一种SVM集成的图像分类方法研究[J].电视技术,2012,36(23):39-42. 被引量：7
9邹晓辉,孙静.LDA主题模型[J].智能计算机与应用,2014,4(5):105-106. 被引量：19
10田甜,张振国.一种基于PLSA和词袋模型的图像分类新方法[J].咸阳师范学院学报,2010,25(4):50-55. 被引量：1

计算机与现代化

2013年第4期

浏览历史

内容加载中请稍等...

基于话题模型的视频动作识别系统研究

参考文献15

相关作者

相关机构

相关主题

浏览历史