期刊文献+

基于张量表示的直推式多模态视频语义概念检测 被引量:10

Transductive Multi-Modality Video Semantic Concept Detection with Tensor Representation
在线阅读 下载PDF
导出
摘要 提出了一种基于高阶张量表示的视频语义分析与理解框架.在此框架中,视频镜头首先被表示成由视频中所包含的文本、视觉和听觉等多模态数据构成的三阶张量;其次,基于此三阶张量表达及视频的时序关联共生特性设计了一种子空间嵌入降维方法,称为张量镜头;由于直推式学习从已知样本出发能对特定的未知样本进行学习和识别.最后在这个框架中提出了一种基于张量镜头的直推式支持张量机算法,它不仅保持了张量镜头所在的流形空间的本征结构,而且能够将训练集合外数据直接映射到流形子空间,同时充分利用未标记样本改善分类器的学习性能.实验结果表明,该方法能够有效地进行视频镜头的语义概念检测. A higher-order tensor framework for video analysis and understanding is proposed in this paper. In this framework, image frame, audio and text are represented, which are the three modalities in video shots as data points by the 3rd-order tensor. Then a subspace embedding and dimension reduction method is proposed, which explicitly considers the manifold structure of the tensor space from temporal-sequenced associated co-occurring multimodal media data in video. It is called TensorShot approach. Transductive learning uses a large amount of unlabeled data together with the labeled data to build better classifiers. A transductive support tensor machines algorithm is proposed to train effective classifier. This algorithm preserves the intrinsic structure of the submanifold where tensorshots are sampled, and is also able to map out-of-sample data points directly. Moreover, the utilization of unlabeled data improves classification ability. Experimental results show that this method improves the performance of video semantic concept detection.
出处 《软件学报》 EI CSCD 北大核心 2008年第11期2853-2868,共16页 Journal of Software
基金 Supponed by the National Natural Science Foundation of China under Grant Nos.60603096 60533090(国家自然科学基金) the National High-Tech Research and Development Plan of China under Grant No.2006AA010107(国家高技术研究发展计划(863) the N~ional Key Technology R&D Program 0f China under Grant No.2007BAH11B01(国家科技支撑计划) the Program for Changjiang Scholars and Innovative Research Team in University ofChina under Grant Nos.IRT0652 PCSIRT(长江学者和创新团队发展计划)
关键词 多模态 张量镜头 时序关联共生 高阶SVD 降维 直推式支持张量机 multi-modality TensorShot temporal associated cooccurrence (TAC) higher order SVD (HOSVD) dimensionality reduction transductive support tensor machine (TSTM)
  • 相关文献

参考文献2

二级参考文献32

  • 1[1]Vapnik V. The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.
  • 2[2]Stitson MO, Weston JAE, Gammerman A, Vovk V, Vapnik V. Theory of support vector machines. Technical Report, CSD-TR-96-17, Computational Intelligence Group, Royal Holloway: University of London, 1996.
  • 3[3]Cortes C, Vapnik V. Support vector networks. Machine Learning, 1995,20:273~297.
  • 4[4]Vapnik V. Statistical Learning Theory. John Wiley and Sons, 1998.
  • 5[5]Gammerman A, Vapnik V, Vowk V. Learning by transduction. In: Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence. Wisconsin, 1998. 148~156.
  • 6[6]Joachims T. Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning (ICML). San Francisco: Morgan Kaufmann Publishers, 1999. 200~209.
  • 7[7]Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. In: Haussler D, ed. Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory. Pittsburgh, PA: ACM Press, 1992. 144~152.
  • 8[8]Burges CJC. Simplified support vector decision rules. In: Saitta L, ed. Proceedings of the 13th International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann Publishers, 1996. 71~77.
  • 9[9]Osuna E, Freund R, Girosi F. An improved training algorithm for support vector machines. In: Proceedings of the IEEE NNSP'97. Amelia Island, FL, 1997. 276~285.
  • 10[10]Joachims T. Making large-scale SVM learning practical. In: Scholkopf, Burges C, Smola A, eds. Advances in Kernel Methods--Support Vector Learning B. MIT Press, 1999.

共引文献121

同被引文献96

引证文献10

二级引证文献52

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部