Online video scene clustering by competitive incremental NMF

被引:3
|
作者
Bucak, Serhat Selcuk [1 ]
Gunsel, Bilge [2 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[2] Istanbul Tech Univ, Dept Elect & Commun Engn, Multimedia Signal Proc & Pattern Recognit Lab, TR-34469 Maslak, Turkey
关键词
Online video segmentation; Unsupervised video clustering; Matrix factorization;
D O I
10.1007/s11760-011-0264-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Efficient clustering and categorizing of video are becoming more and more vital in various applications including video summarization, content-based representation and so on. The large volume of video data is the biggest challenge that this task presents, for most the clustering techniques suffer from high dimensional data in terms of both accuracy and efficiency. In addition to this, most video applications require online processing; therefore, clustering should also be done online for such tasks. This paper presents an online video scene clustering/segmentation method that is based on incremental nonnegative matrix factorization (INMF), which has been shown to be a powerful content representation tool for high dimensional data. The proposed algorithm (Comp-INMF) enables online representation of video content and increases efficiency significantly by integrating a competitive learning scheme into INMF. It brings a systematic solution to the issue of rank selection in nonnegative matrix factorization, which is equivalent to specifying the number of clusters. The clustering performance is evaluated by tests on TRECVID video sequences, and a performance comparison to baseline methods including Adaptive Resonance Theory (ART) is provided in order to demonstrate the efficiency and efficacy of the proposed video clustering scheme. Clustering performance reported in terms of recall, precision and F1 measures shows that the labeling accuracy of the algorithm is notable, especially at edit effect regions that constitute a challenging point in video analysis.
引用
收藏
页码:723 / 739
页数:17
相关论文
共 50 条
  • [31] YouTube Nation: Precarity and Agency in India's Online Video Scene
    Kumar, Sangeet
    INTERNATIONAL JOURNAL OF COMMUNICATION, 2016, 10 : 5608 - 5625
  • [32] Incremental Gradient on the Grassmannian for Online Foreground and Background Separation in Subsampled Video
    He, Jun
    Balzano, Laura
    Szlam, Arthur
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1568 - 1575
  • [33] Transform Invariant Video Fingerprinting by NMF
    Gursoy, Ozan
    Gunsel, Bilge
    Sengor, Neslihan
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 452 - 459
  • [34] Incremental Clustering for Hierarchical Clustering
    Narita, Kakeru
    Hochin, Teruhisa
    Nomiya, Hiroki
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
  • [35] Unsupervised Video Action Clustering via Motion-Scene Interaction Constraint
    Peng, Bo
    Lei, Jianjun
    Fu, Huazhu
    Zhang, Changqing
    Chua, Tat-Seng
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 131 - 144
  • [36] Unsupervised sports video scene clustering and its applications to story units detection
    Zhang, WG
    Ye, QX
    Xing, LY
    Huang, QM
    Gao, W
    Visual Communications and Image Processing 2005, Pts 1-4, 2005, 5960 : 446 - 455
  • [37] Video Scene Segmentation Using Time Constraint Dominant-Set Clustering
    Zeng, Xianglin
    Zhang, Xiaoqin
    Hu, Weiming
    Li, Wanqing
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 637 - +
  • [38] Incremental Graph Clustering for Efficient Retrieval from Streaming Egocentric Video Data
    Chandrasekhar, Vijay
    Wu Min
    Li Xiaoli
    Tan, Cheston
    Li Liyuan
    Hwee, Lim Joo
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2631 - 2636
  • [39] Online Parametric NMF for Speech Enhancement
    Kavalekalam, Mathew Shaji
    Nielsen, Jesper Kjaer
    Shi, Liming
    Christensen, Mads Graesboll
    Boldt, Jesper
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2320 - 2324
  • [40] Video abstraction based on the visual attention model and online clustering
    Ji, Qing-Ge
    Fang, Zhi-Dang
    Xie, Zhen-Hua
    Lu, Zhe-Ming
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (03) : 241 - 253