Online video scene clustering by competitive incremental NMF

被引：3

作者：

Bucak, Serhat Selcuk ^{[1
]}

Gunsel, Bilge ^{[2
]}

机构：

[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA

[2] Istanbul Tech Univ, Dept Elect & Commun Engn, Multimedia Signal Proc & Pattern Recognit Lab, TR-34469 Maslak, Turkey

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2013年 / 7卷 / 04期

关键词：

Online video segmentation; Unsupervised video clustering; Matrix factorization;

D O I：

10.1007/s11760-011-0264-2

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Efficient clustering and categorizing of video are becoming more and more vital in various applications including video summarization, content-based representation and so on. The large volume of video data is the biggest challenge that this task presents, for most the clustering techniques suffer from high dimensional data in terms of both accuracy and efficiency. In addition to this, most video applications require online processing; therefore, clustering should also be done online for such tasks. This paper presents an online video scene clustering/segmentation method that is based on incremental nonnegative matrix factorization (INMF), which has been shown to be a powerful content representation tool for high dimensional data. The proposed algorithm (Comp-INMF) enables online representation of video content and increases efficiency significantly by integrating a competitive learning scheme into INMF. It brings a systematic solution to the issue of rank selection in nonnegative matrix factorization, which is equivalent to specifying the number of clusters. The clustering performance is evaluated by tests on TRECVID video sequences, and a performance comparison to baseline methods including Adaptive Resonance Theory (ART) is provided in order to demonstrate the efficiency and efficacy of the proposed video clustering scheme. Clustering performance reported in terms of recall, precision and F1 measures shows that the labeling accuracy of the algorithm is notable, especially at edit effect regions that constitute a challenging point in video analysis.

引用

页码：723 / 739

页数：17

共 50 条

[31] YouTube Nation: Precarity and Agency in India's Online Video Scene
Kumar, Sangeet
INTERNATIONAL JOURNAL OF COMMUNICATION, 2016, 10 : 5608 - 5625
[32] Incremental Gradient on the Grassmannian for Online Foreground and Background Separation in Subsampled Video
He, Jun
Balzano, Laura
Szlam, Arthur
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1568 - 1575
[33] Transform Invariant Video Fingerprinting by NMF
Gursoy, Ozan
Gunsel, Bilge
Sengor, Neslihan
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 452 - 459
[34] Incremental Clustering for Hierarchical Clustering
Narita, Kakeru
Hochin, Teruhisa
Nomiya, Hiroki
2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
[35] Unsupervised Video Action Clustering via Motion-Scene Interaction Constraint
Peng, Bo
Lei, Jianjun
Fu, Huazhu
Zhang, Changqing
Chua, Tat-Seng
Li, Xuelong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 131 - 144
[36] Unsupervised sports video scene clustering and its applications to story units detection
Zhang, WG
Ye, QX
Xing, LY
Huang, QM
Gao, W
Visual Communications and Image Processing 2005, Pts 1-4, 2005, 5960 : 446 - 455
[37] Video Scene Segmentation Using Time Constraint Dominant-Set Clustering
Zeng, Xianglin
Zhang, Xiaoqin
Hu, Weiming
Li, Wanqing
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 637 - +
[38] Incremental Graph Clustering for Efficient Retrieval from Streaming Egocentric Video Data
Chandrasekhar, Vijay
Wu Min
Li Xiaoli
Tan, Cheston
Li Liyuan
Hwee, Lim Joo
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2631 - 2636
[39] Online Parametric NMF for Speech Enhancement
Kavalekalam, Mathew Shaji
Nielsen, Jesper Kjaer
Shi, Liming
Christensen, Mads Graesboll
Boldt, Jesper
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2320 - 2324
[40] Video abstraction based on the visual attention model and online clustering
Ji, Qing-Ge
Fang, Zhi-Dang
Xie, Zhen-Hua
Lu, Zhe-Ming
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (03) : 241 - 253

← 1 2 3 4 5 →