Joint Summarization of Large-scale Collections of Web Images and Videos for Storyline Reconstruction

被引:78
|
作者
Kim, Gunhee [1 ]
Sigal, Leonid [1 ]
Xing, Eric P. [2 ]
机构
[1] Disney Res Pittsburgh, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
LASSO;
D O I
10.1109/CVPR.2014.538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of jointly summarizing large sets of Flickr images and YouTube videos. Starting from the intuition that the characteristics of the two media types are different yet complementary, we develop a fast and easily-parallelizable approach for creating not only high-quality video summaries but also novel structural summaries of online images as storyline graphs. The storyline graphs can illustrate various events or activities associated with the topic in a form of a branching network. The video summarization is achieved by diversity ranking on the similarity graphs between images and video frames. The reconstruction of storyline graphs is formulated as the inference of sparse time-varying directed graphs from a set of photo streams with assistance of videos. For evaluation, we collect the datasets of 20 outdoor activities, consisting of 2.7M Flickr images and 16K YouTube videos. Due to the large-scale nature of our problem, we evaluate our algorithm via crowdsourcing using Amazon Mechanical Turk. In our experiments, we demonstrate that the proposed joint summarization approach outperforms other baselines and our own methods using videos or images only.
引用
收藏
页码:4225 / 4232
页数:8
相关论文
共 50 条
  • [41] Towards Large-Scale Face Recognition Based on Videos
    Yalcin, Meltem
    Cevikalp, Hakan
    Yavuz, Hasan Serhan
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 1078 - 1085
  • [42] Captioning Videos Using Large-Scale Image Corpus
    Xiao-Yu Du
    Yang Yang
    Liu Yang
    Fu-Min Shen
    Zhi-Guang Qin
    Jin-Hui Tang
    Journal of Computer Science and Technology, 2017, 32 : 480 - 493
  • [43] Captioning Videos Using Large-Scale Image Corpus
    Du, Xiao-Yu
    Yang, Yang
    Yang, Liu
    Shen, Fu-Min
    Qin, Zhi-Guang
    Tang, Jin-Hui
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (03) : 480 - 493
  • [44] PhotoShape: Photorealistic Materials for Large-Scale Shape Collections
    Park, Keunhong
    Rematas, Konstantinos
    Farhadi, All
    Seitz, Steven M.
    SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS, 2018,
  • [45] PhotoShape: Photorealistic Materials for Large-Scale Shape Collections
    Park, Keunhong
    Rematas, Konstantinos
    Farhadi, All
    Seitz, Steven M.
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (06):
  • [46] Large-Scale Absolute Surface Reconstruction
    Wu Gao-feng
    Quan Hai-yang
    Song Wei-hong
    Wu Yong-qian
    Wu Fan
    8TH INTERNATIONAL SYMPOSIUM ON ADVANCED OPTICAL MANUFACTURING AND TESTING TECHNOLOGY: OPTICAL TEST, MEASUREMENT TECHNOLOGY, AND EQUIPMENT, 2016, 9684
  • [47] Unbiased reconstruction of the large-scale structure
    Zaroubi, S
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2002, 331 (04) : 901 - 908
  • [48] Reconstruction of very large-scale fires
    Saito, K
    VERY LARGE-SCALE FIRES, 1998, 1336 : 99 - 111
  • [50] WIENER RECONSTRUCTION OF THE LARGE-SCALE STRUCTURE
    ZAROUBI, S
    HOFFMAN, Y
    FISHER, KB
    LAHAV, O
    ASTROPHYSICAL JOURNAL, 1995, 449 (02): : 446 - 459