Joint Summarization of Large-scale Collections of Web Images and Videos for Storyline Reconstruction

被引:78
|
作者
Kim, Gunhee [1 ]
Sigal, Leonid [1 ]
Xing, Eric P. [2 ]
机构
[1] Disney Res Pittsburgh, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
LASSO;
D O I
10.1109/CVPR.2014.538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of jointly summarizing large sets of Flickr images and YouTube videos. Starting from the intuition that the characteristics of the two media types are different yet complementary, we develop a fast and easily-parallelizable approach for creating not only high-quality video summaries but also novel structural summaries of online images as storyline graphs. The storyline graphs can illustrate various events or activities associated with the topic in a form of a branching network. The video summarization is achieved by diversity ranking on the similarity graphs between images and video frames. The reconstruction of storyline graphs is formulated as the inference of sparse time-varying directed graphs from a set of photo streams with assistance of videos. For evaluation, we collect the datasets of 20 outdoor activities, consisting of 2.7M Flickr images and 16K YouTube videos. Due to the large-scale nature of our problem, we evaluate our algorithm via crowdsourcing using Amazon Mechanical Turk. In our experiments, we demonstrate that the proposed joint summarization approach outperforms other baselines and our own methods using videos or images only.
引用
收藏
页码:4225 / 4232
页数:8
相关论文
共 50 条
  • [31] Web tools for large-scale 3D biological images and atlases
    Husz, Zsolt L.
    Burton, Nicholas
    Hill, Bill
    Milyaev, Nestor
    Baldock, Richard A.
    BMC BIOINFORMATICS, 2012, 13
  • [32] Web tools for large-scale 3D biological images and atlases
    Husz, Zsolt L.
    Burton, Nicholas
    Hill, Bill
    Milyaev, Nestor
    Baldock, Richard A.
    BMC Bioinformatics, 2012, 13 (01)
  • [33] Large Scale Learning and Recognition of Faces in Web Videos
    Zhao, Ming
    Yagnik, Jay
    Adam, Hartwig
    Bau, David
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 901 - 907
  • [34] Web Page Harvesting for Automatized Large-scale Digital Images Anomaly Detection
    Kowalczyk, Marcin
    Malanowska, Agnieszka
    Mazurczyk, Wojciech
    Cabaj, Krzysztof
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [35] Improved camouflaged detection in the large-scale images and videos with minimum boundary contrast in detection technique
    Xu, Zhenyu
    Wang, Jinming
    Hu, Fengjun
    Abbas, Ghulam
    Touti, Ezzeddine
    Albekairi, Mohammed
    El-Hamrawy, Osama I.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [36] Improved camouflaged detection in the large-scale images and videos with minimum boundary contrast in detection technique
    Xu, Zhenyu
    Wang, Jinming
    Hu, Fengjun
    Abbas, Ghulam
    Touti, Ezzeddine
    Albekairi, Mohammed
    El-Hamrawy, Osama I.
    Expert Systems with Applications, 2024, 249
  • [37] Large-scale automatic reconstruction of neuronal processes from electron microscopy images
    Kaynig, Verena
    Vazquez-Reina, Amelio
    Knowles-Barley, Seymour
    Roberts, Mike
    Jones, Thouis R.
    Kasthuri, Narayanan
    Miller, Eric
    Lichtman, Jeff
    Pfister, Hanspeter
    MEDICAL IMAGE ANALYSIS, 2015, 22 (01) : 77 - 88
  • [38] Large-Scale Web Page Classification
    Marath, Sathi T.
    Shepherd, Michael
    Milios, Evangelos
    Duffy, Jack
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 1813 - 1822
  • [39] Large-Scale Web Data Analysis
    Leskovec, Jure
    IEEE INTELLIGENT SYSTEMS, 2011, 26 (01) : 11 - 11
  • [40] Linguistics in large-scale Web search
    Gulla, JA
    Auran, PG
    Risvik, KM
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2002, 2553 : 218 - 222