VIVA: visual information retrieval in video archives

被引:0
|
作者
Markus Mühling
Nikolaus Korfhage
Kader Pustu-Iren
Joanna Bars
Mario Knapp
Hicham Bellafkir
Markus Vogelbacher
Daniel Schneider
Angelika Hörth
Ralph Ewerth
Bernd Freisleben
机构
[1] University of Marburg,Department of Mathematics and Computer Science
[2] TIB – Leibniz Information Centre for Science and Technology,L3S Research Center
[3] Leibniz University Hannover,undefined
[4] German Broadcasting Archive,undefined
关键词
Visual information retrieval; Video mining; Video retrieval; Deep learning; German broadcasting archive;
D O I
暂无
中图分类号
学科分类号
摘要
Video retrieval methods, e.g., for visual concept classification, person recognition, and similarity search, are essential to perform fine-grained semantic search in large video archives. However, such retrieval methods often have to be adapted to the users’ changing search requirements: which concepts or persons are frequently searched for, what research topics are currently important or will be relevant in the future? In this paper, we present VIVA, a software tool for building content-based video retrieval methods based on deep learning models. VIVA allows non-expert users to conduct visual information retrieval for concepts and persons in video archives and to add new people or concepts to the underlying deep learning models as new requirements arise. For this purpose, VIVA provides a novel semi-automatic data acquisition workflow including a web crawler, image similarity search, as well as review and user feedback components to reduce the time-consuming manual effort for collecting training samples. We present experimental retrieval results using VIVA for four use cases in the context of a historical video collection of the German Broadcasting Archive based on about 34,000 h of television recordings from the former German Democratic Republic (GDR). We evaluate the performance of deep learning models built using VIVA for 91 GDR specific concepts and 98 personalities from the former GDR as well as the performance of the image and person similarity search approaches.
引用
收藏
页码:319 / 333
页数:14
相关论文
共 50 条
  • [1] VIVA: visual information retrieval in video archives
    Muehling, Markus
    Korfhage, Nikolaus
    Pustu-Iren, Kader
    Bars, Joanna
    Knapp, Mario
    Bellafkir, Hicham
    Vogelbacher, Markus
    Schneider, Daniel
    Hoerth, Angelika
    Ewerth, Ralph
    Freisleben, Bernd
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (04) : 319 - 333
  • [2] Visual Information Retrieval in Endoscopic Video Archives
    Carlos, Jennifer Roldan
    Lux, Mathias
    Giro-i-Nieto, Xavier
    Munoz, Pia
    Anagnostopoulos, Nektarios
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
  • [3] Video Mining on Historical Footage-A Practical Report Project "Visual Information Search in Video Archives" (VIVA)
    Pustu-Iren, Kader
    Bars, Joanna
    Muehling, Markus
    Korfhage, Nikolaus
    Hoerth, Angelika
    Freisleben, Bernd
    Ewerth, Ralph
    BIBLIOTHEK FORSCHUNG UND PRAXIS, 2020, 44 (03) : 436 - 444
  • [4] Composition and retrieval of visual information for video databases
    Cheng, PJ
    Yang, WP
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2001, 12 (06): : 627 - 656
  • [5] Actor based video indexing and retrieval using visual information
    Islam, Mohammad Khairul
    Lee, Soon-Tak
    Baek, Joong-Hwan
    ADVANCES IN NATURAL COMPUTATION, PT 2, 2006, 4222 : 492 - 501
  • [6] Story Based Video Retrieval using Deep Visual and Textual Information
    Hassan, Muhammad A.
    Saleem, Summra
    Khan, Muhammad Zeeshan
    Khan, Muhammad Usman Ghani
    2019 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND DIGITAL SYSTEMS (C-CODE), 2019, : 166 - 171
  • [7] Visual information retrieval
    Gupta, A
    Jain, R
    COMMUNICATIONS OF THE ACM, 1997, 40 (05) : 70 - 79
  • [8] AUTOMATION AND INFORMATION RETRIEVAL IN ARCHIVES - BROAD CONCEPTS
    CAMPBELL, RR
    AMERICAN ARCHIVIST, 1967, 30 (02): : 279 - 286
  • [9] MOTION INFORMATION FOR VIDEO RETRIEVAL
    Tahayna, Bashar
    Belkhatir, Mohammed
    Alhashmi, Saadat
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 870 - 873
  • [10] A visual information retrieval tool
    Zhang, J
    ASIS 2000: PROCEEDINGS OF THE 63RD ASIS ANNUAL MEETING, VOL 37, 2000, 2000, 37 : 248 - 257