Summarizing Lecture Videos by Key Handwritten Content Regions

被引:9
|
作者
Kota, Bhargava Urala [1 ]
Ahmed, Saleem [1 ]
Stone, Alexander [1 ]
Davila, Kenny [1 ]
Setlur, Srirangaraj [1 ]
Govindaraju, Venu [1 ]
机构
[1] SUNY BUFFALO, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/ICDARW.2019.30058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88).
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [1] Automated Detection of Handwritten Whiteboard Content in Lecture Videos for Summarization
    Kota, Bhargava Urala
    Davila, Kenny
    Stone, Alexander
    Setlur, Srirangaraj
    Govindaraju, Venu
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 19 - 24
  • [2] LectYS: A System for Summarizing Lecture Videos on YouTube
    Yoo, Taewon
    Jeong, Hyewon
    Lee, Donghwan
    Jung, Hyunggu
    26TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI '21 COMPANION), 2021, : 90 - 92
  • [3] Generalized framework for summarization of fixed-camera lecture videos by detecting and binarizing handwritten content
    Bhargava Urala Kota
    Kenny Davila
    Alexander Stone
    Srirangaraj Setlur
    Venu Govindaraju
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 221 - 233
  • [4] Generalized framework for summarization of fixed-camera lecture videos by detecting and binarizing handwritten content
    Kota, Bhargava Urala
    Davila, Kenny
    Stone, Alexander
    Setlur, Srirangaraj
    Govindaraju, Venu
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (03) : 221 - 233
  • [5] BLACKBOARD CONTENT CLASSIFICATION FOR LECTURE VIDEOS
    Imran, Ali Shariq
    Cheikh, Faouzi Alaya
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [6] Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos
    Medida, Lakshmi Haritha
    Ramani, Kasarapu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 336 - 344
  • [7] Summarizing Videos with Attention
    Fajtl, Jiri
    Sokeh, Hajar Sadeghi
    Argyriou, Vasileios
    Monekosso, Dorothy
    Remagnino, Paolo
    COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 39 - 54
  • [8] Visual Search Engine for Handwritten and Typeset Math in Lecture Videos and LATEX Notes
    Davila, Kenny
    Zanibbi, Richard
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 50 - 55
  • [9] Summarizing Videos by Key frame extraction using SSIM and other Visual Features
    Sandhu, Sharanjeet Kaur
    Agarwal, Anupam
    6TH INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT-2015), 2015, : 209 - 213
  • [10] Pre-Course Key Segment Analysis of Online Lecture Videos
    Che, Xiaoyin
    Staubitz, Thomas
    Yang, Haojin
    Meinel, Christoph
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2016, : 416 - 420