Summarizing Lecture Videos by Key Handwritten Content Regions

被引:9
|
作者
Kota, Bhargava Urala [1 ]
Ahmed, Saleem [1 ]
Stone, Alexander [1 ]
Davila, Kenny [1 ]
Setlur, Srirangaraj [1 ]
Govindaraju, Venu [1 ]
机构
[1] SUNY BUFFALO, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/ICDARW.2019.30058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88).
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [41] Evolving Background Recovery in Lecture Videos
    Genetet, Come
    Agam, Gady
    IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
  • [42] HOW TO ENHANCE LECTURE CAPTURE VIDEOS
    Young, C.
    Moes, S.
    EDULEARN13: 5TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2013, : 378 - 387
  • [43] Improving Access to Online Lecture Videos
    Bauer, Matthias
    Malchow, Martin
    Meinel, Christoph
    PROCEEDINGS OF 2018 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON) - EMERGING TRENDS AND CHALLENGES OF ENGINEERING EDUCATION, 2018, : 1161 - 1168
  • [44] IDENTIFYING SALIENT POSES IN LECTURE VIDEOS
    Zhang, John R.
    Kender, John R.
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [45] CAMTASIA STUDIO AND VIDEOSCRIBE FOR LECTURE VIDEOS
    Photinon, K.
    ICERI2015: 8TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION, 2015, : 3986 - 3986
  • [46] Re-watching lecture videos
    Jarrett, Christian
    PSYCHOLOGIST, 2019, 32 : 23 - 23
  • [47] 10TH CONFERENCE ON WATER CHEMISTRY - SUMMARIZING LECTURE
    SZABO, I
    ENERGIA ES ATOMTECHNIKA, 1978, 31 (12): : 526 - 540
  • [48] Localizing and Recognizing Text in Lecture Videos
    Dutta, Kartik
    Mathew, Minesh
    Krishnan, Praveen
    Jawahar, C. V.
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 235 - 240
  • [49] ABSUM: ABstractive SUMmarization of Lecture Videos
    Devi, M. S. Karthika
    Bhuvaneshwari, R.
    Baskaran, R.
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 3, SMARTCOM 2024, 2024, 947 : 237 - 248
  • [50] Deaf and Hard-of-hearing Users Evaluating Designs for Highlighting Key Words in Educational Lecture Videos
    Kafle, Sushant
    Dingman, Becca
    Huenerfauth, Matt
    ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2021, 14 (04)