Summarizing Lecture Videos by Key Handwritten Content Regions

被引:9
|
作者
Kota, Bhargava Urala [1 ]
Ahmed, Saleem [1 ]
Stone, Alexander [1 ]
Davila, Kenny [1 ]
Setlur, Srirangaraj [1 ]
Govindaraju, Venu [1 ]
机构
[1] SUNY BUFFALO, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/ICDARW.2019.30058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88).
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [31] Automated Summarization of Lecture Videos
    Vimalaksha, Anusha
    Prekash, Abhijit
    Vinay, Siddarth
    Kumar, N. S.
    2018 IEEE TENTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E), 2018, : 126 - 129
  • [32] A RESOURCE ALLOCATION FRAMEWORK FOR SUMMARIZING TEAM SPORT VIDEOS
    Chen, Fan
    De Vleeschouwer, Christophe
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4349 - 4352
  • [33] Summarizing Rushes Videos by Motion, Object, and Event Understanding
    Wang, Feng
    Ngo, Chong-Wah
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 76 - 87
  • [34] Cursive Handwritten Segmentation and Recognition for Instructional Videos
    Imran, Ali Shariq
    Chanda, Sukalpa
    Cheikh, Faouzi Alaya
    Franke, Katrin
    Pal, Umapada
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 155 - 160
  • [35] SUMMARIZING THE PERFORMANCES OF A BACKGROUND SUBTRACTION ALGORITHM MEASURED ON SEVERAL VIDEOS
    Pierard, S.
    Van Droogenbroeck, M.
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3234 - 3238
  • [36] Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos
    Zhang, Luming
    Gao, Yue
    Hong, Richang
    Hu, Yuxing
    Ji, Rongrong
    Dai, Qionghai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (01) : 40 - 49
  • [37] EMBEDDED SPARSE CODING FOR SUMMARIZING MULTI-VIEW VIDEOS
    Panda, Rameswar
    Das, Abir
    Roy-Chowdhury, Mut K.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 191 - 195
  • [38] Efficient Anomaly Detection Algorithms for Summarizing Low Quality Videos
    Kwan, Chiman
    Zhou, Jin
    Wang, Zheshen
    Li, Baoxin
    PATTERN RECOGNITION AND TRACKING XXIX, 2018, 10649
  • [39] A NEW APPROACH FOR EXTRACTING AND SUMMARIZING ABNORMAL ACTIVITIES IN SURVEILLANCE VIDEOS
    Zhang, Yihao
    Lin, Weiyao
    Zhang, Guangwei
    Luo, Chuanfei
    Jiang, Dong
    Yao, Chunlian
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [40] Summarizing egocentric videos using deep features and optimal clustering
    Sahu, Abhimanyu
    Chowdhury, Ananda S.
    NEUROCOMPUTING, 2020, 398 : 209 - 221