Summarizing Lecture Videos by Key Handwritten Content Regions

被引：9

作者：

Kota, Bhargava Urala ^{[1
]}

Ahmed, Saleem ^{[1
]}

Stone, Alexander ^{[1
]}

Davila, Kenny ^{[1
]}

Setlur, Srirangaraj ^{[1
]}

Govindaraju, Venu ^{[1
]}

机构：

[1] SUNY BUFFALO, Dept Comp Sci & Engn, Buffalo, NY 14260 USA

来源：

2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 8TH INTERNATIONAL WORKSHOP ON CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, VOL 4 | 2019年

基金：

美国国家科学基金会;

关键词：

RECOGNITION;

D O I：

10.1109/ICDARW.2019.30058

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce a novel method for summarization of whiteboard lecture videos using key handwritten content regions. A deep neural network is used for detecting bounding boxes that contain semantically meaningful groups of handwritten content. A neural network embedding is learnt, under triplet loss, from the detected regions in order to discriminate between unique handwritten content. The detected regions along with embeddings at every frame of the lecture video are used to extract unique handwritten content across the video which are presented as the video summary. Additionally, a spatiotemporal index is constructed from the video which records the time and location of each individual summary region in the video which can potentially be used for content-based search and navigation. We train and test our methods on the publicly available AccessMath dataset. We use the DetEval scheme to benchmark our summarization by recall of unique ground truth objects (92.09%) and average number of summary regions (128) compared to the ground truth (88).

引用

页码：13 / 18

页数：6

共 50 条

[31] Automated Summarization of Lecture Videos
Vimalaksha, Anusha
Prekash, Abhijit
Vinay, Siddarth
Kumar, N. S.
2018 IEEE TENTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E), 2018, : 126 - 129
[32] A RESOURCE ALLOCATION FRAMEWORK FOR SUMMARIZING TEAM SPORT VIDEOS
Chen, Fan
De Vleeschouwer, Christophe
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4349 - 4352
[33] Summarizing Rushes Videos by Motion, Object, and Event Understanding
Wang, Feng
Ngo, Chong-Wah
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 76 - 87
[34] Cursive Handwritten Segmentation and Recognition for Instructional Videos
Imran, Ali Shariq
Chanda, Sukalpa
Cheikh, Faouzi Alaya
Franke, Katrin
Pal, Umapada
8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 155 - 160
[35] SUMMARIZING THE PERFORMANCES OF A BACKGROUND SUBTRACTION ALGORITHM MEASURED ON SEVERAL VIDEOS
Pierard, S.
Van Droogenbroeck, M.
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3234 - 3238
[36] Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos
Zhang, Luming
Gao, Yue
Hong, Richang
Hu, Yuxing
Ji, Rongrong
Dai, Qionghai
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (01) : 40 - 49
[37] EMBEDDED SPARSE CODING FOR SUMMARIZING MULTI-VIEW VIDEOS
Panda, Rameswar
Das, Abir
Roy-Chowdhury, Mut K.
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 191 - 195
[38] Efficient Anomaly Detection Algorithms for Summarizing Low Quality Videos
Kwan, Chiman
Zhou, Jin
Wang, Zheshen
Li, Baoxin
PATTERN RECOGNITION AND TRACKING XXIX, 2018, 10649
[39] A NEW APPROACH FOR EXTRACTING AND SUMMARIZING ABNORMAL ACTIVITIES IN SURVEILLANCE VIDEOS
Zhang, Yihao
Lin, Weiyao
Zhang, Guangwei
Luo, Chuanfei
Jiang, Dong
Yao, Chunlian
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
[40] Summarizing egocentric videos using deep features and optimal clustering
Sahu, Abhimanyu
Chowdhury, Ananda S.
NEUROCOMPUTING, 2020, 398 : 209 - 221

← 1 2 3 4 5 →