Retrieving Instructional Video Content from Speech and Text Information

被引：3

作者：

Kothawade, Ashwini Y. ^{[1
]}

Patil, Dipak R. ^{[1
]}

机构：

[1] Amruvahini Coll Engn, Dept Informat Technol, Sangamner, India

来源：

PROCEEDINGS OF THE INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2015, VOL 2 | 2016年 / 439卷

关键词：

OCR; ASR; Video content retrieval; Instructional videos; e-Learning; Tele-lecture; Tesseract OCR; Video lectures indexing;

D O I：

10.1007/978-981-10-0755-2_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The interest of today's generation to learn from video lectures is becoming popular due to its considerable advantages and easy availability than classroom learning. To involve into this, many institutes and organizations are using this method for teaching and learning. An enormous amount of data is generated in video lecturing form. To extract the desired information from the desired video from this vast video information available on internet becomes difficult. In this paper, we have used techniques for automatically retrieving the information from video files to collect it as a metadata for those files. For efficient retrieval of text from videos we use the OCR (Optical Character Recognition) tool to extract text from slides and ASR (Automatic Speech Recognition) tool for recognizing information from speech given by the speaker. First, we do segmentation and classification of video frames for identifying the key frames. Then the OCR and ASR tool is used for extracting the information from video slides and audio speech respectively. The collected data can be stored as a metadata for the file. Finally, the search can be made more efficient by applying clustering and ontology concept.

引用

页码：311 / 322

页数：12

共 50 条

[1] Content Based Lecture Video Retrieval Using Speech and Video Text Information
Yang, Haojin
Meinel, Christoph
IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2014, 7 (02): : 142 - 154
[2] Optimized Searching of Video Based On Speech and Video Text Content
Vigneshwari, G.
Juliet, A. Noble Mary
PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,
[3] Instructional video content analysis using audio information
Li, Ying
Dorai, Chitra
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2264 - 2274
[4] A QUERY LANGUAGE FOR RETRIEVING INFORMATION FROM HIERARCHICAL TEXT STRUCTURES
MACLEOD, IA
COMPUTER JOURNAL, 1991, 34 (03): : 254 - 264
[5] Extracting text information for content-based video retrieval
Xu, Lei
Wang, Kongqiao
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2008, 4903 : 58 - 69
[6] Combining partial information from speech and text
Fogerty, Daniel
Iftikhar, Irraj
Madorskiy, Rachel
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (02): : EL189 - EL195
[7] Browsing and retrieving video content in a unified framework
Rui, Y
Huang, TS
Mehrotra, S
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 9 - 14
[8] RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
Mdhaffar, Salima
Bonastre, Jean-Francois
Tommasi, Marc
Tomashenko, Natalia
Esteve, Yannick
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6767 - 6771
[9] Genetic Information as Instructional Content
Romano, Daniele
HUMANA MENTE-JOURNAL OF PHILOSOPHICAL STUDIES, 2007, (01): : 47 - 48
[10] Genetic information as instructional content
Stegmann, UE
PHILOSOPHY OF SCIENCE, 2005, 72 (03) : 425 - 443

← 1 2 3 4 5 →