Retrieving Instructional Video Content from Speech and Text Information

被引:3
|
作者
Kothawade, Ashwini Y. [1 ]
Patil, Dipak R. [1 ]
机构
[1] Amruvahini Coll Engn, Dept Informat Technol, Sangamner, India
关键词
OCR; ASR; Video content retrieval; Instructional videos; e-Learning; Tele-lecture; Tesseract OCR; Video lectures indexing;
D O I
10.1007/978-981-10-0755-2_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest of today's generation to learn from video lectures is becoming popular due to its considerable advantages and easy availability than classroom learning. To involve into this, many institutes and organizations are using this method for teaching and learning. An enormous amount of data is generated in video lecturing form. To extract the desired information from the desired video from this vast video information available on internet becomes difficult. In this paper, we have used techniques for automatically retrieving the information from video files to collect it as a metadata for those files. For efficient retrieval of text from videos we use the OCR (Optical Character Recognition) tool to extract text from slides and ASR (Automatic Speech Recognition) tool for recognizing information from speech given by the speaker. First, we do segmentation and classification of video frames for identifying the key frames. Then the OCR and ASR tool is used for extracting the information from video slides and audio speech respectively. The collected data can be stored as a metadata for the file. Finally, the search can be made more efficient by applying clustering and ontology concept.
引用
收藏
页码:311 / 322
页数:12
相关论文
共 50 条
  • [31] Retrieving amateur video from a small collection
    Petrelli, D
    Auld, D
    Gurrin, C
    Smeaton, A
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2005, 3652 : 487 - 488
  • [32] Ivia: Interactive Video Intelligent Agent Framework for Instructional Video Information Retrieval
    Khan, Emdad
    AlSalem, Adel
    12TH INTERNATIONAL EDUCATIONAL TECHNOLOGY CONFERENCE - IETC 2012, 2012, 64 : 186 - 191
  • [33] RETRIEVING TEMPORAL INFORMATION FROM MEMORY
    BURROWS, D
    OKADA, R
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1981, 18 (02) : 62 - 62
  • [34] Retrieving Information from Multiple Sources
    Roy, Anurag
    Ghosh, Kripabandhu
    Basu, Moumita
    Gupta, Parth
    Ghosh, Saptarshi
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 43 - 44
  • [35] Retrieving information from scientific periodicals
    Clarke, M
    LEARNED PUBLISHING, 1996, 9 (04) : 219 - 223
  • [36] Spatial-temporal semantic grouping of instructional video content
    Liu, TC
    Kender, JR
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2003, 2728 : 362 - 372
  • [37] Retrieving information from a hierarchical plan
    Schneider, Darryl W.
    Logan, Gordon D.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2007, 33 (06) : 1076 - 1091
  • [38] RETRIEVING INFORMATION FROM KORSAKOFF PATIENTS
    BOLLER, F
    GARDNER, H
    BUTTERS, N
    NEUROLOGY, 1973, 23 (04) : 400 - &
  • [39] Constructing a speech audio–video corpus by aligning long segments of speech and text
    Karpukhin I.A.
    Konushin A.S.
    Moscow University Computational Mathematics and Cybernetics, 2017, 41 (2) : 97 - 103
  • [40] INFORMATION-PROCESSING APPROACH - ORGANIZING INSTRUCTIONAL CONTENT
    CASON, GJ
    EDUCATIONAL TECHNOLOGY, 1975, 15 (10) : 21 - 25