Retrieving Instructional Video Content from Speech and Text Information

被引:3
|
作者
Kothawade, Ashwini Y. [1 ]
Patil, Dipak R. [1 ]
机构
[1] Amruvahini Coll Engn, Dept Informat Technol, Sangamner, India
关键词
OCR; ASR; Video content retrieval; Instructional videos; e-Learning; Tele-lecture; Tesseract OCR; Video lectures indexing;
D O I
10.1007/978-981-10-0755-2_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest of today's generation to learn from video lectures is becoming popular due to its considerable advantages and easy availability than classroom learning. To involve into this, many institutes and organizations are using this method for teaching and learning. An enormous amount of data is generated in video lecturing form. To extract the desired information from the desired video from this vast video information available on internet becomes difficult. In this paper, we have used techniques for automatically retrieving the information from video files to collect it as a metadata for those files. For efficient retrieval of text from videos we use the OCR (Optical Character Recognition) tool to extract text from slides and ASR (Automatic Speech Recognition) tool for recognizing information from speech given by the speaker. First, we do segmentation and classification of video frames for identifying the key frames. Then the OCR and ASR tool is used for extracting the information from video slides and audio speech respectively. The collected data can be stored as a metadata for the file. Finally, the search can be made more efficient by applying clustering and ontology concept.
引用
收藏
页码:311 / 322
页数:12
相关论文
共 50 条
  • [41] A Development of Instructional Video for Increasing Learners' Motivation and Content Mastery in Video Learning Environment
    Kaewsa-Ard, Atima
    29TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2021), VOL II, 2021, : 480 - 486
  • [42] Automatic text extraction from video for content-based annotation and retrieval
    Shim, JC
    Dorai, C
    Bolle, R
    FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 618 - 620
  • [43] Constructing explanatory models from text-based information: Why instructional tools help
    James, Katherine
    Goldman, Susan R.
    CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 2020, 63
  • [44] Integrated Video and Text for Content-based Access to Video Databases
    Haitao Jiang
    Danilo Montesi
    Ahmed K. Elmagarmid
    Multimedia Tools and Applications, 1999, 9 : 227 - 249
  • [45] Integrated video and text for content-based access to video databases
    Jiang, HT
    Montesi, D
    Elmagarmid, AK
    MULTIMEDIA TOOLS AND APPLICATIONS, 1999, 9 (03) : 227 - 249
  • [46] RETRIEVING SPEECH SAMPLES WITH SIMILAR EMOTIONAL CONTENT USING A TRIPLET LOSS FUNCTION
    Harvill, John
    AbdelWahab, Mohammed
    Lotfian, Reza
    Busso, Carlos
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7400 - 7404
  • [47] Text information extraction in images and video: a survey
    Jung, K
    Kim, KI
    Jain, AK
    PATTERN RECOGNITION, 2004, 37 (05) : 977 - 997
  • [48] COLOMBIAN DIALECT RECOGNITION BASED ON INFORMATION EXTRACTED FROM SPEECH AND TEXT SIGNALS
    Escobar-Grisales, D.
    Rios-Urrego, C. D.
    Lopez-Santander, D. A.
    Gallo-Aristizabal, J. D.
    Vasquez-Correa, J. C.
    Noeth, E.
    Orozco-Arroyave, J. R.
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 556 - 563
  • [49] Recognising and Retrieving the Meaning of Thirukkural from Speech Utterances
    Bharathi, B.
    Sridevi, G.
    Varshitha, G. J.
    2017 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2017,
  • [50] Video retrieval using speech and image information
    Hauptmann, AG
    Jin, R
    Ng, TD
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 148 - 159