Recognition and Transition Frame Detection of Arabic News Captions for Video Retrieval

被引:0
|
作者
Iwata, Seiya [1 ]
Ohyama, Wataru [1 ]
Wakabayashi, Tetsushi [1 ]
Kimura, Fumitaka [1 ]
机构
[1] Mie Univ, Grad Sch Engn, 1577 Kurimamachiya Cho, Tsu, Mie 5148507, Japan
关键词
OCR; News caption recognition; Arabic word recognition; Combing noise; Video retrieval; Moving news caption; TEXT DETECTION; EXTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The authors have conducted studies on recognizing Arabic news captions to develop a system for video retrieval to index and edit Arabic broadcast programs daily received and stored in big database. This paper describes a dedicated OCR for recognizing low resolution news captions in video images. News caption recognition system consisting of text line extraction, word segmentation and segmentation-recognition of words is developed and the performance was experimentally evaluated using datasets of frame images extracted from AIJazeera broadcasting programs. Character recognition of moving news caption is difficult due to combing noise yielded by the interlacing of scan lines. A technique to detect and eliminate the combing noise to correctly recognize the moving news caption is proposed. This paper also proposes a technique based on inter-frame text difference to detect transition frame of still news captions. The technique to detect transition frames is necessary for efficient video retrieve and play. The proposed technique is experimentally tested and shown to be robust to quick motion of the background and is able to detect the transition frame correctly with the F-measure higher than 90%. When compared with the ABBY FineReader 11 (R) commercial OCR the dedicated OCR improves the recall of the Arabic characters in AIJazeera broadcasting news from 70.74% to 95.85% for non-interlaced moving news captions and from 23.82% to 96.29% for interlaced moving news captions.
引用
收藏
页码:4005 / 4010
页数:6
相关论文
共 50 条
  • [1] Detection and retrieval of captions in news video
    Luo, M
    Bai, XS
    Xu, GG
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 233 - 238
  • [2] Recognition and Connection of Moving Captions in Arabic TV News
    Iwata, Seiya
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 163 - 167
  • [3] Caption Detection, Localization and Type Recognition in Arabic News Video
    Zedan, Ibrahim A.
    Elsayed, Khaled M.
    Emary, Eid
    INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS 2016), 2016, : 114 - 120
  • [4] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Sato, T
    Kanade, T
    Hughes, EK
    Smith, MA
    Satoh, S
    MULTIMEDIA SYSTEMS, 1999, 7 (05) : 385 - 395
  • [5] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    JOURNAL OF IMAGING, 2018, 4 (02)
  • [6] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Toshio Sato
    Takeo Kanade
    Ellen K. Hughes
    Michael A. Smith
    Shin'ichi Satoh
    Multimedia Systems, 1999, 7 : 385 - 395
  • [7] Content-based news video retrieval with closed captions and time alignment
    Kim, YT
    Kim, JG
    Chang, HS
    Kang, K
    Kim, J
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 879 - 884
  • [8] Towards Leveraging Closed Captions for News Retrieval
    Blanco, Roi
    De Francisci Morales, Gianmarco
    Silvestri, Fabrizio
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 135 - 136
  • [9] Arabic text detection in news video using RetinaNet
    Manita, Sameh
    Mansouri, Sadek
    Zrigui, Mounir
    Berchech, Salma
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 796 - 803
  • [10] Detection and Recognition of Arabic Text in Video Frames
    Ohyama, Wataru
    Iwata, Seiya
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 20 - 24