Video-text extraction and recognition

被引:0
|
作者
Chen, TB [1 ]
Ghosh, D [1 ]
Ranganath, S [1 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection and recognition of text from video is an important issue in automated content-based indexing of visual information in video archives. In this paper, we present a comprehensive system for extracting and recognizing artificial text from unconstrained, general-purpose videos. Exploiting the temporal,feature of videos, an edge-detection-based text segmentation method is applied only on selective frames for extracting text from a video scene. Subsequently, a combination of techniques including multiple frame integration, gray-scale filtering, entropy-based thresholding and line adjacency graphs is used to enhance the detected text areas. Finally, character recognition is accomplished by using the character side profiles. Results obtained from experiments on uncompressed MPEG-I video clips demonstrate the effectiveness of our proposed system.
引用
收藏
页码:A319 / A322
页数:4
相关论文
共 50 条
  • [41] Robust Video-Text Retrieval Via Noisy Pair Calibration
    Zhang, Huaiwen
    Yang, Yang
    Qi, Fan
    Qian, Shengsheng
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8632 - 8645
  • [42] SEMANTIC-PRESERVING METRIC LEARNING FOR VIDEO-TEXT RETRIEVAL
    Choo, Sungkwon
    Ha, Seong Jong
    Lee, Joonsoo
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2388 - 2392
  • [43] Video-Text Representation Learning via DifferentiableWeak Temporal Alignment
    Ko, Dohwan
    Choi, Joonmyung
    Ko, Juyeon
    Noh, Shinyeong
    On, Kyoung-Woon
    Kim, Eun-Sol
    Kim, Hyunwoo J.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5006 - 5015
  • [44] Exposing the Limits of Video-Text Models through Contrast Sets
    Park, Jae Sung
    Shen, Sheng
    Farhadi, Ali
    Darrell, Trevor
    Choi, Yejin
    Rohrbach, Anna
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3574 - 3586
  • [45] STACKED CONVOLUTIONAL DEEP ENCODING NETWORK FOR VIDEO-TEXT RETRIEVAL
    Zhao, Rui
    Zheng, Kecheng
    Zha, Zheng-jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [46] Improving Transformer with Dynamic Convolution and Shortcut for Video-Text Retrieval
    Liu, Zhi
    Cai, Jincen
    Zhang, Mengmeng
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (07): : 2407 - 2424
  • [47] Unified Coarse-to-Fine Alignment for Video-Text Retrieval
    Wang, Ziyang
    Sung, Yi-Lin
    Cheng, Feng
    Bertasius, Gedas
    Bansal, Mohit
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2804 - 2815
  • [48] ActBERT: Learning Global-Local Video-Text Representations
    Zhu, Linchao
    Yang, Yi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8743 - 8752
  • [49] Extraction of text regions and recognition of characters from video inputs
    Kim, JR
    Moon, YS
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 767 - 774
  • [50] LOOK, TELL AND MATCH: REFINING VIDEO-TEXT RETRIEVAL WITH SEMANTIC INFORMATION
    Zhu Jinkuan
    Hu Weiyi
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,