Video-text extraction and recognition

被引:0
|
作者
Chen, TB [1 ]
Ghosh, D [1 ]
Ranganath, S [1 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection and recognition of text from video is an important issue in automated content-based indexing of visual information in video archives. In this paper, we present a comprehensive system for extracting and recognizing artificial text from unconstrained, general-purpose videos. Exploiting the temporal,feature of videos, an edge-detection-based text segmentation method is applied only on selective frames for extracting text from a video scene. Subsequently, a combination of techniques including multiple frame integration, gray-scale filtering, entropy-based thresholding and line adjacency graphs is used to enhance the detected text areas. Finally, character recognition is accomplished by using the character side profiles. Results obtained from experiments on uncompressed MPEG-I video clips demonstrate the effectiveness of our proposed system.
引用
收藏
页码:A319 / A322
页数:4
相关论文
共 50 条
  • [21] Video Question Answering with Iterative Video-Text Co-tokenization
    Piergiovanni, A. J.
    Morton, Kairo
    Kuo, Weicheng
    Ryoo, Michael S.
    Angelova, Anelia
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 76 - 94
  • [22] Bridging Video-text Retrieval with Multiple Choice Questions
    Ge, Yuying
    Ge, Yixiao
    Liu, Xihui
    Li, Dian
    Shan, Ying
    Qie, Xiaohu
    Luo, Ping
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16146 - 16155
  • [23] Video text extraction from images for character recognition
    Amarapur, Basavaraj
    Patil, Nagaraj
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
  • [24] Guided Graph Attention Learning for Video-Text Matching
    Li, Kunpeng
    Liu, Chang
    Stopa, Mike
    Amano, Jun
    Fu, Yun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [25] Survey on Video-Text Cross-Modal Retrieval
    Chen, Lei
    Xi, Yimeng
    Liu, Libo
    Computer Engineering and Applications, 2024, 60 (04) : 1 - 20
  • [26] SViTT: Temporal Learning of Sparse Video-Text Transformers
    Li, Yi
    Min, Kyle
    Tripathi, Subarna
    Vasconcelos, Nuno
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18919 - 18929
  • [27] HANet: Hierarchical Alignment Networks for Video-Text Retrieval
    Wu, Peng
    He, Xiangteng
    Tang, Mingqian
    Lv, Yiliang
    Liu, Jing
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3518 - 3527
  • [28] Dual Encoding Integrating Key Frame Extraction for Video-text Cross-modal Entity Resolution
    Zeng Z.
    Cao J.
    Weng N.
    Jiang G.
    Fan Q.
    Binggong Xuebao/Acta Armamentarii, 2022, 43 (05): : 1107 - 1116
  • [29] Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
    Lin, Chengzhi
    Wu, Ancong
    Liang, Junwei
    Zhang, Jun
    Ge, Wenhang
    Zheng, Wei-Shi
    Shen, Chunhua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [30] Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval
    Hao, Xiaoshuai
    Zhang, Wanqian
    Wu, Dayan
    Zhu, Fei
    Li, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18962 - 18972