A Unified Video Text Detection Method with Network Flow

被引:7
|
作者
Yang, Xue-Hang [1 ,2 ]
He, Wenhao [1 ,2 ]
Yin, Fei [1 ]
Liu, Cheng-Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
中国国家自然科学基金;
关键词
IMAGES;
D O I
10.1109/ICDAR.2017.62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method [1] to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA (Average Tracking Accuracy) on ICDAR video scene text dataset.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [21] A method for automatic score box detection and text recognition in soccer video
    Tabii, Youness
    Thami, Rachid Oulad Haj
    International Review on Computers and Software, 2009, 4 (02) : 188 - 191
  • [22] An automatic video text detection method based on BP-adaboost
    Hui Wu
    Bei-ji Zou
    Yu-qian Zhao
    Hong-pu Fu
    Multimedia Tools and Applications, 2016, 75 : 7715 - 7738
  • [23] A method of Text Event Detection and Image Enhancement based on Aerial Video
    Sun, Guangmin
    Liu, Xiaopeng
    Zhao, Dequn
    ISIE: 2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, 2009, : 307 - 310
  • [24] An efficient method for text detection in video based on stroke width similarity
    Dinh, Viet Cuong
    Chun, Seong Soo
    Cha, Scungwook
    Ryu, Hanjin
    Sull, Sanghoon
    COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 200 - 209
  • [25] Feature flow: In-network feature flow estimation for video object detection
    Jin, Ruibing
    Lin, Guosheng
    Wen, Changyun
    Wang, Jianliang
    Liu, Fayao
    PATTERN RECOGNITION, 2022, 122
  • [26] Flow driven attention network for video salient object detection
    Zhou, Feng
    Shuai, Hui
    Liu, Qingshan
    Guo, Guodong
    IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
  • [27] Video Text Detection Based on Text Edge Map
    Huang, Xiaodong
    Wang, Qin
    Zhu, Lishang
    Liu, Kehua
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1003 - 1007
  • [28] A Unified Framework for Multioriented Text Detection and Recognition
    Yao, Cong
    Bai, Xiang
    Liu, Wenyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (11) : 4737 - 4749
  • [29] Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation
    Liu, Jiawei
    Wang, Weining
    Chen, Sihan
    Zhu, Xinxin
    Liu, Jing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 141 - 153
  • [30] A Video Text Detection and Tracking System
    Yusufu, Tuoerhongjiang
    Wang, Yiqing
    Fang, Xiangzhong
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 522 - 529