A Unified Video Text Detection Method with Network Flow

被引:7
|
作者
Yang, Xue-Hang [1 ,2 ]
He, Wenhao [1 ,2 ]
Yin, Fei [1 ]
Liu, Cheng-Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
IMAGES;
D O I
10.1109/ICDAR.2017.62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method [1] to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA (Average Tracking Accuracy) on ICDAR video scene text dataset.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [1] An Improved Method of Video Text Detection
    Liu, Hua-ying
    Yuan, Wei
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [2] Text Flow: A Unified Text Detection System in Natural Scene Images
    Tian, Shangxuan
    Pan, Yifeng
    Huang, Chang
    Lu, Shijian
    Yu, Kai
    Tan, Chew Lim
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4651 - 4659
  • [3] Video Text Detection with Text Edges and Convolutional Neural Network
    Hu, Ping
    Wang, Weiqiang
    Lu, Ke
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 675 - 679
  • [4] A Unified Deep Neural Network for Scene Text Detection
    Li, Yixin
    Ma, Jinwen
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 101 - 112
  • [5] A Video Text Detection Method Based on Key Text Points
    Li, Zhi
    Liu, Guizhong
    Qian, Xueming
    Wang, Chen
    Ma, Yana
    Yang, Yang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 284 - 295
  • [6] A Novel Video Image Text Detection Method
    Zhou, Lin
    Ping, Xijian
    Gao, Haolin
    Xu, Sen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (04): : 1140 - 1152
  • [7] A Novel Video Image Text Detection Method
    Zhou, Lin
    Ping, Xijian
    Gao, Haolin
    Xu, Sen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (03): : 941 - 953
  • [8] VIDEO TEXT DETECTION WITH FULLY CONVOLUTIONAL NETWORK AND TRACKING
    Wang, Yang
    Wang, Lan
    Su, Feng
    Shi, Jiahao
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1738 - 1743
  • [9] A Robust Symmetry-based Method for Scene/Video Text Detection Through Neural Network
    Wu, Yirui
    Wang, Wenhai
    Palaiahnakote, Shivakumara
    Lu, Tong
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1249 - 1254
  • [10] Method for text detection in video based on homogeneity mapping
    Huang, Jianhua
    Wu, Rui
    Liu, Jiafeng
    Tang, Xianglong
    Gaojishu Tongxin/Chinese High Technology Letters, 2007, 17 (03): : 249 - 254