A Unified Video Text Detection Method with Network Flow

被引:7
|
作者
Yang, Xue-Hang [1 ,2 ]
He, Wenhao [1 ,2 ]
Yin, Fei [1 ]
Liu, Cheng-Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
基金
中国国家自然科学基金;
关键词
IMAGES;
D O I
10.1109/ICDAR.2017.62
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection in videos has many application needs but has drawn less attention than that in images. Existing methods for video text detection perform unsatisfactorily because of the insufficient utilization of spatial and temporal information. In this paper, we propose a novel video text detection method with network flow based tracking. The system first applies a newly proposed Fully Convolutional Neural Network (FCN) based scene text detection method [1] to detect texts in individual frames and then track proposals in adjacent frames with a motion-based method. Next, the text association problem is formulated into a cost-flow network and text trajectories are derived from the network with a min-cost flow algorithm. At last, the trajectories are post-processed to improve the precision accuracy. The method can detect multi-oriented scene text in videos and incorporate spatial and temporal information efficiently. Experimental results show that the method improves the detection performance remarkably on benchmark datasets, e.g., by a 15.66% increase of ATA (Average Tracking Accuracy) on ICDAR video scene text dataset.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [31] Thresholding video images for text detection
    Du, EY
    Chang, CI
    Thouin, PD
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 919 - 922
  • [32] Multiresolution text detection in video frames
    Anthimopoulos, Marios
    Gatos, Basilis
    Pratikakis, Ioannis
    VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 161 - +
  • [33] Text Localization and Detection for News Video
    Song, Yu
    Wang, Wenhong
    ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 2, PROCEEDINGS: IMAGE ANALYSIS, INFORMATION AND SIGNAL PROCESSING, 2009, : 98 - 101
  • [34] A new approach for video text detection
    Cai, M
    Song, JQ
    Lyu, MR
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 117 - 120
  • [35] VIDEO FRAMES TEXT DETECTION THROUGH BAYESIAN CLASSIFICATION AND BOUNDARY GROWING METHOD
    Nancy, A.
    Jayapriya, D.
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [36] A unified text extraction method for instructional videos
    Tang, LJ
    Kender, JR
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 2893 - 2896
  • [37] Automatic text detection in video frames based on bootstrap artificial neural network and CED
    Yan, H
    Zhang, Y
    Hou, ZG
    Tan, M
    WSCG'2003, VOL 11, NO 2, CONFERENCE PROCEEDINGS, 2003, : 226 - 231
  • [38] Video Detection Algorithm Using an Optical Flow Calculation Method
    Glowacz, Andrzej
    Mikrut, Zbigniew
    Pawlik, Piotr
    MULTIMEDIA COMMUNICATIONS, SERVICES AND SECURITY, 2012, 287 : 118 - +
  • [39] FOTS: Fast Oriented Text Spotting with a Unified Network
    Liu, Xuebo
    Liang, Ding
    Yan, Shi
    Chen, Dagui
    Qiao, Yu
    Yan, Junjie
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5676 - 5685
  • [40] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891