Video-text extraction and recognition

被引：0

作者：

Chen, TB ^{[1
]}

Ghosh, D ^{[1
]}

Ranganath, S ^{[1
]}

机构：

[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore

来源：

TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The detection and recognition of text from video is an important issue in automated content-based indexing of visual information in video archives. In this paper, we present a comprehensive system for extracting and recognizing artificial text from unconstrained, general-purpose videos. Exploiting the temporal,feature of videos, an edge-detection-based text segmentation method is applied only on selective frames for extracting text from a video scene. Subsequently, a combination of techniques including multiple frame integration, gray-scale filtering, entropy-based thresholding and line adjacency graphs is used to enhance the detected text areas. Finally, character recognition is accomplished by using the character side profiles. Results obtained from experiments on uncompressed MPEG-I video clips demonstrate the effectiveness of our proposed system.

引用

页码：A319 / A322

页数：4

共 50 条

[41] Robust Video-Text Retrieval Via Noisy Pair Calibration
Zhang, Huaiwen
Yang, Yang
Qi, Fan
Qian, Shengsheng
Xu, Changsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8632 - 8645
[42] SEMANTIC-PRESERVING METRIC LEARNING FOR VIDEO-TEXT RETRIEVAL
Choo, Sungkwon
Ha, Seong Jong
Lee, Joonsoo
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2388 - 2392
[43] Video-Text Representation Learning via DifferentiableWeak Temporal Alignment
Ko, Dohwan
Choi, Joonmyung
Ko, Juyeon
Noh, Shinyeong
On, Kyoung-Woon
Kim, Eun-Sol
Kim, Hyunwoo J.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5006 - 5015
[44] Exposing the Limits of Video-Text Models through Contrast Sets
Park, Jae Sung
Shen, Sheng
Farhadi, Ali
Darrell, Trevor
Choi, Yejin
Rohrbach, Anna
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3574 - 3586
[45] STACKED CONVOLUTIONAL DEEP ENCODING NETWORK FOR VIDEO-TEXT RETRIEVAL
Zhao, Rui
Zheng, Kecheng
Zha, Zheng-jun
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[46] Improving Transformer with Dynamic Convolution and Shortcut for Video-Text Retrieval
Liu, Zhi
Cai, Jincen
Zhang, Mengmeng
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (07): : 2407 - 2424
[47] Unified Coarse-to-Fine Alignment for Video-Text Retrieval
Wang, Ziyang
Sung, Yi-Lin
Cheng, Feng
Bertasius, Gedas
Bansal, Mohit
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2804 - 2815
[48] ActBERT: Learning Global-Local Video-Text Representations
Zhu, Linchao
Yang, Yi
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8743 - 8752
[49] Extraction of text regions and recognition of characters from video inputs
Kim, JR
Moon, YS
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 767 - 774
[50] LOOK, TELL AND MATCH: REFINING VIDEO-TEXT RETRIEVAL WITH SEMANTIC INFORMATION
Zhu Jinkuan
Hu Weiyi
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,

← 1 2 3 4 5 →