Video-text extraction and recognition

被引：0

作者：

Chen, TB ^{[1
]}

Ghosh, D ^{[1
]}

Ranganath, S ^{[1
]}

机构：

[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore

来源：

TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The detection and recognition of text from video is an important issue in automated content-based indexing of visual information in video archives. In this paper, we present a comprehensive system for extracting and recognizing artificial text from unconstrained, general-purpose videos. Exploiting the temporal,feature of videos, an edge-detection-based text segmentation method is applied only on selective frames for extracting text from a video scene. Subsequently, a combination of techniques including multiple frame integration, gray-scale filtering, entropy-based thresholding and line adjacency graphs is used to enhance the detected text areas. Finally, character recognition is accomplished by using the character side profiles. Results obtained from experiments on uncompressed MPEG-I video clips demonstrate the effectiveness of our proposed system.

引用

页码：A319 / A322

页数：4

共 50 条

[21] Video Question Answering with Iterative Video-Text Co-tokenization
Piergiovanni, A. J.
Morton, Kairo
Kuo, Weicheng
Ryoo, Michael S.
Angelova, Anelia
COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 76 - 94
[22] Bridging Video-text Retrieval with Multiple Choice Questions
Ge, Yuying
Ge, Yixiao
Liu, Xihui
Li, Dian
Shan, Ying
Qie, Xiaohu
Luo, Ping
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16146 - 16155
[23] Video text extraction from images for character recognition
Amarapur, Basavaraj
Patil, Nagaraj
2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
[24] Guided Graph Attention Learning for Video-Text Matching
Li, Kunpeng
Liu, Chang
Stopa, Mike
Amano, Jun
Fu, Yun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
[25] Survey on Video-Text Cross-Modal Retrieval
Chen, Lei
Xi, Yimeng
Liu, Libo
Computer Engineering and Applications, 2024, 60 (04) : 1 - 20
[26] SViTT: Temporal Learning of Sparse Video-Text Transformers
Li, Yi
Min, Kyle
Tripathi, Subarna
Vasconcelos, Nuno
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18919 - 18929
[27] HANet: Hierarchical Alignment Networks for Video-Text Retrieval
Wu, Peng
He, Xiangteng
Tang, Mingqian
Lv, Yiliang
Liu, Jing
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3518 - 3527
[28] Dual Encoding Integrating Key Frame Extraction for Video-text Cross-modal Entity Resolution
Zeng Z.
Cao J.
Weng N.
Jiang G.
Fan Q.
Binggong Xuebao/Acta Armamentarii, 2022, 43 (05): : 1107 - 1116
[29] Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
Lin, Chengzhi
Wu, Ancong
Liang, Junwei
Zhang, Jun
Ge, Wenhang
Zheng, Wei-Shi
Shen, Chunhua
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[30] Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval
Hao, Xiaoshuai
Zhang, Wanqian
Wu, Dayan
Zhu, Fei
Li, Bo
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18962 - 18972

← 1 2 3 4 5 →