Video-text extraction and recognition

被引：0

作者：

Chen, TB ^{[1
]}

Ghosh, D ^{[1
]}

Ranganath, S ^{[1
]}

机构：

[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117548, Singapore

来源：

TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The detection and recognition of text from video is an important issue in automated content-based indexing of visual information in video archives. In this paper, we present a comprehensive system for extracting and recognizing artificial text from unconstrained, general-purpose videos. Exploiting the temporal,feature of videos, an edge-detection-based text segmentation method is applied only on selective frames for extracting text from a video scene. Subsequently, a combination of techniques including multiple frame integration, gray-scale filtering, entropy-based thresholding and line adjacency graphs is used to enhance the detected text areas. Finally, character recognition is accomplished by using the character side profiles. Results obtained from experiments on uncompressed MPEG-I video clips demonstrate the effectiveness of our proposed system.

引用

页码：A319 / A322

页数：4

共 50 条

[1] Activity Recognition applications from Contextual Video-Text Fusion
Levchuk, Georgiy
Shabarekh, Charlotte
2015 IEEE WINTER APPLICATIONS AND COMPUTER VISION WORKSHOPS (WACVW), 2015, : 1 - 3
[2] WILL VIDEO-TEXT SYSTEMS TRAVEL WELL
不详
ELECTRONICS, 1978, 51 (19): : 24 - 24
[3] Video Text Extraction and Recognition: A Survey
Pooja
Dhir, Renu
PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1366 - 1373
[4] Alignment of Image-Text and Video-Text Datasets
Ozkose, Yunus Emre
Gokce, Zeynep
Duygulu, Pinar
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
[5] Learning Video-Text Aligned Representations for Video Captioning
Shi, Yaya
Xu, Haiyang
Yuan, Chunfeng
Li, Bing
Hu, Weiming
Zha, Zheng-Jun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
[6] A NOVEL CONVOLUTIONAL ARCHITECTURE FOR VIDEO-TEXT RETRIEVAL
Li, Zheng
Guo, Caili
Yang, Bo
Feng, Zerun
Zhang, Hao
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[7] Multi-event Video-Text Retrieval
Zhang, Gengyuan
Ren, Jisen
Gu, Jindong
Tresp, Volker
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22056 - 22066
[8] Deep learning for video-text retrieval: a review
Zhu, Cunjuan
Jia, Qi
Chen, Wei
Guo, Yanming
Liu, Yu
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (01)
[9] Progressive Semantic Matching for Video-Text Retrieval
Liu, Hongying
Luo, Ruyi
Shang, Fanhua
Niu, Mantang
Liu, Yuanyuan
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5083 - 5091
[10] A Framework for Video-Text Retrieval with Noisy Supervision
Vaseqi, Zahra
Fan, Pengnan
Clark, James
Levine, Martin
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 373 - 383

← 1 2 3 4 5 →