Automatic Video Captioning via Multi-channel Sequential Encoding

被引:2
|
作者
Zhang, Chenyang [1 ]
Tian, Yingli [1 ]
机构
[1] CUNY City Coll, Dept Elect Engn, New York, NY 10031 USA
关键词
Video captioning; Long-short-term-memory; Sequential encoding; American Sign Language;
D O I
10.1007/978-3-319-48881-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel two-stage video captioning framework composed of (1) a multi-channel video encoder and (2) a sentence-generating language decoder. Both of the encoder and decoder are based on recurrent neural networks with long-short-term-memory cells. Our system can take videos of arbitrary lengths as input. Compared with the previous sequence-to-sequence video captioning frameworks, the proposed model is able to handle multiple channels of video representations and jointly learn how to combine them. The proposed model is evaluated on two large-scale movie datasets (MPII Corpus and Montreal Video Description) and one YouTube dataset (Microsoft Video Description Corpus) and achieves the state-of-the-art performances. Furthermore, we extend the proposed model towards automatic American Sign Language recognition. To evaluate the performance of our model on this novel application, a new dataset for ASL video description is collected based on YouTube videos. Results on this dataset indicate that the proposed framework on ASL recognition is promising and will significantly benefit the independent communication between ASL users and others.
引用
收藏
页码:146 / 161
页数:16
相关论文
共 50 条
  • [21] Multi-channel video impulse radar for landmine detection
    Yarovoy, A
    Schukin, A
    Kaploun, I
    Ligthart, L
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS VI, PTS 1 AND 2, 2001, 4394 : 662 - 670
  • [22] Multi-channel video streaming server for surveillance systems
    Wijnhoven, RGJ
    Jaspers, EGT
    de With, PHN
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, PROCEEDINGS, 2004, : 353 - 358
  • [23] Multi-channel Automatic Calibration System of Pressure Sensor
    Jin Wanyu
    Zuo Siran
    Sun Dehui
    Wang Zhongyu
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 506 - 510
  • [24] Multi-channel digital automatic ultrasonic detecting system
    Tang, JJ
    Ni, QZ
    Wang, YF
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 2572 - 2574
  • [25] Moving Target Detection in Multi-Channel Quantum Video
    Yan, Fei
    Iliyasu, Abdullah M.
    Khan, Asif R.
    Yang, Huamin
    2015 IEEE 9TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING (WISP), 2015, : 136 - 140
  • [26] Embedded Multi-Channel Video Encoder Based on DSP
    Yao Chunlian
    Li Wei
    Meng Qinglei
    Gao Lihua
    SEC 2008: PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING, 2008, : 124 - +
  • [27] A multi-Channel Video Multiplexer with High Isolation Switch
    Huang Xiaozong
    Liu Luncai
    Huang Wengang
    Huang Zhihua
    2013 IEEE INTERNATIONAL CONFERENCE OF ELECTRON DEVICES AND SOLID-STATE CIRCUITS (EDSSC), 2013,
  • [28] MULTI-CHANNEL, AUTOMATIC SEALED CONTACT TEST SET
    GHAEL, PR
    THOMAS, GL
    WESTERN ELECTRIC ENGINEER, 1972, 16 (01): : 24 - &
  • [29] On utilizing multi-channel to provide scheduled video delivery
    Lin, CS
    Chang, TY
    Hsieh, JR
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 817 - 820
  • [30] Construction of a Multi-Channel Video Wireless Transmission System
    Chen, Qizhou
    Jun, Wang
    Zhou, Runnan
    2015 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND INTELLIGENT CONTROL (ISIC 2015), 2015, : 197 - 200