Clockwork Convnets for Video Semantic Segmentation

被引:124
|
作者
Shelhamer, Evan [1 ]
Rakelly, Kate [1 ]
Hoffman, Judy [1 ]
Darrell, Trevor [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1007/978-3-319-49409-8_69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have seen tremendous progress in still-image segmentation; however the naive application of these state-of-the-art algorithms to every video frame requires considerable computation and ignores the temporal continuity inherent in video. We propose a video recognition framework that relies on two key observations: (1) while pixels may change rapidly from frame to frame, the semantic content of a scene evolves more slowly, and (2) execution can be viewed as an aspect of architecture, yielding purpose-fit computation schedules for networks. We define a novel family of "clockwork" convnets driven by fixed or adaptive clock signals that schedule the processing of different layers at different update rates according to their semantic stability. We design a pipeline schedule to reduce latency for real-time recognition and a fixed-rate schedule to reduce overall computation. Finally, we extend clockwork scheduling to adaptive video processing by incorporating data-driven clocks that can be tuned on unlabeled video. The accuracy and efficiency of clockwork convnets are evaluated on the Youtube-Objects, NYUD, and Cityscapes video datasets.
引用
收藏
页码:852 / 868
页数:17
相关论文
共 50 条
  • [1] Entire Deformable ConvNets for semantic segmentation
    Yu, Bingqi
    Jiao, Licheng
    Liu, Xu
    Li, Lingling
    Liu, Fang
    Yang, Shuyuan
    Tang, Xu
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [2] Handling Missing Annotations for Semantic Segmentation with Deep ConvNets
    Petit, Olivier
    Thome, Nicolas
    Charnoz, Arnaud
    Hostettler, Alexandre
    Soler, Luc
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 20 - 28
  • [3] Spatiotemporal Semantic Video Segmentation
    Galmar, E.
    Athanasiadis, Th
    Huet, B.
    Avrithis, Y.
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 578 - +
  • [4] Semantic Segmentation Facilitates Semantic Communication in Surveillance Video
    Ma, Wenbo
    Xie, Yu
    Wang, Congyan
    Zheng, Kaipeng
    Chen, Mingkai
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [5] SEMANTIC SEGMENTATION OF VEGETATION IMAGES ACQUIRED BY UNMANNED AERIAL VEHICLES USING AN ENSEMBLE OF CONVNETS
    Nogueira, Keiller
    dos Santos, Jefersson A.
    Cancian, Leonardo
    Borges, Bruno D.
    Silva, Thiago S. F.
    Morellato, Leonor Patricia
    Torres, Ricardo da S.
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3787 - 3790
  • [6] Deep Video Dehazing With Semantic Segmentation
    Ren, Wenqi
    Zhang, Jingang
    Xu, Xiangyu
    Ma, Lin
    Cao, Xiaochun
    Meng, Gaofeng
    Liu, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1895 - 1908
  • [7] Semantic video scene segmentation and transfer
    Gritti, Tommaso
    Damkat, Chris
    Monaci, Gianluca
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 122 : 172 - 181
  • [8] A pothole video dataset for semantic segmentation
    Ihsan, Muhammad
    Amrizal, Muhammad Alfian
    Harjoko, Agus
    DATA IN BRIEF, 2024, 53
  • [9] Semantic segmentation and description for video transcoding
    Cavallaro, A
    Steiger, O
    Ebrahimi, T
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 597 - 600
  • [10] Semantic color extraction and semantic shot segmentation for soccer video
    Niu Z.-X.
    Li J.
    Gao X.-B.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2010, 37 (04): : 613 - 618