Clockwork Convnets for Video Semantic Segmentation

被引：124

作者：

Shelhamer, Evan ^{[1
]}

Rakelly, Kate ^{[1
]}

Hoffman, Judy ^{[1
]}

Darrell, Trevor ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III | 2016年 / 9915卷

关键词：

D O I：

10.1007/978-3-319-49409-8_69

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent years have seen tremendous progress in still-image segmentation; however the naive application of these state-of-the-art algorithms to every video frame requires considerable computation and ignores the temporal continuity inherent in video. We propose a video recognition framework that relies on two key observations: (1) while pixels may change rapidly from frame to frame, the semantic content of a scene evolves more slowly, and (2) execution can be viewed as an aspect of architecture, yielding purpose-fit computation schedules for networks. We define a novel family of "clockwork" convnets driven by fixed or adaptive clock signals that schedule the processing of different layers at different update rates according to their semantic stability. We design a pipeline schedule to reduce latency for real-time recognition and a fixed-rate schedule to reduce overall computation. Finally, we extend clockwork scheduling to adaptive video processing by incorporating data-driven clocks that can be tuned on unlabeled video. The accuracy and efficiency of clockwork convnets are evaluated on the Youtube-Objects, NYUD, and Cityscapes video datasets.

引用

页码：852 / 868

页数：17

共 50 条

[1] Entire Deformable ConvNets for semantic segmentation
Yu, Bingqi
Jiao, Licheng
Liu, Xu
Li, Lingling
Liu, Fang
Yang, Shuyuan
Tang, Xu
KNOWLEDGE-BASED SYSTEMS, 2022, 250
[2] Handling Missing Annotations for Semantic Segmentation with Deep ConvNets
Petit, Olivier
Thome, Nicolas
Charnoz, Arnaud
Hostettler, Alexandre
Soler, Luc
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 20 - 28
[3] Spatiotemporal Semantic Video Segmentation
Galmar, E.
Athanasiadis, Th
Huet, B.
Avrithis, Y.
2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 578 - +
[4] Semantic Segmentation Facilitates Semantic Communication in Surveillance Video
Ma, Wenbo
Xie, Yu
Wang, Congyan
Zheng, Kaipeng
Chen, Mingkai
2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
[5] SEMANTIC SEGMENTATION OF VEGETATION IMAGES ACQUIRED BY UNMANNED AERIAL VEHICLES USING AN ENSEMBLE OF CONVNETS
Nogueira, Keiller
dos Santos, Jefersson A.
Cancian, Leonardo
Borges, Bruno D.
Silva, Thiago S. F.
Morellato, Leonor Patricia
Torres, Ricardo da S.
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 3787 - 3790
[6] Deep Video Dehazing With Semantic Segmentation
Ren, Wenqi
Zhang, Jingang
Xu, Xiangyu
Ma, Lin
Cao, Xiaochun
Meng, Gaofeng
Liu, Wei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1895 - 1908
[7] Semantic video scene segmentation and transfer
Gritti, Tommaso
Damkat, Chris
Monaci, Gianluca
COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 122 : 172 - 181
[8] A pothole video dataset for semantic segmentation
Ihsan, Muhammad
Amrizal, Muhammad Alfian
Harjoko, Agus
DATA IN BRIEF, 2024, 53
[9] Semantic segmentation and description for video transcoding
Cavallaro, A
Steiger, O
Ebrahimi, T
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 597 - 600
[10] Semantic color extraction and semantic shot segmentation for soccer video
Niu Z.-X.
Li J.
Gao X.-B.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2010, 37 (04): : 613 - 618

← 1 2 3 4 5 →