Cross Pixel Optical-Flow Similarity for Self-supervised Learning

被引：24

作者：

Mahendran, Aravindh ^{[1
]}

Thewlis, James ^{[1
]}

Vedaldi, Andrea ^{[1
]}

机构：

[1] Univ Oxford, Visual Geometry Grp, Oxford, England

来源：

COMPUTER VISION - ACCV 2018, PT V | 2019年 / 11365卷

基金：

英国工程与自然科学研究理事会;

关键词：

Self-supervised learning; Motion; Convolutional neural network;

D O I：

10.1007/978-3-030-20873-8_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel method for learning convolutional neural image representations without manual supervision. We use motion cues in the form of optical-flow, to supervise representations of static images. The obvious approach of training a network to predict flow from a single image can be needlessly difficult due to intrinsic ambiguities in this prediction task. We instead propose a much simpler learning goal: embed pixels such that the similarity between their embeddings matches that between their optical-flow vectors. At test time, the learned deep network can be used without access to video or flow information and transferred to tasks such as image classification, detection, and segmentation. Our method, which significantly simplifies previous attempts at using motion for self-supervision, achieves state-of-the-art results in self-supervision using motion cues, and is overall state of the art in self-supervised pre-training for semantic image segmentation, as demonstrated on standard benchmarks.

引用

页码：99 / 116

页数：18

共 50 条

[41] Self-supervised learning based on Transformer for flow reconstruction and prediction
Xu, Bonan
Zhou, Yuanye
Bian, Xin
PHYSICS OF FLUIDS, 2024, 36 (02)
[42] SELF-SUPERVISED LEARNING OF OPTICAL FLOW, DEPTH, CAMERA POSE AND RIGIDITY SEGMENTATION WITH OCCLUSION HANDLING
Abdein, Rokia
Xiang, Xuezhi
Lv, Ning
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 6 - 10
[43] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
Denize, Julien
Rabarisoa, Jaonary
Orcesi, Astrid
Herault, Romain
MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
[44] A New Self-supervised Method for Supervised Learning
Yang, Yuhang
Ding, Zilin
Cheng, Xuan
Wang, Xiaomin
Liu, Ming
INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
[45] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
Julien Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
Machine Vision and Applications, 2023, 34
[46] SKILL: SIMILARITY-AWARE KNOWLEDGE DISTILLATION FOR SPEECH SELF-SUPERVISED LEARNING
Zampierin, Luca
Hacene, Ghouthi Boukli
Nguyen, Bac
Ravanelli, Mirco
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 675 - 679
[47] SAM: Self-Supervised Learning of Pixel-Wise Anatomical Embeddings in Radiological Images
Yan, Ke
Cai, Jinzheng
Jin, Dakai
Miao, Shun
Guo, Dazhou
Harrison, Adam P.
Tang, Youbao
Xiao, Jing
Lu, Jingjing
Lu, Le
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) : 2658 - 2669
[48] Improving sub-pixel accuracy in ultrasound localization microscopy using supervised and self-supervised deep learning
Zhang, Zeng
Hwang, Misun
Kilbaugh, Todd J.
Katz, Joseph
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (04)
[49] Reverse Optical Flow for Self-Supervised Adaptive Autonomous Robot Navigation
A. Lookingbill
J. Rogers
D. Lieb
J. Curry
S. Thrun
International Journal of Computer Vision, 2007, 74 : 287 - 302
[50] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION
Khare, Aparna
Parthasarathy, Srinivas
Sundaram, Shiva
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388

← 1 2 3 4 5 →