Cross Pixel Optical-Flow Similarity for Self-supervised Learning

被引:24
|
作者
Mahendran, Aravindh [1 ]
Thewlis, James [1 ]
Vedaldi, Andrea [1 ]
机构
[1] Univ Oxford, Visual Geometry Grp, Oxford, England
来源
基金
英国工程与自然科学研究理事会;
关键词
Self-supervised learning; Motion; Convolutional neural network;
D O I
10.1007/978-3-030-20873-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel method for learning convolutional neural image representations without manual supervision. We use motion cues in the form of optical-flow, to supervise representations of static images. The obvious approach of training a network to predict flow from a single image can be needlessly difficult due to intrinsic ambiguities in this prediction task. We instead propose a much simpler learning goal: embed pixels such that the similarity between their embeddings matches that between their optical-flow vectors. At test time, the learned deep network can be used without access to video or flow information and transferred to tasks such as image classification, detection, and segmentation. Our method, which significantly simplifies previous attempts at using motion for self-supervision, achieves state-of-the-art results in self-supervision using motion cues, and is overall state of the art in self-supervised pre-training for semantic image segmentation, as demonstrated on standard benchmarks.
引用
收藏
页码:99 / 116
页数:18
相关论文
共 50 条
  • [41] Self-supervised learning based on Transformer for flow reconstruction and prediction
    Xu, Bonan
    Zhou, Yuanye
    Bian, Xin
    PHYSICS OF FLUIDS, 2024, 36 (02)
  • [42] SELF-SUPERVISED LEARNING OF OPTICAL FLOW, DEPTH, CAMERA POSE AND RIGIDITY SEGMENTATION WITH OCCLUSION HANDLING
    Abdein, Rokia
    Xiang, Xuezhi
    Lv, Ning
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 6 - 10
  • [43] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
    Denize, Julien
    Rabarisoa, Jaonary
    Orcesi, Astrid
    Herault, Romain
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [44] A New Self-supervised Method for Supervised Learning
    Yang, Yuhang
    Ding, Zilin
    Cheng, Xuan
    Wang, Xiaomin
    Liu, Ming
    INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [45] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
    Julien Denize
    Jaonary Rabarisoa
    Astrid Orcesi
    Romain Hérault
    Machine Vision and Applications, 2023, 34
  • [46] SKILL: SIMILARITY-AWARE KNOWLEDGE DISTILLATION FOR SPEECH SELF-SUPERVISED LEARNING
    Zampierin, Luca
    Hacene, Ghouthi Boukli
    Nguyen, Bac
    Ravanelli, Mirco
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 675 - 679
  • [47] SAM: Self-Supervised Learning of Pixel-Wise Anatomical Embeddings in Radiological Images
    Yan, Ke
    Cai, Jinzheng
    Jin, Dakai
    Miao, Shun
    Guo, Dazhou
    Harrison, Adam P.
    Tang, Youbao
    Xiao, Jing
    Lu, Jingjing
    Lu, Le
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) : 2658 - 2669
  • [48] Improving sub-pixel accuracy in ultrasound localization microscopy using supervised and self-supervised deep learning
    Zhang, Zeng
    Hwang, Misun
    Kilbaugh, Todd J.
    Katz, Joseph
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (04)
  • [49] Reverse Optical Flow for Self-Supervised Adaptive Autonomous Robot Navigation
    A. Lookingbill
    J. Rogers
    D. Lieb
    J. Curry
    S. Thrun
    International Journal of Computer Vision, 2007, 74 : 287 - 302
  • [50] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION
    Khare, Aparna
    Parthasarathy, Srinivas
    Sundaram, Shiva
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388