SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

被引:0
|
作者
Zhou, Shili [1 ]
He, Ruian [1 ]
Tan, Weimin [1 ]
Yan, Bo [1 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Shanghai, Peoples R China
基金
上海市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the SAM encoder with the optical flow context encoder, and a Context Adaption Module to adapt the SAM features for optical flow task with Learned Task-Specific Embedding. Our proposed SAMFlow model reaches 0.86/2.10 clean/final EPE and 3.55/12.32 EPE/F1-all on Sintel and KITTI-15 training set, surpassing Flowformer by 8.5%/9.9% and 13.2%/16.3%. Furthermore, our model achieves state-of-the-art performance on the Sintel and KITTI-15 benchmarks, ranking #1 among all two-frame methods on Sintel clean pass.
引用
收藏
页码:7695 / 7703
页数:9
相关论文
共 50 条
  • [41] Automated galaxy sizes in Euclid images using the Segment Anything Model
    Vega-Ferrero, J.
    Buitrago, F.
    Fernandez-Iglesias, J.
    Raji, S.
    Sahelices, B.
    Sanchez, H. Dominguez
    ASTRONOMY & ASTROPHYSICS, 2025, 693
  • [42] EyeSAM: Unveiling the Potential of Segment Anything Model in Ophthalmic Image Segmentation
    da Silva, Alan Sousa
    Naik, Gunjan
    Bagga, Pallavi
    Soornro, Taha
    Reis, Ana P. Ribeiro
    Zhang, Gongyu
    Waisberg, Ethan
    Kandakji, Lynn
    Liu, Siyin
    Fu, Dun Jack
    Woof, Wiliam
    Moghul, Ismail
    Balaskas, Konstantinos
    Pontikos, Nikolas
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [43] Tooth segmentation on multimodal images using adapted segment anything model
    Peijuan Wang
    Hanjie Gu
    Yuliang Sun
    Scientific Reports, 15 (1)
  • [44] An efficient fine tuning strategy of segment anything model for polyp segmentation
    Mingyan Wang
    Cun Xu
    Kefeng Fan
    Scientific Reports, 15 (1)
  • [45] Evaluation and Improvement of Segment Anything Model for Interactive Histopathology Image Segmentation
    Kim, SeungKyu
    Oh, Hyun-Jic
    Min, Seonghui
    Jeong, Won-Ki
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 245 - 255
  • [46] Data Efficiency of Segment Anything Model for Optic Disc and Cup Segmentation
    Yii, Fabian
    MacGillivray, Tom
    Bernabeu, Miguel O.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2023 WORKSHOPS, 2023, 14394 : 336 - 346
  • [47] Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter
    Li, Yaqin
    Wang, Dandan
    Yuan, Cao
    Li, Hao
    Hu, Jing
    SENSORS, 2023, 23 (18)
  • [48] Knowledge Distillation with Segment Anything (SAM) Model for Planetary Geological Mapping
    Julka, Sahib
    Granitzer, Michael
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 68 - 77
  • [49] DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation
    Gao, Yifan
    Xia, Wei
    Hu, Dingdu
    Wang, Wenkui
    Gao, Xin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 509 - 519
  • [50] SamSLAM: A Visual SLAM Based on Segment Anything Model for Dynamic Environment
    Chen, Xianhao
    Wang, Tengyue
    Mai, Haonan
    Yang, Liangjing
    2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 91 - 97