SAMFlow: Eliminating Any Fragmentation in Optical Flow with Segment Anything Model

被引:0
|
作者
Zhou, Shili [1 ]
He, Ruian [1 ]
Tan, Weimin [1 ]
Yan, Bo [1 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Shanghai, Peoples R China
基金
上海市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Flow Estimation aims to find the 2D dense motion field between two frames. Due to the limitation of model structures and training datasets, existing methods often rely too much on local clues and ignore the integrity of objects, resulting in fragmented motion estimation. Through theoretical analysis, we find the pre-trained large vision models are helpful in optical flow estimation, and we notice that the recently famous Segment Anything Model (SAM) demonstrates a strong ability to segment complete objects, which is suitable for solving the fragmentation problem. We thus propose a solution to embed the frozen SAM image encoder into FlowFormer to enhance object perception. To address the challenge of in-depth utilizing SAM in non-segmentation tasks like optical flow estimation, we propose an Optical Flow Task-Specific Adaption scheme, including a Context Fusion Module to fuse the SAM encoder with the optical flow context encoder, and a Context Adaption Module to adapt the SAM features for optical flow task with Learned Task-Specific Embedding. Our proposed SAMFlow model reaches 0.86/2.10 clean/final EPE and 3.55/12.32 EPE/F1-all on Sintel and KITTI-15 training set, surpassing Flowformer by 8.5%/9.9% and 13.2%/16.3%. Furthermore, our model achieves state-of-the-art performance on the Sintel and KITTI-15 benchmarks, ranking #1 among all two-frame methods on Sintel clean pass.
引用
收藏
页码:7695 / 7703
页数:9
相关论文
共 50 条
  • [1] Segment and Recognize Anything at Any Granularity
    Li, Feng
    Zhang, Hao
    Sun, Peize
    Zou, Xueyan
    Liu, Shilong
    Li, Chunyuan
    Yang, Jianwei
    Zhang, Lei
    Gao, Jianfeng
    COMPUTER VISION - ECCV 2024, PT XLVIII, 2025, 15106 : 467 - 484
  • [2] Detect Any Shadow: Segment Anything for Video Shadow Detection
    Wang, Yonghui
    Zhou, Wengang
    Mao, Yunyao
    Li, Houqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3782 - 3794
  • [3] MeSAM: Multiscale Enhanced Segment Anything Model for Optical Remote Sensing Images
    Zhou, Xichuan
    Liang, Fu
    Chen, Lihui
    Liu, Haijun
    Song, Qianqian
    Vivone, Gemine
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [4] Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping
    Li, Wenwen
    Hsu, Chia-Yu
    Wang, Sizhe
    Yang, Yezhou
    Lee, Hyunho
    Liljedahl, Anna
    Witharana, Chandi
    Yang, Yili
    Rogers, Brendan M.
    Arundel, Samantha T.
    Jones, Matthew B.
    McHenry, Kenton
    Solis, Patricia
    REMOTE SENSING, 2024, 16 (05)
  • [5] Segment anything model for medical images?
    Huang, Yuhao
    Yang, Xin
    Liu, Lian
    Zhou, Han
    Chang, Ao
    Zhou, Xinrui
    Chen, Rusi
    Yu, Junxuan
    Chen, Jiongquan
    Chen, Chaoyu
    Liu, Sijing
    Chi, Haozhe
    Hu, Xindi
    Yue, Kejuan
    Li, Lei
    Grau, Vicente
    Fan, Deng-Ping
    Dong, Fajin
    Ni, Dong
    MEDICAL IMAGE ANALYSIS, 2024, 92
  • [6] Matte anything: Interactive natural image matting with segment anything model
    Yao, Jingfeng
    Wang, Xinggang
    Ye, Lang
    Liu, Wenyu
    IMAGE AND VISION COMPUTING, 2024, 147
  • [7] BubSAM: Bubble segmentation and shape reconstruction based on Segment Anything Model of bubbly flow
    Xu, Haohan
    Feng, Xin
    Pu, Yuqi
    Wang, Xiaoyue
    Huang, Dingwang
    Zhang, Weipeng
    Duan, Xiaoxia
    Chen, Jie
    Yang, Chao
    AICHE JOURNAL, 2024, 70 (12)
  • [8] Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization
    Lai, Yingxin
    Luo, Zhiming
    Yu, Zitong
    BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 180 - 190
  • [9] Explain Any Concept: Segment Anything Meets Concept-Based Explanation
    Sun, Ao
    Ma, Pingchuan
    Yuan, Yuanyuan
    Wang, Shuai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Zero-shot moving ship segmentation based on segment anything network and optical flow network
    Liu, Wenhui
    Qiao, Yulong
    Xing, Zhengyi
    Zhao, Yue
    ELECTRONICS LETTERS, 2025, 61 (01)