SFGAN: Unsupervised Generative Adversarial Learning of 3D Scene Flow from the 3D Scene Self

被引:12
|
作者
Wang, Guangming [1 ]
Jiang, Chaokang [2 ]
Shen, Zehang [1 ]
Miao, Yanzi [2 ]
Wang, Hesheng [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Engn Res Ctr Intelligent Control & Manag, Key Lab Marine Intelligent Equipment & Syst,Minis, Key Lab Syst Control & Informat Proc,Dept Automat, Shanghai 200240, Peoples R China
[2] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Spac, Minist Educ, Sch Informat & Control Engn,Adv Robot Res Ctr, Xuzhou 221116, Jiangsu, Peoples R China
关键词
3D point clouds; generative adversarial network; scene flow estimation; soft correspondence; unsupervised learning;
D O I
10.1002/aisy.202100197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene flow tracks the 3D motion of each point in adjacent point clouds. It provides fundamental 3D motion perception for autonomous driving and server robot. Although red green blue depth (RGBD) camera or light detection and ranging (LiDAR) capture discrete 3D points in space, the objects and motions usually are continuous in the macroworld. That is, the objects keep themselves consistent as they flow from the current frame to the next frame. Based on this insight, the generative adversarial networks (GAN) is utilized to self-learn 3D scene flow without ground truth. The fake point cloud is synthesized from the predicted scene flow and the point cloud of the first frame. The adversarial training of the generator and discriminator is realized through synthesizing indistinguishable fake point cloud and discriminating the real point cloud and the synthesized fake point cloud. The experiments on Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) dataset show that our method realizes promising results. Just as human, the proposed method can identify the similar local structures of two adjacent frames even without knowing the ground truth scene flow. Then, the local correspondence can be correctly estimated, and further the scene flow is correctly estimated. An interactive preprint version of the article can be found here: .
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A Generative Model for 3D Urban Scene Understanding from Movable Platforms
    Geiger, Andreas
    Lauer, Martin
    Urtasun, Raquel
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [22] What Matters for 3D Scene Flow Network
    Wang, Guangming
    Hu, Yunzhe
    Liu, Zhe
    Zhou, Yiyang
    Tomizuka, Masayoshi
    Zhan, Wei
    Wang, Hesheng
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13693 LNCS : 38 - 55
  • [23] JOINT 3D ESTIMATION OF VEHICLES AND SCENE FLOW
    Menze, M.
    Heipke, C.
    Geiger, A.
    ISPRS GEOSPATIAL WEEK 2015, 2015, II-3 (W5): : 427 - 434
  • [24] What Matters for 3D Scene Flow Network
    Wang, Guangming
    Hu, Yunzhe
    Liu, Zhe
    Zhou, Yiyang
    Tomizuka, Masayoshi
    Zhan, Wei
    Wang, Hesheng
    COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 : 38 - 55
  • [25] Rigid Scene Flow for 3D LiDAR Scans
    Dewan, Ayush
    Caselitz, Tim
    Tipaldi, Gian Diego
    Burgard, Wolfram
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 1765 - 1770
  • [26] Learning to Exploit Stability for 3D Scene Parsing
    Du, Yilun
    Liu, Zhijian
    Basevi, Hector
    Leonardis, Ales
    Freeman, William T.
    Tenenbaum, Joshua B.
    Wu, Jiajun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [27] Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition
    Wang, Tianyu
    Liu, Miaomiao
    Ng, Kee Siong
    COMPUTER VISION, ECCV 2022, PT XXIII, 2022, 13683 : 120 - 135
  • [28] Deep Scene Flow Learning: From 2D Images to 3D Point Clouds
    Harbin Engineering University, School of Information and Communication Engineering, Heilongjiang, Harbin
    150001, China
    不详
    150001, China
    不详
    ON
    K1N 6N5, Canada
    IEEE Trans Pattern Anal Mach Intell, 2024, 1 (185-208):
  • [29] Deep Scene Flow Learning: From 2D Images to 3D Point Clouds
    Xiang, Xuezhi
    Abdein, Rokia
    Li, Wei
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 185 - 208
  • [30] Automatic 3D object placement for 3D scene generation
    Akazawa, Y
    Okada, Y
    Niijima, K
    MODELLING AND SIMULATION 2003, 2003, : 316 - 318