Highly Interactive Self-Supervised Learning for Multi-Modal Trajectory Prediction

被引:0
|
作者
Xie, Wenda [1 ]
Liu, Yahui [1 ]
Zhao, Hongxia [1 ]
Guo, Chao [1 ]
Dai, Xingyuan [1 ,2 ]
Lv, Yisheng [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Changan Univ, Minist Educ, Engn Res Ctr Highway Infrastruct Digitalizat, Xian 710064, Peoples R China
来源
IFAC PAPERSONLINE | 2024年 / 58卷 / 10期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Automatic driving; Self-supervised learning; Trajectory prediction; Deep learning; Intelligent Transportation;
D O I
10.1016/j.ifacol.2024.07.345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To ensure the safety of autonomous vehicles, trajectory prediction is critical as it enables vehicles to anticipate the movements of surrounding agents, thereby facilitating the planning of secure and strategic driving routes. However, striking a trade-off between predictive accuracy and training costs has always been an intricate challenge. This paper introduces a groundbreaking framework for trajectory prediction known as Highly Interactive Self-Supervised Learning (HI-SSL), a methodology based on self-supervised learning (SSL) that has yet to be thoroughly investigated in the realm of trajectory prediction. The cornerstone of the aforementioned framework is Interactive Masking, which leverages a novel trajectory masking strategy facilitating self-supervised learning tasks that not only enhance prediction accuracy but also eliminate the need for manual annotations. Experiments conducted on the Argoverse motion forecasting dataset demonstrate that our approach achieves competitive performance to prior methods that depend on supervised learning without additional annotation costs. Copyright (C) 2024 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:231 / 236
页数:6
相关论文
共 50 条
  • [1] Self-Supervised Distilled Learning for Multi-modal Misinformation Identification
    Mu, Michael
    Das Bhattacharjee, Sreyasee
    Yuan, Junsong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2818 - 2827
  • [2] SELF-SUPERVISED LEARNING OF MULTI-MODAL COOPERATION FOR SAR DESPECKLING
    Gaya, Victor
    Dalsasso, Emanuele
    Denis, Loic
    Tupin, Florence
    Pinel-Puyssegur, Beatrice
    Guerin, Cyrielle
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2180 - 2183
  • [3] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
    Bo Liu
    Ruoyi Song
    Yuejia Xiang
    Junbo Du
    Weijian Ruan
    Jinhui Hu
    IEEE/CAA Journal of Automatica Sinica, 2022, 9 (11) : 2031 - 2033
  • [4] Multi-modal Food Recommendation Using Clustering and Self-supervised Learning
    Zhang, Yixin
    Zhou, Xin
    Meng, Qianwen
    Zhu, Fanglin
    Xu, Yonghui
    Shen, Zhiqi
    Cui, Lizhen
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 269 - 281
  • [5] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
    Liu, Bo
    Song, Ruoyi
    Xiang, Yuejia
    Du, Junbo
    Ruan, Weijian
    Hu, Jinhui
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (11) : 2031 - 2033
  • [6] Self-Supervised Multi-Modal Learning for Collaborative Robotic Grasp-Throw
    Hou, Yanxu
    Fang, Zihan
    Li, Jun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4250 - 4256
  • [7] Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis
    Xiang, Zhuo
    Zhuo, Qiuluan
    Zhao, Cheng
    Deng, Xiaofei
    Zhu, Ting
    Wang, Tianfu
    Jiang, Wei
    Lei, Baiying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [8] Self-supervised Multi-Modal Video Forgery Attack Detection
    Zhao, Chenhui
    Li, Xiang
    Younes, Rabih
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [9] Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning
    Ye, Yiwen
    Xie, Yutong
    Zhang, Jianpeng
    Chen, Ziyang
    Wu, Qi
    Xia, Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11114 - 11124
  • [10] Self-supervised opinion summarization with multi-modal knowledge graph
    Lingyun Jin
    Jingqiang Chen
    Journal of Intelligent Information Systems, 2024, 62 : 191 - 208