Highly Interactive Self-Supervised Learning for Multi-Modal Trajectory Prediction

被引:0
|
作者
Xie, Wenda [1 ]
Liu, Yahui [1 ]
Zhao, Hongxia [1 ]
Guo, Chao [1 ]
Dai, Xingyuan [1 ,2 ]
Lv, Yisheng [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] Changan Univ, Minist Educ, Engn Res Ctr Highway Infrastruct Digitalizat, Xian 710064, Peoples R China
来源
IFAC PAPERSONLINE | 2024年 / 58卷 / 10期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Automatic driving; Self-supervised learning; Trajectory prediction; Deep learning; Intelligent Transportation;
D O I
10.1016/j.ifacol.2024.07.345
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To ensure the safety of autonomous vehicles, trajectory prediction is critical as it enables vehicles to anticipate the movements of surrounding agents, thereby facilitating the planning of secure and strategic driving routes. However, striking a trade-off between predictive accuracy and training costs has always been an intricate challenge. This paper introduces a groundbreaking framework for trajectory prediction known as Highly Interactive Self-Supervised Learning (HI-SSL), a methodology based on self-supervised learning (SSL) that has yet to be thoroughly investigated in the realm of trajectory prediction. The cornerstone of the aforementioned framework is Interactive Masking, which leverages a novel trajectory masking strategy facilitating self-supervised learning tasks that not only enhance prediction accuracy but also eliminate the need for manual annotations. Experiments conducted on the Argoverse motion forecasting dataset demonstrate that our approach achieves competitive performance to prior methods that depend on supervised learning without additional annotation costs. Copyright (C) 2024 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页码:231 / 236
页数:6
相关论文
共 50 条
  • [21] Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search
    Liang, Meiyu
    Du, Junping
    Liang, Zhengyang
    Xing, Yongwang
    Huang, Wei
    Xue, Zhe
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13744 - 13753
  • [22] A lightning augmented recurrent nowcasting model based on self-supervised learning and multi-modal fusion method
    Zhang, Liang
    Li, Qian
    Zhou, Zeming
    Yang, Kangquan
    ATMOSPHERIC RESEARCH, 2025, 321
  • [23] Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering
    Wang, Qianqian
    Xia, Wei
    Tao, Zhiqiang
    Gao, Quanxue
    Cao, Xiaochun
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1748 - 1755
  • [24] Self-Supervised Multi-Modal Hybrid Fusion Network for Brain Tumor Segmentation
    Fang, Feiyi
    Yao, Yazhou
    Zhou, Tao
    Xie, Guosen
    Lu, Jianfeng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (11) : 5310 - 5320
  • [25] The Effectiveness of Self-supervised Pre-training for Multi-modal Endometriosis Classification
    Butler, David
    Wang, Hu
    Zhang, Yuan
    To, Minh-Son
    Condous, George
    Leonardi, Mathew
    Knox, Steven
    Avery, Jodie
    Hull, M. Louise
    Carneiro, Gustavo
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [26] Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs
    Tao, Ruijie
    Lee, Kong Aik
    Das, Rohan Kumar
    Hautamaki, Ville
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1706 - 1719
  • [27] Multi-label remote sensing classification with self-supervised gated multi-modal transformers
    Liu, Na
    Yuan, Ye
    Wu, Guodong
    Zhang, Sai
    Leng, Jie
    Wan, Lihong
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2024, 18
  • [28] Integrating remote sensing with OpenStreetMap data for comprehensive scene understanding through multi-modal self-supervised learning
    Bai, Lubin
    Zhang, Xiuyuan
    Wang, Haoyu
    Du, Shihong
    REMOTE SENSING OF ENVIRONMENT, 2025, 318
  • [29] Multi-Modal Self-Supervised Learning for Cross-Domain One-Shot Bearing Fault Diagnosis
    Chen, Xiaohan
    Xue, Yihao
    Huang, Mengjie
    Yang, Rui
    IFAC PAPERSONLINE, 2024, 58 (04): : 746 - 751
  • [30] A self-supervised building extraction method based on multi-modal remote sensing data
    Qu, Yunhao
    Wang, Chang
    REMOTE SENSING LETTERS, 2025, 16 (01) : 77 - 88