Highly Interactive Self-Supervised Learning for Multi-Modal Trajectory Prediction

被引：0

作者：

Xie, Wenda ^{[1
]}

Liu, Yahui ^{[1
]}

Zhao, Hongxia ^{[1
]}

Guo, Chao ^{[1
]}

Dai, Xingyuan ^{[1
,2
]}

Lv, Yisheng ^{[1
,3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China

[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China

[3] Changan Univ, Minist Educ, Engn Res Ctr Highway Infrastruct Digitalizat, Xian 710064, Peoples R China

来源：

IFAC PAPERSONLINE | 2024年 / 58卷 / 10期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Automatic driving; Self-supervised learning; Trajectory prediction; Deep learning; Intelligent Transportation;

D O I：

10.1016/j.ifacol.2024.07.345

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To ensure the safety of autonomous vehicles, trajectory prediction is critical as it enables vehicles to anticipate the movements of surrounding agents, thereby facilitating the planning of secure and strategic driving routes. However, striking a trade-off between predictive accuracy and training costs has always been an intricate challenge. This paper introduces a groundbreaking framework for trajectory prediction known as Highly Interactive Self-Supervised Learning (HI-SSL), a methodology based on self-supervised learning (SSL) that has yet to be thoroughly investigated in the realm of trajectory prediction. The cornerstone of the aforementioned framework is Interactive Masking, which leverages a novel trajectory masking strategy facilitating self-supervised learning tasks that not only enhance prediction accuracy but also eliminate the need for manual annotations. Experiments conducted on the Argoverse motion forecasting dataset demonstrate that our approach achieves competitive performance to prior methods that depend on supervised learning without additional annotation costs. Copyright (C) 2024 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/)

引用

页码：231 / 236

页数：6

共 50 条

[1] Self-Supervised Distilled Learning for Multi-modal Misinformation Identification
Mu, Michael
Das Bhattacharjee, Sreyasee
Yuan, Junsong
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2818 - 2827
[2] SELF-SUPERVISED LEARNING OF MULTI-MODAL COOPERATION FOR SAR DESPECKLING
Gaya, Victor
Dalsasso, Emanuele
Denis, Loic
Tupin, Florence
Pinel-Puyssegur, Beatrice
Guerin, Cyrielle
IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2180 - 2183
[3] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
Bo Liu
Ruoyi Song
Yuejia Xiang
Junbo Du
Weijian Ruan
Jinhui Hu
IEEE/CAA Journal of Automatica Sinica, 2022, 9 (11) : 2031 - 2033
[4] Multi-modal Food Recommendation Using Clustering and Self-supervised Learning
Zhang, Yixin
Zhou, Xin
Meng, Qianwen
Zhu, Fanglin
Xu, Yonghui
Shen, Zhiqi
Cui, Lizhen
PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 269 - 281
[5] Self-Supervised Entity Alignment Based on Multi-Modal Contrastive Learning
Liu, Bo
Song, Ruoyi
Xiang, Yuejia
Du, Junbo
Ruan, Weijian
Hu, Jinhui
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (11) : 2031 - 2033
[6] Self-Supervised Multi-Modal Learning for Collaborative Robotic Grasp-Throw
Hou, Yanxu
Fang, Zihan
Li, Jun
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4250 - 4256
[7] Self-supervised multi-modal fusion network for multi-modal thyroid ultrasound image diagnosis
Xiang, Zhuo
Zhuo, Qiuluan
Zhao, Cheng
Deng, Xiaofei
Zhu, Ting
Wang, Tianfu
Jiang, Wei
Lei, Baiying
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
[8] Self-supervised Multi-Modal Video Forgery Attack Detection
Zhao, Chenhui
Li, Xiang
Younes, Rabih
2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
[9] Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning
Ye, Yiwen
Xie, Yutong
Zhang, Jianpeng
Chen, Ziyang
Wu, Qi
Xia, Yong
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11114 - 11124
[10] Self-supervised opinion summarization with multi-modal knowledge graph
Lingyun Jin
Jingqiang Chen
Journal of Intelligent Information Systems, 2024, 62 : 191 - 208

← 1 2 3 4 5 →