共 50 条
- [3] Knowledge-Enhanced Context Representation for Unbiased Scene Graph Generation WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 248 - 263
- [4] Weakly-supervised Video Scene Graph Generation via Unbiased Cross-modal Learning PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4574 - 4583
- [5] Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1157 - 1164
- [10] Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2001 - 2011