共 50 条
- [41] TrTr-CMR: Cross-Modal Reasoning Dual Transformer for Remote Sensing Image Captioning IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [42] Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2001 - 2011
- [44] Lightweight dense video captioning with cross-modal attention and knowledge-enhanced unbiased scene graph Complex & Intelligent Systems, 2023, 9 : 4995 - 5012
- [48] Cross-modal fusion for multi-label image classification with attention mechanism Computers and Electrical Engineering, 2022, 101