共 50 条
- [11] Hierarchical Context-aware Network for Dense Video Event Captioning 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2004 - 2013
- [12] Survey of Dense Video Captioning Computer Engineering and Applications, 2023, 59 (12): : 28 - 48
- [13] Multirate Multimodal Video Captioning PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1877 - 1882
- [14] Streamlined Dense Video Captioning 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3581 - +
- [15] Multimodal attention-based transformer for video captioning Applied Intelligence, 2023, 53 : 23349 - 23368
- [19] Multimodal Video Captioning using Object-Auditory Information Fusion with Transformers PROCEEDINGS OF THE 2ND WORKSHOP ON USER-CENTRIC NARRATIVE SUMMARIZATION OF LONG VIDEOS, NARSUM 2023, 2023, : 51 - 56
- [20] Deep multimodal embedding for video captioning Multimedia Tools and Applications, 2019, 78 : 31793 - 31805