共 50 条
- [42] Richer Semantic Visual and Language Representation for Video Captioning PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1871 - 1876
- [43] Semantic Tag Augmented XlanV Model for Video Captioning PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4818 - 4822
- [45] Unsupervised Semantic Parsing of Video Collections 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4480 - 4488
- [46] End-to-End Dense Video Captioning with Masked Transformer 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8739 - 8748
- [48] Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7190 - 7198