共 50 条
- [33] Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer INTERSPEECH 2023, 2023, : 2128 - 2132
- [34] Automated Audio Captioning with Epochal Difficult Captions for curriculum learning PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1058 - 1063
- [35] Audio interface for immersive 3D-audio desktop applications VECIMS'03: 2003 IEEE INTERNATIONAL SYMPOSIUM ON VIRTUAL ENVIRONMENTS, HUMAN-COMPUTER INTERFACES AND MEASUREMENT SYSTEMS, 2003, : 179 - 182
- [36] UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18063 - 18073
- [38] Evaluating Web Audio for Learning, Accessibility, and Distribution AES: Journal of the Audio Engineering Society, 2022, 70 (11): : 951 - 961
- [39] Evaluating Web Audio for Learning, Accessibility, and Distribution JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2022, 70 (11): : 951 - 961
- [40] Explore and Tell: Embodied Visual Captioning in 3D Environments 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2482 - 2491