共 50 条
- [21] Scalable 3D Captioning with Pretrained Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [22] SEEING AND HEARING TOO: AUDIO REPRESENTATION FOR VIDEO CAPTIONING 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 381 - 388
- [23] FeatureCut: An Adaptive Data Augmentation for Automated Audio Captioning PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 313 - 318
- [24] Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding INTERSPEECH 2024, 2024, : 1135 - 1139
- [25] Captioning and Indian Sign Language as Accessibility Tools in Universal Design SAGE OPEN, 2013, 3 (02): : 1 - 16
- [26] Closed Captioning for Accessibility of Hard of Hearing People in Educational Environments PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 305 - 306
- [27] A Transformer-based Audio Captioning Model with Keyword Estimation INTERSPEECH 2020, 2020, : 1977 - 1981
- [28] Automated audio captioning: an overview of recent progress and new challenges EURASIP Journal on Audio, Speech, and Music Processing, 2022
- [29] Scene Graph with 3D Information for Change Captioning PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5074 - 5082
- [30] Enhance Temporal Relations in Audio Captioning with Sound Event Detection INTERSPEECH 2023, 2023, : 4179 - 4183