共 50 条
- [1] AUDIO DIFFERENCE LEARNING FOR AUDIO CAPTIONING 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1456 - 1460
- [2] TRAINING AUDIO CAPTIONING MODELS WITHOUT AUDIO 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 371 - 375
- [3] CLOTHO: AN AUDIO CAPTIONING DATASET 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 736 - 740
- [4] Audio Captioning Based on Combined Audio and Semantic Embeddings 2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 41 - 48
- [8] JOINT SPEECH RECOGNITION AND AUDIO CAPTIONING 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7892 - 7896
- [9] DIVERSE AUDIO CAPTIONING VIA ADVERSARIAL TRAINING 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8882 - 8886
- [10] RECAP: RETRIEVAL-AUGMENTED AUDIO CAPTIONING 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1161 - 1165