共 50 条
- [31] MITIGATING DATASET BIAS IN IMAGE CAPTIONING THROUGH CLIP CONFOUNDER-FREE CAPTIONING NETWORK 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1720 - 1724
- [33] Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding INTERSPEECH 2024, 2024, : 1135 - 1139
- [34] A Transformer-based Audio Captioning Model with Keyword Estimation INTERSPEECH 2020, 2020, : 1977 - 1981
- [35] Automated audio captioning: an overview of recent progress and new challenges EURASIP Journal on Audio, Speech, and Music Processing, 2022
- [36] Enhance Temporal Relations in Audio Captioning with Sound Event Detection INTERSPEECH 2023, 2023, : 4179 - 4183
- [39] Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer INTERSPEECH 2023, 2023, : 2128 - 2132
- [40] Automated Audio Captioning with Epochal Difficult Captions for curriculum learning PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1058 - 1063