共 50 条
- [42] Sieve: Multimodal Dataset Pruning Using Image Captioning Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 22423 - 22432
- [43] M-VAD names: a dataset for video captioning with naming Multimedia Tools and Applications, 2019, 78 : 14007 - 14027
- [44] Smartphone Audio Replay Attacks Dataset 2021 9TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2021), 2021,
- [45] Towards Image Captioning for the Portuguese Language: Evaluation on a Translated Dataset ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 384 - 393
- [46] AUDIO SET: AN ONTOLOGY AND HUMAN-LABELED DATASET FOR AUDIO EVENTS 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 776 - 780
- [47] Student Class Behavior Dataset: a video dataset for recognizing, detecting, and captioning students' behaviors in classroom scenes NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14): : 8335 - 8354
- [48] Student Class Behavior Dataset: a video dataset for recognizing, detecting, and captioning students’ behaviors in classroom scenes Neural Computing and Applications, 2021, 33 : 8335 - 8354
- [49] DIVERSITY-CONTROLLABLE AND ACCURATE AUDIO CAPTIONING BASED ON NEURAL CONDITION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 971 - 975