共 50 条
- [1] Synchronising audio and ultrasound by learning cross-modal embeddings INTERSPEECH 2019, 2019, : 4100 - 4104
- [2] Cross-modal Embeddings for Video and Audio Retrieval COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 711 - 716
- [3] Token Embeddings Alignment for Cross-Modal Retrieval PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4555 - 4563
- [4] Adaptive Cross-Modal Embeddings for Image-Text Alignment THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12313 - 12320
- [6] Diachronic Cross-modal Embeddings PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2061 - 2069
- [7] Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR INTERSPEECH 2022, 2022, : 1016 - 1020
- [8] PERFECT MATCH: IMPROVED CROSS-MODAL EMBEDDINGS FOR AUDIO-VISUAL SYNCHRONISATION 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3965 - 3969
- [9] Learning Cross-modal Embeddings for Cooking Recipes and Food Images 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3068 - 3076