共 50 条
- [7] PRIMING THE VISUAL RECOGNITION OF SPOKEN WORDS JOURNAL OF SPEECH AND HEARING RESEARCH, 1995, 38 (06): : 1377 - 1386
- [8] Visual-Semantic Graph Matching for Visual Grounding MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4041 - 4050
- [10] PHONETIC-AND-SEMANTIC EMBEDDING OF SPOKEN WORDS WITH APPLICATIONS IN SPOKEN CONTENT RETRIEVAL 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 941 - 948