共 50 条
- [31] Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 2241 - 2244
- [32] Audio and Video-based Emotion Recognition using Multimodal Transformers 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2582 - 2588
- [34] INFORMATION RETRIEVAL METHODS FOR AUTOMATIC SPEECH RECOGNITION 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5550 - 5553
- [35] Prosodic and accentual information for automatic speech recognition IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (04): : 321 - 333
- [37] A multimodal emotion recognition model integrating speech, video and MoCAP Multimedia Tools and Applications, 2022, 81 : 32265 - 32286
- [38] The AhoSR Automatic Speech Recognition System ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 279 - 288
- [39] AN AUTOMATIC SPEECH RECOGNITION SYSTEM TABARCA REVISTA DE INFORMATICA Y AUTOMATICA, 1990, 23 (01): : 15 - 24
- [40] Automatic Indexing Algorithm of Golf Video Using Audio Information JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (05): : 441 - 446