Audio-Visual Voice Activity Detection Using Diffusion Maps

被引:0
|
作者
Department of Electrical Engineering, Technion-Israel Institute of Technology, Haifa [1 ]
32000, Israel
机构
来源
关键词
Number:; 1130/11; Acronym:; -; Sponsor:;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] Vehicle Detection and Classification using Audio-Visual cues
    Piyush, P.
    Rajan, Rajeev
    Mary, Leena
    Koshy, Bino I.
    2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2016, : 732 - 736
  • [22] Object category detection using audio-visual cues
    Luo, Jie
    Caputo, Barbara
    Zweig, Alon
    Bach, Joerg-Hendrik
    Anemueller, Joern
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 539 - 548
  • [23] Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
    Chien, Yung-Lun
    Chen, Hsin-Hao
    Yen, Ming-Chi
    Tsai, Shu-Wei
    Wang, Hsin-Min
    Tsao, Yu
    Chi, Tai-Shih
    INTERSPEECH 2023, 2023, : 5023 - 5026
  • [24] Joint Audio-Visual Deepfake Detection
    Zhou, Yipin
    Lim, Ser-Nam
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14780 - 14789
  • [25] Audio-Visual Detection Benefits in the Rat
    Gleiss, Stephanie
    Kayser, Christoph
    PLOS ONE, 2012, 7 (09):
  • [26] Incongruence Detection in Audio-Visual Processing
    Havlena, Michal
    Heller, Jan
    Kayser, Hendrik
    Bach, Joerg-Hendrik
    Anemueller, Joern
    Pajdla, Tomas
    DETECTION AND IDENTIFICATION OF RARE AUDIOVISUAL CUES, 2012, 384 : 67 - +
  • [27] Audio-visual talking face detection
    Li, MK
    Li, DG
    Dimitrova, N
    Sethi, I
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, 2003, : 473 - 476
  • [28] Audio-Visual Information Fusion Using Cross-modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments
    Zhou, Hengshun
    Du, Jun
    Chen, Hang
    Jing, Zijun
    Xiong, Shifu
    Lee, Chin-Hui
    INTERSPEECH 2021, 2021, : 341 - 345
  • [29] Audio-Visual Emotion Recognition using Gaussian Mixture Models for Face and Voice
    Metallinou, Angeliki
    Lee, Sungbok
    Narayanan, Shrikanth
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 250 - 257
  • [30] Information optimization in coupled audio-visual cortical maps
    Kardar, M
    Zee, A
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (25) : 15894 - 15897