Audio-visual biometric recognition by vector quantization

被引：3

作者：

Das, Amitava ^{[1
]}

Ghosh, Prasanta ^{[1
]}

机构：

[1] Microsoft Res India, Bangalore, Karnataka, India

来源：

2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP | 2006年

关键词：

D O I：

10.1109/SLT.2006.326843

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a Vector Quantization based bimodal (speech and face) biometric recognition method which delivers high performance amidst noise, illumination variations and occlusions (disguised mode) while requiring very little training data, memory storage and complexity of operation. A Transform VQ method delivers good face-recognition performance and a Text Dependent VQ method provides good recognition performance using speech. Simple fusion of two leads to a wider separation between the user-clusters in the combined feature space, leading to high performance.

引用

页码：166 / +

页数：2

共 50 条

[41] Audio-visual speech recognition using lstm and cnn
El Maghraby E.E.
Gody A.M.
Farouk M.H.
Recent Advances in Computer Science and Communications, 2021, 14 (06) : 2023 - 2039
[42] Audio-visual fuzzy fusion for robust speech recognition
Malcangi, M.
Ouazzane, K.
Patel, P.
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[43] Building a data corpus for audio-visual speech recognition
Chitu, Alin G.
Rothkrantz, Leon J. M.
EUROMEDIA '2007, 2007, : 88 - 92
[44] Audio-Visual Automatic Speech Recognition for Connected Digits
Wang, Xiaoping
Hao, Yufeng
Fu, Degang
Yuan, Chunwei
2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, : 328 - +
[45] Speaker and digit recognition by audio-visual lip biometrics
Faraj, Maycel Isaac
Bigun, Josef
ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1016 - +
[46] Audio-Visual Speech Recognition in the Presence of a Competing Speaker
Shao, Xu
Barker, Jon
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1292 - 1295
[47] Dynamic Bayesian Networks for audio-visual speaker recognition
Li, DD
Yang, YC
Wu, ZH
ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 539 - 545
[48] DARE: Deceiving Audio-Visual speech Recognition model
Mishra, Saumya
Gupta, Anup Kumar
Gupta, Puneet
KNOWLEDGE-BASED SYSTEMS, 2021, 232
[49] Temporal aggregation of audio-visual modalities for emotion recognition
Birhala, Andreea
Ristea, Catalin Nicolae
Radoi, Anamaria
Dutu, Liviu Cristian
2020 43RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2020, : 305 - 308
[50] Audio-Visual Group Recognition Using Diffusion Maps
Keller, Yosi
Coifman, Ronald R.
Lafon, Stephane
Zucker, Steven W.
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (01) : 403 - 413

← 1 2 3 4 5 →