Audio-visual biometric recognition by vector quantization

被引：3

作者：

Das, Amitava ^{[1
]}

Ghosh, Prasanta ^{[1
]}

机构：

[1] Microsoft Res India, Bangalore, Karnataka, India

来源：

2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP | 2006年

关键词：

D O I：

10.1109/SLT.2006.326843

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a Vector Quantization based bimodal (speech and face) biometric recognition method which delivers high performance amidst noise, illumination variations and occlusions (disguised mode) while requiring very little training data, memory storage and complexity of operation. A Transform VQ method delivers good face-recognition performance and a Text Dependent VQ method provides good recognition performance using speech. Simple fusion of two leads to a wider separation between the user-clusters in the combined feature space, leading to high performance.

引用

页码：166 / +

页数：2

共 50 条

[31] Audio-visual modeling for bimodal speech recognition
Kaynak, MN
Zhi, Q
Cheok, AD
Sengupta, K
Chung, KC
2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 181 - 186
[32] Audio-Visual Recognition System in Compression Domain
Wong, Yee Wan
Seng, Kah Phooi
Ang, Li-Minn
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) : 637 - 646
[33] AUDIO-VISUAL RECOGNITION OF GOOSE FLOCKING BEHAVIOR
Steen, Kim Arild
Therkildsen, Ole Roland
Green, Ole
Karstoft, Henrik
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (07)
[34] Audio-visual system for robust speaker recognition
Chen, Q
Yang, JG
Gou, J
MLMTA '05: Proceedings of the International Conference on Machine Learning Models Technologies and Applications, 2005, : 97 - 103
[35] Bimodal fusion in audio-visual speech recognition
Zhang, XZ
Mersereau, RM
Clements, M
2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 964 - 967
[36] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition
Su, Rongfeng
Wang, Lan
Liu, Xunying
2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
[37] Biometric person authentication with liveness detection based on audio-visual fusion
Chetty, Girija
Wagner, Michael
INTERNATIONAL JOURNAL OF BIOMETRICS, 2009, 1 (04) : 463 - 478
[38] Catching audio-visual mice:: The extrapolation of audio-visual speed
Hofbauer, MM
Wuerger, SM
Meyer, GF
Röhrbein, F
Schill, K
Zetzsche, C
PERCEPTION, 2003, 32 : 96 - 96
[39] Audio-Visual Speech Modeling for Continuous Speech Recognition
Dupont, Stephane
Luettin, Juergen
IEEE TRANSACTIONS ON MULTIMEDIA, 2000, 2 (03) : 141 - 151
[40] Speaker independent audio-visual continuous speech recognition
Liang, LH
Liu, XX
Zhao, YB
Pi, XB
Nefian, AV
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A25 - A28

← 1 2 3 4 5 →