Audio-visual biometric recognition by vector quantization

被引：3

作者：

Das, Amitava ^{[1
]}

Ghosh, Prasanta ^{[1
]}

机构：

[1] Microsoft Res India, Bangalore, Karnataka, India

来源：

2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP | 2006年

关键词：

D O I：

10.1109/SLT.2006.326843

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a Vector Quantization based bimodal (speech and face) biometric recognition method which delivers high performance amidst noise, illumination variations and occlusions (disguised mode) while requiring very little training data, memory storage and complexity of operation. A Transform VQ method delivers good face-recognition performance and a Text Dependent VQ method provides good recognition performance using speech. Simple fusion of two leads to a wider separation between the user-clusters in the combined feature space, leading to high performance.

引用

页码：166 / +

页数：2

共 50 条

[1] An audio-visual distance for audio-visual speech vector quantization
Girin, L
Foucher, E
Feng, G
1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528
[2] Dynamic Audio-Visual Biometric Fusion for Person Recognition
Alsaedi, Najlaa Hindi
Jaha, Emad Sami
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1283 - 1311
[3] Audio-Visual Biometric Recognition Via Joint Sparse Representations
Primorac, Rudi
Togneri, Roberto
Bennamoun, Mohammed
Sohel, Ferdous
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3031 - 3035
[4] Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey
Mandalapu, Hareesh
Reddy, Aravinda P. N.
Ramachandra, Raghavendra
Rao, Krothapalli Sreenivasa
Mitra, Pabitra
Prasanna, S. R. Mahadeva
Busch, Christoph
IEEE ACCESS, 2021, 9 : 37431 - 37455
[5] An audio-visual speech recognition with a new mandarin audio-visual database
Liao, Wen-Yuan
Pao, Tsang-Long
Chen, Yu-Te
Chang, Tsun-Wei
INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
[6] Audio-visual biometric based speaker identification
Kar, Biswajit
Bhatia, Sandeep
Dutta, P. K.
ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL IV, PROCEEDINGS, 2007, : 94 - 98
[7] Audio-visual affect recognition
Zeng, Zhihong
Tu, Jilin
Liu, Ming
Huang, Thomas S.
Pianfetti, Brian
Roth, Dan
Levinson, Stephen
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (02) : 424 - 428
[8] Audio-visual gender recognition
Liu, Ming
Xu, Xun
Huang, Thomas S.
MIPPR 2007: PATTERN RECOGNITION AND COMPUTER VISION, 2007, 6788
[9] An audio-visual speech recognition system for testing new audio-visual databases
Pao, Tsang-Long
Liao, Wen-Yuan
VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 192 - +
[10] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
Zhang, Zi-Qiang
Zhang, Jie
Zhang, Jian-Shu
Wu, Ming-Hui
Fang, Xin
Dai, Li-Rong
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350

← 1 2 3 4 5 →