A speaker identification system for video content analysis

被引:0
|
作者
Bi, Jing [1 ]
Liu, Shu-Chang [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100088, Peoples R China
来源
2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS | 2008年
关键词
D O I
10.1109/IIH-MSP.2008.215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, more literatures proposed to apply audio content analysis techniques in content-based video parsing. This paper presents our current works on a speaker identification system for video content analysis. The system is different from normal ones in the following aspects: firstly, soundtrack extracted from video stream includes not only silence and speech, but also music and environmental sound; secondly, the number of speakers in video content are uncertain; thirdly, the presence of noise in the video can significantly deteriorate system performance. According to these considerations, our speaker identification system involves such basic parts: audio classification and segmentation using rule and Support Vector Machine(SVM) based classifier; speech clustering using spectral clustering technique and speaker identification based on Gaussian Mixture Model(GMM); speech enhancement based on spectral subtraction. Experiments are carried on a database extracted from news, conversation and movie videos. The obtained results confirm the validity of the proposed system architecture.
引用
收藏
页码:200 / 203
页数:4
相关论文
共 50 条
  • [31] Speaker Identification System Under Noisy Conditions
    Alam, Md Shariful
    Zilany, Muhammad S. A.
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 566 - 569
  • [32] Real-time speaker identification system
    Al-Shboul, Bashar
    Alsawalqah, Hamad
    Lee, Dongman
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 422 - +
  • [33] A MODULAR AND HYBRID CONNECTIONIST SYSTEM FOR SPEAKER IDENTIFICATION
    BENNANI, Y
    NEURAL COMPUTATION, 1995, 7 (04) : 791 - 798
  • [34] Continuous Speech Recognition and Identification of the Speaker System
    Guffanti, Diego
    Martinez, Danilo
    Paladines, Jose
    Sarmiento, Andrea
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 767 - 776
  • [35] Modifications ofKNN Classifier for Speaker Identification System
    Kacur, Juraj
    PROCEEDINGS OF ELMAR 2016 - 58TH INTERNATIONAL SYMPOSIUM ELMAR 2016, 2016, : 35 - 38
  • [36] An MFCC-based Speaker Identification System
    Leu, Fang-Yie
    Lin, Guan-Liang
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 1055 - 1062
  • [37] Robust speaker identification system for voice changes
    Martinez Mascorro, Guillermo Arturo
    Aguilar Torres, Gualberto
    INGENIUS-REVISTA DE CIENCIA Y TECNOLOGIA, 2012, (08): : 45 - 53
  • [38] Speaker Identification System based on a Web Interface
    Kacur, Juraj
    Lapin, Ivan
    Durajka, Juraj
    Rozinaj, Gregor
    PROCEEDINGS ELMAR-2012, 2012, : 191 - 194
  • [39] CSLBP and OCLBP local descriptors for speaker identification from video sequences
    Chelali, Fatma zohra
    Djeradi, Amar
    PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [40] A Speaker Identification system with verification method based on speaker relative threshold and HMM
    He, ZY
    Hu, QX
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 488 - 491