Fast multimedia contents retrieval by partially spoken query

被引:0
|
作者
Jeong, So-Young [1 ]
Han, Icksang [1 ]
Kwak, Byung-Kwan [1 ]
Cho, Jeongmi [1 ]
Kim, Jeongsu [1 ]
机构
[1] Samsung Elect Co Ltd, Samsung Adv Inst Technol, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present novel fast multi-pass decoding strategies for recognizing large named-entities on a low-resource embedded device and thus retrieving MP3 music using spoken query, which contains partial segments of whole music titles and artists. After acoustic-phonetic decoding in the first stage processing, we incorporate word boundary information with phonetic confusion matrix into next stage partial word matching. Then, we rescore candidate phone lists using more complex context-dependent acoustic model, whose outputs are the retrieved songs. We tested our retrieval system to the task of retrieving 1000 songs on a commercial MP3 player and could achieve about 15.5% relative improvements in response time over conventional frame-based multi-pass decoding method without sacrificing recognition rates.
引用
收藏
页码:839 / 840
页数:2
相关论文
共 50 条
  • [1] Ontology based user query interpretation for semantic multimedia contents retrieval
    Moo-Hun Lee
    Seungmin Rho
    Eui-In Choi
    Multimedia Tools and Applications, 2014, 73 : 901 - 915
  • [2] Ontology based user query interpretation for semantic multimedia contents retrieval
    Lee, Moo-Hun
    Rho, Seungmin
    Choi, Eui-In
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (02) : 901 - 915
  • [3] Spoken query processing for information retrieval
    Moreno-Daniel, A.
    Parthasarathy, S.
    Juang, B. H.
    Wilpon, J. G.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 121 - +
  • [4] Spoken information retrieval for multimedia databases
    Salgado-Garza, Luis R.
    Nolazco-Flores, Juan A.
    Díaz-López, Pablo D.
    ACS/IEEE Int. Conf. Comput. Syst. Applic., (146-150):
  • [5] Spoken Information Retrieval for Multimedia Databases
    Salgado-Garza, Luis R.
    Nolazco-Flores, Juan A.
    Diaz-Lopez, Pablo D.
    3RD ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, 2005, 2005,
  • [6] On the Effectiveness of Contextualisation Techniques in Spoken Query Spoken Content Retrieval
    Racca, David N.
    Jones, Gareth J. F.
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 933 - 936
  • [7] Spoken query processing for interactive information retrieval
    Crestani, F
    DATA & KNOWLEDGE ENGINEERING, 2002, 41 (01) : 105 - 124
  • [8] Enhancing Query Formulation for Spoken Document Retrieval
    Chen, Berlin
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Yu, Kuen-Tyng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (03) : 553 - 569
  • [9] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [10] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64