Fast multimedia contents retrieval by partially spoken query

被引:0
|
作者
Jeong, So-Young [1 ]
Han, Icksang [1 ]
Kwak, Byung-Kwan [1 ]
Cho, Jeongmi [1 ]
Kim, Jeongsu [1 ]
机构
[1] Samsung Elect Co Ltd, Samsung Adv Inst Technol, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present novel fast multi-pass decoding strategies for recognizing large named-entities on a low-resource embedded device and thus retrieving MP3 music using spoken query, which contains partial segments of whole music titles and artists. After acoustic-phonetic decoding in the first stage processing, we incorporate word boundary information with phonetic confusion matrix into next stage partial word matching. Then, we rescore candidate phone lists using more complex context-dependent acoustic model, whose outputs are the retrieved songs. We tested our retrieval system to the task of retrieving 1000 songs on a commercial MP3 player and could achieve about 15.5% relative improvements in response time over conventional frame-based multi-pass decoding method without sacrificing recognition rates.
引用
收藏
页码:839 / 840
页数:2
相关论文
共 50 条
  • [41] Towards the Integration of Automatic Speech Recognition and Information Retrieval for Spoken Query Processing
    Moreno-Daniel, A.
    Wilpon, J.
    Juang, B. -H.
    Parthasarathy, S.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2154 - 2157
  • [42] Fast Image Retrieval: Query Pruning and Early Termination
    Zheng, Liang
    Wang, Shengjin
    Liu, Ziqiong
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (05) : 648 - 659
  • [43] Multimodal query-level fusion for efficient multimedia information retrieval
    Sattari, Saeid
    Yazici, Adnan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2018, 33 (10) : 2019 - 2037
  • [44] Agent Based MPEG Query Format Middleware for Standardized Multimedia Retrieval
    Doeller, Mario
    Hoelbling, Guenther
    Webersberger, Christine
    INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES, 2010, 6 : 287 - 297
  • [45] Stochastic Query Covering for Fast Approximate Document Retrieval
    Anagnostopoulos, Aris
    Becchetti, Luca
    Bordino, Ilaria
    Leonardi, Stefano
    Mele, Ida
    Sankowski, Piotr
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2015, 33 (03) : 11
  • [46] Agent Based MPEG Query Format Middleware for Standardized Multimedia Retrieval
    Döller M.
    Hölbling G.
    Webersberger C.
    Smart Innovation, Systems and Technologies, 2010, 6 : 287 - 297
  • [47] Multimedia fusion in automatic extraction of studio speech segments for spoken document retrieval
    Hui, PY
    Lo, WK
    Meng, HM
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 724 - 727
  • [48] Multimedia indexing and fast retrieval based on a vote system
    Philipp-Foliguet, S.
    Logerot, G.
    Constant, P.
    Gosselin, Ph.
    Lahanier, Christian
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1781 - 1784
  • [49] Partially Decompressing Binary Interpolative Coding for Fast Query Processing
    Fu, Xi
    Li, Peng
    Li, Rui
    Wang, Bin
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2016, PT II, 2016, 10042 : 187 - 195
  • [50] ENHANCING QUERY EXPANSION FOR SEMANTIC RETRIEVAL OF SPOKEN CONTENT WITH AUTOMATICALLY DISCOVERED ACOUSTIC PATTERNS
    Lee, Hung-yi
    Li, Yun-Chiao
    Chung, Cheng-Tao
    Lee, Lin-shan
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8297 - 8301