Audio fingerprinting: Combining computer vision & data stream processing

被引:0
|
作者
Baluja, Shumeet [1 ]
Covell, Michele [1 ]
机构
[1] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3 | 2007年
关键词
acoustic applications; acoustic signal processing; pattern recognition; music;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present Waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.
引用
收藏
页码:213 / +
页数:2
相关论文
共 50 条
  • [21] Autopipelining for Data Stream Processing
    Tang, Yuzhe
    Gedik, Bugra
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (12) : 2344 - 2354
  • [22] Optimization of data stream processing
    Getta, JR
    Vossough, E
    SIGMOD RECORD, 2004, 33 (03) : 34 - 39
  • [23] Data stream query processing
    Koudas, N
    Srivastava, D
    FOURTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, PROCEEDINGS, 2003, : 374 - 374
  • [24] Combining computer graphics and computer vision for probabilistic visual robot navigation
    Heigl, B
    Denzler, J
    Niemann, H
    ENHANCED AND SYNTHETIC VISION 2000, 2000, 4023 : 226 - 235
  • [25] AN AUDIO FINGERPRINTING SYSTEM FOR LIVE VERSION IDENTIFICATION USING IMAGE PROCESSING TECHNIQUES
    Rafii, Zafar
    Coover, Bob
    Han, Jinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [26] Combining brain computer interfaces with vision for object categorization
    Kapoor, Ashish
    Shenoy, Pradeep
    Tan, Desney
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2150 - +
  • [27] Combining vision and computer graphics for video motion capture
    Wilhelms, J
    Van Gelder, A
    VISUAL COMPUTER, 2003, 19 (06): : 360 - 376
  • [28] Combining vision and computer graphics for video motion capture
    Jane Wilhelms
    Allen Van Gelder
    The Visual Computer, 2003, 19 : 360 - 376
  • [29] Audio real-time processing for multimedia computer
    Zhang, Chengyun
    Xie, Zhiwen
    Xie, Bosun
    Diansheng Jishu/Audio Engineering, 2000, (01): : 19 - 21
  • [30] Data Processing Unit for Energy Saving in Computer Vision: Weapon Detection Use Case
    Perea-Trigo, Marina
    Lopez-Ortiz, Enrique J.
    Salazar-Gonzalez, Jose L.
    Alvarez-Garcia, Juan A.
    Vegas Olmos, J. J.
    ELECTRONICS, 2023, 12 (01)