Audio fingerprinting: Combining computer vision & data stream processing

被引:0
|
作者
Baluja, Shumeet [1 ]
Covell, Michele [1 ]
机构
[1] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3 | 2007年
关键词
acoustic applications; acoustic signal processing; pattern recognition; music;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present Waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.
引用
收藏
页码:213 / +
页数:2
相关论文
共 50 条
  • [1] UniStream: A Unified Stream Architecture Combining Configuration and Data Processing
    Yan, Jian
    Jin, Jifang
    Wang, Ying
    Zhou, Xuegong
    Leong, Philip
    Wang, Lingli
    2015 25TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2015,
  • [2] COMBINING COMPUTER VISION AND VIDEO PROCESSING TO ACHIEVE IMMERSIVE MOBILE VIDEOCONFERENCING
    Caviedes, Jorge E.
    Wu, Sin Lin
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2467 - 2471
  • [3] Computer vision based the research for a mass of data processing
    Liao, Qiang
    Liang, Depei
    Xu, Zongjun
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement & Diagnosis, 2000, 20 (SUPPL.): : 203 - 206
  • [4] A Stream Algebra for Computer Vision Pipelines
    Helala, Mohamed A.
    Pu, Ken Q.
    Qureshi, Faisal Z.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 800 - 807
  • [5] Conducting audio files via computer vision
    Murphy, D
    Andersen, TH
    Jensen, K
    GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 2003, 2915 : 529 - 540
  • [6] PARALLEL PROCESSING FOR COMPUTER VISION
    DELP, EJ
    MUDGE, TN
    SIEGEL, LJ
    SIEGEL, HJ
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1982, 336 : 161 - 167
  • [7] Special issue: Data and information fusion in image processing and computer vision
    Roberto, V
    Trucco, E
    PATTERN RECOGNITION, 2001, 34 (08) : 1513 - 1513
  • [8] Data and Computer Vision
    Russell, John
    ART BULLETIN, 2024, 106 (02): : 23 - 24
  • [9] A Cloud Platform for Big IoT Data Analytics by Combining Batch and Stream Processing Technologies
    Dissanayake, D. M. C.
    Jayasena, K. P. N.
    2017 NATIONAL INFORMATION TECHNOLOGY CONFERENCE (NITC), 2017, : 40 - 45
  • [10] Using computer vision to generate customized spatial audio
    Mohan, A
    Zotkin, DN
    DeMenthon, D
    Davis, LS
    Duraiswami, R
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 57 - 60