Audio fingerprinting: Combining computer vision & data stream processing

被引：0

作者：

Baluja, Shumeet ^{[1
]}

Covell, Michele ^{[1
]}

机构：

[1] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3 | 2007年

关键词：

acoustic applications; acoustic signal processing; pattern recognition; music;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present Waveprint, a novel system for audio identification. Waveprint uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched. The resulting system has excellent identification capabilities for small snippets of audio that have been degraded in a variety of manners, including competing noise, poor recording quality, and cell-phone playback. We measure the tradeoffs between performance, memory usage, and computation through extensive experimentation. The system is more efficient in terms of memory usage and computation, while being more accurate, when compared with previous state of the art systems.

引用

页码：213 / +

页数：2

共 50 条

[1] UniStream: A Unified Stream Architecture Combining Configuration and Data Processing
Yan, Jian
Jin, Jifang
Wang, Ying
Zhou, Xuegong
Leong, Philip
Wang, Lingli
2015 25TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2015,
[2] COMBINING COMPUTER VISION AND VIDEO PROCESSING TO ACHIEVE IMMERSIVE MOBILE VIDEOCONFERENCING
Caviedes, Jorge E.
Wu, Sin Lin
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2467 - 2471
[3] Computer vision based the research for a mass of data processing
Liao, Qiang
Liang, Depei
Xu, Zongjun
Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement & Diagnosis, 2000, 20 (SUPPL.): : 203 - 206
[4] A Stream Algebra for Computer Vision Pipelines
Helala, Mohamed A.
Pu, Ken Q.
Qureshi, Faisal Z.
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 800 - 807
[5] Conducting audio files via computer vision
Murphy, D
Andersen, TH
Jensen, K
GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 2003, 2915 : 529 - 540
[6] PARALLEL PROCESSING FOR COMPUTER VISION
DELP, EJ
MUDGE, TN
SIEGEL, LJ
SIEGEL, HJ
PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1982, 336 : 161 - 167
[7] Special issue: Data and information fusion in image processing and computer vision
Roberto, V
Trucco, E
PATTERN RECOGNITION, 2001, 34 (08) : 1513 - 1513
[8] Data and Computer Vision
Russell, John
ART BULLETIN, 2024, 106 (02): : 23 - 24
[9] A Cloud Platform for Big IoT Data Analytics by Combining Batch and Stream Processing Technologies
Dissanayake, D. M. C.
Jayasena, K. P. N.
2017 NATIONAL INFORMATION TECHNOLOGY CONFERENCE (NITC), 2017, : 40 - 45
[10] Using computer vision to generate customized spatial audio
Mohan, A
Zotkin, DN
DeMenthon, D
Davis, LS
Duraiswami, R
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 57 - 60

← 1 2 3 4 5 →