Similarity Based Join Over Audio Feeds in a Multimedia Data Stream Management System

被引:2
|
作者
Maison, Rafal [1 ]
Majda, Ewelina [2 ]
Dobrowolski, Andrzej P. [2 ]
Zakrzewicz, Maciej [3 ]
机构
[1] Alcatel Lucent, Murray Hill, NJ USA
[2] Mil Univ Technol, Fac Elect, Warsaw, Poland
[3] Poznan Univ Tech, Fac Comp, Poznan, Poland
关键词
CHANNELS;
D O I
10.1002/bltj.21599
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the last several years, processing of high performance data streams has become very important in various domains. A new type of data processing is needed for applications where input data streams are modeled as multimedia data streams, such as audio and video feeds. For example, in the public safety sector, monitoring and automatic identification of particular individuals suspected of terrorist or criminal activity requires the processing of complex audio and video streams, which is beyond the capabilities of a typical data stream management system (DSMS). The concept of a multimedia data stream management system (MMDSMS) has recently been introduced in order to effectively process continuous queries over dynamic multimedia data streams. In this paper, we address MMDSMS functionalities related to speaker recognition problems in the area of detecting individuals who may pose security threats. We focus on audio feed processing using our novel similarity-based join and on parameterization of the multimedia signal for the process of recognition. We propose a set of signal parameters which a clearly discriminate among individual voices by describing the signal using a homomorphic processing method. Our research was primarily focused on assessing the applicability of cepstral analysis in speech recognition systems, based on a set of acquired digitized voice samples. We developed a research prototype to assess the proposed concepts, and verified the effectiveness of our framework in a lab environment. (C) 2013 Alcatel-Lucent.
引用
收藏
页码:195 / 212
页数:18
相关论文
共 50 条
  • [31] Audio multimedia conferencing system based on the technology of speech recognition
    Wang, YB
    Zhang, XM
    Li, WC
    2000 IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS: ELECTRONIC COMMUNICATION SYSTEMS, 2000, : 771 - 774
  • [32] Design of a MIDI-based audio system for multimedia applications
    Chen, Hsin-Chuan
    WMSCI 2007: 11TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS, 2007, : 229 - 232
  • [33] Techniques and data structures for efficient multimedia retrieval based on similarity
    Lu, GJ
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (03) : 372 - 384
  • [34] Learning Based Neural Similarity Metrics for Multimedia Data Mining
    Dianhui Wang
    Yong-Soo Kim
    Seok Cheon Park
    Chul Soo Lee
    Yoon Kyung Han
    Soft Computing, 2007, 11 : 335 - 340
  • [35] Learning based neural similarity metrics for multimedia data mining
    Wang, Dianhui
    Kim, Yong-Soo
    Park, Seok Cheon
    Lee, Chul Soo
    Han, Yoon Kyung
    SOFT COMPUTING, 2007, 11 (04) : 335 - 340
  • [36] Random Draw Forest: A Salient Index for Similarity Search over Multimedia Data
    Lu, Yangdi
    He, Wenbo
    Nabatchian, Amir
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [37] Similarity-Based Trust Management System: Data Validation Scheme
    Al Falasi, Hind
    Mohamed, Nader
    El-Syed, Hesham
    HYBRID INTELLIGENT SYSTEMS, HIS 2015, 2016, 420 : 141 - 153
  • [38] A-DSP: An Adaptive Join Algorithm for Dynamic Data Stream on Cloud System
    Fang, Junhua
    Zhang, Rong
    Zhao, Yan
    Zheng, Kai
    Zhou, Xiaofang
    Zhou, Aoying
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 1861 - 1876
  • [39] A multimedia database management system for medical data
    Stanescu, Liana
    Burdescu, Dumitru
    Brezovan, Marius
    Stoica, Cosmin
    SIGMAP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2007, : 371 - +
  • [40] Multimedia data management in a highway information system
    Wang, Kelvin C.P.
    Li, Xuyang
    Computing in Civil Engineering (New York), 1996, : 607 - 612