A pattern classification proposal for object-oriented audio coding in MPEG-4

被引:4
|
作者
Beritelli, F [1 ]
Casale, S [1 ]
Russo, M [1 ]
机构
[1] Univ Catania, Ist Informat & Telecomun, I-95125 Catania, Italy
关键词
D O I
10.1023/A:1019112310453
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The future MPEG-4 standard will adopt an object-oriented encoding strategy whereby an audio source is encoded at a very low bit-rate by adapting a suitable coding scheme to the local characteristics of the signal. One of the most delicate issues in this approach is that the overall performance of the audio encoder greatly depends on the accuracy with which the input signal is classified. This paper shows that the difficult problem of audio classification for object-oriented coding can be effectively solved by selecting a salient set of acoustic parameters and adopting a fuzzy model for each audio object, obtained by a soft computing-hybrid learning tool. The audio classifier proposed operates at two levels: recognition of the class to which the input signal belongs (talkspurt, music, noise, signaling tones) and then recognition of the subclass to which it belongs. The results obtained show that fuzzy logic is a valid alternative to the matching techniques of a traditional pattern recognition approach.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [1] A pattern classification proposal for object‐oriented audio coding in MPEG‐4
    Francesco Beritelli
    Salvatore Casale
    Marco Russo
    Telecommunication Systems, 1998, 9 : 375 - 391
  • [2] MPEG-4 natural audio coding
    Brandenburg, K
    Kunz, O
    Sugiyama, A
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2000, 15 (4-5) : 423 - 444
  • [3] Coding of natural audio in MPEG-4
    Quackenbush, SR
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3797 - 3800
  • [4] Lossless audio coding with MPEG-4 structured audio
    Vasiloglou, N
    Schafer, RW
    Hans, MC
    SECOND INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2002, : 184 - 191
  • [5] Coding of prediction residual in MPEG-4 standard for lossless audio coding (MPEG-4 ALS)
    Reznik, YA
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 1024 - 1027
  • [6] An introduction to MPEG-4 audio lossless coding
    Liebchen, T
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 1012 - 1015
  • [7] On the usefulness of object shape coding with MPEG-4
    Prati, A
    Cucchiara, R
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 483 - 490
  • [8] MPEG-4 low delay general audio coding
    Sporer, T
    Grill, B
    Herre, J
    VOICE OVER IP (VOIP) TECHNOLOGY, 2001, 4522 : 109 - 118
  • [9] HILN - The MPEG-4 parametric audio coding tools
    Purnhagen, H
    Meine, N
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 201 - 204
  • [10] Enhancement of MPEG-4 ALS lossless audio coding
    NTT Communication Science Laboratories, Atsugi-shi, 243-0198, Japan
    不详
    NTT Tech. Rev., 2007, 12