A pattern classification proposal for object-oriented audio coding in MPEG-4

被引:4
|
作者
Beritelli, F [1 ]
Casale, S [1 ]
Russo, M [1 ]
机构
[1] Univ Catania, Ist Informat & Telecomun, I-95125 Catania, Italy
关键词
D O I
10.1023/A:1019112310453
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The future MPEG-4 standard will adopt an object-oriented encoding strategy whereby an audio source is encoded at a very low bit-rate by adapting a suitable coding scheme to the local characteristics of the signal. One of the most delicate issues in this approach is that the overall performance of the audio encoder greatly depends on the accuracy with which the input signal is classified. This paper shows that the difficult problem of audio classification for object-oriented coding can be effectively solved by selecting a salient set of acoustic parameters and adopting a fuzzy model for each audio object, obtained by a soft computing-hybrid learning tool. The audio classifier proposed operates at two levels: recognition of the class to which the input signal belongs (talkspurt, music, noise, signaling tones) and then recognition of the subclass to which it belongs. The results obtained show that fuzzy logic is a valid alternative to the matching techniques of a traditional pattern recognition approach.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [21] Efficient bit allocation algorithm for MPEG-4 advanced audio coding
    Yang, Cheng-Han
    Hang, Hsueh-Ming
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 2119 - +
  • [22] Object-based rate control for MPEG-4 video object coding
    Chen, ZZ
    Ngan, KN
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, 2004, : 973 - 976
  • [23] VLSI implementation for portable application oriented MPEG-4 audio codec
    Liu, Peilin
    Liu, Lingzhi
    Deng, Ning
    Fu, Xuan
    Liu, Jiayan
    Liu, Qianru
    Zhang, Guocheng
    He, Bin
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 777 - +
  • [24] Speech coding in MPEG-4
    Edler, Bernd
    International Journal of Speech Technology, 1999, 2 (04): : 289 - 303
  • [25] Speech coding in MPEG-4
    Edler B.
    International Journal of Speech Technology, 1999, 2 (4) : 289 - 303
  • [26] The MPEG-4 structured audio standard
    Scheirer, ED
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3801 - 3804
  • [27] Synthetic and SNHC audio in MPEG-4
    Scheirer, ED
    Lee, YJ
    Wang, JW
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2000, 15 (4-5) : 445 - 461
  • [28] Technologies and functions of MPEG-4 audio
    Moriya, Takehiro
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2001, 55 (12):
  • [29] Object-based texture coding of moving video in MPEG-4
    Kaup, A
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (01) : 5 - 15
  • [30] Automatic Object Segmentation Algorithms for Sprite Coding using MPEG-4
    Krutz, Andreas
    Glantz, Alexander
    Sikora, Thomas
    Nunes, Paulo
    Pereira, Fernando
    PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 459 - +