A pattern classification proposal for object-oriented audio coding in MPEG-4

被引:4
|
作者
Beritelli, F [1 ]
Casale, S [1 ]
Russo, M [1 ]
机构
[1] Univ Catania, Ist Informat & Telecomun, I-95125 Catania, Italy
关键词
D O I
10.1023/A:1019112310453
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The future MPEG-4 standard will adopt an object-oriented encoding strategy whereby an audio source is encoded at a very low bit-rate by adapting a suitable coding scheme to the local characteristics of the signal. One of the most delicate issues in this approach is that the overall performance of the audio encoder greatly depends on the accuracy with which the input signal is classified. This paper shows that the difficult problem of audio classification for object-oriented coding can be effectively solved by selecting a salient set of acoustic parameters and adopting a fuzzy model for each audio object, obtained by a soft computing-hybrid learning tool. The audio classifier proposed operates at two levels: recognition of the class to which the input signal belongs (talkspurt, music, noise, signaling tones) and then recognition of the subclass to which it belongs. The results obtained show that fuzzy logic is a valid alternative to the matching techniques of a traditional pattern recognition approach.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [41] MPEG-4 STUDIO: An Object-Based Authoring System for MPEG-4 Contents
    Kyung-Ae Cha
    Sangwook Kim
    Multimedia Tools and Applications, 2005, 25 : 111 - 131
  • [42] MPEG-4 STUDIO: An object-based authoring system for MPEG-4 contents
    Cha, KA
    Kim, S
    MULTIMEDIA TOOLS AND APPLICATIONS, 2005, 25 (01) : 111 - 131
  • [43] Tests on MPEG-4 audio codec proposals
    Centro Studi e Laboratori, Telecomunicazioni, Torino, Italy
    Signal Process Image Commun, 4 (327-342):
  • [44] Efficient algorithms and architectures for MPEG-4 object-based video coding
    Chang, HC
    Wang, YC
    Hsu, MY
    Chen, LG
    2000 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 2000, : 13 - 22
  • [45] ISO/IEC MPEG-4 high-definition scalable advanced audio coding
    Geiger, Ralf
    Yu, Rongshan
    Herre, Jürgen
    Rahardja, Susanto
    Kim, Sang-Wook
    Lin, Xiao
    Schmidt, Markus
    AES: Journal of the Audio Engineering Society, 1600, 55 (1-2): : 27 - 43
  • [46] ISO/IEC MPEG-4 high-definition scalable advanced audio coding
    Geiger, Ralf
    Yu, Rongshan
    Herre, Jurgen
    Rahardja, Susanto
    Kim, Sang-Wook
    Lin, Xiao
    Schmidt, Markus
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2007, 55 (1-2): : 27 - 43
  • [47] Tests on MPEG-4 audio codec proposals
    Contin, L
    Edler, B
    Meares, D
    Schreiner, P
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 9 (04) : 327 - 342
  • [48] Implementation of the MPEG-4 advanced audio coding encoder on ADSP-21060 SHARC
    Huang, DY
    Gong, XS
    Zhou, DQ
    Miki, T
    Hotani, S
    ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3: ANALOG AND DIGITAL SIGNAL PROCESSING, 1999, : 544 - 547
  • [49] Synchronous and asynchronous multiple object rate control for MPEG-4 video coding
    Sun, Y
    Ahmad, I
    Luo, JC
    Wei, XH
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 801 - 804
  • [50] MPEG-4 systems: Architecting object-based audio-visual content
    Eleftheriadis, A
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 27 (1-2): : 55 - 67