A pattern classification proposal for object-oriented audio coding in MPEG-4

被引:4
|
作者
Beritelli, F [1 ]
Casale, S [1 ]
Russo, M [1 ]
机构
[1] Univ Catania, Ist Informat & Telecomun, I-95125 Catania, Italy
关键词
D O I
10.1023/A:1019112310453
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The future MPEG-4 standard will adopt an object-oriented encoding strategy whereby an audio source is encoded at a very low bit-rate by adapting a suitable coding scheme to the local characteristics of the signal. One of the most delicate issues in this approach is that the overall performance of the audio encoder greatly depends on the accuracy with which the input signal is classified. This paper shows that the difficult problem of audio classification for object-oriented coding can be effectively solved by selecting a salient set of acoustic parameters and adopting a fuzzy model for each audio object, obtained by a soft computing-hybrid learning tool. The audio classifier proposed operates at two levels: recognition of the class to which the input signal belongs (talkspurt, music, noise, signaling tones) and then recognition of the subclass to which it belongs. The results obtained show that fuzzy logic is a valid alternative to the matching techniques of a traditional pattern recognition approach.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [31] Cascaded RLS-LMS prediction in MPEG-4 lossless audio coding
    Huang, H.
    Rahardja, S.
    Lin, X.
    Yu, R.
    Franti, P.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5039 - 5042
  • [32] Cascaded RLS-LMS prediction in MPEG-4 Lossless Audio Coding
    Huang, Haibin
    Franti, Pasi
    Huang, Dongyan
    Rahardja, Susanto
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 554 - 562
  • [33] An Adaptive And Efficient Bit Allocation Scheme For MPEG-4 Advanced Audio Coding
    Shu Ruo
    Wu Lenan
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, PROCEEDINGS, 2009, : 312 - 315
  • [34] A Study of Using Least Squares Method in MPEG-4 Audio Lossless Coding
    You, Shingchern D.
    Wang, Chau-Jia
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [35] An object-oriented schema for querying audio
    Martinez, J
    Lutfi, R
    Gelgon, M
    OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 76 - 81
  • [36] MPEG-4 synthetic image coding
    NTT Human Interface Laboratories, Yokosuka, Japan
    Kyokai Joho Imeji Zasshi, 12 (1986-1988):
  • [37] A low-complexity joint-coding method for mpeg-4 audio lossless coding encoders
    Cho, Choong Sang
    Kim, Je Woo
    Choi, Byeong Ho
    Kim, Dong Sun
    ICIC Express Letters, 2012, 6 (07): : 1713 - 1719
  • [38] Error concealment for shape in MPEG-4 object-based video coding
    Huang, C
    Salama, P
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (04) : 389 - 396
  • [39] Automatic MPEG-4 sprite coding—Comparison of integrated object segmentation algorithms
    Alexander Glantz
    Andreas Krutz
    Thomas Sikora
    Paulo Nunes
    Fernando Pereira
    Multimedia Tools and Applications, 2010, 49 : 483 - 512
  • [40] MPEG-4 coding of ultrasound sequences
    Lau, C
    Cabral, JE
    Rambhia, AH
    Kim, Y
    MEDICAL IMAGING 2000: IMAGE DISPLAY AND VISUALIZATION, 2000, 3976 : 573 - 579