Specification of audio representations in audio-related standards: Three audio representations: channel-based, object-based, and scene-based

被引:0
|
作者
Sugimoto, Takehiro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, 1-10-11 Kinuta,Setagaya Ku, Tokyo 1578510, Japan
关键词
Audio representation; Loudspeaker layout; Audio-related standard; ITU-R; MPEG;
D O I
10.1250/ast.e24.65
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Currently, there are three mainstream audio representations, namely channel-based audio, object-based audio, and scene-based audio. The features of content expression differ among these audio representations, the details of which have been specified in the International Telecommunication Union: Radiocommunication Sector (ITU-R) Recommendations. The effective use of these audio representations in accordance with what is to be expressed in the content requires a deep understanding of the technical specifications and capabilities of the audio representations. This review first traces the evolution of loudspeaker layouts developed in recent years, i.e., a history of multichannelization, which is indispensable for the understanding of audio representations. Then, the position of each audio representation among various audio-related standards is described and the method of adopting and implementing each audio representation in other audio-related standards is reviewed using the Moving Picture Experts Group (MPEG) standards as examples.
引用
收藏
页码:311 / 319
页数:9
相关论文
共 50 条
  • [31] Perceptual Evaluation of Blind Source Separation in Object-Based Audio Production
    Coleman, Philip
    Liu, Qingju
    Francombe, Jon
    Jackson, Philip J. B.
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 558 - 567
  • [32] Object-based audio streaming over error-prone channels
    Marks, SK
    Gonzalez, R
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 261 - 264
  • [33] Algorithmic Spatialization Using Object-Based Audio and Indoor Positioning System
    Fan, Yuan-Yi
    LEONARDO MUSIC JOURNAL, 2019, 29 : 25 - 30
  • [34] Determination and Validation of Mix Parameters for Modifying Envelopment in Object-Based Audio
    Francombe, Jon
    Brookes, Tim
    Mason, Russell
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 127 - 145
  • [35] Audio classification based on MPEG-7 spectral basis representations
    Kim, HG
    Moreau, N
    Sikora, T
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) : 716 - 725
  • [36] PERCEPTUAL LOUDNESS COMPENSATION IN INTERACTIVE OBJECT-BASED AUDIO CODING SYSTEMS
    Paulus, Jouni
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 579 - 583
  • [37] Object-based and image-based object representations
    Samet, H
    ACM COMPUTING SURVEYS, 2004, 36 (02) : 159 - 217
  • [38] To catch a chorus: Using chroma-based representations for audio thumbnailing
    Bartsch, MA
    Wakefield, GH
    PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 15 - 18
  • [39] CONTENT-BASED REPRESENTATIONS OF AUDIO USING SIAMESE NEURAL NETWORKS
    Manocha, Pranay
    Badlani, Rohan
    Kumar, Anurag
    Shah, Ankit
    Elizalde, Benjamin
    Raj, Bhiksha
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3136 - 3140
  • [40] A Neural Network Based Framework for Audio Scene Analysis in Audio Sensor Networks
    Li, Qi
    Ma, Huadong
    Zhao, Dong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009, 2009, 5879 : 480 - 490