Specification of audio representations in audio-related standards: Three audio representations: channel-based, object-based, and scene-based

被引:0
|
作者
Sugimoto, Takehiro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, 1-10-11 Kinuta,Setagaya Ku, Tokyo 1578510, Japan
关键词
Audio representation; Loudspeaker layout; Audio-related standard; ITU-R; MPEG;
D O I
10.1250/ast.e24.65
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Currently, there are three mainstream audio representations, namely channel-based audio, object-based audio, and scene-based audio. The features of content expression differ among these audio representations, the details of which have been specified in the International Telecommunication Union: Radiocommunication Sector (ITU-R) Recommendations. The effective use of these audio representations in accordance with what is to be expressed in the content requires a deep understanding of the technical specifications and capabilities of the audio representations. This review first traces the evolution of loudspeaker layouts developed in recent years, i.e., a history of multichannelization, which is indispensable for the understanding of audio representations. Then, the position of each audio representation among various audio-related standards is described and the method of adopting and implementing each audio representation in other audio-related standards is reviewed using the Moving Picture Experts Group (MPEG) standards as examples.
引用
收藏
页码:311 / 319
页数:9
相关论文
共 50 条
  • [21] A SOURCE SEPARATION EVALUATION METHOD IN OBJECT-BASED SPATIAL AUDIO
    Liu, Qingju
    Wang, Wenwu
    Jackson, Philip J. B.
    Cox, Trevor J.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1088 - 1092
  • [22] Gestural Interactions with Object-Based Audio in an Internet of Sounds Ecosystem
    Mikolajczyk, Kurt
    Trolland, Sam
    Ilsar, Alon
    McCormack, Jon
    Ferguson, Sam
    Bown, Oliver
    2023 4TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, 2023, : 324 - 332
  • [23] Correction to: Exploring object-based content adaptation for mobile audio
    Tim Walton
    Michael Evans
    David Kirk
    Frank Melchior
    Personal and Ubiquitous Computing, 2018, 22 : 721 - 721
  • [24] Personalized Object-Based Audio for Hearing Impaired TV Viewers
    Shirley, Ben
    Meadows, Melissa
    Malak, Fadi
    Woodcock, James
    Tidball, Ash
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 293 - 303
  • [25] Assessing Spatial Audio: A Listener-Centric Case Study on Object-Based and Ambisonic Audio Processing
    Malecki, Pawel
    Stefanska, Joanna
    Szydlowska, Maja
    ARCHIVES OF ACOUSTICS, 2024, 49 (03) : 331 - 343
  • [26] AUDIO EVENT DETECTION BASED ON LAYERED SYMBOLIC SEQUENCE REPRESENTATIONS
    Chin, Michele Lai
    Burred, Juan Jose
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1953 - 1956
  • [27] GUEST EDITORS' NOTE Special Issue on Object-Based Audio
    Davies, William J.
    Spors, Sascha
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2019, 67 (7-8): : 484 - 485
  • [28] Object-Based Benefits Without Object-Based Representations
    Fougnie, Daryl
    Cormiea, Sarah M.
    Alvarez, George A.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2013, 142 (03) : 621 - 626
  • [29] Algorithms for multiplex scheduling of object-based audio-visual presentations
    Kalva, H
    Eleftheriadis, A
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (12) : 1283 - 1293
  • [30] Object-based and image-based object representations
    Samet, Hanan
    ACM Comput Surv, 1600, 2 (159-217):