Byte Pair Encoding for Symbolic Music

被引:0
|
作者
Fradet, Nathan [1 ,2 ]
Gutowski, Nicolas [3 ]
Chhel, Fabien [3 ,4 ]
Briot, Jean-Pierre [1 ]
机构
[1] Sorbonne Univ, CNRS, LIP6, F-75005 Paris, France
[2] Aubay, Boulogne, France
[3] Univ Angers, LERIA, F-49000 Angers, France
[4] ESEO, ERIS, F-49100 Angers, France
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When used with deep learning, the symbolic music modality is often coupled with language model architectures. To do so, the music needs to be tokenized, i.e. converted into a sequence of discrete tokens. This can be achieved by different approaches, as music can be composed of simultaneous tracks, of simultaneous notes with several attributes. Until now, the proposed tokenizations rely on small vocabularies of tokens describing the note attributes and time events, resulting in fairly long token sequences, and a sub-optimal use of the embedding space of language models. Recent research has put efforts on reducing the overall sequence length by merging embeddings or combining tokens. In this paper, we show that Byte Pair Encoding, a compression technique widely used for natural language, significantly decreases the sequence length while increasing the vocabulary size. By doing so, we leverage the embedding capabilities of such models with more expressive tokens, resulting in both better results and faster inference in generation and classification tasks. The source code is shared on Github(1), along with a companion website(2). Finally, BPE is directly implemented in MidiTok(3), allowing the reader to easily benefit from this method.
引用
收藏
页码:2001 / 2020
页数:20
相关论文
共 50 条
  • [31] Symbolic music representation in MPEG
    Bellini, P
    Nesi, P
    Zoia, G
    IEEE MULTIMEDIA, 2005, 12 (04) : 42 - 49
  • [32] Transforming Boolean relations by symbolic encoding
    Cabodi, G
    Quer, S
    Camurati, P
    CORRECT HARDWARE DESIGN AND VERIFICATION METHODS, 1995, 987 : 161 - 170
  • [33] Sub-symbolic encoding of words
    Vassallo, G
    Pilato, G
    Maggio, A
    Puglisi, A
    Gaglio, S
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 449 - 461
  • [34] Right and left: a symbolic pair (1981)
    Fachinelli, Elvio
    AUT AUT, 2011, (352): : 27 - 31
  • [35] PERCEPTUAL ENCODING OF EMOTIONAL SYMBOLIC GESTURES
    Haecker, Frank E. K.
    Flaisch, Tobias
    Renner, Britta
    Schupp, Harald T.
    PSYCHOPHYSIOLOGY, 2009, 46 : S133 - S133
  • [36] PathWalker: A Dynamic Symbolic Execution Tool Based on LLVM Byte Code Instrumentation
    Zhang Jun-xian
    Li Zhou-jun
    Zheng Xian-chen
    DEPENDABLE SOFTWARE ENGINEERING: THEORIES, TOOLS, AND APPLICATIONS, SETTA 2015, 2015, 9409 : 227 - 242
  • [37] Symbolic violence in rock music and technocumbia
    Moncada Landeta, Raul
    CHASQUI-REVISTA LATINOAMERICANA DE COMUNICACION, 2013, (123): : 82 - 90
  • [38] A Viewpoint Approach to Symbolic Music Transformation
    Bigo, Louis
    Conklin, Darrell
    MUSIC, MIND, AND EMBODIMENT, 2016, 9617 : 213 - 227
  • [39] Implementing a symbolic music processing system
    Filgueiras, M
    SOFTWARE-PRACTICE & EXPERIENCE, 1998, 28 (05): : 493 - 512
  • [40] Symbolic Music Genre Transfer with CycleGAN
    Brunner, Gino
    Wang, Yuyi
    Wattenhofer, Roger
    Zhao, Sumu
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 786 - 793