Recognition of the Main Melody in a Polyphonic Symbolic Score using Perceptual Knowledge

被引:5
|
作者
Friberg, Anders [1 ]
Ahlback, Sven [1 ]
机构
[1] KTH, S-10044 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
PERFORMANCE;
D O I
10.1080/09298210903215900
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is in many cases easy for a human to identify the main melodic theme when listening to a music example. Melodic properties have been studied in several research projects, however, the differences between properties of the melody and properties of the accompaniment (non-melodic) voices have not been addressed until recently. A set of features relating to basic low-level statistical measures were selected considering general perceptual aspects. A new 'narrative' measure was designed intended to capture the amount of new unique material in each voice. The features were applied to a set of scores consisting of about 250 polyphonic ringtones consisting of MIDI versions of contemporary pop songs. All tracks were annotated into categories such as melody and accompaniment. Both multiple regression and support vector machines were applied on either the features directly or on a Gaussian transformation of the features. The resulting models predicted the correct melody in about 90% of the cases using a set of eight features. The results emphasize context as an important factor for determining the main melody. A previous version of the system has been used in a commercial system for modifying ring tones.
引用
收藏
页码:155 / 169
页数:15
相关论文
共 50 条
  • [21] Research on Vocal Information Processing using a Main Melody Extraction Algorithm
    Liu, Shengnan
    Wang, Xu
    IEIE Transactions on Smart Processing and Computing, 2024, 13 (04): : 322 - 327
  • [22] SKIER: A Symbolic Knowledge Integrated Model for Conversational Emotion Recognition
    Li, Wei
    Zhu, Luyao
    Mao, Rui
    Cambria, Erik
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13121 - 13129
  • [23] Improving symbolic data visualization for pattern recognition and knowledge discovery
    Umbleja, Kadri
    Ichino, Manabu
    Yaguchi, Hiroyuki
    VISUAL INFORMATICS, 2020, 4 (01) : 23 - 31
  • [24] Face recognition using symbolic KDA in the framework of symbolic data analysis
    Hiremath, P. S.
    Prabhakar, C. J.
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 56 - +
  • [25] Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots
    Otsuka, Takuma
    Murata, Kazumasa
    Nakadai, Kazuhiro
    Takahashi, Toni
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 2289 - 2296
  • [26] Predominant instrument recognition from polyphonic music using feature fusion
    Ajayakumar, Roshni
    Rajan, Rajeev
    EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY FOR SOCIETY, ENERGY AND ENVIRONMENT, 2018, : 721 - 726
  • [27] PROBABILISTIC MODEL FOR MAIN MELODY EXTRACTION USING CONSTANT-Q TRANSFORM
    Fuentes, Benoit
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5357 - 5360
  • [28] RECOGNITION OF HARMONIC SOUNDS IN POLYPHONIC AUDIO USING A MISSING FEATURE APPROACH
    Giannoulis, Dimitrios
    Klapuri, Anssi
    Plumbley, Mark D.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8658 - 8662
  • [29] Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach
    Giannoulis, Dimitrios
    Klapuri, Anssi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1805 - 1817
  • [30] Main Melody Extraction Using the Auditory Scene Analysis for the Humming Music Retrieval
    Kong Chenchen
    Yu Yibiao
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 27 - 31