Using local & global phonotactic features in Chinese dialect identification

被引:0
|
作者
Lim, BP [1 ]
Li, HZ [1 ]
Ma, B [1 ]
机构
[1] Inst Infocomm Res, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional techniques for spoken language identification use variants of phone similarity and language model scoring, which represent local phonetic constraints in spoken language. In this paper, we explore the identification of Chinese dialects which share the same written script and have similar sound systems and syllable structures. As such, local phonetic constraints do not provide enough discriminative information among the dialects. We propose to use Latent Semantic Analysis (LSA) to extract global features that represent the high-order statistics in the co-occurrence of sounds. The experiments show that we can achieve the best performance by combining acoustic, n-gram language modeling and LSA scores. An accuracy of 99.23% is achieved in 4-way classification tests using 20-second speech sessions.
引用
收藏
页码:577 / 580
页数:4
相关论文
共 50 条
  • [1] American Dialect Identification using Phonotactic and Prosodic Features
    Etman, A.
    Beex, A. A.
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 963 - 970
  • [2] Chinese dialect identification using segmental and prosodic features
    Chang, WW
    Tsai, WH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04): : 1906 - 1913
  • [3] Chinese dialect identification using tone features based on pitch flux
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1029 - 1032
  • [4] CHIMPANZEE IDENTIFICATION USING GLOBAL AND LOCAL FEATURES
    Loos, Alexander
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2347 - 2351
  • [5] Face Identification Using Local and Global Features
    Ammour, Basma
    Bouden, Toufik
    Boubchir, Larbi
    Biad, Souad
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 784 - 788
  • [6] Exploiting convolutional neural networks for phonotactic based dialect identification
    Najafian, Maryam
    Khurana, Sameer
    Shon, Suwon
    Ali, Ahmed
    Glass, James
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5174 - 5178
  • [7] Construction of a Phonotactic Dialect Corpus using Semiautomatic Annotation
    Schwartz, Reva
    Shen, Wade
    Campbell, Joseph
    Paget, Shelley
    Vonwiller, Julie
    Estival, Dominique
    Cieri, Christopher
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2472 - +
  • [8] Routine identification of melanoma disease using global and local features
    Kavya, S. P.
    Saranya, S.
    2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2016,
  • [9] Neural network classifiers for language identification using phonotactic and prosodic features
    Mary, L
    Rao, KS
    Yegnanarayana, B
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 404 - 408
  • [10] Chinese Dialect Identification Using SC-GMM
    Gu, Ming-liang
    Zhang, Biao
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 3292 - +