An efficient algorithm for clustering short spoken utterances

被引:0
|
作者
Liu, Z [1 ]
机构
[1] AT&T Labs Res, Middletown, NJ 07748 USA
来源
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, spoken dialogue systems which provide automated customer service at call centers become more prevalent. It is time consuming to determine a set of call types for the dialogue system by analyzing a large volume of unstructured spoken utterances. Traditional hierarchical agglomerative clustering (HAC) algorithm can bootstrap the call types in an unsupervised way, yet the time and space complexities are huge, especially for large data set. Based on our observation that spoken utterances containing less than ten terms are common in the spoken dialogue system, we proposed an efficient HAC algorithm for short utterances. By utilizing the particular properties of short utterances, we significantly reduced both the time and the space complexities of the clustering, algorithm.
引用
收藏
页码:593 / 596
页数:4
相关论文
共 50 条
  • [41] CLUSTERING OF DISFLUENCY IN NONSTUTTERING CHILDRENS EARLY UTTERANCES
    COLBURN, N
    JOURNAL OF FLUENCY DISORDERS, 1985, 10 (01) : 51 - 58
  • [42] A robust unsupervised speaker clustering of speech utterances
    Zhang, SL
    Zhang, SW
    Xu, B
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 115 - 120
  • [43] Entropy-based Recognition of Anomalous Answers for Efficient Grading of Short Answers with an Evolutionary Clustering Algorithm
    Lui, Andrew Kwok-Fai
    Ng, Sin-Chun
    Cheung, Stella Wing-Nga
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 3091 - 3098
  • [44] An Efficient Clustering Algorithm for Multivariate Time Series
    Zhou, Da-Zhuo
    Zhang, Bo
    EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 5190 - 5193
  • [45] Cure: An efficient clustering algorithm for large databases
    Guha, S
    Rastogi, R
    Shim, K
    INFORMATION SYSTEMS, 2001, 26 (01) : 35 - 58
  • [46] An Efficient Clustering Algorithm for k-Anonymisation
    Grigorios Loukides
    Jian-Hua Shao
    Journal of Computer Science and Technology, 2008, 23 : 188 - 202
  • [47] An efficient incremental protein sequence clustering algorithm
    Vijaya, PA
    Murty, MN
    Subramanian, DK
    IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 409 - 413
  • [48] Efficient pose clustering using a randomized algorithm
    Olson, CF
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 23 (02) : 131 - 147
  • [49] An efficient algorithm for clustering search engine results
    Zhang, Hui
    Pang, Bin
    Xie, Ke
    Wu, Hui
    COMPUTATIONAL INTELLIGENCE AND SECURITY, 2007, 4456 : 661 - 671
  • [50] Squeezer: An efficient algorithm for clustering categorical data
    Zengyou He
    Xiaofei Xu
    Shengchun Deng
    Journal of Computer Science and Technology, 2002, 17 : 611 - 624