An efficient algorithm for clustering short spoken utterances

被引：0

作者：

Liu, Z ^{[1
]}

机构：

[1] AT&T Labs Res, Middletown, NJ 07748 USA

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, spoken dialogue systems which provide automated customer service at call centers become more prevalent. It is time consuming to determine a set of call types for the dialogue system by analyzing a large volume of unstructured spoken utterances. Traditional hierarchical agglomerative clustering (HAC) algorithm can bootstrap the call types in an unsupervised way, yet the time and space complexities are huge, especially for large data set. Based on our observation that spoken utterances containing less than ten terms are common in the spoken dialogue system, we proposed an efficient HAC algorithm for short utterances. By utilizing the particular properties of short utterances, we significantly reduced both the time and the space complexities of the clustering, algorithm.

引用

页码：593 / 596

页数：4

共 50 条

[41] CLUSTERING OF DISFLUENCY IN NONSTUTTERING CHILDRENS EARLY UTTERANCES
COLBURN, N
JOURNAL OF FLUENCY DISORDERS, 1985, 10 (01) : 51 - 58
[42] A robust unsupervised speaker clustering of speech utterances
Zhang, SL
Zhang, SW
Xu, B
PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 115 - 120
[43] Entropy-based Recognition of Anomalous Answers for Efficient Grading of Short Answers with an Evolutionary Clustering Algorithm
Lui, Andrew Kwok-Fai
Ng, Sin-Chun
Cheung, Stella Wing-Nga
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 3091 - 3098
[44] An Efficient Clustering Algorithm for Multivariate Time Series
Zhou, Da-Zhuo
Zhang, Bo
EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 5190 - 5193
[45] Cure: An efficient clustering algorithm for large databases
Guha, S
Rastogi, R
Shim, K
INFORMATION SYSTEMS, 2001, 26 (01) : 35 - 58
[46] An Efficient Clustering Algorithm for k-Anonymisation
Grigorios Loukides
Jian-Hua Shao
Journal of Computer Science and Technology, 2008, 23 : 188 - 202
[47] An efficient incremental protein sequence clustering algorithm
Vijaya, PA
Murty, MN
Subramanian, DK
IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 409 - 413
[48] Efficient pose clustering using a randomized algorithm
Olson, CF
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 23 (02) : 131 - 147
[49] An efficient algorithm for clustering search engine results
Zhang, Hui
Pang, Bin
Xie, Ke
Wu, Hui
COMPUTATIONAL INTELLIGENCE AND SECURITY, 2007, 4456 : 661 - 671
[50] Squeezer: An efficient algorithm for clustering categorical data
Zengyou He
Xiaofei Xu
Shengchun Deng
Journal of Computer Science and Technology, 2002, 17 : 611 - 624

← 1 2 3 4 5 →