Text Classification from Labeled and Unlabeled Documents using EM

被引:114
作者
Kamal Nigam
Andrew Kachites Mccallum
Sebastian Thrun
Tom Mitchell
机构
[1] Carnegie Mellon University,School of Computer Science
[2] Just Research,School of Computer Science
[3] Carnegie Mellon University,School of Computer Science
[4] Carnegie Mellon University,School of Computer Science
[5] Carnegie Mellon University,undefined
来源
Machine Learning | 2000年 / 39卷
关键词
text classification; Expectation-Maximization; integrating supervised and unsupervised learning; combining labeled and unlabeled data; Bayesian learning;
D O I
暂无
中图分类号
学科分类号
摘要
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. This is important because in many text classification problems obtaining training labels is expensive, while large quantities of unlabeled documents are readily available.
引用
收藏
页码:103 / 134
页数:31
相关论文
共 23 条
[1]  
Castelli V.(1995)On the exponential value of labeled samples Pattern Recognition Letters 16 105-111
[2]  
Cover T.M.(1977)Maximum likelihood from incomplete data via the EM algorithm Journal of the Royal Statistical Society 39 1-38
[3]  
Dempster A.P.(1998)Approximate statistical tests for comparing supervised classification learning algorithms Neural Computation 10 1895-1923
[4]  
Laird N.M.(1997)On the optimality of the simple Bayesian classifier under zero-one loss Machine Learning 29 103-130
[5]  
Rubin D.B.(1997)On bias, variance, 0/1-loss, and the curse-of-dimensionality Data Mining and Knowledge Discovery 1 55-77
[6]  
Dietterich T.G.(1994)Supervised learning from incomplete data via an EM approach Advances in neural information processing systems 6 120-127
[7]  
Domingos P.(1995)A sequential algorithm for training text classifiers: Corrigendum and additional data SIGIR Forum 29 13-19
[8]  
Pazzani M.(1997)Threading electronic mail: A preliminary study Information Processing and Management 33 209-217
[9]  
Friedman J.H.(1997)A mixture of experts classifier with learning based on both labelled and unlabelled data Advances in Neural Information Processing Systems 9 571-577
[10]  
Ghahramani Z.(1983)A universal prior for integers and estimation by minimum description length Annals of Statistics 11 416-431