Semisupervised learning of hierarchical latent trait models for data visualization

被引:4
|
作者
Nabney, IT [1 ]
Sun, Y
Tino, P
Kabán, A
机构
[1] Aston Univ, Neural Comp Res Grp, Birmingham B4 7ET, W Midlands, England
[2] Univ Hertfordshire, Sch Comp Sci, Hatfield AL10 9AB, Herts, England
[3] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
英国生物技术与生命科学研究理事会;
关键词
hierarchical model; latent trait model; magnification factors; data visualization; document mining;
D O I
10.1109/TKDE.2005.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, we have developed the hierarchical Generative Topographic Mapping (HGTM), an interactive method for visualization of large high-dimensional real-valued data sets. In this paper, we propose a more general visualization system by extending HGTM in three ways, which allows the user to visualize a wider range of data sets and better support the model development process. 1) We integrate HGTM with noise models from the exponential family of distributions. The basic building block is the Latent Trait Model (LTM). This enables us to visualize data of inherently discrete nature, e. g., collections of documents, in a hierarchical manner. 2) We give the user a choice of initializing the child plots of the current plot in either interactive, or automatic mode. In the interactive mode, the user selects "regions of interest," whereas in the automatic mode, an unsupervised minimum message length (MML)-inspired construction of a mixture of LTMs is employed. The unsupervised construction is particularly useful when high-level plots are covered with dense clusters of highly overlapping data projections, making it difficult to use the interactive mode. Such a situation often arises when visualizing large data sets. 3) We derive general formulas for magnification factors in latent trait models. Magnification factors are a useful tool to improve our understanding of the visualization plots, since they can highlight the boundaries between data clusters. We illustrate our approach on a toy example and evaluate it on three more complex real data sets.
引用
收藏
页码:384 / 400
页数:17
相关论文
共 50 条
  • [41] On the Visualization of Hierarchical Multivariate Data
    Zheng, Boyan
    Sadlo, Filip
    2021 IEEE 14TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2021), 2021, : 136 - 145
  • [42] Dynamic visualization of hierarchical data
    Senay, H
    Saltz, JS
    HUMAN VISION AND ELECTRONIC IMAGING II, 1997, 3016 : 451 - 458
  • [43] Identification of Nonlinear Latent Hierarchical Models
    Kong, Lingjing
    Huang, Biwei
    Xie, Feng
    Xing, Eric
    Chi, Yuejie
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] LOGISTIC LATENT TRAIT MODELS WITH LINEAR CONSTRAINTS
    FISCHER, GH
    PSYCHOMETRIKA, 1983, 48 (01) : 3 - 26
  • [45] THE UTILITY OF LATENT TRAIT MODELS IN PSYCHIATRIC EPIDEMIOLOGY
    DUNCANJONES, P
    GRAYSON, DA
    MORAN, PAP
    PSYCHOLOGICAL MEDICINE, 1986, 16 (02) : 391 - 405
  • [46] PARAMETER-ESTIMATION IN LATENT TRAIT MODELS
    RIGDON, SE
    TSUTAKAWA, RK
    PSYCHOMETRIKA, 1983, 48 (04) : 567 - 574
  • [47] A PROCEDURE FOR COMPARING LOGISTIC LATENT TRAIT MODELS
    WALLER, MI
    JOURNAL OF EDUCATIONAL MEASUREMENT, 1981, 18 (02) : 119 - 125
  • [48] IDENTIFIABILITY OF HIERARCHICAL LATENT ATTRIBUTE MODELS
    Gu, Yuqi
    Xu, Gongjun
    STATISTICA SINICA, 2023, 33 (04) : 2561 - 2591
  • [49] Hierarchical marginal models with latent uncertainty
    Colombi, Roberto
    Giordano, Sabrina
    Gottard, Anna
    Iannario, Maria
    SCANDINAVIAN JOURNAL OF STATISTICS, 2019, 46 (02) : 595 - 620
  • [50] PARAMETER-ESTIMATION IN LATENT TRAIT MODELS
    RIGDON, SE
    TSUTAKAWA, RK
    BIOMETRICS, 1982, 38 (04) : 1120 - 1120