Confidence estimation for t-SNE embeddings using random forest

被引:0
|
作者
Busra Ozgode Yigin
Gorkem Saygili
机构
[1] Tilburg University,Cognitive Sciences and Artificial Intelligence, Tilburg School of Humanities and Digital Sciences
来源
International Journal of Machine Learning and Cybernetics | 2022年 / 13卷
关键词
T-SNE; Confidence score; Embedding; Dimensionality reduction; Random forest;
D O I
暂无
中图分类号
学科分类号
摘要
Dimensionality reduction algorithms are commonly used for reducing the dimension of multi-dimensional data to visualize them on a standard display. Although many dimensionality reduction algorithms such as the t-distributed Stochastic Neighborhood Embedding aim to preserve close neighborhoods in low-dimensional space, they might not accomplish that for every sample of the data and eventually produce erroneous representations. In this study, we developed a supervised confidence estimation algorithm for detecting erroneous samples in embeddings. Our algorithm generates a confidence score for each sample in an embedding based on a distance-oriented score and a random forest regressor. We evaluate its performance on both intra- and inter-domain data and compare it with the neighborhood preservation ratio as our baseline. Our results showed that the resulting confidence score provides distinctive information about the correctness of any sample in an embedding compared to the baseline. The source code is available at https://github.com/gsaygili/dimred.
引用
收藏
页码:3981 / 3992
页数:11
相关论文
共 50 条
  • [31] Geochemical characterisation of rock hydration processes using t-SNE
    Horrocks, Tom
    Holden, Eun-Jung
    Wedge, Daniel
    Wijns, Chris
    Fiorentini, Marco
    COMPUTERS & GEOSCIENCES, 2019, 124 : 46 - 57
  • [32] Visualization and Detection of Changes in Brain States Using t-SNE
    Parmar, Harshit S.
    Mitra, Sunanda
    Nutter, Brian
    Long, Rodney
    Antani, Sameer
    2020 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI 2020), 2020, : 14 - 17
  • [33] Nonlinear Manifold Embedding on Keyword Spotting using t-SNE
    Retsinas, George
    Stamatopoulos, Nikolaos
    Louloudis, Georgios
    Sfikas, Giorgos
    Gatos, Basilis
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 487 - 492
  • [34] GPGPU Linear Complexity t-SNE Optimization
    Pezzotti, Nicola
    Thijssen, Julian
    Mordvintsev, Alexander
    Hollt, Thomas
    van Lew, Baldur
    Lelieveldt, Boudewijn P. F.
    Eisemann, Elmar
    Vilanova, Anna
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) : 1172 - 1181
  • [35] Seeing data as t-SNE and UMAP do
    Marx, Vivien
    NATURE METHODS, 2024, 21 (06) : 930 - 933
  • [36] Demonstrating the Evolution of GANs Through t-SNE
    Costa, Victor
    Lourenco, Nuno
    Correia, Joao
    Machado, Penousal
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2021, 2021, 12694 : 618 - 633
  • [37] Application of t-SNE to human genetic data
    Li, Wentian
    Cerise, Jane E.
    Yang, Yaning
    Han, Henry
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2017, 15 (04)
  • [38] Using t-SNE to Evaluate the Brand Style of New Mice Design
    Wang, Hung-Hsiang
    Chen, Chih-Ping
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 451 - 452
  • [39] Dimensionality reduction and visualisation of hyperspectral ink data using t-SNE
    Devassy, Binu Melit
    George, Sony
    FORENSIC SCIENCE INTERNATIONAL, 2020, 311
  • [40] Ant-SNE: Tracking Community Evolution via Animated t-SNE
    Nguyen, Ngan V. T.
    Dang, Tommy
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 330 - 341