TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections

被引:60
|
作者
Kim, Minjeong [1 ]
Kang, Kyeongpil [1 ]
Park, Deokgun [2 ]
Choo, Jaegul [1 ]
Elmqvist, Niklas [2 ]
机构
[1] Korea Univ, Seoul, South Korea
[2] Univ Maryland, College Pk, MD 20742 USA
基金
新加坡国家研究基金会;
关键词
topic modeling; nonnegative matrix factorization; t-distributed stochastic neighbor embedding; magic lens; text analytics; NONNEGATIVE MATRIX; VISUALIZATION; ANALYTICS;
D O I
10.1109/TVCG.2016.2598445
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Topic modeling, which reveals underlying topics of a document corpus, has been actively adopted in visual analytics for large-scale document collections. However, due to its significant processing time and non-interactive nature, topic modeling has so far not been tightly integrated into a visual analytics workflow. Instead, most such systems are limited to utilizing a fixed, initial set of topics. Motivated by this gap in the literature, we propose a novel interaction technique called TopicLens that allows a user to dynamically explore data through a lens interface where topic modeling and the corresponding 2D embedding are efficiently computed on the fly. To support this interaction in real time while maintaining view consistency, we propose a novel efficient topic modeling method and a semi-supervised 2D embedding algorithm. Our work is based on improving state-of-the-art methods such as nonnegative matrix factorization and t-distributed stochastic neighbor embedding. Furthermore, we have built a web-based visual analytics system integrated with TopicLens. We use this system to measure the performance and the visualization quality of our proposed methods. We provide several scenarios showcasing the capability of TopicLens using real-world datasets.
引用
收藏
页码:151 / 160
页数:10
相关论文
共 50 条
  • [31] Multi-level image representation for large-scale image-based instance retrieval
    Deng, Qili
    Wu, Shuai
    Wen, Jie
    Xu, Yong
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (01) : 33 - 39
  • [32] Model Predictive Control for Large-scale Urban Traffic Networks with a Multi-level Hierarchy
    Lin, Shu
    Ling, Taiyu
    Xi, Yugeng
    2013 16TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS - (ITSC), 2013, : 211 - 216
  • [33] Multi-level application-based traffic characterization in a large-scale wireless network
    Ploumidis, Manolis
    Papadopouli, Maria
    Karagiannis, Thomas
    2007 IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS, VOL 1, 2007, : 388 - 396
  • [34] Concentric Layered Architecture for Multi-Level Clustering in Large-Scale Wireless Sensor Networks
    Singh, Harmanpreet
    Singh, Damanpreet
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 467 - 471
  • [35] Efficient Search and Browsing of Large-Scale Video Collections with Vibro
    Hezel, Nico
    Schall, Konstantin
    Jung, Klaus
    Barthel, Kai Uwe
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 487 - 492
  • [36] Efficient Very Large Scale Integration Architecture of Multi-Level Discrete Wavelet Transform
    Pan, Zhang
    Wei, Zhang
    ACTA OPTICA SINICA, 2019, 39 (04)
  • [37] Efficient Large-Scale Image Data Set Exploration: Visual Concept Network and Image Summarization
    Yang, Chunlei
    Feng, Xiaoyi
    Peng, Jinye
    Fan, Jianping
    ADVANCES IN MULTIMEDIA MODELING, PT II, 2011, 6524 : 111 - 121
  • [38] An Effective and Computationally Efficient Approach for Anonymizing Large-Scale Physical Activity Data: Multi-Level Clustering-Based Anonymization
    Parameshwarappa, Pooja
    Chen, Zhiyuan
    Koru, Gunes
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2020, 14 (03) : 72 - 94
  • [39] Mesoscale explorer: Visual exploration of large-scale molecular models
    Rose, Alexander
    Sehnal, David
    Goodsell, David S.
    Autin, Ludovic
    PROTEIN SCIENCE, 2024, 33 (10)
  • [40] Interactive visual exploration of halos in large-scale cosmology simulation
    Guihua Shan
    Maojin Xie
    Feng’An Li
    Yang Gao
    Xuebin Chi
    Journal of Visualization, 2014, 17 : 145 - 156