TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections

被引:60
|
作者
Kim, Minjeong [1 ]
Kang, Kyeongpil [1 ]
Park, Deokgun [2 ]
Choo, Jaegul [1 ]
Elmqvist, Niklas [2 ]
机构
[1] Korea Univ, Seoul, South Korea
[2] Univ Maryland, College Pk, MD 20742 USA
基金
新加坡国家研究基金会;
关键词
topic modeling; nonnegative matrix factorization; t-distributed stochastic neighbor embedding; magic lens; text analytics; NONNEGATIVE MATRIX; VISUALIZATION; ANALYTICS;
D O I
10.1109/TVCG.2016.2598445
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Topic modeling, which reveals underlying topics of a document corpus, has been actively adopted in visual analytics for large-scale document collections. However, due to its significant processing time and non-interactive nature, topic modeling has so far not been tightly integrated into a visual analytics workflow. Instead, most such systems are limited to utilizing a fixed, initial set of topics. Motivated by this gap in the literature, we propose a novel interaction technique called TopicLens that allows a user to dynamically explore data through a lens interface where topic modeling and the corresponding 2D embedding are efficiently computed on the fly. To support this interaction in real time while maintaining view consistency, we propose a novel efficient topic modeling method and a semi-supervised 2D embedding algorithm. Our work is based on improving state-of-the-art methods such as nonnegative matrix factorization and t-distributed stochastic neighbor embedding. Furthermore, we have built a web-based visual analytics system integrated with TopicLens. We use this system to measure the performance and the visualization quality of our proposed methods. We provide several scenarios showcasing the capability of TopicLens using real-world datasets.
引用
收藏
页码:151 / 160
页数:10
相关论文
共 50 条
  • [21] Merit: multi-level graph embedding refinement framework for large-scale graph
    Weishuai Che
    Zhaowei Liu
    Yingjie Wang
    Jinglei Liu
    Complex & Intelligent Systems, 2024, 10 : 1303 - 1318
  • [22] Tenant Placement Strategies within Multi-Level Large-Scale Shopping Centers
    Yuo, Tony Shun-Te
    Lizieri, Colin
    JOURNAL OF REAL ESTATE RESEARCH, 2013, 35 (01) : 25 - 51
  • [23] Merit: multi-level graph embedding refinement framework for large-scale graph
    Che, Weishuai
    Liu, Zhaowei
    Wang, Yingjie
    Liu, Jinglei
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 10 (1) : 1303 - 1318
  • [24] Reduction method for large-scale multi-level capacitated lot sizing problem
    Xiong, Hongyun
    He, Yue
    Changsha Tiedao Xuyuan Xuebao/Journal of Changsha Railway University, 2000, 18 (02): : 45 - 49
  • [25] IMPROVING LARGE-SCALE FACE IMAGE RETRIEVAL USING MULTI-LEVEL FEATURES
    Chen, Xiaojing
    An, Le
    Bhanu, Bir
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4367 - 4371
  • [26] Visualization of Large-Scale Urban Models through Multi-Level Relief Impostors
    Andujar, C.
    Brunet, P.
    Chica, A.
    Navazo, I.
    COMPUTER GRAPHICS FORUM, 2010, 29 (08) : 2456 - 2468
  • [27] Multi-level placement for large-scale mixed-size IC designs
    ACM SIGDA; IEEE Circuits and Systems Society; IEICE (Institute of Electronics, Information and Communication Engineers); IPSJ (Information Processing Society of Japan) (Institute of Electrical and Electronics Engineers Inc., United States):
  • [28] The role of large-scale eddies in the nonlinear equilibration of a multi-level QG model
    Solomon, A
    Stone, PH
    11TH CONFERENCE ON ATMOSPHERIC AND OCEANIC FLUID DYNAMICS, 1997, : 360 - 362
  • [29] Multi-purpose, multi-level feature modeling of large-scale industrial software systems
    Daniela Rabiser
    Herbert Prähofer
    Paul Grünbacher
    Michael Petruzelka
    Klaus Eder
    Florian Angerer
    Mario Kromoser
    Andreas Grimmer
    Software & Systems Modeling, 2018, 17 : 913 - 938
  • [30] Multi-purpose, multi-level feature modeling of large-scale industrial software systems
    Rabiser, Daniela
    Praehofer, Herbert
    Gruenbacher, Paul
    Petruzelka, Michael
    Eder, Klaus
    Angerer, Florian
    Kromoser, Mario
    Grimmer, Andreas
    SOFTWARE AND SYSTEMS MODELING, 2018, 17 (03): : 913 - 938