GENERATING A VISUAL OVERVIEW OF LARGE DIACHRONIC DOCUMENT COLLECTIONS BASED ON THE DETECTION OF TOPIC CHANGE

被引:0
|
作者
Holz, Florian [1 ]
Teresniak, Sven [1 ]
Heyer, Gerhard [1 ]
Scheuermann, Gerik [2 ]
机构
[1] Univ Leipzig, Inst Comp Sci, Nat Language Proc Grp, Leipzig, Germany
[2] Univ Leipzig, Inst Comp Sci, Image & Signal Proc Grp, Leipzig, Germany
关键词
Visual analytics; Document collection summarization; Browsing document collections; Topic detection; Change of meaning; Volatility; Time-sliced corpora;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large digital diachronic document collections are a central source of information in science, business, and for the general public. One challenge for the efficient visualization of these collections is the automatic calculation and visualization of the main topics. These topics can then serve as the basis for an overview of the content and any subsequent interactive visual analysis. We introduce the new language processing concept of volatility of terms measured as the change of the context of terms. We demonstrate that volatility can serve as an excellent basis for the visual overview of large collections using two different examples.
引用
收藏
页码:153 / 156
页数:4
相关论文
共 50 条
  • [1] Topic-based Coordination for Visual Analysis of Evolving Document Collections
    Eler, Danilo Medeiros
    Paulovich, Fernando Vieira
    Ferreira de Oliveira, Maria Cristina
    Minghim, Rosane
    INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 149 - 155
  • [2] CONTRAVIS: Contrastive and Visual Topic Modeling for Comparing Document Collections
    Le, Tuan V. M.
    Akoglu, Leman
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 928 - 938
  • [3] Topic modeling for mediated access to very large document collections
    Muresan, G
    Harper, DJ
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (10): : 892 - 910
  • [4] TopicLens: Efficient Multi-Level Visual Topic Exploration of Large-Scale Document Collections
    Kim, Minjeong
    Kang, Kyeongpil
    Park, Deokgun
    Choo, Jaegul
    Elmqvist, Niklas
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 151 - 160
  • [5] Generating hierarchical document indices from common denominators in large document collections
    OKane, KC
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (01) : 105 - 115
  • [7] Topic Similarity Networks: Visual Analytics for Large Document Sets
    Maiya, Arun S.
    Rolfe, Robert M.
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 364 - 372
  • [8] Noise Document Detection for Document Retrieval Based on Topic Match
    Noh, Yunseok
    Park, Seong-Bae
    ADVANCED SCIENCE LETTERS, 2017, 23 (10) : 9478 - 9481
  • [9] MULTIMODAL TOPIC MODELING BASED GEO-ANNOTATION FOR SOCIAL EVENT DETECTION IN LARGE PHOTO COLLECTIONS
    Xu, Bin
    Fan, Guoliang
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3319 - 3323
  • [10] VISTopic: A Visual Analytics System for Making Sense of Large Document Collections using Hierarchical Topic Modeling (vol 1, pg 40, 2017)
    Yang, Yi
    Yao, Quanming
    Qu, Huamin
    VISUAL INFORMATICS, 2021, 5 (01): : 44 - 44