Semantic Network Analysis Pipeline-Interactive Text Mining Framework for Exploration of Semantic Flows in Large Corpus of Text

被引:1
|
作者
Cenek, Martin [1 ,4 ]
Bulkow, Rowan [2 ]
Pak, Eric [3 ]
Oyster, Levi [3 ]
Ching, Boyd [3 ]
Mulagada, Ashika [1 ]
机构
[1] Univ Portland, Comp Sci, Portland, OR 90203 USA
[2] Resource Data Inc, Anchorage, AK 99503 USA
[3] Univ Alaska Anchorage, Comp Sci, Anchorage, AK 99508 USA
[4] 5000 N Willamette Blvd, Portland, OR 97203 USA
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 24期
关键词
semantic concept; text mining; computational linguistics; language processing; natural language processing; interactive visualization; MODEL;
D O I
10.3390/app9245302
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Historical topic modeling and semantic concepts exploration in a large corpus of unstructured text remains a hard, opened problem. Despite advancements in natural languages processing tools, statistical linguistics models, graph theory and visualization, there is no framework that combines these piece-wise tools under one roof. We designed and constructed a Semantic Network Analysis Pipeline (SNAP) that is available as an open-source web-service that implements work-flow needed by a data scientist to explore historical semantic concepts in a text corpus. We define a graph theoretic notion of a semantic concept as a flow of closely related tokens through the corpus of text. The modular work-flow pipeline processes text using natural language processing tools, statistical content narrowing, creates semantic networks from lexical token chaining, performs social network analysis of token networks and creates a 3D visualization of the semantic concept flows through corpus for interactive concept exploration. Finally, we illustrate the framework's utility to extract the information from a text corpus of Herman Melville's novel Moby Dick, the transcript of the 2015-2016 United States (U.S.) Senate Hearings on Environment and Public Works, and the Australian Broadcast Corporation's short news articles on rural and science topics.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Identification of Occupant Dissatisfaction Factors in Newly Constructed Apartments: Text Mining and Semantic Network Analysis
    Noh, Seok-Ho
    Jo, Inho
    Han, SangHyeok
    Moon, Sungkon
    Kim, Jae-Jun
    BUILDINGS, 2023, 13 (12)
  • [22] Applying text mining and semantic network analysis to investigate effects of perceived crowding in the service sector
    Ellahi, Abida
    Ul Ain, Qurat
    Rehman, Hafiz Mudassir
    Hossain, Md Billal
    Illes, Csaba Balint
    Rehman, Mobashar
    COGENT BUSINESS & MANAGEMENT, 2023, 10 (02):
  • [23] Text mining using nonnegative matrix factorization and latent semantic analysis
    Hassani, Ali
    Iranmanesh, Amir
    Mansouri, Najme
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (20): : 13745 - 13766
  • [24] A Novel Web Text Mining Method based on Semantic Polarity Analysis
    Yu, Li
    Li, Qiang
    2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 5116 - +
  • [25] Text mining using nonnegative matrix factorization and latent semantic analysis
    Hassani, Ali
    Iranmanesh, Amir
    Mansouri, Najme
    Neural Computing and Applications, 2021, 33 (20) : 13745 - 13766
  • [26] Text mining using nonnegative matrix factorization and latent semantic analysis
    Ali Hassani
    Amir Iranmanesh
    Najme Mansouri
    Neural Computing and Applications, 2021, 33 : 13745 - 13766
  • [27] Latent semantic analysis for text categorization using neural network
    Yu, Bo
    Xu, Zong-ben
    Li, Cheng-hua
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 900 - 904
  • [28] Word Net-based lexical semantic classification for text corpus analysis
    龙军
    王鲁达
    李祖德
    张祖平
    杨柳
    Journal of Central South University, 2015, 22 (05) : 1833 - 1840
  • [29] Semantic Framework-Based Defect Text Mining Technique and Application in Power Grid
    Cao J.
    Chen L.
    Qiu J.
    Wang H.
    Ying G.
    Zhang B.
    1600, Power System Technology Press (41): : 637 - 643
  • [30] A lightweight semantic-enhanced interactive network for efficient short-text matching
    Yu, Chuanming
    Xue, Haodong
    An, Lu
    Li, Gang
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2023, 74 (02) : 283 - 300