Improving the Keyword Co-occurrence Analysis: An Integrated Semantic Similarity Approach

被引:7
|
作者
Bhuyan, A. [1 ]
Sanguri, K. [1 ]
Sharma, H. [1 ]
机构
[1] Indian Inst Management Kashipur, Operat Management & Decis Sci, Kashipur, India
关键词
Co-occurrence analysis; Semantic similarity; Network analysis; Text mining; Urban mobility; CO-WORD ANALYSIS; SCIENCE;
D O I
10.1109/IEEM50564.2021.9673030
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Bibliometric studies help yield useful information about the quantity and quality of research works in a particular academic domain. We used the popular bibliometric technique of co-occurrence analysis to explore emergent topical areas in the field of urban mobility. Our work contributes from a methodological perspective to improve the conventional co-occurrence analysis method. The modified co-occurrence analysis is two-fold: firstly, we used the Rapid Automatic Keyword Extraction (RAKE) algorithm to extract keywords from abstracts of documents in a corpus to generate a new co-occurrence matrix. Secondly, we produced "semantic similarity" between each keyword in matrix form, which is combined with the co-occurrence matrix of the extracted keywords from documents in a corpus to yield a weighted co-occurrence matrix. We analyzed the unweighted and weighted matrices in terms of their network structure and cluster quality. We demonstrated that the weighted matrix shows network structures with higher modularity and superior cluster quality than its unweighted counterpart. These observations are consistent in terms of more meaningful content and greater ease of exposition in the emergent themes.
引用
收藏
页码:482 / 487
页数:6
相关论文
共 50 条
  • [1] Co-occurrence and Semantic Similarity Based Hybrid Approach for Improving Automatic Query Expansion in Information Retrieval
    Singh, Jagendra
    Sharan, Aditi
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2015, 2015, 8956 : 415 - 418
  • [2] A better index for analysis of co-occurrence and similarity
    Mainali, Kumar P.
    Slud, Eric
    Singer, Michael C.
    Fagan, William F.
    SCIENCE ADVANCES, 2022, 8 (04)
  • [3] Computing Text Semantic Similarity with Syntactic Network of Co-occurrence Distance
    Jiao Y.
    Jing M.
    Kang F.
    Data Analysis and Knowledge Discovery, 2019, 3 (12) : 93 - 100
  • [4] Co-occurrence and similarity revisited
    Fernando Chirigati
    Nature Computational Science, 2022, 2 : 67 - 67
  • [5] Co-occurrence and similarity revisited
    Chirigati, Fernando
    NATURE COMPUTATIONAL SCIENCE, 2022, 2 (02): : 67 - 67
  • [6] A Metric Approach to Hot Topics in Biomedicine via Keyword Co-occurrence
    Jane HQin
    Jean JWang
    Fred YYe
    JournalofDataandInformationScience, 2019, 4 (04) : 13 - 25
  • [7] A Metric Approach to Hot Topics in Biomedicine via Keyword Co-occurrence
    Jane H.Qin
    Jean J.Wang
    Fred Y.Ye
    Journal of Data and Information Science, 2019, (04) : 13 - 25
  • [8] A Metric Approach to Hot Topics in Biomedicine via Keyword Co-occurrence
    Qin, Jane H.
    Wang, Jean J.
    Ye, Fred Y.
    JOURNAL OF DATA AND INFORMATION SCIENCE, 2019, 4 (04) : 13 - 25
  • [9] The influence of label co-occurrence and semantic similarity on children's inductive generalization
    Matlen, Bryan J.
    Fisher, Anna V.
    Godwin, Karrie E.
    FRONTIERS IN PSYCHOLOGY, 2015, 6
  • [10] Use of Neighbor Sentence Co-occurrence to Improve Word Semantic Similarity Detection
    Loukachevitch, Natalia
    Alekseev, Aleksey
    PROCEEDINGS OF THE INTERNATIONAL FRUCT CONFERENCE ON INTELLIGENCE, SOCIAL MEDIA AND WEB (ISMW FRUCT 2016), 2016, : 36 - 42