Detecting the knowledge structure of bioinformatics by mining full-text collections

被引:56
|
作者
Song, Min [1 ]
Kim, Su Yeon [1 ]
机构
[1] Yonsei Univ, Dept Lib & Informat Sci, Seoul 120749, South Korea
关键词
Text mining; PubMed Central; Bioinformatics; COCITATION ANALYSIS; AUTHOR COCITATION; CITATION; PAGERANK;
D O I
10.1007/s11192-012-0900-9
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Bioinformatics is a fast-growing, diverse research field that has recently gained much public attention. Even though there are several attempts to understand the field of bioinformatics by bibliometric analysis, the proposed approach in this paper is the first attempt at applying text mining techniques to a large set of full-text articles to detect the knowledge structure of the field. To this end, we use PubMed Central full-text articles for bibliometric analysis instead of relying on citation data provided in Web of Science. In particular, we develop text mining routines to build a custom-made citation database as a result of mining full-text. We present several interesting findings in this study. First, the majority of the papers published in the field of bioinformatics are not cited by others (63 % of papers received less than two citations). Second, there is a linear, consistent increase in the number of publications. Particularly year 2003 is the turning point in terms of publication growth. Third, most researches of bioinformatics are driven by USA-based institutes followed by European institutes. Fourth, the results of topic modeling and word co-occurrence analysis reveal that major topics focus more on biological aspects than on computational aspects of bioinformatics. However, the top 10 ranked articles identified by PageRank are more related to computational aspects. Fifth, visualization of author co-citation analysis indicates that researchers in molecular biology or genomics play a key role in connecting sub-disciplines of bioinformatics.
引用
收藏
页码:183 / 201
页数:19
相关论文
共 50 条
  • [1] Detecting the knowledge structure of bioinformatics by mining full-text collections
    Min Song
    Su Yeon Kim
    Scientometrics, 2013, 96 : 183 - 201
  • [2] SAGE full-text collections
    不详
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2003, 37 (04) : 271 - 271
  • [3] Full-text searching
    Olson, MA
    DR DOBBS JOURNAL, 1999, 24 (05): : 10 - 10
  • [4] THE FULL-TEXT IDEAL
    MARCUS, J
    DATABASE-THE MAGAZINE OF ELECTRONIC DATABASE REVIEWS, 1995, 18 (06): : 83 - 85
  • [5] FULL-TEXT DATABASES
    SIDDIQUI, MA
    ONLINE REVIEW, 1991, 15 (06): : 367 - 372
  • [6] A Knowledge Discovery from Full-Text Document Collections Using Clustering and Interpretable Genetic-Fuzzy Systems
    Rudzinski, Filip
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 434 - 443
  • [7] FULL-TEXT DATABASES
    TENOPIR, C
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1984, 19 : 215 - 246
  • [8] Humanities full-text
    Williams, H
    LIBRARY JOURNAL, 2003, 128 (05) : 124 - 124
  • [9] A text-mining system for extracting metabolic reactions from full-text articles
    Czarnecki, Jan
    Nobeli, Irene
    Smith, Adrian M.
    Shepherd, Adrian J.
    BMC BIOINFORMATICS, 2012, 13
  • [10] A text-mining system for extracting metabolic reactions from full-text articles
    Jan Czarnecki
    Irene Nobeli
    Adrian M Smith
    Adrian J Shepherd
    BMC Bioinformatics, 13