Detecting the knowledge structure of bioinformatics by mining full-text collections

被引:56
|
作者
Song, Min [1 ]
Kim, Su Yeon [1 ]
机构
[1] Yonsei Univ, Dept Lib & Informat Sci, Seoul 120749, South Korea
关键词
Text mining; PubMed Central; Bioinformatics; COCITATION ANALYSIS; AUTHOR COCITATION; CITATION; PAGERANK;
D O I
10.1007/s11192-012-0900-9
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Bioinformatics is a fast-growing, diverse research field that has recently gained much public attention. Even though there are several attempts to understand the field of bioinformatics by bibliometric analysis, the proposed approach in this paper is the first attempt at applying text mining techniques to a large set of full-text articles to detect the knowledge structure of the field. To this end, we use PubMed Central full-text articles for bibliometric analysis instead of relying on citation data provided in Web of Science. In particular, we develop text mining routines to build a custom-made citation database as a result of mining full-text. We present several interesting findings in this study. First, the majority of the papers published in the field of bioinformatics are not cited by others (63 % of papers received less than two citations). Second, there is a linear, consistent increase in the number of publications. Particularly year 2003 is the turning point in terms of publication growth. Third, most researches of bioinformatics are driven by USA-based institutes followed by European institutes. Fourth, the results of topic modeling and word co-occurrence analysis reveal that major topics focus more on biological aspects than on computational aspects of bioinformatics. However, the top 10 ranked articles identified by PageRank are more related to computational aspects. Fifth, visualization of author co-citation analysis indicates that researchers in molecular biology or genomics play a key role in connecting sub-disciplines of bioinformatics.
引用
收藏
页码:183 / 201
页数:19
相关论文
共 50 条
  • [21] VIDEODISCS FOR FULL-TEXT SEARCHING
    SCHIPMA, PB
    ZIEMER, SM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 183 (MAR): : 26 - CINF
  • [22] General science full-text
    Stoklosa, K
    LIBRARY JOURNAL, 2003, 128 (04) : 129 - 129
  • [23] FULL-TEXT ONLINE RETRIEVAL
    COLBERT, AW
    ONLINE, 1988, 12 (02): : 91 - 91
  • [24] WHERE FULL-TEXT IS VIABLE
    COTTON, PL
    ONLINE REVIEW, 1987, 11 (02): : 87 - 93
  • [25] Full-text linking projects
    Hoffman, DJ
    ONLINE, 2001, 25 (01): : 40 - +
  • [26] FULL-TEXT SOURCES ON COMPUSERV
    MARCUS, J
    DATABASE-THE MAGAZINE OF ELECTRONIC DATABASE REVIEWS, 1995, 18 (03): : 91 - 93
  • [27] RESEARCH INTO FULL-TEXT RETRIEVAL
    OJALA, M
    DATABASE, 1990, 13 (04): : 78 - 80
  • [28] The weaknesses of full-text searching
    Beall, Jeffrey
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2008, 34 (05): : 438 - 444
  • [29] PMC text mining subset in BioC: about three million full-text articles and growing
    Comeau, Donald C.
    Wei, Chih-Hsuan
    Dogan, Rezarta Islamaj
    Lu, Zhiyong
    BIOINFORMATICS, 2019, 35 (18) : 3533 - 3535
  • [30] SEARCHING FULL-TEXT PERIODICALS - HOW FULL IS FULL
    PAGELL, R
    DATABASE, 1987, 10 (05): : 33 - 36