Detecting the knowledge structure of bioinformatics by mining full-text collections

被引：56

作者：

Song, Min ^{[1
]}

Kim, Su Yeon ^{[1
]}

机构：

[1] Yonsei Univ, Dept Lib & Informat Sci, Seoul 120749, South Korea

来源：

SCIENTOMETRICS | 2013年 / 96卷 / 01期

关键词：

Text mining; PubMed Central; Bioinformatics; COCITATION ANALYSIS; AUTHOR COCITATION; CITATION; PAGERANK;

D O I：

10.1007/s11192-012-0900-9

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Bioinformatics is a fast-growing, diverse research field that has recently gained much public attention. Even though there are several attempts to understand the field of bioinformatics by bibliometric analysis, the proposed approach in this paper is the first attempt at applying text mining techniques to a large set of full-text articles to detect the knowledge structure of the field. To this end, we use PubMed Central full-text articles for bibliometric analysis instead of relying on citation data provided in Web of Science. In particular, we develop text mining routines to build a custom-made citation database as a result of mining full-text. We present several interesting findings in this study. First, the majority of the papers published in the field of bioinformatics are not cited by others (63 % of papers received less than two citations). Second, there is a linear, consistent increase in the number of publications. Particularly year 2003 is the turning point in terms of publication growth. Third, most researches of bioinformatics are driven by USA-based institutes followed by European institutes. Fourth, the results of topic modeling and word co-occurrence analysis reveal that major topics focus more on biological aspects than on computational aspects of bioinformatics. However, the top 10 ranked articles identified by PageRank are more related to computational aspects. Fifth, visualization of author co-citation analysis indicates that researchers in molecular biology or genomics play a key role in connecting sub-disciplines of bioinformatics.

引用

页码：183 / 201

页数：19

共 50 条

[1] Detecting the knowledge structure of bioinformatics by mining full-text collections
Min Song
Su Yeon Kim
Scientometrics, 2013, 96 : 183 - 201
[2] SAGE full-text collections
不详
PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2003, 37 (04) : 271 - 271
[3] Full-text searching
Olson, MA
DR DOBBS JOURNAL, 1999, 24 (05): : 10 - 10
[4] THE FULL-TEXT IDEAL
MARCUS, J
DATABASE-THE MAGAZINE OF ELECTRONIC DATABASE REVIEWS, 1995, 18 (06): : 83 - 85
[5] FULL-TEXT DATABASES
SIDDIQUI, MA
ONLINE REVIEW, 1991, 15 (06): : 367 - 372
[6] A Knowledge Discovery from Full-Text Document Collections Using Clustering and Interpretable Genetic-Fuzzy Systems
Rudzinski, Filip
MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 434 - 443
[7] FULL-TEXT DATABASES
TENOPIR, C
ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1984, 19 : 215 - 246
[8] Humanities full-text
Williams, H
LIBRARY JOURNAL, 2003, 128 (05) : 124 - 124
[9] A text-mining system for extracting metabolic reactions from full-text articles
Czarnecki, Jan
Nobeli, Irene
Smith, Adrian M.
Shepherd, Adrian J.
BMC BIOINFORMATICS, 2012, 13
[10] A text-mining system for extracting metabolic reactions from full-text articles
Jan Czarnecki
Irene Nobeli
Adrian M Smith
Adrian J Shepherd
BMC Bioinformatics, 13

← 1 2 3 4 5 →