A data analytic approach to quantifying scientific impact

被引:47
作者
Cao, Xuanyu [1 ]
Chen, Yan [1 ]
Liu, K. J. Ray [1 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
Citation prediction; Data analytic approach; CITATION IMPACT; H-INDEX; NORMALIZATION; PREDICTION; POWER; TOOL;
D O I
10.1016/j.joi.2016.02.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Citation is perhaps the mostly used metric to evaluate the scientific impact of papers. Various measures of the scientific impact of researchers and journals rely heavily on the citations of papers. Furthermore, in many practical applications, people may need to know not only the current citations of a paper, but also a prediction of its future citations. However, the complex heterogeneous temporal patterns of the citation dynamics make the predictions of future citations rather difficult. The existing state-of-the-art approaches used parametric methods that require long period of data and have poor performance on some scientific disciplines. In this paper, we present a simple yet effective and robust data analytic method to predict future citations of papers from a variety of disciplines. With rather short-term (e.g., 3 years after the paper is published) citation data, the proposed approach can give accurate estimate of future citations, outperforming state-of-the-art prediction methods significantly. Extensive experiments confirm the robustness of the proposed approach across various journals of different disciplines. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:471 / 484
页数:14
相关论文
共 38 条
[1]   Predicting scientific success [J].
Acuna, Daniel E. ;
Allesina, Stefano ;
Kording, Konrad P. .
NATURE, 2012, 489 (7415) :201-202
[2]   Modelling count response variables in informetric studies: Comparison among count, linear, and lognormal regression models [J].
Ajiferuke, Isola ;
Famoye, Felix .
JOURNAL OF INFORMETRICS, 2015, 9 (03) :499-513
[3]  
Bishop C., 2006, Pattern recognition and machine learning, P423
[4]   How to improve the prediction based on citation impact percentiles for years shortly after the publication date? [J].
Bornmann, Lutz ;
Leydesdorff, Loet ;
Wang, Jian .
JOURNAL OF INFORMETRICS, 2014, 8 (01) :175-180
[5]   Which percentile-based approach should be preferred for calculating normalized citation impact values? An empirical comparison of five approaches including a newly developed citation-rank approach (P100) [J].
Bornmann, Lutz ;
Leydesdorff, Loet ;
Wang, Jian .
JOURNAL OF INFORMETRICS, 2013, 7 (04) :933-944
[6]   Inventor team size as a predictor of the future citation impact of patents [J].
Breitzman, Anthony ;
Thomas, Patrick .
SCIENTOMETRICS, 2015, 103 (02) :631-647
[7]   Characterizing and Modeling Citation Dynamics [J].
Eom, Young-Ho ;
Fortunato, Santo .
PLOS ONE, 2011, 6 (09)
[8]  
Ferraty F., 2006, SPR S STAT
[9]   Normalization of Mendeley reader counts for impact assessment [J].
Haunschild, Robin ;
Bornmann, Lutz .
JOURNAL OF INFORMETRICS, 2016, 10 (01) :62-73
[10]   Bibliometric indicators of young authors in astrophysics: Can later stars be predicted? [J].
Havemann, Frank ;
Larsen, Birger .
SCIENTOMETRICS, 2015, 102 (02) :1413-1434