Identifying a better measure of relatedness for mapping science

被引:142
作者
Klavans, R
Boyack, KW
机构
[1] SciTech Strategies Inc, Berwyn, PA 19312 USA
[2] Sandia Natl Labs, Albuquerque, NM 87185 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2006年 / 57卷 / 02期
关键词
D O I
10.1002/asi.20274
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Measuring the relatedness between bibliometric units (journals, documents, authors, or words) is a central task in bibliometric analysis. Relatedness measures are used for many different tasks, among them the generating of maps, or visual pictures, showing the relationship between all items from these data. Despite the importance of these tasks, there has been little written on how to quantitatively evaluate the accuracy of relatedness measures or the resulting maps. The authors propose a new framework for assessing the performance of relatedness measures and visualization algorithms that contains four factors: accuracy, coverage, scalability, and robustness. This method was applied to 10 measures of journal-journal relatedness to determine the best measure. The 10 relatedness measures were then used as inputs to a visualization algorithm to create an additional 10 measures of journal-journal relatedness based on the distances between pairs of journals in two-dimensional space. This second step determines robustness (i.e., which measure remains best after dimension reduction). Results show that, for low coverage (under 50%) the Pearson correlation is the most accurate raw relatedness measure. However, the best overall measure, both at high coverage, and after dimension reduction, is the cosine index or a modified cosine index. Results also showed that the visualization algorithm increased local accuracy for most measures. Possible reasons for this counterintuitive finding are discussed.
引用
收藏
页码:251 / 263
页数:13
相关论文
共 46 条
[1]  
[Anonymous], 1979, EVALUATION FACTORS A
[2]   Indicators in a research institute: A multi-level classification of scientific journals [J].
Bassecoulard, E ;
Zitt, M .
SCIENTOMETRICS, 1999, 44 (03) :323-345
[3]  
Batagelj V., 1998, Connections, V21, P47
[4]   Domain visualization using VxInsight® for science and technology management [J].
Boyack, KW ;
Wylie, BN ;
Davidson, GS .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (09) :764-774
[5]  
Chen Chaomei., 2003, Mapping Scientific Frontiers: The Quest for Knowledge Visualization
[6]   Visualizing and tracking the growth of competing paradigms: Two case studies [J].
Chen, CM ;
Cribbin, T ;
Macredie, R ;
Morar, S .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (08) :678-689
[7]   Constrained Steiner trees in Halin graphs [J].
Chen, GT ;
Burkard, RE .
RAIRO-OPERATIONS RESEARCH, 2003, 37 (03) :179-193
[8]   Cluster stability and the use of noise in interpretation of clustering [J].
Davidson, GS ;
Wylie, BN ;
Boyack, KW .
IEEE SYMPOSIUM ON INFORMATION VISUALIZATION 2001, PROCEEDINGS, 2001, :23-30
[9]  
de Chazal P, 1998, P ANN INT IEEE EMBS, V20, P1422, DOI 10.1109/IEMBS.1998.747150
[10]   Journal as markers of intellectual space: Journal co-citation analysis of information Retrieval area, 1987-1997 [J].
Ding, Y ;
Chowdhury, GG ;
Foo, S .
SCIENTOMETRICS, 2000, 47 (01) :55-73