Disambiguating authors by pairwise classification

被引:4
|
作者
Lin Q. [1 ]
Wang B. [2 ]
Du Y. [3 ]
Wang X. [3 ]
Li Y. [1 ]
Chen S. [2 ]
机构
[1] Department of Computer Science, Huazhong University of Science and Technology
[2] Department of Computer Science, Nanjing University of Aeronautics and Astronautics
[3] Department of Computer Science, Tsinghua University
基金
中国国家自然科学基金;
关键词
arnetminer; disambiguating; pairwise classification;
D O I
10.1016/S1007-0214(10)70114-0
中图分类号
学科分类号
摘要
Name ambiguity is a critical problem in many applications, in particular in online bibliography systems, such as DBLP, ACM, and CiteSeerx. Despite the many studies, this problem is still not resolved and is becoming even more serious, especially with the increasing popularity of Web 2.0. This paper addresses the problem in the academic researcher social network ArnetMiner using a supervised method for exploiting all side information including co-author, organization, paper citation, title similarity, author's homepage, web constraint, and user feedback. The method automatically determines the person number k. Tests on the researcher social network with up to 100 different names show that the method significantly outperforms the baseline method using an unsupervised attribute-augmented graph clustering algorithm.
引用
收藏
页码:668 / 677
页数:9
相关论文
共 50 条
  • [1] Disambiguating Authors by Pairwise Classification
    林泉
    王波
    杜圆
    王雪至
    李玉华
    陈松灿
    TsinghuaScienceandTechnology, 2010, 15 (06) : 668 - 677
  • [2] Disambiguating Authors in Academic Search Engines
    Zhang, Long
    Chen, Guohua
    Tang, Yong
    Cai, Zurui
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 511 - 514
  • [3] Disambiguating authors in citations on the web and authorship correlations
    Peng, Hsin-Tsung
    Lu, Cheng-Yu
    Hsu, William
    Ho, Jan-Ming
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10521 - 10532
  • [4] Disambiguating Authors in Academic Publications using Random Forests
    Treeratpituk, Pucktada
    Giles, C. Lee
    JCDL 09: PROCEEDINGS OF THE 2009 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2009, : 39 - 48
  • [5] Topological-collaborative approach for disambiguating authors’ names in collaborative networks
    Diego R. Amancio
    Osvaldo N. Oliveira jr
    Luciano da F. Costa
    Scientometrics, 2015, 102 : 465 - 485
  • [6] On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner
    Li, Na
    Zhu, Renyu
    Zhou, Xiaoxu
    He, Xiangnan
    Cai, Wenyuan
    Gao, Ming
    Zhou, Aoying
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 888 - 899
  • [7] Classification by pairwise coupling
    Hastie, T
    Tibshirani, R
    ANNALS OF STATISTICS, 1998, 26 (02): : 451 - 471
  • [8] Efficient pairwise classification
    Park, Sang-Hyeun
    Fuernkranz, Johannes
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 658 - +
  • [9] Classification by pairwise coupling
    Hastie, T
    Tibshirani, R
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 507 - 513
  • [10] Topological-collaborative approach for disambiguating authors' names in collaborative networks
    Amancio, Diego R.
    Oliveira, Osvaldo N., Jr.
    Costa, Luciano da F.
    SCIENTOMETRICS, 2015, 102 (01) : 465 - 485