Author name disambiguation using a graph model with node splitting and merging based on bibliographic information

被引:2
|
作者
Dongwook Shin
Taehwan Kim
Joongmin Choi
Jungsun Kim
机构
[1] Hanyang University,Department of Computer Science and Engineering
来源
Scientometrics | 2014年 / 100卷
关键词
Author name disambiguation; Graph model; Namesake resolution; Heteronymous name resolution; Digital library;
D O I
暂无
中图分类号
学科分类号
摘要
Author ambiguity mainly arises when several different authors express their names in the same way, generally known as the namesake problem, and also when the name of an author is expressed in many different ways, referred to as the heteronymous name problem. These author ambiguity problems have long been an obstacle to efficient information retrieval in digital libraries, causing incorrect identification of authors and impeding correct classification of their publications. It is a nontrivial task to distinguish those authors, especially when there is very limited information about them. In this paper, we propose a graph based approach to author name disambiguation, where a graph model is constructed using the co-author relations, and author ambiguity is resolved by graph operations such as vertex (or node) splitting and merging based on the co-authorship. In our framework, called a Graph Framework for Author Disambiguation (GFAD), the namesake problem is solved by splitting an author vertex involved in multiple cycles of co-authorship, and the heteronymous name problem is handled by merging multiple author vertices having similar names if those vertices are connected to a common vertex. Experiments were carried out with the real DBLP and Arnetminer collections and the performance of GFAD is compared with three representative unsupervised author name disambiguation systems. We confirm that GFAD shows better overall performance from the perspective of representative evaluation metrics. An additional contribution is that we released the refined DBLP collection to the public to facilitate organizing a performance benchmark for future systems on author disambiguation.
引用
收藏
页码:15 / 50
页数:35
相关论文
共 50 条
  • [31] An Unsupervised Heuristic Based Approach for Author Name Disambiguation
    Pooja, K. M.
    Mondal, Samrat
    Chandra, Joydeep
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 540 - 542
  • [32] Self-Training Author Name Disambiguation for Information Scarce Scenarios
    Ferreira, Anderson A.
    Veloso, Adriano
    Goncalves, Marcos Andre
    Laender, Alberto H. F.
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (06) : 1257 - 1278
  • [33] Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning
    Wang, Haiwen
    Wang, Ruijie
    Wen, Chuan
    Li, Shuhao
    Jia, Yuting
    Zhang, Weinan
    Wang, Xinbing
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 238 - 245
  • [34] A Fast Method Based on Multiple Clustering for Name Disambiguation in Bibliographic Citations
    Liu, Yu
    Li, Weijia
    Huang, Zhen
    Fang, Qiang
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (03) : 634 - 644
  • [35] Automatic identification of academic profiles using author name disambiguation
    Digiampietri, Luciano Antonio
    Ferreira, Joao Eduardo
    EM QUESTAO, 2018, 24 (02): : 37 - 54
  • [36] Automatic Method for Author Name Disambiguation using Social Networks
    Shin, Dongwook
    Kim, Taehwan
    Jung, Hana
    Choi, Joongmin
    2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 1263 - 1270
  • [37] AuthCrowd: Author Name Disambiguation and Entity Matching using Crowdsourcing
    Correia, Antonio
    Guimaraes, Diogo
    Paulino, Dennis
    Jameel, Shoaib
    Schneider, Daniel
    Fonseca, Benjamim
    Paredes, Hugo
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 150 - 155
  • [38] A Novel Approach for Author Name Disambiguation Using Ranking Confidence
    Lin, Xueqin
    Zhu, Jia
    Tang, Yong
    Yang, Fen
    Peng, Bo
    Li, Weiling
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 169 - 182
  • [39] Using Co-authorship Networks for Author Name Disambiguation
    Momeni, Fakhri
    Mayr, Philipp
    2016 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2016, : 261 - 262
  • [40] Author name disambiguation using a new categorical distribution similarity
    Nanyang Technological University, Singapore
    Lect. Notes Comput. Sci., PART 1 (569-584):