WEB PERSON NAME DISAMBIGUATION USING SOCIAL LINKS AND ENRICHED PROFILE INFORMATION

被引:1
|
作者
Emami, Hojjat [1 ]
Shirazi, Hossein [1 ]
Barforoush, Ahmad Abdollahzadeh [2 ]
机构
[1] Malek Ashtar Univ Technol, Dept Informat & Commun Technol ICT, Social Network & Intelligent Syst Lab, Tehran, Iran
[2] Amirkabir Univ Technol, Comp Engn & IT Dept, Intelligent Syst Lab, Tehran, Iran
关键词
Web mining; cross-document name disambiguation; social links; profile enrichment; clustering;
D O I
10.4149/cai_2018_6_1485
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we investigate the problem of cross-document person name disambiguation, which aimed at resolving ambiguities between person names and clustering web documents according to their association to different persons sharing the same name. The majority of previous work often formulated cross-document name disambiguation as a clustering problem. These methods employed various syntactic and semantic features either from the local corpus or distant knowledge bases to compute similarities between entities and group similar entities. However, these approaches show limitations regarding robustness and performance. We propose an unsupervised, graph-based name disambiguation approach to improve the performance and robustness of the state-of-the-art. Our approach exploits both local information extracted from the given corpus, and global information obtained from distant knowledge bases. We show the effectiveness of our approach by testing it on standard WePS datasets. The experimental results are encouraging and show that our proposed method outperforms several baseline methods and also its counterparts. The experiments show that our approach not only improves the performances, but also increases the robustness of name disambiguation.
引用
收藏
页码:1485 / 1515
页数:31
相关论文
共 41 条
  • [1] Treatment of Social Media in Person Name Disambiguation in the Web
    Delgado, Agustin D.
    Martinez, Raquel
    Montalvo, Soto
    Fresno, Victor
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2016, (57): : 117 - 124
  • [2] A Survey of Person Name Disambiguation on the Web
    Delgado, Agustin D.
    Montalvo, Soto
    Martinez Unanue, Raquel
    Fresno, Victor
    IEEE ACCESS, 2018, 6 : 59496 - 59514
  • [3] Using Web Information for Author Name Disambiguation
    Pereira, Denilson Alves
    Ribeiro-Neto, Berthier
    Ziviani, Nivio
    Laender, Alberto H. F.
    Goncalves, Marcos Andre
    Ferreira, Anderson A.
    JCDL 09: PROCEEDINGS OF THE 2009 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2009, : 49 - 58
  • [4] Person Name Disambiguation in the Web Using Adaptive Threshold Clustering
    Delgado, Agustin D.
    Martinez, Raquel
    Montalvo, Soto
    Fresno, Victor
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (07) : 1751 - 1762
  • [5] Name disambiguation in person information mining
    Wei, Yu-Chuan
    Lin, Ming-Shun
    Chen, Hsin-Hsi
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 378 - +
  • [6] Person name disambiguation on the web in a multilingual context
    Delgado, Agustin D.
    Martinez, Raquel
    Montalvo, Soto
    Fresno, Victor
    INFORMATION SCIENCES, 2018, 465 : 373 - 387
  • [7] An Unsupervised Algorithm for Person Name Disambiguation in the Web
    Delgado, Agustin D.
    Martinez, Raquel
    Fresno, Victor
    Montalvo, Soto
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2014, (53): : 51 - 58
  • [8] Person name disambiguation in web pages using social network, compound words and latent topics
    Ono, Shingo
    Sato, Issei
    Yoshida, Minoru
    Nakagawa, Hiroshi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 260 - +
  • [9] Person name disambiguation of searching results using social network
    Lang, Jun
    Qin, Bing
    Song, Wei
    Liu, Long
    Liu, Ting
    Li, Sheng
    Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (07): : 1365 - 1374
  • [10] A Graph-based Approach to Person Name Disambiguation in Web
    Emami, Hojjat
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2019, 10 (02)