An Alignment Comparator for Entity Resolution with Multi-valued Attributes

被引:0
|
作者
Mazzucchi-Augel, Pablo N. [1 ]
Ceballos, Hector G. [1 ]
机构
[1] Tecnol Monterrey, Monterrey, Nuevo Leon, Mexico
关键词
Entity Resolution; Author Matching; Multi-Valued Attributes; Bibliographic databases;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity matching is a problem that concerns many data management processes. If we consider matching between entities represented by RDF individuals we might find attributes values lists with variable-length for some properties, which will lead us to the problem of comparing multi-valued attributes, e.g. comparing author names lists for determining publication matching. This matching technique would be more complex than comparing fixed-length records, but less complex than comparing XML documents. Instead of comparing a single string, representing the concatenation of these values, each value of one vector should be compared against all values of the other vector. We propose a set of heuristics to address the alignment and comparison process of multi-valued attributes and evaluate them in the context of bibliographic databases. Our first results show that it is possible to reduce the comparisons amount and provide an aggregated similarity metric that outperforms the average similarity of cross product comparisons.
引用
收藏
页码:272 / 284
页数:13
相关论文
共 50 条
  • [1] Optimization and design for multi-valued quantum comparator circuits
    Zhang, Tingyan
    Li, Yi
    Lu, Sijun
    Ai, Jingwen
    Bai, Mingqiang
    Mo, Zhiwen
    Du, Wenbo
    Jiang, Pengheng
    Liu, Jiawei
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2025, 58 (04)
  • [2] CMOS Multi-valued current comparator as protection circuit
    Samuel, Lino M.
    Patil, Savita Y.
    2013 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMMUNICATION, CONTROL, SIGNAL PROCESSING AND COMPUTING APPLICATIONS (IEEE-C2SPCA-2013), 2013,
  • [3] Managing Multi-Valued Attributes in Spreadsheet Applications
    Churcher, Clare
    McLennan, Theresa
    Spray, Wendy
    PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2005, SECTIONS 1-8 AND POSTER SESSIONS 1-6, 2005, : 169 - 182
  • [4] Bias of Importance Measures for Multi-valued Attributes and Solutions
    Deng, Houtao
    Runger, George
    Tuv, Eugene
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT II, 2011, 6792 : 293 - +
  • [5] Multi-valued Autoencoders for Multi-valued Neural Networks
    Hata, Ryusuke
    Murase, Kazuyuki
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4412 - 4417
  • [6] Automatic preference learning on numeric and multi-valued categorical attributes
    Marin, Lucas
    Moreno, Antonio
    Isern, David
    KNOWLEDGE-BASED SYSTEMS, 2014, 56 : 201 - 215
  • [7] A Novel Very Low-Complexity Multi-valued Logic Comparator in Nanoelectronics
    Seied Ali Hosseini
    Sajjad Etezadi
    Circuits, Systems, and Signal Processing, 2020, 39 : 223 - 244
  • [8] A Novel Very Low-Complexity Multi-valued Logic Comparator in Nanoelectronics
    Hosseini, Seied Ali
    Etezadi, Sajjad
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (01) : 223 - 244
  • [9] A semantic similarity measure for objects described with multi-valued categorical attributes
    Moreno, Antonio
    Valls, Aida
    Mata, Ferran
    Martinez, Sergio
    Marin, Lucas
    Vicient, Carlos
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 263 - 272
  • [10] Fast Feature Selection Using Partial Correlation for Multi-valued Attributes
    Lallich, S.
    Rakotomalala, R.
    LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 221 - 231