Methods for precise named entity matching in digital collections

被引:9
|
作者
Davis, PT [1 ]
Elson, DK [1 ]
Klavans, JL [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
关键词
D O I
10.1109/JCDL.2003.1204852
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe an interactive system, built within the context,of CLiMB project, which permits a user to locate the occurrences of named entities within a given text. The named entity tool was developed to identify references to a single art object (e.g. a particular building) with high precision in,text related to images of that object in a digital collection. We start with an authoritative list of art objects, and seek to match variants of these named entities in related text. Our approach is to "decay" entities into progressively more general variants while retaining high precision. As variants become more general, and thus more ambiguous, we propose methods to disambiguate intermediate results. Our results will be used to select records into which automatically generated metadata will be loaded.
引用
收藏
页码:125 / 127
页数:3
相关论文
共 50 条
  • [1] Effective Named Entity Recognition for Idiosyncratic Web Collections
    Prokofyev, Roman
    Demartini, Gianluca
    Cudre-Mauroux, Philippe
    WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 397 - 407
  • [2] Two Test Collections for Retrieval Using Named Entity Markup
    Bremerman, Jacob
    Lawrie, Dawn
    Mayfield, James
    Oard, Douglas W.
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3265 - 3268
  • [3] Pair Hidden Markov Model for Named Entity Matching
    Nabende, Peter
    Tiedemann, Jorg
    Nerbonne, John
    INNOVATIONS AND ADVANCES IN COMPUTER SCIENCES AND ENGINEERING, 2010, : 497 - 502
  • [4] Comparison of Methods to Annotate Named Entity Corpora
    Komiya, Kanako
    Suzuki, Masaya
    Iwakura, Tomoya
    Sasaki, Minoru
    Shinnou, Hiroyuki
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (04)
  • [5] Named Entity Disambiguation for Archival Collections: Metadata, Wikidata, and Linked Data
    Polley, Katherine Louise
    Tompkins, Vivian Teresa
    Honick, Brendan John
    Qin, Jian
    Proceedings of the Association for Information Science and Technology, 2021, 58 (01) : 520 - 524
  • [6] Harnessing Historical Corrections to Build Test Collections for Named Entity Disambiguation
    Reitz, Florian
    DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018, 2018, 11057 : 47 - 58
  • [7] Efficient methods for biomedical named entity recognition
    Chan, Shing-Kit
    Lam, Wai
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 729 - 735
  • [8] Character Gazetteer for Named Entity Recognition with Linear Matching Complexity
    Dlugolinsky, Stefan
    Nguyen, Giang
    Laclavik, Michal
    Seleng, Martin
    2013 THIRD WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2013, : 361 - 365
  • [9] Chinese Named Entity Recognition Methods Combined with Entity Boundary Cues
    Huang, Rong
    Chen, Yanping
    Hu, Ying
    Huang, Ruizhang
    Qin, Yongbin
    Computer Engineering and Applications, 2024, 60 (06) : 199 - 206
  • [10] Improving Disease Named Entity Recognition for Clinical Trial Matching
    Khan, Md Abdullah Al Hafiz
    Shamsuzzaman, Md
    Hasan, Sadid A.
    Sorower, Mohammad S.
    Liu, Joey
    Datla, Vivek
    Milosevic, Mladen
    Mankovich, Gabe
    van Ommering, Rob
    Dimitrova, Nevenka
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2541 - 2548