Handling conjunctions in named entities

被引:0
|
作者
Mazur, Pawel [1 ,2 ]
Dale, Robert [2 ]
机构
[1] Wroclaw Univ Technol, Inst Appl Informat, Wyb Wyspianskiego 27, PL-50370 Wroclaw, Poland
[2] Macquarie Univ, Ctr Language Technol, Sydney, NSW 2109, Australia
来源
LINGUISTICAE INVESTIGATIONES | 2007年 / 30卷 / 01期
关键词
named entity recognition; conjunctions; machine learning;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Although the literature contains reports of very high accuracy figures for the recognition of named entities in text, there are still some named entity phenomena that remain problematic for existing text processing systems. One of these is the ambiguity of conjunctions in candidate named entity strings, an all-too-prevalent problem in corporate and legal documents. In this paper, we distinguish four uses of the conjunction in these strings, and explore the use of a supervised machine learning approach to conjunction disambiguation trained on a very limited set of 'name internal' features that avoids the need for expensive lexical or semantic resources. We achieve 84% correctly classified examples using k-fold evaluation on a data set of 600 instances. We argue that further improvements are likely to require the use of wider domain knowledge and name external features.
引用
收藏
页码:49 / 68
页数:20
相关论文
共 50 条
  • [1] Handling conjunctions in named entities
    Dale, Robert
    Mazur, Pawel
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 131 - +
  • [2] Separating Named Entities
    Ulipova, Barbora
    Grac, Marek
    RASLAN 2014: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2014, : 91 - 96
  • [3] Named Entities for Computational Linguistics
    Golikova, Daria M.
    VOPROSY ONOMASTIKI-PROBLEMS OF ONOMASTICS, 2018, 15 (01): : 207 - 215
  • [4] Cluster analysis of named entities
    Kozareva, Z
    Silva, J
    Gamallo, P
    Lopes, G
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2004, : 429 - 433
  • [5] Indexing concepts and/or named entities
    Buizza, Pino
    JLIS.IT, 2011, 2 (02):
  • [6] Processing Named Entities in Text
    McNamee, Paul
    Mayfield, James C.
    Piatko, Christine D.
    JOHNS HOPKINS APL TECHNICAL DIGEST, 2011, 30 (01): : 31 - 40
  • [7] Identifying Named Entities as they are Typed
    Arora, Ravneet Singh
    Tsai, Chen-Tse
    Preotiuc-Pietro, Daniel
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 976 - 988
  • [8] Integrating Bilingual Named Entities Lexicon with Conditional Random Fields Model for Arabic Named Entities Recognition
    Hkiri, Emna
    Mallati, Souheyl
    Zrigui, Mounir
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 609 - 614
  • [9] Disambiguating named entities by semantic web
    Azari, Ideh
    Koohpeyma, Fateme
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS), 2014, 109 : 741 - 744
  • [10] Community relation discovery by named entities
    Zhu, Jian-Han
    Goncalves, Alexandre L.
    Uren, Victoria S.
    Motta, Enrico
    Pacheco, Roberto
    Song, Da-Wei
    Rueger, Stefan
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1966 - +