Retrieving relatives from historical data

被引:6
|
作者
Hundt, Marianne [1 ]
Denison, David [2 ]
Schneider, Gerold [1 ]
机构
[1] Univ Zurich, CH-8006 Zurich, Switzerland
[2] Univ Manchester, Manchester M13 9PL, Lancs, England
来源
LITERARY AND LINGUISTIC COMPUTING | 2012年 / 27卷 / 01期
关键词
D O I
10.1093/llc/fqr049
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Variation and change in relativization strategies has been well documented (e. g. Ball 1996: 46, Biber and Clark 2002, Biber, Johansson, Leech, Conrad and Finegan 1999, Johansson 2006, Lehmann 2002). Certain types of relative clause, namely that-relatives and zero relatives, were difficult to retrieve from plain-text corpora. Studies therefore either relied on manual extraction of data or a subset of possible relativization strategies. In some text types, however, the zero relative is an important member of the class of possible relativizers. Recent advances in syntactic annotation should have made that-relatives and zero relatives more accessible to automatic retrieval. In this article, we test precision and recall of searches on a modest-sized corpus, i.e. scientific texts from ARCHER (A Representative Corpus of Historical English Registers), as a preliminary to future work on the large corpora which are increasingly becoming available. The parser retrieved some false positives and at the same time missed some relevant data. We discuss structural reasons for both kinds of shortcoming as well as the possibilities and limitations of parser adaptation.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [1] Retrieving signals from array data
    Bohme, JF
    MONITORING A COMPREHENSIVE TEST BAN TREATY, 1996, 303 : 587 - 610
  • [2] Retrieving a Context Tree from EEG Data
    Duarte, Aline
    Fraiman, Ricardo
    Galves, Antonio
    Ost, Guilherme
    Vargas, Claudia D.
    MATHEMATICS, 2019, 7 (05)
  • [3] On Retrieving Multivariate Data Sets from Their Moments
    Provost, Serge B.
    Ahmed, S. Ejaz
    Yang, Zhaoqi
    STATISTICS AND APPLICATIONS, 2024, 22 (03): : 471 - 483
  • [4] Retrieving GPCR data from public databases
    Southan, Christopher
    CURRENT OPINION IN PHARMACOLOGY, 2016, 30 : 38 - 43
  • [6] Who Are My Ancestors? Retrieving Family Relationships from Historical Texts
    Efremova, Julia
    Garcia, Alejandro Montes
    Iriondo, Alfredo Bolt
    Calders, Toon
    INFORMATION RETRIEVAL, (RUSSIR 2015), 2016, 573 : 121 - 129
  • [7] Satellite Image Processing for Retrieving Historical Solar Irradiance Data Within the Mexican Territory
    Callejas-Cornejo, Juan M.
    Pena-Cruz, Manuel, I
    Valentin-Coronado, Luis M.
    PROGRESS IN OPTOMECHATRONIC TECHNOLOGIES, 2019, 233 : 117 - 126
  • [8] NUMERICAL PROCEDURE FOR RETRIEVING THE PHASE FROM INTENSITY DATA
    HUDSON, GE
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1982, 72 (12) : 1768 - 1768
  • [9] Retrieving and semantically integrating heterogeneous data from the Web
    Michalowski, M
    Ambite, JL
    Thakkar, S
    Tuchinda, R
    Knoblock, CA
    Minton, S
    IEEE INTELLIGENT SYSTEMS, 2004, 19 (03) : 72 - 79
  • [10] Retrieving Data Permutations From Noisy Observations: Asymptotics
    Jeong, Minoh
    Dytso, Alex
    Cardone, Martina
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (04) : 2999 - 3017