Retrieving relatives from historical data

被引:6
|
作者
Hundt, Marianne [1 ]
Denison, David [2 ]
Schneider, Gerold [1 ]
机构
[1] Univ Zurich, CH-8006 Zurich, Switzerland
[2] Univ Manchester, Manchester M13 9PL, Lancs, England
来源
LITERARY AND LINGUISTIC COMPUTING | 2012年 / 27卷 / 01期
关键词
D O I
10.1093/llc/fqr049
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Variation and change in relativization strategies has been well documented (e. g. Ball 1996: 46, Biber and Clark 2002, Biber, Johansson, Leech, Conrad and Finegan 1999, Johansson 2006, Lehmann 2002). Certain types of relative clause, namely that-relatives and zero relatives, were difficult to retrieve from plain-text corpora. Studies therefore either relied on manual extraction of data or a subset of possible relativization strategies. In some text types, however, the zero relative is an important member of the class of possible relativizers. Recent advances in syntactic annotation should have made that-relatives and zero relatives more accessible to automatic retrieval. In this article, we test precision and recall of searches on a modest-sized corpus, i.e. scientific texts from ARCHER (A Representative Corpus of Historical English Registers), as a preliminary to future work on the large corpora which are increasingly becoming available. The parser retrieved some false positives and at the same time missed some relevant data. We discuss structural reasons for both kinds of shortcoming as well as the possibilities and limitations of parser adaptation.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [21] Word embeddings for retrieving tabular data from research publications
    Alberto Berenguer
    Jose-Norberto Mazón
    David Tomás
    Machine Learning, 2024, 113 : 2227 - 2248
  • [22] RETRIEVING THE MISSING DATA FROM DIFFERENT INCOMPLETE SOFT SETS
    Srivastava, Julee
    Maddheshiya, Sudhir
    3C EMPRESA, 2022, 11 (02): : 104 - 114
  • [23] Enhanced Interface for Retrieving Glycan and Glycosylation Data from GlyGen
    Tiemeyer, Michael
    Kulkarni, Sujeet
    Kahsay, Robel
    Ranzinger, Rene
    Mazumder, Raja
    FASEB JOURNAL, 2021, 35
  • [24] Retrieving Unobserved Consideration Sets from Household Panel Data
    van Nierop, Erjen
    Bronnenberg, Bart
    Paap, Richard
    Wedel, Michel
    Franses, Philip Hans
    JOURNAL OF MARKETING RESEARCH, 2010, 47 (01) : 63 - 74
  • [25] Retrieving Clinical and Omic Data from Electronic Health Records
    Cabot, Chloe
    Lelong, Romain
    Grosjean, Julien
    Soualmia, Lina F.
    Darmoni, Stefan J.
    TRANSFORMING HEALTHCARE WITH THE INTERNET OF THINGS, 2016, 221 : 115 - 115
  • [26] THE MEASUREMENT OF CAPITAL: RETRIEVING INITIAL VALUES FROM PANEL DATA
    Chen, Xi
    Plotnikova, Tatiana
    REVIEW OF INCOME AND WEALTH, 2018, 64 (03) : 542 - 562
  • [27] Word embeddings for retrieving tabular data from research publications
    Berenguer, Alberto
    Mazon, Jose-Norberto
    Tomas, David
    MACHINE LEARNING, 2024, 113 (04) : 2227 - 2248
  • [28] Optimizing the algorithm for retrieving soil moisture from SMOS data
    Waldteufel, P.
    Richaume, P.
    Kerr, Y.
    Wigneron, J. -P
    Mahmoodi, A.
    Mialon, A.
    Vergely, J. -L
    Cabot, F.
    Ferrazzoli, P.
    Delwart, S.
    IGARSS: 2007 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-12: SENSING AND UNDERSTANDING OUR PLANET, 2007, : 3952 - +
  • [29] CONCERNING HISTORICAL AND LOCAL LEGENDS AND THEIR RELATIVES
    JASON, H
    JOURNAL OF AMERICAN FOLKLORE, 1971, 84 (331) : 134 - 144
  • [30] STORING AND RETRIEVING LOCATIONAL DATA
    CALOGERO, V
    BARTH, K
    ATHANASSOULIS, G
    COMPUTER-AIDED DESIGN, 1978, 10 (04) : 249 - 256