Reranking and Self-Training for Parser Adaptation

被引:0
|
作者
McClosky, David [1 ]
Charniak, Eugene [1 ]
Johnson, Mark [1 ]
机构
[1] Brown Univ, BLLIP, Providence, RI 02912 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical parsers trained and tested on the Penn Wall Street Journal (WSJ) treebank have shown vast improvements over the last 10 years. Much of this improvement, however, is based upon an ever-increasing number of features to be trained on (typically) the WSJ treebank data. This has led to concern that such parsers may be too finely tuned to this corpus at the expense of portability to other genres. Such worries have merit. The standard "Charniak parser" checks in at a labeled precision-recall f-measure of 89.7% on the Penn WSJ test set, but only 82.9% on the test set from the Brown treebank corpus. This paper should allay these fears. In particular, we show that the reranking parser described in Charniak and Johnson (2005) improves performance of the parser on Brown to 85.2%. Furthermore, use of the self-training techniques described in (MeClosky et al., 2006) raise this to 87.8% (an error reduction of 28%) again without any use of labeled Brown data. This is remarkable since training the parser and reranker on labeled Brown data achieves only 88.4%.
引用
收藏
页码:337 / 344
页数:8
相关论文
共 50 条
  • [31] Unsupervised Adaptation of Question Answering Systems via Generative Self-training
    Rennie, Steven J.
    Marcheret, Etienne
    Mallinar, Neil
    Nahamoo, David
    Goel, Vaibhava
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1148 - 1157
  • [32] Unsupervised Domain Adaptation with Multiple Domain Discriminators and Adaptive Self-Training
    Spadotto, Teo
    Toldo, Marco
    Michieli, Umberto
    Zanuttigh, Pietro
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2845 - 2852
  • [33] Test-time adaptation via self-training with future information
    Wen, Xin
    Shen, Hao
    Zhao, Zhongqiu
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33012
  • [34] Combining Semantic Self-Supervision and Self-Training for Domain Adaptation in Semantic Segmentation
    Niemeijer, Joshua
    Schaefer, Joerg P.
    2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, : 364 - 371
  • [35] Unsupervised Domain Adaptation for Medical Image Segmentation by Disentanglement Learning and Self-Training
    Xie, Qingsong
    Li, Yuexiang
    He, Nanjun
    Ning, Munan
    Ma, Kai
    Wang, Guoxing
    Lian, Yong
    Zheng, Yefeng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (01) : 4 - 14
  • [36] Reranking CCG Parser for Jazz Chord Sequences
    Hong Xuan Ong
    Nguyen Le Minh
    Tojo, Satoshi
    2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 205 - 211
  • [37] An Evaluation of Self-training Styles for Domain Adaptation on the Task of Splice Site Prediction
    Herndon, Nic
    Caragea, Doina
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 1042 - 1047
  • [38] AT-ST: Self-training Adaptation Strategy for OCR in Domains with Limited Transcriptions
    Kiss, Martin
    Benes, Karel
    Hradis, Michal
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 463 - 477
  • [39] Source-Free Domain Adaptation for Question Answering with Masked Self-training
    Yin, Maxwell J.
    Dong, Yue
    Wang, Boyu
    Ling, Charles
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 721 - 737
  • [40] UNSUPERVISED DOMAIN ADAPTATION FOR SPEECH RECOGNITION VIA UNCERTAINTY DRIVEN SELF-TRAINING
    Khurana, Sameer
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6553 - 6557