Unsupervised Discourse Constituency Parsing Using Viterbi EM

被引:10
|
作者
Nishida, Noriki [1 ]
Nakayama, Hideki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
关键词
57;
D O I
10.1162/tacl_a_00312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce an unsupervised discourse constituency parsing algorithm. We use Viterbi EM with a margin- based criterion to train a span-based discourse parser in an unsupervised manner. We also propose initialization methods for Viterbi training of discourse constituents based on our prior knowledge of text structures. Experimental results demonstrate that our unsupervised parser achieves comparable or even superior performance to fully supervised parsers. We also investigate discourse constituents that are learned by our method.
引用
收藏
页码:215 / 230
页数:16
相关论文
共 50 条
  • [41] Unsupervised morphological parsing of Bengali
    Sajib Dasgupta
    Vincent Ng
    Language Resources and Evaluation, 2006, 40 : 311 - 330
  • [42] Multilingual Unsupervised Dependency Parsing with Unsupervised POS Tags
    Marecek, David
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 72 - 82
  • [43] UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging
    Haenig, Christian
    Bordag, Stefan
    Quasthoff, Uwe
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1109 - 1114
  • [44] An Empirical Study for Vietnamese Constituency Parsing with Pre-training
    Tuan-Vi Tran
    Xuan-Thien Pham
    Duc-Vu Nguyen
    Kiet Van Nguyen
    Ngan Luu-Thuy Nguyen
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 234 - 239
  • [45] Event Detection in Hungarian Texts with Dependency and Constituency Parsing and WordNet
    Subecz, Zoltan
    2017 IEEE 14TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATICS, 2017, : 365 - 371
  • [46] Vietnamese Span-based Constituency Parsing with BERT Embedding
    Phan, Thi-Phuong-Uyen
    Huynh, Ngoc-Thanh-Tung
    Truong, Hung-Thinh
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 293 - 299
  • [47] Linear-Time Constituency Parsing with RNNs and Dynamic Programming
    Hong, Juneki
    Huang, Liang
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 477 - 483
  • [48] Unsupervised image segmentation using EM algorithm by histogram
    Huang, Zhi-Kai
    Liu, De-Hui
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 1275 - +
  • [49] Measurement of sentence similarity based on constituency parsing and dilated convolution
    Ji, MingYu
    Wang, ChenLong
    Liu, Gang
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (03) : 252 - 259
  • [50] Sentence level discourse parsing using syntactic and lexical information
    Soricut, R
    Marcu, D
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 228 - 235