Data-Augmentation Method for BERT-based Legal Textual Entailment Systems in COLIEE Statute Law Task

被引:1
|
作者
Aoki, Yasuhiro [1 ]
Yoshioka, Masaharu [1 ,2 ,3 ,4 ]
Suzuki, Youta [1 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, N14 W9, Sapporo, Hokkaido, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo, Hokkaido, Japan
[3] Hokkaido Univ, Global Stn Big Data & Cybersecur, Sapporo, Hokkaido, Japan
[4] Hokkaido Univ, Inst Chem React Design & Discovery WPI ICReDD, Sapporo, Hokkaido, Japan
来源
REVIEW OF SOCIONETWORK STRATEGIES | 2022年 / 16卷 / 01期
关键词
D O I
10.1007/s12626-022-00104-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A legal textual entailment task is a task to recognize entailment between a law article and its statements. In the Competition on Legal Information Extraction/Entailment (COLIEE), this task is designed as a task to confirm the entailment of a yes/no answer from the given civil code article(s). Based on the development of deep-learning-based natural language processing tools such as bidirectional encoder representations from transformers (BERT), many participants in the task used such tools, and the best performance system of COLIEE 2020 was a BERT-based system. However, because of the limitation of the size of training data provided by the task organizer, training such tools to adapt to the variability of the questions is difficult. In this paper, we propose a data-augmentation method to make training data using civil code articles for understanding the syntactic structure of the questions and articles for entailment. Our BERT-based ensemble system, which uses this augmentation method, achieves the best performance (accuracy = 0.7037) in Task 4 of COLIEE 2021. We also introduce the results of additional experiments to discuss the characteristics of the proposed method.
引用
收藏
页码:175 / 196
页数:22
相关论文
共 4 条
  • [1] Data-Augmentation Method for BERT-based Legal Textual Entailment Systems in COLIEE Statute Law Task
    Yasuhiro Aoki
    Masaharu Yoshioka
    Youta Suzuki
    The Review of Socionetwork Strategies, 2022, 16 : 175 - 196
  • [2] BERT-Based Ensemble Model for Statute Law Retrieval and Legal Information Entailment
    Shao, Hsuan-Lei
    Chen, Yi-Chia
    Huang, Sieh-Chuen
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2020, 2021, 12758 : 226 - 239
  • [3] Legal Textual Entailment Using Ensemble of Rule-Based and BERT-Based Method with Data Augmentation by Related Article Generation
    Fujita, Masaki
    Onaga, Takaaki
    Ueyama, Ayaka
    Kano, Yoshinobu
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13859 LNAI : 138 - 153
  • [4] Legal Textual Entailment Using Ensemble of Rule-Based and BERT-Based Method with Data Augmentation by Related Article Generation
    Fujita, Masaki
    Onaga, Takaaki
    Ueyama, Ayaka
    Kano, Yoshinobu
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2022 WORKSHOP, JURISIN 2022, JSAI 2022, 2023, 13859 : 138 - 153