Japanese legal term correction using random forest

被引:0
|
作者
Yamakoshi T. [1 ]
Ogawa Y. [1 ,2 ]
Komamizu T.
Toyama K.
机构
[1] Graduate School of Informatics, Nagoya University
[2] Information Technology Center/Graduate School of Informatics, Nagoya University
来源
基金
日本学术振兴会;
关键词
Japanese legal term; Legal term correction; Random forest;
D O I
10.1527/tjsai.H-J53
中图分类号
学科分类号
摘要
We propose a method that assists legislation drafters in finding inappropriate use of Japanese legal terms and their corrections from Japanese statutory sentences. In particular, we focus on sets of similar legal terms whose usages are strictly defined in legislation drafting rules that have been established over the years. In this paper, we first define input and output of legal term correction task. We regard it as a special case of sentence completion test with multiple choices. Next, we describe a legal term correction method for Japanese statutory sentences. Our method predicts suitable legal terms using Random Forest classifiers. The classifiers in our method use adjacent words to a target legal term as input features, and are optimized in various parameters including the number of adjacent words to be used for each legal term set. We conduct an experiment using actual statutory sentences from 3,983 existing acts and cabinet orders that consist of approximately 47M words in total. As for legal term sets, we pick 27 sets from legislation drafting manuals. The experimental result shows that our method outperformed existing modern word prediction methods using neural language models and that each Random Forest classifier utilizes characteristics of its corresponding legal term set. © 2020, Japanese Society for Artificial Intelligence. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [1] Japanese Legal Term Correction Using Random Forests
    Yamakoshi, Takahiro
    Komamizu, Takahiro
    Ogawa, Yasuhiro
    Toyama, Katsuhiko
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2018), 2018, 313 : 161 - 170
  • [2] Japanese Mistakable Legal Term Correction using Infrequency-aware BERT Classifier
    Yamakoshi, Takahiro
    Komamizu, Takahiro
    Ogawa, Yasuhiro
    Toyama, Katsuhiko
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4342 - 4351
  • [3] Japanese mistakable legal term correction using infrequency-aware bert classifier
    Yamakoshi T.
    Komamizu T.
    Ogawa Y.
    Toyama K.
    Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (04) : 1 - 17
  • [4] Correction for population stratification in random forest analysis
    Zhao, Yang
    Chen, Feng
    Zhai, Rihong
    Lin, Xihong
    Wang, Zhaoxi
    Su, Li
    Christiani, David C.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2012, 41 (06) : 1798 - 1806
  • [5] A simple correction for population stratification in random forest analysis
    Zhao, Yang
    Zhai, Rihong
    Lin, Xihong
    Wang, Mike Zhaoxi
    Su, Li
    Christiani, David C.
    CANCER RESEARCH, 2012, 72
  • [6] Short-term prediction of groundwater level using improved random forest regression with a combination of random features
    Wang, Xuanhui
    Liu, Tailian
    Zheng, Xilai
    Peng, Hui
    Xin, Jia
    Zhang, Bo
    APPLIED WATER SCIENCE, 2018, 8 (05)
  • [7] Short-term prediction of groundwater level using improved random forest regression with a combination of random features
    Xuanhui Wang
    Tailian Liu
    Xilai Zheng
    Hui Peng
    Jia Xin
    Bo Zhang
    Applied Water Science, 2018, 8
  • [8] Using Random Forest To Model the Domain Applicability of Another Random Forest Model
    Sheridan, Robert P.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) : 2837 - 2850
  • [9] Diabetes detection using random forest classifier and risk score calculation using random forest regressor
    Kaur, Simarjeet
    Kaur, Damandeep
    Mayank, Mrinal
    Singh, Nongmeikapam Thoiba
    Artificial Intelligence, Blockchain, Computing and Security - Proceedings of the International Conference on Artificial Intelligence, Blockchain, Computing and Security, ICABCS 2023, 2024, 2 : 426 - 431
  • [10] Password Guessing Using Random Forest
    Wang, Ding
    Zou, Yunkai
    Zhang, Zijian
    Xiu, Kedong
    PROCEEDINGS OF THE 32ND USENIX SECURITY SYMPOSIUM, 2023, : 965 - 982