Japanese legal term correction using random forest

被引：0

作者：

Yamakoshi T. ^{[1
]}

Ogawa Y. ^{[1
,2
]}

Komamizu T.

Toyama K.

机构：

[1] Graduate School of Informatics, Nagoya University

[2] Information Technology Center/Graduate School of Informatics, Nagoya University

来源：

Trans. Jpn. Soc. Artif. Intell. | 2020年 / 1卷

基金：

日本学术振兴会;

关键词：

Japanese legal term; Legal term correction; Random forest;

D O I：

10.1527/tjsai.H-J53

中图分类号：

学科分类号：

摘要：

We propose a method that assists legislation drafters in finding inappropriate use of Japanese legal terms and their corrections from Japanese statutory sentences. In particular, we focus on sets of similar legal terms whose usages are strictly defined in legislation drafting rules that have been established over the years. In this paper, we first define input and output of legal term correction task. We regard it as a special case of sentence completion test with multiple choices. Next, we describe a legal term correction method for Japanese statutory sentences. Our method predicts suitable legal terms using Random Forest classifiers. The classifiers in our method use adjacent words to a target legal term as input features, and are optimized in various parameters including the number of adjacent words to be used for each legal term set. We conduct an experiment using actual statutory sentences from 3,983 existing acts and cabinet orders that consist of approximately 47M words in total. As for legal term sets, we pick 27 sets from legislation drafting manuals. The experimental result shows that our method outperformed existing modern word prediction methods using neural language models and that each Random Forest classifier utilizes characteristics of its corresponding legal term set. © 2020, Japanese Society for Artificial Intelligence. All rights reserved.

引用

共 50 条

[1] Japanese Legal Term Correction Using Random Forests
Yamakoshi, Takahiro
Komamizu, Takahiro
Ogawa, Yasuhiro
Toyama, Katsuhiko
LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2018), 2018, 313 : 161 - 170
[2] Japanese Mistakable Legal Term Correction using Infrequency-aware BERT Classifier
Yamakoshi, Takahiro
Komamizu, Takahiro
Ogawa, Yasuhiro
Toyama, Katsuhiko
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4342 - 4351
[3] Japanese mistakable legal term correction using infrequency-aware bert classifier
Yamakoshi T.
Komamizu T.
Ogawa Y.
Toyama K.
Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (04) : 1 - 17
[4] Correction for population stratification in random forest analysis
Zhao, Yang
Chen, Feng
Zhai, Rihong
Lin, Xihong
Wang, Zhaoxi
Su, Li
Christiani, David C.
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2012, 41 (06) : 1798 - 1806
[5] A simple correction for population stratification in random forest analysis
Zhao, Yang
Zhai, Rihong
Lin, Xihong
Wang, Mike Zhaoxi
Su, Li
Christiani, David C.
CANCER RESEARCH, 2012, 72
[6] Short-term prediction of groundwater level using improved random forest regression with a combination of random features
Wang, Xuanhui
Liu, Tailian
Zheng, Xilai
Peng, Hui
Xin, Jia
Zhang, Bo
APPLIED WATER SCIENCE, 2018, 8 (05)
[7] Short-term prediction of groundwater level using improved random forest regression with a combination of random features
Xuanhui Wang
Tailian Liu
Xilai Zheng
Hui Peng
Jia Xin
Bo Zhang
Applied Water Science, 2018, 8
[8] Using Random Forest To Model the Domain Applicability of Another Random Forest Model
Sheridan, Robert P.
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) : 2837 - 2850
[9] Diabetes detection using random forest classifier and risk score calculation using random forest regressor
Kaur, Simarjeet
Kaur, Damandeep
Mayank, Mrinal
Singh, Nongmeikapam Thoiba
Artificial Intelligence, Blockchain, Computing and Security - Proceedings of the International Conference on Artificial Intelligence, Blockchain, Computing and Security, ICABCS 2023, 2024, 2 : 426 - 431
[10] Password Guessing Using Random Forest
Wang, Ding
Zou, Yunkai
Zhang, Zijian
Xiu, Kedong
PROCEEDINGS OF THE 32ND USENIX SECURITY SYMPOSIUM, 2023, : 965 - 982

← 1 2 3 4 5 →