Japanese legal term correction using random forest

被引:0
|
作者
Yamakoshi T. [1 ]
Ogawa Y. [1 ,2 ]
Komamizu T.
Toyama K.
机构
[1] Graduate School of Informatics, Nagoya University
[2] Information Technology Center/Graduate School of Informatics, Nagoya University
来源
基金
日本学术振兴会;
关键词
Japanese legal term; Legal term correction; Random forest;
D O I
10.1527/tjsai.H-J53
中图分类号
学科分类号
摘要
We propose a method that assists legislation drafters in finding inappropriate use of Japanese legal terms and their corrections from Japanese statutory sentences. In particular, we focus on sets of similar legal terms whose usages are strictly defined in legislation drafting rules that have been established over the years. In this paper, we first define input and output of legal term correction task. We regard it as a special case of sentence completion test with multiple choices. Next, we describe a legal term correction method for Japanese statutory sentences. Our method predicts suitable legal terms using Random Forest classifiers. The classifiers in our method use adjacent words to a target legal term as input features, and are optimized in various parameters including the number of adjacent words to be used for each legal term set. We conduct an experiment using actual statutory sentences from 3,983 existing acts and cabinet orders that consist of approximately 47M words in total. As for legal term sets, we pick 27 sets from legislation drafting manuals. The experimental result shows that our method outperformed existing modern word prediction methods using neural language models and that each Random Forest classifier utilizes characteristics of its corresponding legal term set. © 2020, Japanese Society for Artificial Intelligence. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [41] Location recognition system using random forest
    Sunmin Lee
    Nammee Moon
    Journal of Ambient Intelligence and Humanized Computing, 2018, 9 : 1191 - 1196
  • [42] ANOMALY DETECTION BY USING RANDOM PROJECTION FOREST
    Chen, Fan
    Liu, Zicheng
    Sun, Ming-ting
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1210 - 1214
  • [43] Predicting student dropouts using random forest
    Devi, Kapila
    Ratnoo, Saroj
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2022, 25 (07) : 1579 - 1590
  • [44] Lung cancer prediction using random forest
    Rajini A.
    Jabbar M.A.
    Recent Advances in Computer Science and Communications, 2021, 14 (05) : 1650 - 1657
  • [45] A comparison of random forest based algorithms: random credal random forest versus oblique random forest
    Mantas, Carlos J.
    Castellano, Javier G.
    Moral-Garcia, Serafin
    Abellan, Joaquin
    SOFT COMPUTING, 2019, 23 (21) : 10739 - 10754
  • [46] Short-term electric power load forecasting using random forest and gated recurrent unit
    Venkataramana Veeramsetty
    K. Rajeshwar Reddy
    M. Santhosh
    Arjun Mohnot
    Gaurav Singal
    Electrical Engineering, 2022, 104 : 307 - 329
  • [47] Short-term electric power load forecasting using random forest and gated recurrent unit
    Veeramsetty, Venkataramana
    Reddy, K. Rajeshwar
    Santhosh, M.
    Mohnot, Arjun
    Singal, Gaurav
    ELECTRICAL ENGINEERING, 2022, 104 (01) : 307 - 329
  • [48] Forest signal detection for photon counting LiDAR using Random Forest
    Chen, Bowei
    Pang, Yong
    Li, Zengyuan
    Lu, Hao
    North, Peter
    Rosette, Jacqueline
    Yan, Min
    REMOTE SENSING LETTERS, 2020, 11 (01) : 37 - 46
  • [49] Correction: Understanding overfitting in random forest for probability estimation: a visualization and simulation study
    Lasai Barreñada
    Paula Dhiman
    Dirk Timmerman
    Anne-Laure Boulesteix
    Ben Van Calster
    Diagnostic and Prognostic Research, 9 (1)
  • [50] A grid model for vertical correction of precipitable water vapor over the Chinese mainland and surrounding areas using random forest
    Li, Junyu
    Wang, Yuxin
    Liu, Lilong
    Yao, Yibin
    Huang, Liangke
    Li, Feijuan
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2024, 17 (07) : 2569 - 2581