Extracting information from textual descriptions for actuarial applications

被引:3
|
作者
Manski, Scott [1 ]
Yang, Kaixu [1 ]
Lee, Gee Y. [1 ]
Maiti, Tapabrata [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
关键词
Actuarial modelling; Generalised additive models; GloVe; High dimensional; Lasso; Loss modelling; Risk analysis; Word embedding; Word similarity; Text analysis; REGRESSION; SELECTION;
D O I
10.1017/S1748499521000026
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Initial insurance losses are often reported with a textual description of the claim. The claims manager must determine the adequate case reserve for each known claim. In this paper, we present a framework for predicting the amount of loss given a textual description of the claim using a large number of words found in the descriptions. Prior work has focused on classifying insurance claims based on keywords selected by a human expert, whereas in this paper the focus is on loss amount prediction with automatic word selection. In order to transform words into numeric vectors, we use word cosine similarities and word embedding matrices. When we consider all unique words found in the training dataset and impose a generalised additive model to the resulting explanatory variables, the resulting design matrix is high dimensional. For this reason, we use a group lasso penalty to reduce the number of coefficients in the model. The scalable, analytical framework proposed provides for a parsimonious and interpretable model. Finally, we discuss the implications of the analysis, including how the framework may be used by an insurance company and how the interpretation of the covariates can lead to significant policy change. The code can be found in the TAGAM R package (github.com/scottmanski/TAGAM).
引用
收藏
页码:605 / 622
页数:18
相关论文
共 50 条
  • [21] PROCEDURAL MODELLING OF MONUMENTAL BUILDINGS FROM TEXTUAL DESCRIPTIONS
    Rodrigues, Roberto
    Coelho, Antonio
    Reis, Luis Paulo
    GRAPP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2010, : 130 - 133
  • [22] Inferring spatial relations from textual descriptions of images
    Elu, Aitzol
    Azkune, Gorka
    Lopez de Lacalle, Oier
    Arganda-Carreras, Ignacio
    Soroa, Aitor
    Agirre, Eneko
    PATTERN RECOGNITION, 2021, 113
  • [23] Data Model for Procedural Modelling from Textual Descriptions
    Rodrigues, Roberto
    Coelho, Antonio
    Reis, Luis Paulo
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [24] Children Activity Descriptions from Visual and Textual Associations
    Phon-Amnuaisuk, Somnuk
    Murata, Ken T.
    Pavarangkoon, Praphan
    Mizuhara, Takamichi
    Hadi, Shiqah
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2019, 11909 : 121 - 132
  • [25] Extracting business rules from web product descriptions
    Iwaihara, M
    Shiga, T
    Kozawa, V
    WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 135 - 146
  • [26] Extracting generalized descriptions from a binary feedforward network
    Sedbrook, T
    APPLIED ARTIFICIAL INTELLIGENCE, 1998, 12 (04) : 309 - 327
  • [27] Extracting surface patches from complete range descriptions
    Fisher, RB
    Fitzgibbon, AW
    Eggert, D
    INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 1997, : 148 - 154
  • [28] Extracting Code Segments and Their Descriptions from Research Articles
    Chatterjee, Preetha
    Gause, Benjamin
    Hedinger, Hunter
    Pollock, Lori
    2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, : 91 - 101
  • [29] A method for extracting textual meaning
    Sandford, E
    Fraisse, S
    META, 1997, 42 (02) : 356 - 363
  • [30] Extracting information from text
    Chai, JY
    Biermann, AW
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 202 - 206