Extracting information from textual descriptions for actuarial applications

被引：3

作者：

Manski, Scott ^{[1
]}

Yang, Kaixu ^{[1
]}

Lee, Gee Y. ^{[1
]}

Maiti, Tapabrata ^{[1
]}

机构：

[1] Michigan State Univ, E Lansing, MI 48824 USA

来源：

ANNALS OF ACTUARIAL SCIENCE | 2021年 / 15卷 / 03期

关键词：

Actuarial modelling; Generalised additive models; GloVe; High dimensional; Lasso; Loss modelling; Risk analysis; Word embedding; Word similarity; Text analysis; REGRESSION; SELECTION;

D O I：

10.1017/S1748499521000026

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

Initial insurance losses are often reported with a textual description of the claim. The claims manager must determine the adequate case reserve for each known claim. In this paper, we present a framework for predicting the amount of loss given a textual description of the claim using a large number of words found in the descriptions. Prior work has focused on classifying insurance claims based on keywords selected by a human expert, whereas in this paper the focus is on loss amount prediction with automatic word selection. In order to transform words into numeric vectors, we use word cosine similarities and word embedding matrices. When we consider all unique words found in the training dataset and impose a generalised additive model to the resulting explanatory variables, the resulting design matrix is high dimensional. For this reason, we use a group lasso penalty to reduce the number of coefficients in the model. The scalable, analytical framework proposed provides for a parsimonious and interpretable model. Finally, we discuss the implications of the analysis, including how the framework may be used by an insurance company and how the interpretation of the covariates can lead to significant policy change. The code can be found in the TAGAM R package (github.com/scottmanski/TAGAM).

引用

页码：605 / 622

页数：18

共 50 条

[21] PROCEDURAL MODELLING OF MONUMENTAL BUILDINGS FROM TEXTUAL DESCRIPTIONS
Rodrigues, Roberto
Coelho, Antonio
Reis, Luis Paulo
GRAPP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS THEORY AND APPLICATIONS, 2010, : 130 - 133
[22] Inferring spatial relations from textual descriptions of images
Elu, Aitzol
Azkune, Gorka
Lopez de Lacalle, Oier
Arganda-Carreras, Ignacio
Soroa, Aitor
Agirre, Eneko
PATTERN RECOGNITION, 2021, 113
[23] Data Model for Procedural Modelling from Textual Descriptions
Rodrigues, Roberto
Coelho, Antonio
Reis, Luis Paulo
2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
[24] Children Activity Descriptions from Visual and Textual Associations
Phon-Amnuaisuk, Somnuk
Murata, Ken T.
Pavarangkoon, Praphan
Mizuhara, Takamichi
Hadi, Shiqah
MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2019, 11909 : 121 - 132
[25] Extracting business rules from web product descriptions
Iwaihara, M
Shiga, T
Kozawa, V
WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 135 - 146
[26] Extracting generalized descriptions from a binary feedforward network
Sedbrook, T
APPLIED ARTIFICIAL INTELLIGENCE, 1998, 12 (04) : 309 - 327
[27] Extracting surface patches from complete range descriptions
Fisher, RB
Fitzgibbon, AW
Eggert, D
INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 1997, : 148 - 154
[28] Extracting Code Segments and Their Descriptions from Research Articles
Chatterjee, Preetha
Gause, Benjamin
Hedinger, Hunter
Pollock, Lori
2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, : 91 - 101
[29] A method for extracting textual meaning
Sandford, E
Fraisse, S
META, 1997, 42 (02) : 356 - 363
[30] Extracting information from text
Chai, JY
Biermann, AW
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 202 - 206

← 1 2 3 4 5 →