Who Are My Ancestors? Retrieving Family Relationships from Historical Texts

被引:2
|
作者
Efremova, Julia [1 ]
Garcia, Alejandro Montes [1 ]
Iriondo, Alfredo Bolt [1 ]
Calders, Toon [1 ,2 ]
机构
[1] Eindhoven Univ Technol, Eindhoven, Netherlands
[2] Univ Libre Bruxelles, Brussels, Belgium
来源
关键词
D O I
10.1007/978-3-319-41718-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach for automatically retrieving family relationships from a real-world collection of Dutch historical notary acts. We aim to retrieve relationships like husband - wife, parent - child, widow of, etc. Our approach includes person names extraction, reference disambiguation, candidate generation and family relationship prediction. Since we have a limited amount of training data, we evaluate different feature configurations based on the n-gram analysis. The best results were obtained by using a combination of bi-grams and trigrams of words together with the distance in words between two names. We evaluate our results for each type of the relationships in terms of precision, recall and f - score.
引用
收藏
页码:121 / 129
页数:9
相关论文
共 50 条