Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

被引:2
|
作者
Scharpf, Philipp [1 ]
Schubotz, Moritz [2 ]
Gipp, Bela [3 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] FIZ Karlsruhe, Karlsruhe, Germany
[3] Univ Wuppertal, Wuppertal, Germany
关键词
Entity Linking; Wikipedia; Wikidata; Recommender Systems;
D O I
10.1145/3442442.3452348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and speed up this process by using an application-driven strategy and AI-aided system. We evaluate the quality and time-savings of AI-generated formula and identifier annotation recommendations on a test selection of Wikipedia articles from the physics domain. Moreover, we evaluate the community acceptance of Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to significantly speed up the annotation process by a factor of 1.4 for formulae and 2.4 for identifiers. Our contributions were accepted in 88% of the edited Wikipedia articles and 67% of the Wikidata items. The "AnnoMathTeX" annotation recommender system is hosted by Wikimedia at annomathtex.wmflabs.org . In the future, our data refinement pipeline will be integrated seamlessly into the Wikimedia user interfaces.
引用
收藏
页码:602 / 609
页数:8
相关论文
共 1 条