Two-Tier Random Walk Based Relational Inference Algorithm

被引：0

作者：

Liu Q. ^{[1
]}

Han M.-H. ^{[1
]}

Jiang L.-Y. ^{[1
]}

Liu Y. ^{[1
]}

Geng J. ^{[1
]}

机构：

[1] School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu

来源：

| 1600年 / Science Press卷 / 40期

关键词：

Artificial intelligence; Knowledge base population; Path ranking algorithm; Random walk; Relational inference; Statistical relational learning;

D O I：

10.11897/SP.J.1016.2017.01275

中图分类号：

学科分类号：

摘要：

Relational inference is one of the crucial techniques for knowledge base population tasks, typical application scenarios include relationship prediction and entity linking. The challenging problem of relational inference is how to infer new relations between entities from the facts existed in the knowledge bases. The reasoning models adopted in current mainstream knowledge bases can be divided into two categories: the latent factor models and the random walk models. The latent factor models realize the reasoning by mapping the entities and relations into a low dimensional real-valued vector space, and then computing with corresponding vector similarity measures. The random walk models, however, are based on the first-order predicate logic to deduce the reasoning between the entities and reduce the algorithm complexity through stochastic algorithm. In comparison, the efficiency of the latent factor models usually suffer from their computational complexity caused by large-scale matrix computation operations. While the random walk models usually suffer from their low recall rates, due to the fact that it is difficult to fully utilize all of the available structure information provided by the knowledge bases with any random sampling design. This work studied the potential problems of the basic assumptions adopted by the existing random walk models, and proposed two new inference modeling assumptions thereby. Firstly, the random walk models represented by the Path Ranking Algorithm (PRA) adopt the unidimensionality assumption of the relationships in between the entities. In typical random walk models, the entity-relation-entity tuples that existed in the knowledge base are regarded as first-order Horn clauses, in which the relationships are treated as partial ordering relations between the subjects and the objects. Our hypothesis is that although the relation between two entities is literally, syntactically directional, the information conveyed by this relation is equally shared between the connected entities on both side, thus all of the relations are semantically bidirectional, which allows the relational reasoning algorithm to use the inverse relation semantics from the object to the subject for reasoning. Secondly, the PRA algorithm makes use of the first-order predicate logic for relational reasoning, it also introduces a random sampling scheme in order to avoid exhaustive search in the path space and to speed up the calculation process. However, we argue in this paper that this maybe one of the major reasons that explains why the PRA and alike algorithms can not make full use of the existing information in the knowledge base for relational inference tasks, and then we propose an alternative assumption for remedy, which claims that the topology structures of the relation-specific sub-graphs in knowledge bases can be exploited to improve the performance of the random-walk based relational inference algorithms. In order to verify the validity of the above assumptions and algorithm, we propose a novel relational inference algorithm based on a two-tier random walk strategy. Experimental results on benchmark datasets include WN18, FB15K, and FB40K, show that the proposed algorithm can be very effective in promoting the accuracy and recall rate of the random walk models. The proposed algorithm also outperforms other prevalent latent factor models on each data sets. © 2017, Science Press. All right reserved.

引用

页码：1275 / 1290

页数：15

共 24 条

[1] Nickel M., Murphy K., Tresp V., Et al., A review of relational machine learning for knowledge graphs: From multi-relational link prediction to automated knowledge graph construction, Proceedings of the IEEE, 104, 1, pp. 11-33, (2016)
[2] Wang Y.-Z., Jia Y.-T., Liu D.-W., Et al., Open web knowledge aided information search and data mining, Journal of Computer Research and Development, 52, 2, pp. 456-474, (2015)
[3] Choi E., Kwiatkowski T., Zettlemoyer L., Scalable semantic parsing with partial ontologies, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 1311-1320, (2015)
[4] West R., Gabrilovich E., Murphy K., Et al., Knowledge base completion via search-based question answering, Proceedings of the 23rd International World Wide Web Conference, pp. 515-526, (2014)
[5] Neelakantan A., Roth B., McCallum A., Compositional vector space models for knowledge base completion, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, pp. 156-166, (2015)
[6] Carlson A., Betteridge J., Kisiel B., Et al., Toward an architecture for never-ending language learning, Proceedings of the 24th AAAI Conference on Artificial Intelligence, 5, (2010)
[7] Mitchell T.M., Cohen W.W., Talukdar P.P., Et al., Never-ending learning, Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 2302-2310, (2015)
[8] Dong X., Gabrilovich E., Heitz G., Et al., Knowledge vault: A web-scale approach to probabilistic knowledge fusion, Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 601-610, (2014)
[9] Zhong X.-Q., Liu Z., Ding P.-P., Construction of knowledge base on hybrid reasoning and its application, Chinese Journal of Computers, 35, 4, pp. 761-766, (2012)
[10] Gu R., Wang F.-F., Yuan C.-F., Et al., YARM: Efficient and scalable semantic reasoning engine based on MapReduce, Chinese Journal of Computers, 38, 1, pp. 74-85, (2015)

← 1 2 3 →