Survey on Data Integration Technologies for Relational Data and Knowledge Graph

被引:0
|
作者
Gao Y.-J. [1 ]
Ge C.-C. [2 ]
Guo Y.-X. [1 ]
Chen L. [1 ]
机构
[1] College of Computer Science and Technology, Zhejiang University, Hangzhou
[2] Data Intelligence Innovation Lab, Huawei Cloud Computing Technologies Co. Ltd., Hangzhou
来源
Ruan Jian Xue Bao/Journal of Software | 2023年 / 34卷 / 05期
关键词
data integration; knowledge graph (KG); relational data;
D O I
10.13328/j.cnki.jos.006808
中图分类号
学科分类号
摘要
Recently, big data is considered a critical strategic resource by many countries and regions. However, difficult data circulation and insufficient data regulation commonly exist in the big data era, thereby leading to the serious phenomenon of data silos, poor data quality, and difficulty in unleashing the potential of data elements. This provokes researchers to explore data integration techniques for breaking data barriers, enabling data sharing, improving data quality, and activating the potential of data elements. Relational data and knowledge graphs, as two significant forms of data organization and storage, have been widely applied in real life. To this end, this study focuses on relational data and knowledge graphs to summarize and analyze the key technologies of data integration, including entity resolution, data fusion, and data cleaning. Finally, it prospects future research directions. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:2365 / 2391
页数:26
相关论文
共 181 条
  • [1] Reinsel D, Gantz J, Rydning J., The digitization of the world from edge to core, (2022)
  • [2] Big Data White Paper, (2021)
  • [3] Chen YG, Wang JC., A review of data integration, Computer Science, 31, 5, pp. 48-51, (2004)
  • [4] Yang XD, Peng ZY, Liu JQ, Li XH., An overview of information integration, Computer Science, 33, 7, pp. 55-59, (2006)
  • [5] Wang S, Peng YW, Lan H, Luo QW, Peng ZY., Survey and prospect: Data integration methodologies, Ruan Jian Xue Bao/Journal of Software, 31, 3, (2020)
  • [6] Getoor L, Machanavajjhala A., Entity resolution: Theory, practice & open challenges, Proc. of the VLDB Endowment, 5, 12, pp. 2018-2019, (2012)
  • [7] Sun ZQ, Zhang QH, Hu W, Wang CM, Chen MH, Akrami F, Li CK., A benchmarking study of embedding-based entity alignment for knowledge graphs, Proc. of the VLDB Endowment, 13, 12, pp. 2326-2340, (2020)
  • [8] Zhuang Y, Li GL, Feng JH., A survey on entity alignment of knowledge base, Journal of Computer Research and Development, 53, 1, pp. 165-192, (2016)
  • [9] Meng XF, Du ZJ., Research on the big data fusion: Issues and challenges, Journal of Computer Research and Development, 53, 2, pp. 231-246, (2016)
  • [10] Guo ZM, Zhou AY., Research on data quality and data cleaning: A survey, Ruan Jian Xue Bao/Journal of Software, 13, 11, pp. 2076-2082, (2002)