A Novel Data Cleaning Framework Based on Knowledge Graph

被引:0
|
作者
Song, Yuanfeng [1 ]
Zhang, Danni [2 ]
Li, Xiaodong [1 ]
Luo, Kunming [3 ]
Liao, Jianming [3 ]
机构
[1] Univ Elect Sci & Technol China, Informat Ctr, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Informatizat & Network Managernent Off, Chengdu, Peoples R China
[3] Univ Elect Sci & Technol China, Sch CSE, Chengdu, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
data cleaning; knowledge graph; error repair; knowledge inference; EDITING RULES; FIXES;
D O I
10.1109/BigCom57025.2022.00050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In real-world applications, data cleaning has long been a challenge across both academia and industry. Unsuccessful cleaning of data may lead to inaccurate analysis and untrustworthy decision-making. This paper proposes a novel knowledge graph-based data cleaning framework The framework performs pattern repair and inference repair on dirty data based on the obtained implicit and explicit relationships by establishing the knowledge graph and the relationship patterns in the data The pattern repair includes both explicit and implicit relationship matching, while the inference repair includes both attribution inference repair and rule inference repair. The experimental results show that the higher the number of association relations among data tables, the greater the improvement in cleaning efficiency; moreover, the more association knowledge is contained in the knowledge graph, the more obvious the improvement of cleaning efficiency.
引用
收藏
页码:350 / 355
页数:6
相关论文
共 50 条
  • [21] A Knowledge Graph based Framework for Web API Recommendation
    Kwapong, Benjamin A.
    Fletcher, Kenneth K.
    2019 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2019), 2019, : 115 - 120
  • [22] A Framework for Service Semantic Description Based on Knowledge Graph
    Sun, Qitong
    Han, Jun
    Ma, Dianfu
    ELECTRONICS, 2021, 10 (09)
  • [23] A Knowledge Graph based Disaster Storyline Generation Framework
    Ni, Jinxin
    Liu, Xiang
    Zhou, Qifeng
    Cao, Langcai
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 4432 - 4437
  • [24] A data-centric framework of improving graph neural networks for knowledge graph embedding
    Cao, Yanan
    Lin, Xixun
    Wu, Yongxuan
    Shi, Fengzhao
    Shang, Yanmin
    Tan, Qingfeng
    Zhou, Chuan
    Zhang, Peng
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2025, 28 (01):
  • [25] A Universal Data Cleaning Framework Based on User Model
    Huang Yu
    Zhang Xiao-yi
    Yuan Zhen
    Jiang Guo-quan
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 200 - 202
  • [26] A FRAMEWORK FOR DATA CLEANING IN DATA WAREHOUSES
    Peng, Taoxin
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : 473 - 478
  • [27] The Knowledge Graph as an Ontological Framework
    Hurlburt, George F.
    IT PROFESSIONAL, 2021, 23 (04) : 14 - 18
  • [28] A novel Knowledge Graph recommendation algorithm based on Graph Convolutional Network
    Guo, Hui
    Yang, Chengyong
    Zhou, Liqing
    Wei, Shiwei
    CONNECTION SCIENCE, 2024, 36 (01)
  • [29] SocialCCF: Graph-text Collaborative Cleaning Framework Based on Social Networks
    Zhang, Yun
    Jin, Zongze
    Liu, Fan
    Zhu, Weilin
    Mu, Weimin
    Wang, Weiping
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 742 - 747
  • [30] A Framework of Data Fusion Through Spatio-Temporal Knowledge Graph
    Zhang, Xiaohan
    Zhu, Xinning
    Wu, Jie
    Hu, Zheng
    Zhang, Chunhong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 216 - 228