Big Data Integration

被引:0
|
作者
Dong, Xin Luna [1 ]
Srivastava, Divesh [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
TRUTH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Big Data era is upon us: data is being generated, collected and analyzed at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of Big Data. BDI differs from traditional data integration in many dimensions: (i) the number of data sources, even for a single domain, has grown to be in the tens of thousands, (ii) many of the data sources are very dynamic, as a huge amount of newly collected data are continuously made available, (iii) the data sources are extremely heterogeneous in their structure, with considerable variety even for substantially similar entities, and (iv) the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This seminar explores the progress that has been made by the data integration community on the topics of schema mapping, record linkage and data fusion in addressing these novel challenges faced by big data integration, and identifies a range of open problems for the community.
引用
收藏
页码:1245 / 1248
页数:4
相关论文
共 50 条
  • [31] A big Data Analytics Framework for the Integration of Heterogeneous Federated Data Centers
    Hewapathirana, Ishara
    Silva, Thushari
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 650 - 657
  • [32] Research on Big Data Integration Based on Karma Modeling
    Wang Xiao
    Li Guoqi
    Liu Bin
    PROCEEDINGS OF 2017 8TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2017), 2017, : 245 - 248
  • [33] Systematic method for big manufacturing data integration and sharing
    Xiang, Feng
    Yin, Qi
    Wang, Zihan
    Jiang, Guo Zhang
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 94 (9-12): : 3345 - 3358
  • [34] A data model-independent approach to big research data integration
    Bartalesi V.
    Meghini C.
    Thanos C.
    International Journal of Metadata, Semantics and Ontologies, 2019, 13 (04): : 330 - 345
  • [35] Data Integration and Transformation Proposal for Big Data Analyses in Automotive Industry
    Tanuska, Pavol
    Spendla, Lukas
    Kebisek, Michal
    Vazan, Pavel
    Hrcka, Lukas
    2017 IEEE 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES), 2017, : 97 - 102
  • [36] Integration of Big Data Using Semantic Web Technologies
    Ostrowski, David
    Rychtyckyj, Nestor
    MacNeille, Perry
    Kim, Mira
    2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, : 381 - 384
  • [37] A Semantic Based Framework for the purpose of Big Data Integration
    Ostrowski, David
    Kim, Mira
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, : 305 - 309
  • [38] Foreword to the Special Issue: "Semantics for Big Data Integration"
    Beneventano, Domenico
    Vincini, Maurizio
    INFORMATION, 2019, 10 (02)
  • [39] Blockchain integration in big data: Review, vision, and opportunities
    Bakhtiary, Vahid
    Rahmani, Amir Masoud
    SECURITY AND PRIVACY, 2024, 7 (05)
  • [40] Integration and optimization of multiple big data processing platforms
    Chang, Bao-Rong
    Tsai, Hsiu-Fen
    Tsai, Yun-Che
    Kuo, Chin-Fu
    Chen, Chi-Chung
    ENGINEERING COMPUTATIONS, 2016, 33 (06) : 1680 - 1704