Big Data Integration

被引:0
|
作者
Dong, Xin Luna [1 ]
Srivastava, Divesh [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
TRUTH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Big Data era is upon us: data is being generated, collected and analyzed at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of Big Data. BDI differs from traditional data integration in many dimensions: (i) the number of data sources, even for a single domain, has grown to be in the tens of thousands, (ii) many of the data sources are very dynamic, as a huge amount of newly collected data are continuously made available, (iii) the data sources are extremely heterogeneous in their structure, with considerable variety even for substantially similar entities, and (iv) the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This seminar explores the progress that has been made by the data integration community on the topics of schema mapping, record linkage and data fusion in addressing these novel challenges faced by big data integration, and identifies a range of open problems for the community.
引用
收藏
页码:1245 / 1248
页数:4
相关论文
共 50 条
  • [1] Big Data Integration: The Big Promise of Data Integration
    Gal, Avigdor
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : XLIV - XLIV
  • [2] Big Data Integration
    Dong, Xin Luna
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1188 - 1189
  • [3] Big Data Integration
    Cudre-Mauroux, Philippe
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS CONTEL 2017, 2017, : 5 - 5
  • [4] Challenges of Data Integration and Interoperability in Big Data
    Kadadi, Anirudh
    Agrawal, Rajeev
    Nyamful, Christopher
    Atiq, Rahman
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [5] Keynote: Big Data Integration for eGovernment
    Cudre-Mauroux, Philippe
    2017 FOURTH INTERNATIONAL CONFERENCE ON EDEMOCRACY & EGOVERNMENT (ICEDEG), 2017, : 5 - 5
  • [6] Multistore Big Data Integration with CloudMdsQL
    Bondiombouy, Carlyna
    Kolev, Boyan
    Levchenko, Oleksandra
    Valduriez, Patrick
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXVIII: SPECIAL ISSUE ON DATABASE- AND EXPERT-SYSTEMS APPLICATIONS, 2016, 9940 : 48 - 74
  • [7] Big Heterogeneous Data Integration and Analysis
    Vetova, Stella
    APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE20), 2021, 2333
  • [8] Big data, generalities and integration in radiotherapy
    Le Fevre, C.
    Poty, L.
    Noel, G.
    CANCER RADIOTHERAPIE, 2018, 22 (01): : 101 - 101
  • [9] An Integration of Big Data and Cloud Computing
    Thingom, Chintureena
    Yeon, Guydeuk
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 729 - 737
  • [10] Exploiting Semantics for Big Data Integration
    Knoblock, Craig A.
    Szekely, Pedro
    AI MAGAZINE, 2015, 36 (01) : 25 - 38