Safely Managing Data Variety in Big Data Software Development

被引:0
|
作者
Cerqueus, Thomas [1 ]
de Almeida, Eduardo Cunha [2 ]
Scherzinger, Stefanie [3 ]
机构
[1] Univ Lyon, CNRS, INSA Lyon, LIRIS,UMR5205, Lyon, France
[2] Univ Fed Parana, BR-80060000 Curitiba, Parana, Brazil
[3] OTH Regensburg, Regensburg, Germany
关键词
SCHEMA EVOLUTION; MODEL;
D O I
10.1109/BIGDSE.2015.9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We consider the task of building Big Data software systems, offered as software-as-a-service. These applications are commonly backed by NoSQL data stores that address the proverbial Vs of Big Data processing: NoSQL data stores can handle large volumes of data and many systems do not enforce a global schema, to account for structural variety in data. Thus, software engineers can design the data model on the go, a flexibility that is particularly crucial in agile software development. However, NoSQL data stores commonly do not yet account for the veracity of changes when it comes to changes in the structure of persisted data. Yet this is an inevitable consequence of agile software development. In most NoSQL-based application stacks, schema evolution is completely handled within the application code, usually involving object mapper libraries. Yet simple code refactorings, such as renaming a class attribute at the source code level, can cause data loss or runtime errors once the application has been deployed to production. We address this pain point by contributing type checking rules that we have implemented within an IDE plugin. Our plugin ControVol statically type checks the object mapper class declarations against the code release history. ControVol is thus capable of detecting common yet risky cases of mismatched data and schema, and can even suggest automatic fixes.
引用
收藏
页码:4 / 10
页数:7
相关论文
共 50 条
  • [41] Big Data analytics in Agile software development: A systematic mapping study
    Biesialska, Katarzyna
    Franch, Xavier
    Muntes-Mulero, Victor
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 132 (132)
  • [42] Big data and intelligent software systems
    Jalal, Ahmed Adeeb
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2018, 22 (03) : 177 - 193
  • [43] Applying Software Engineering Processes for Big Data Analytics Applications Development
    Al-Jaroodi, Jameela
    Hollein, Brandon
    Mohamed, Nader
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [44] Big(ger) Data in Software Engineering
    Nagappan, Meiyappan
    Mirakhorli, Mehdi
    2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, Vol 2, 2015, : 957 - 958
  • [45] Software Engineering for Big Data Systems
    Gorton, Ian
    Bener, Ayse Basar
    Mockus, Audris
    IEEE SOFTWARE, 2016, 33 (02) : 32 - 35
  • [48] Influence of Big Data in managing cyber assets
    Mitra, Amit
    Munir, Kamran
    BUILT ENVIRONMENT PROJECT AND ASSET MANAGEMENT, 2019, 9 (04) : 503 - 514
  • [49] A Middleware for Managing Big-Data Flows
    Gupta, Rajeev
    Gupta, Himanshu
    Gupta, Sanjeev
    Padmanabhan, Sriram
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 410 - 424
  • [50] Design of an aggregator for managing informative big data
    Blazquez-Ochando, Manuel
    PROFESIONAL DE LA INFORMACION, 2016, 25 (04): : 671 - 683