A Comprehensive Scenario Agnostic Data LifeCycle Model for an Efficient Data Complexity Management

被引:0
|
作者
Sinaeepourfard, Amir [1 ]
Garcia, Jordi [1 ]
Masip-Bruin, Xavier [1 ]
Marin-Tordera, Eva [1 ]
机构
[1] Univ Politecn Cataluna, BarcelonaTech, Adv Network Architectures Lab CRAAX, Barcelona, Spain
关键词
Data LifeCycle; Data Management; Data Complexity; Vs Challenges; BIG DATA;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There is a vast amount of data being generated every day in the world, coming from a variety of sources, with different formats, quality levels, etc. This new data, together with the archived historical data, constitute the seed for future knowledge discovery and value generation in several fields of eScience. Discovering value from data is a complex computing process where data is the key resource, not only during its processing, but also during its entire life cycle. However, there is still a huge concern about how to organize and manage this data in all fields, and at all scales, for efficient usage and exploitation during all data life cycles. Although several specific Data LifeCycle (DLC) models have been recently defined for particular scenarios, we argue that there is no global and comprehensive DLC framework to be widely used in different fields. For this reason, in this paper we present and describe a comprehensive scenario agnostic Data LifeCycle (COSA-DLC) model successfully addressing all challenges included in the 6Vs, namely Value, Volume, Variety, Velocity, Variability and Veracity, not tailored to any specific environment, but easy to be adapted to fit the requirements of any particular field. We conclude that a comprehensive scenario agnostic DLC model provides several advantages, such as facilitating global data organization and integration, easing the adaptation to any kind of scenario, guaranteeing good quality data levels, and helping save design time and efforts for the research and industrial communities.
引用
收藏
页码:276 / 281
页数:6
相关论文
共 50 条
  • [31] TID-MOP: The Comprehensive Framework of Security Management and Control in the Scenario of Data Exchange
    Ziran D.
    Yue D.
    Chengqi Y.
    Boran H.
    Mingze G.
    Lin L.
    Data Analysis and Knowledge Discovery, 2022, 6 (01): : 13 - 21
  • [32] Big Data LifeCycle: Threats and Security Model
    Alshboul, Yazan
    Wang, Yong
    Nepali, Raj Kumar
    AMCIS 2015 PROCEEDINGS, 2015,
  • [33] Methods to Evaluate Lifecycle Models for Research Data Management
    Weber, Tobias
    Kranzlmueller, Dieter
    BIBLIOTHEK FORSCHUNG UND PRAXIS, 2019, 43 (01) : 75 - 81
  • [34] Comparison of data models for plant lifecycle information management
    Siltanen, Pekka
    Parnanen, Antti
    LEADING THE WEB IN CONCURRENT ENGINEERING: NEXT GENERATION CONCURRENT ENGINEERING, 2006, 143 : 346 - +
  • [35] DSL Approach to Deep Learning Lifecycle Data Management
    Celms, Edgars
    Barzdins, Janis
    Kalnins, Audris
    Barzdins, Paulis
    Sprogis, Arturs
    Grasmanis, Mikus
    Rikacovs, Sergejs
    BALTIC JOURNAL OF MODERN COMPUTING, 2020, 8 (04): : 597 - 617
  • [36] A framework for Big Data driven product lifecycle management
    Zhang, Yingfeng
    Ren, Shan
    Liu, Yang
    Sakao, Tomohiko
    Huisingh, Donald
    JOURNAL OF CLEANER PRODUCTION, 2017, 159 : 229 - 240
  • [37] Towards Unified Data and Lifecycle Management for Deep Learning
    Miao, Hui
    Li, Ang
    Davis, Larry S.
    Deshpande, Amol
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 571 - 582
  • [38] Information Visualization for Product Lifecycle Management (PLM) Data
    Guo, Chen
    Chen, Yingjie Victor
    Miller, Craig L.
    Hartman, Nathan W.
    Mueller, Amy B.
    Connolly, Patrick E.
    2014 ASEE ANNUAL CONFERENCE, 2014,
  • [39] Towards data exchange interoperability in building lifecycle management
    Kubler, Sylvain
    Madhikermi, Manik
    Buda, Andrea
    Framling, Kary
    Derigent, William
    Thomas, Andre
    2014 IEEE EMERGING TECHNOLOGY AND FACTORY AUTOMATION (ETFA), 2014,
  • [40] Algebricks: A Data Model-Agnostic Compiler Backend for Big Data Languages
    Borkar, Vinayak
    Bu, Yingyi
    Carman, E. Preston, Jr.
    Onose, Nicola
    Westmann, Till
    Pirzadeh, Pouria
    Carey, Michael J.
    Tsotras, Vassilis J.
    ACM SOCC'15: PROCEEDINGS OF THE SIXTH ACM SYMPOSIUM ON CLOUD COMPUTING, 2015, : 422 - 433