A guide to creating an effective big data management framework

被引:1
|
作者
Arundel, S. T. [1 ]
Mckeehan, K. G. [1 ]
Campbell, B. B. [2 ]
Bulen, A. N. [2 ]
Thiem, P. T. [1 ]
机构
[1] US Geol Survey, Ctr Excellence Geospatial Informat Sci, 1400 Independence Rd, Rolla, MO 65401 USA
[2] US Geol Survey, Natl Geospatial Tech Operat Ctr, 1400 Independence Rd, Rolla, MO 65401 USA
关键词
ADOM; Data movement; Ingress; Egress; Rclone; INFORMATION-SYSTEMS; MAPREDUCE;
D O I
10.1186/s40537-023-00801-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced with challenges in storing data and ingesting it, transferring it between internal programs, and egressing it to external entities. As a result, these agencies and organizations may inadvertently devote unnecessary time and money to convey data without existing or outdated standards. This research aims to evaluate the components of data conveyance systems, such as transfer methods, tracking, and automation, to guide their improved performance. Specifically, organizations face the challenges of slow dispatch time and manual intervention when conveying data into, within, and from their systems. Conveyance often requires skilled workers when the system depends on physical media such as hard drives, particularly when terabyte transfers are required. In addition, incomplete or inconsistent metadata may necessitate manual intervention, process changes, or both. A proposed solution is organization-wide guidance for efficient data conveyance. That guidance involves systems analysis to outline a data management framework, which may include understanding the minimum requirements of data manifests, specification of transport mechanisms, and improving automation capabilities.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] A Big Aurora Data Management Framework toward Aurora Classification
    Wang, Yuhang
    Zhang, Xian
    Zhao, Hui
    Liang, Jimin
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 1284 - 1287
  • [22] A framework for investigating the role of big data in service parts management
    Boone, Christopher A.
    Skipper, Joseph B.
    Hazen, Benjamin T.
    JOURNAL OF CLEANER PRODUCTION, 2017, 153 (01) : 687 - 691
  • [23] Serendipitous, Open Big Data Management and Analytics: The SeDaSOMA Framework
    Cuzzocrea, Alfredo
    Ciancarini, Paolo
    MODELLING, 2024, 5 (03): : 1173 - 1196
  • [24] Privacy Preserving Framework for Big Data Management in Smart Buildings
    Inibhunu, Catherine
    McGregor, Carolyn
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 667 - 673
  • [25] Big Data Analytics Framework for Natural Disaster Management in Malaysia
    Abdullah, Mohammad Fikry
    Ibrahim, Mardhiah
    Zulkifli, Harlisa
    IOTBDS: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY, 2017, : 406 - 411
  • [26] A framework of pavement management system based on IoT and big data
    Dong, Jichang
    Meng, Weina
    Liu, Ying
    Ti, Jing
    ADVANCED ENGINEERING INFORMATICS, 2021, 47
  • [27] Big Data Framework
    Tekiner, Firat
    Keane, John A.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1494 - 1499
  • [28] Big data: creating the right balance
    Palamuthusingam, Dharmenaan
    Reyaldeen, Reza
    Nath, Karthik
    KIDNEY INTERNATIONAL, 2019, 96 (06) : 1422 - 1422
  • [29] Big Data for Managers: Creating Value
    Dutta, Bidyarthi
    JOURNAL OF SCIENTOMETRIC RESEARCH, 2019, 8 (02) : 117 - 118
  • [30] Towards effective GML data management: framework and prototype
    Wang, Weili
    Zhu, Fubao
    Jin, Ting
    Qian, Zhiping
    Zhang, Long
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (04) : 413 - 432