A guide to creating an effective big data management framework

被引:1
|
作者
Arundel, S. T. [1 ]
Mckeehan, K. G. [1 ]
Campbell, B. B. [2 ]
Bulen, A. N. [2 ]
Thiem, P. T. [1 ]
机构
[1] US Geol Survey, Ctr Excellence Geospatial Informat Sci, 1400 Independence Rd, Rolla, MO 65401 USA
[2] US Geol Survey, Natl Geospatial Tech Operat Ctr, 1400 Independence Rd, Rolla, MO 65401 USA
关键词
ADOM; Data movement; Ingress; Egress; Rclone; INFORMATION-SYSTEMS; MAPREDUCE;
D O I
10.1186/s40537-023-00801-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced with challenges in storing data and ingesting it, transferring it between internal programs, and egressing it to external entities. As a result, these agencies and organizations may inadvertently devote unnecessary time and money to convey data without existing or outdated standards. This research aims to evaluate the components of data conveyance systems, such as transfer methods, tracking, and automation, to guide their improved performance. Specifically, organizations face the challenges of slow dispatch time and manual intervention when conveying data into, within, and from their systems. Conveyance often requires skilled workers when the system depends on physical media such as hard drives, particularly when terabyte transfers are required. In addition, incomplete or inconsistent metadata may necessitate manual intervention, process changes, or both. A proposed solution is organization-wide guidance for efficient data conveyance. That guidance involves systems analysis to outline a data management framework, which may include understanding the minimum requirements of data manifests, specification of transport mechanisms, and improving automation capabilities.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A guide to creating an effective big data management framework
    S. T. Arundel
    K. G. McKeehan
    B. B. Campbell
    A. N. Bulen
    P. T. Thiem
    Journal of Big Data, 10
  • [2] Effective and efficient distributed management of big clinical data: a framework
    Cuzzocrea, Alfredo
    Grasso, Giorgio Mario
    Nolich, Massimiliano
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2019, 11 (03) : 284 - 313
  • [3] A big data analytics framework for scientific data management
    Fiore, Sandro
    Palazzo, Cosimo
    D'Anca, Alessandro
    Foster, Ian
    Williams, Dean N.
    Aloisio, Giovanni
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [4] A Holistic Framework for Big Scientific Data Management
    Kantere, Verena
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 220 - 226
  • [5] Preprocessing framework for scholarly big data management
    Khan, Samiya
    Alam, Mansaf
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 39719 - 39743
  • [6] Preprocessing framework for scholarly big data management
    Samiya Khan
    Mansaf Alam
    Multimedia Tools and Applications, 2023, 82 : 39719 - 39743
  • [7] Factors influencing effective use of big data: A research framework
    Surbakti, Feliks P. Sejahtera
    Wang, Wei
    Indulska, Marta
    Sadiq, Shazia
    INFORMATION & MANAGEMENT, 2020, 57 (01)
  • [8] Development of big data assisted effective enterprise resource planning framework for smart human resource management
    Zhao, Yaxuan
    PLOS ONE, 2024, 19 (05):
  • [9] A guide to reverse metabolomics-a framework for big data discovery strategy
    Charron-Lamoureux, Vincent
    Mannochio-Russo, Helena
    Lamichhane, Santosh
    Xing, Shipei
    Patan, Abubaker
    Gomes, Paulo Wender Portal
    Rajkumar, Prajit
    Deleray, Victoria
    Caraballo-Rodriguez, Andres Mauricio
    Chua, Kee Voon
    Lee, Lye Siang
    Liu, Zhao
    Ching, Jianhong
    Wang, Mingxun
    Dorrestein, Pieter C.
    NATURE PROTOCOLS, 2025,
  • [10] Creating Strategic Business Value from Big Data Analytics: A Research Framework
    Grover, Varun
    Chiang, Roger H. L.
    Liang, Ting-Peng
    Zhang, Dongsong
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2018, 35 (02) : 388 - 423