Successes and Lessons Learned From OOI End-to-End System Data Quality Audit

被引:0
|
作者
Belabbassi, Leila [1 ]
Garzio, Lori [1 ]
Smith, Mike [1 ]
Knuth, Friedrich [1 ]
Kerfoot, John [1 ]
Vardaro, Michael [1 ]
Crowley, Mike [1 ]
机构
[1] Rutgers State Univ, Dept Marine & Coastal Sci, New Brunswick, NJ 08901 USA
基金
美国国家科学基金会;
关键词
OOI Observatory; Instruments; Platforms; Operation; Information Management System; Cyberinfrastructure; End-To-End System; Data quality Audit;
D O I
10.1109/OCEANS.2016.7761419
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
The Ocean Observatories Initiative (OOI) consists of seven research sites with over 800 instruments collecting ocean, seafloor, and meteorological data in the world's oceans, extending from the Irminger Sea to the Southern Ocean. The scale with which data are produced from the instruments and their platforms presents key challenges, including how to (1) build data and information management infrastructure that associates measurements from multiple instruments for concurrent observations, (2) develop data delivery mechanisms that meet a variety of needs, (3) ensure timely release of data, and (4) formulate capabilities to provide stable, long-term support for research and societal needs. As a strategy for maintenance of the sustained, large-scale, and variety of scientific observations collected, the OOI Cyberinfrastructure (CI) developed an end-to-end system designed to store, query, process, and disseminate the compiled information. These resources include raw instrument values and derived data products, metadata associated with instrument and platform deployments (e.g., deployment dates, water depths, instrument manufacturer, etc.), calibration coefficients, data provenance and descriptors of computational algorithms and transformation functions, and other related outputs. As with many data and information management systems, the need to monitor and improve upon the performance of the various interrelated components of the CI is an integral part in establishing the success of the system. For this reason, the data evaluation team at Rutgers initiated an end-to-end system data quality audit that ensures the system can accurately and completely deliver data within the required specifications. This was applied to a subset of representative platforms for all instrument types. Thus, based on specific system failures that can prevent production of quality data products, we prepared a set of tests and built tools that function as a troubleshooting method for system repair and enhancement. The method was used to report on the system's ability to: (a) Respond to data queries (b) Provide links to data product files (c) Produce all relevant data products (d) Produce correct provenance information (e) Produce quality science data products (f) Parse data with the correct specifications and the correct number of particles (g) Calculate different data levels with correct dimensions and units To further assess the system fidelity to produce high quality data, a subject matter expert (SME) was consulted to provide outside validation of the OOI data quality, especially for some of the unique instrument classes that have less available documentation or publication records to consult. If the end-to-end system data quality audit logs successful system performance, the data checked were marked as ready to release to the public. If the data quality audit logs failures, they were investigated and a repair ensued, followed by another data check. Feedback from the end-to-end system data quality audit will be communicated to the public in form of annotations. This is needed to complete the end-to-end system's ability to communicate information from and about the system. Considering that the data are continuously collected in the water and the system configuration can change over time, it is very important not to become complacent with the end-to-end system once it is in place. To ensure continuity of the required quality, an end-to-end system data quality audit is needed on regular basis.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] SYSTEM FOR ANALYSES OF END-TO-END QUALITY OF DATA SERVICES IN CELLULAR NETWORKS
    Janevski, Toni
    Tudzarov, Aleksandar
    Porjazoski, Marko
    Latkoski, Pero
    EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1624 - 1631
  • [2] Towards an End-to-End Data Quality Optimizer
    Restat, Valerie
    Klettke, Meike
    Stoerl, Uta
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 262 - 266
  • [3] End-to-End Data Quality: Insights from Two Case Studies
    Hasan, M. Redwan
    Legner, Christine
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 392 - 399
  • [4] Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure
    Li, Yang
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Wang, Yue
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4297 - 4305
  • [5] The NOAO end-to-end data management system: An overview
    Smith, R. Chris
    Dickinson, Mark
    Lowry, Sonya
    Miller, Christopher J.
    Trueblood, Mark
    Valdes, Frank
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVI, 2007, 376 : 615 - +
  • [6] End-to-end Integration of ScientificWorkflows on Distributed Cyberinfrastructures: Challenges and Lessons Learned with an Earth Science Application
    Roa, Camila
    Rynge, Mats
    Olaya, Paula
    Vahi, Karan
    Miller, Todd
    Goodhue, John
    Griffioen, James
    Hudak, David
    Knuth, Shelley
    Romanella, Alana
    Llamas, Ricardo
    Vargas, Rodrigo
    Livny, Miron
    Deelman, Ewa
    Taufer, Michela
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,
  • [7] Monitoring End-to-End Quality of Service in a Video Streaming System
    Lehikoinen, L.
    Raty, T.
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 750 - 754
  • [8] An End-to-End Data-Adaptive Pancreas Segmentation System with an Image Quality Control Toolbox
    Zhu Y.
    Hu P.
    Li X.
    Tian Y.
    Bai X.
    Liang T.
    Li J.
    Journal of Healthcare Engineering, 2023, 2023
  • [9] Multi-modality end-to-end audit by the ACDS
    Lye, J.
    Gibbons, F.
    Shaw, M.
    Alves, A.
    Keehan, S.
    Williams, I.
    RADIOTHERAPY AND ONCOLOGY, 2017, 123 : S966 - S967
  • [10] End-to-End Learned Self-Interference Cancellation
    Balatsoukas-Stimming, Alexios
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 1334 - 1338