Open Science and Data Science

被引:4
|
作者
Wittenburg, Peter [1 ]
机构
[1] Max Planck Comp & Data Facil, Giessenbachstr 2, D-85748 Garching, Germany
关键词
Open Science by Design; Open Science by Publication; Data Science; Data infrastructure; Digital Objects; FAIR;
D O I
10.1162/dint_a_00082
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Science (DS) as defined by Jim Gray is an emerging paradigm in all research areas to help finding non-obvious patterns of relevance in large distributed data collections. "Open Science by Design" (OSD), i.e., making artefacts such as data, metadata, models, and algorithms available and re-usable to peers and beyond as early as possible, is a pre-requisite for a flourishing DS landscape. However, a few major aspects can be identified hampering a fast transition: (1) The classical "Open Science by Publication" (OSP) is not sufficient any longer since it serves different functions, leads to non-acceptable delays and is associated with high curation costs. Changing data lab practices towards OSD requires more fundamental changes than OSP. 2) The classical publication-oriented models for metrics, mainly informed by citations, will not work anymore since the roles of contributors are more difficult to assess and will often change, i.e., other ways for assigning incentives and recognition need to be found. (3) The huge investments in developing DS skills and capacities by some global companies and strong countries is leading to imbalances and fears by different stakeholders hampering the acceptance of Open Science (OS). (4) Finally, OSD will depend on the availability of a global infrastructure fostering an integrated and interoperable data domain-"one data-domain" as George Strawn calls it-which is still not visible due to differences about the technological key pillars. OS therefore is a need for DS, but it will take much more time to implement it than we may have expected.
引用
收藏
页码:95 / 105
页数:11
相关论文
共 50 条
  • [31] Beyond the data management plan: Expanding roles for librarians in data science and open science
    Federer L.M.
    Qin J.
    Proceedings of the Association for Information Science and Technology, 2019, 56 (01) : 529 - 531
  • [32] The Design of a Community Science Cloud: The Open Science Data Cloud Perspective
    Grossman, Robert L.
    Greenway, Matthew
    Heath, Allison P.
    Powell, Ray
    Suarez, Rafael D.
    Wells, Walt
    White, Kevin
    Atkinson, Malcolm
    Klampanos, Iraklis
    Alvarez, Heidi L.
    Harvey, Christine
    Mambretti, Joe J.
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1051 - 1057
  • [33] Open Science, Open Research Data and some Open Questions
    Novotny, Jakub
    HRADEC ECONOMIC DAYS, PT II, 2019, 2019, 9 : 174 - 181
  • [34] OPEN SCIENCE, OPEN RESEARCH DATA AND THE ROLE OF IOSSG
    Gargiulo, Paola
    SCIRES-IT-SCIENTIFIC RESEARCH AND INFORMATION TECHNOLOGY, 2020, 10 : 53 - 58
  • [35] OPEN DATA INFRASTRUCTURES: EUROPEAN OPEN SCIENCE CLOUD
    Vevera, V. A.
    Barbu, D.
    14TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2020), 2020, : 5573 - 5577
  • [36] Open science: The open clinical trials data journey
    Rockhold, Frank
    Bromley, Christina
    Wagner, Erin K.
    Buyse, Marc
    CLINICAL TRIALS, 2019, 16 (05) : 539 - 546
  • [37] Open access to research data as a driver for open science
    Giglia, Elena
    JLIS.IT, 2015, 6 (02): : 225 - 247
  • [38] FAQ on Open Data and Open Science in the Sport Psychology
    Schoenbrodt, Felix D.
    Scheel, Anne
    ZEITSCHRIFT FUR SPORTPSYCHOLOGIE, 2017, 24 (04): : 134 - 139
  • [39] Open Molecular Science for the Open Science Cloud
    Lagana, Antonio
    Terstyanszky, Gabor
    Krueger, Jens
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT III, 2017, 10406 : 29 - 43
  • [40] Open Sourcing Education for Data Engineering and Data Science
    Drummond, David E.
    2016 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2016,