Navigating Data-Centric Artificial Intelligence with DC-Check: Advances, Challenges, and Opportunities

被引:3
|
作者
Seedat N. [2 ]
Imrie F. [1 ]
Van Der Schaar M. [2 ,3 ]
机构
[1] University of California, Los Angeles
[2] University of Cambridge, Cambridge
[3] Alan Turing Institute, London
来源
关键词
Data-centric artificial intelligence (AI); machine learning (ML) pipelines; reliable-ML;
D O I
10.1109/TAI.2023.3345805
中图分类号
学科分类号
摘要
Data-centric artificial intelligence (AI) is an emerging paradigm that emphasizes the critical role of data in real-world machine learning (ML) systems - as a complement to model development. However, data-centric AI is still in its infancy, lacking a standardized framework that outlines necessary data-centric considerations at various stages of the ML pipeline: Data, Training, Testing, and Deployment. This lack of guidance hampers effective communication and design of data-centric driven ML systems. To address this critical gap, we introduce the Data-Centric Checklist (DC-Check), an actionable checklist-style framework that encapsulates data-centric considerations for ML systems. DC-Check is aimed at both practitioners and researchers to serve as a reference guide to data-centric AI development. Around each question in DC-Check, we discuss the applicability of different approaches, survey the state of the art, and highlight specific data-centric AI challenges and research opportunities. While developing DC-Check, we also undertook an analysis of the current data-centric AI landscape. The insights obtained from this exploration support the DC-Check framework, reinforcing its utility and relevance in the rapidly evolving field. To make DC-Check and related resources easily accessible, we provide a DC-Check companion website (https://www.vanderschaar-lab.com/dc-check/), which will serve as a living resource, updated as methods and tools evolve. © 2020 IEEE.
引用
收藏
页码:2589 / 2603
页数:14
相关论文
共 50 条
  • [21] Navigating the Future of Psychiatry: A Review of Research on Opportunities, Applications, and Challenges of Artificial Intelligence
    Jake Linardon
    Current Treatment Options in Psychiatry, 12 (1)
  • [22] A review on data-centric decision tools for offshore wind operation and maintenance activities: Challenges and opportunities
    Hadjoudj, Yannis
    Pandit, Ravi
    ENERGY SCIENCE & ENGINEERING, 2023, 11 (04) : 1501 - 1515
  • [23] Data-centric explainable artificial intelligence techniques for cyber-attack detection in microgrid networks
    Trivedi, Rohit
    Patra, Sandipan
    Khadem, Shafi
    ENERGY REPORTS, 2025, 13 : 217 - 229
  • [24] USER AND DATA-CENTRIC ARTIFICIAL INTELLIGENCE FOR MAPPING URBAN DEPRIVATION IN MULTIPLE CITIES ACROSS THE GLOBE
    Tareke, Bedru
    Silva Filho, Paulo
    Persello, Claudio
    Kuffer, Monika
    Maretto, Raian, V
    Wang, Jon
    Abascal, Angela
    Pillai, Priam
    Singh, Binti
    D'Attoli, Juan Manuel
    Kabaria, Caroline
    Pedrassoli, Juilo
    Brito, Patricia
    Elias, Peter
    Atenogenes, Elio
    Ramirez Santiago, Andrea
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 1553 - 1557
  • [25] Navigating Opportunities and Challenges of Artificial Intelligence: ChatGPT and Generative Models in Science Teacher Education
    Verma, Geeta
    Campbell, Todd
    Melville, Wayne
    Park, Byung-Yeol
    JOURNAL OF SCIENCE TEACHER EDUCATION, 2023, 34 (08) : 793 - 798
  • [26] Data-centric artificial olfactory system based on the eigengraph
    Sung, Seung-Hyun
    Suh, Jun Min
    Hwang, Yun Ji
    Jang, Ho Won
    Park, Jeon Gue
    Jun, Seong Chan
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [27] Artificial Intelligence, Big Data, and Regulation of Immunity: Challenges and Opportunities
    Singh, Bhagirath
    Jevnikar, Anthony M.
    Desjardins, Eric
    ARCHIVUM IMMUNOLOGIAE ET THERAPIAE EXPERIMENTALIS, 2024, 72 (01)
  • [28] Challenges of Information Retrieval and Evaluation in Data-Centric Biology
    Yu, Yi-Kuo
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2011, 15 (04) : 239 - 240
  • [29] Data-centric artificial olfactory system based on the eigengraph
    Seung-Hyun Sung
    Jun Min Suh
    Yun Ji Hwang
    Ho Won Jang
    Jeon Gue Park
    Seong Chan Jun
    Nature Communications, 15
  • [30] Artificial Intelligence in Radiology: Opportunities and Challenges
    Flory, Marta N.
    Napel, Sandy
    Tsai, Emily B.
    SEMINARS IN ULTRASOUND CT AND MRI, 2024, 45 (02) : 152 - 160