Opportunities and Challenges in Data-Centric AI

被引:8
|
作者
Kumar, Sushant [1 ]
Datta, Sumit [2 ]
Singh, Vishakha [1 ]
Singh, Sanjay Kumar [1 ]
Sharma, Ritesh [3 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
[2] Digital Univ Kerala Formerly IIITM Kerala, Sch Elect Syst & Automat, Thiruvananthapuram 695317, India
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Artificial intelligence; model-centric AI; data-centric AI; data;
D O I
10.1109/ACCESS.2024.3369417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) systems are trained to solve complex problems and learn to perform specific tasks by using large volumes of data, such as prediction, classification, recognition, decision-making, etc. In the past three decades, AI research has focused mostly on the model-centric approach compared to the data-centric approach. In the model-centric approach, the focus is to improve the code or model architecture to enhance performance, whereas in data-centric AI, the focus is to improve the dataset to enhance performance. Data is food for AI. As a result, there has been a recent push in the AI community toward data-centric AI from model-centric AI. This paper provides a comprehensive and critical analysis of the current state of research in data-centric AI, presenting insights into the latest developments in this rapidly evolving field. By emphasizing the importance of data in AI, the paper identifies the key challenges and opportunities that must be addressed to improve the effectiveness of AI systems. Finally, this paper gives some recommendations for research opportunities in data-centric AI.
引用
收藏
页码:33173 / 33189
页数:17
相关论文
共 50 条
  • [1] Data-centric AI: Perspectives and Challenges
    Zha, Daochen
    Bhat, Zaid Pervaiz
    Lai, Kwei-Herng
    Yang, Fan
    Hu, Xia
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 945 - 948
  • [2] Data-Centric AI
    Malerba, Donato
    Pasquadibisceglie, Vincenzo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (06) : 1493 - 1502
  • [3] The Principles of Data-Centric AI
    Jarrahi, Mohammad Hossein
    Memariani, Ali
    Guha, Shion
    COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 84 - 92
  • [4] Challenges and Opportunities for Data-Centric Peer Evaluation Tools for Teamwork
    Shi W.W.
    Jagannadharao A.
    Lee J.
    Bailey B.P.
    Proceedings of the ACM on Human-Computer Interaction, 2021, 5 (CSCW2)
  • [5] Data collection and quality challenges in deep learning: a data-centric AI perspective
    Steven Euijong Whang
    Yuji Roh
    Hwanjun Song
    Jae-Gil Lee
    The VLDB Journal, 2023, 32 : 791 - 813
  • [6] Data collection and quality challenges in deep learning: a data-centric AI perspective
    Whang, Steven Euijong
    Roh, Yuji
    Song, Hwanjun
    Lee, Jae-Gil
    VLDB JOURNAL, 2023, 32 (04): : 791 - 813
  • [7] A Data-Centric AI Paradigm for Socio-Industrial and Global Challenges
    Majeed, Abdul
    Hwang, Seong Oun
    ELECTRONICS, 2024, 13 (11)
  • [8] Maximizing Relation Extraction Potential: A Data-Centric Study to Unveil Challenges and Opportunities
    Swarup, Anushka
    Bhandarkar, Avanti
    Dizon-Paradis, Olivia P.
    Wilson, Ronald
    Woodard, Damon L.
    IEEE ACCESS, 2024, 12 : 167655 - 167682
  • [9] dcbench: A Benchmark for Data-Centric AI Systems
    Eyuboglu, Sabri
    Karlas, Bojan
    Re, Christopher
    Zhang, Ce
    Zou, James
    PROCEEDINGS OF THE 6TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2022, 2022,
  • [10] Potential Impact of Data-Centric AI on Society
    Kumar, Sushant
    Sharma, Ritesh
    Singh, Vishakha
    Tiwari, Shrikant
    Singh, Sanjay Kumar
    Datta, Sumit
    IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 2023, 42 (03) : 98 - 107