Lill-DATA - A Framework for Traceable Active Learning Projects

被引:1
|
作者
Stieler, Fabian [1 ,3 ]
Elia, Miriam [1 ]
Weigell, Benjamin [1 ]
Bauer, Bernhard [1 ,3 ]
Kienle, Peter [2 ]
Roth, Anton [2 ]
Muellegger, Gregor [2 ]
Nann, Marius [2 ]
Dopfer, Sarah [2 ]
机构
[1] Univ Augsburg, Inst Comp Sci, Augsburg, Germany
[2] GS Elekt Med Gerate G Stemple GmbH, Kaufering, Germany
[3] Ctr Responsible AI Technol, Munich, Germany
关键词
Active Learning; Data Labeling; Traceability; Data-Centric AI; !text type='Python']Python[!/text] Framework; Open Source; MODEL;
D O I
10.1109/REW57809.2023.00088
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Active Learning has become a popular method for iteratively improving data -intensive Artificial Intelligence models. However, it often presents a significant challenge when dealing with large volumes of volatile data in projects, as with an Active Learning loop. This paper introduces LIFEDATA, a Python-based framework designed to assist developers in implementing Active Learning projects focusing on traceability. It supports seamless tracking of all artifacts, from data selection and labeling to model interpretation, thus promoting transparency throughout the entire model learning process and enhancing error debugging efficiency while ensuring experiment reproducibility. To showcase its applicability, we present two life science use cases. Moreover, the paper proposes an algorithm that combines query strategies to demonstrate LIFEDATA's ability to reduce data labeling effort.
引用
收藏
页码:465 / 474
页数:10
相关论文
共 50 条
  • [41] When Data Acquisition Meets Data Analytics: A Distributed Active Learning Framework for Optimal Budgeted Mobile Crowdsensing
    Xu, Qiang
    Zheng, Rong
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [42] On active learning for data acquisition
    Zheng, ZQ
    Padmanabhan, B
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 562 - 569
  • [43] Active learning for microarray data
    Vogiatzis, D.
    Tsapatsoulis, N.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 47 (01) : 85 - 96
  • [44] Active learning with FRED data
    Mendez-Carbajo, Diego
    JOURNAL OF ECONOMIC EDUCATION, 2020, 51 (01): : 87 - 94
  • [45] Active Learning with Logged Data
    Yan, Songbai
    Chaudhuri, Kamalika
    Javidi, Tara
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [46] Developing an interpretive learning framework for understanding action research projects
    Blackberry, Gina
    Kearney, Judith
    Glen, Matthew
    EDUCATIONAL ACTION RESEARCH, 2019, 27 (02) : 318 - 330
  • [47] A Conceptual Framework to Improve Project Team Learning in Major Projects
    Gharaibeh, Hani
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY PROJECT MANAGEMENT, 2015, 6 (02) : 61 - 76
  • [48] Combining Active and Semisupervised Learning of Remote Sensing Data Within a Renyi Entropy Regularization Framework
    Polewski, Przemyslaw
    Yao, Wei
    Heurich, Marco
    Krzystek, Peter
    Stilla, Uwe
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (07) : 2910 - 2922
  • [49] LiDAR Dataset Distillation within Bayesian Active Learning Framework Understanding the Effect of Data Augmentation
    Anh Ngoc Phuong Duong
    Almin, Alexandre
    Lemarie, Leo
    Kiran, B. Ravi
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 159 - 167
  • [50] How much is Big Data? A Classification Framework for IT Projects and Technologies
    Volk, Matthias
    Hart, Stefan
    Bosse, Sascha
    Turowski, Klaus
    AMCIS 2016 PROCEEDINGS, 2016,