Lill-DATA - A Framework for Traceable Active Learning Projects

被引:1
|
作者
Stieler, Fabian [1 ,3 ]
Elia, Miriam [1 ]
Weigell, Benjamin [1 ]
Bauer, Bernhard [1 ,3 ]
Kienle, Peter [2 ]
Roth, Anton [2 ]
Muellegger, Gregor [2 ]
Nann, Marius [2 ]
Dopfer, Sarah [2 ]
机构
[1] Univ Augsburg, Inst Comp Sci, Augsburg, Germany
[2] GS Elekt Med Gerate G Stemple GmbH, Kaufering, Germany
[3] Ctr Responsible AI Technol, Munich, Germany
关键词
Active Learning; Data Labeling; Traceability; Data-Centric AI; !text type='Python']Python[!/text] Framework; Open Source; MODEL;
D O I
10.1109/REW57809.2023.00088
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Active Learning has become a popular method for iteratively improving data -intensive Artificial Intelligence models. However, it often presents a significant challenge when dealing with large volumes of volatile data in projects, as with an Active Learning loop. This paper introduces LIFEDATA, a Python-based framework designed to assist developers in implementing Active Learning projects focusing on traceability. It supports seamless tracking of all artifacts, from data selection and labeling to model interpretation, thus promoting transparency throughout the entire model learning process and enhancing error debugging efficiency while ensuring experiment reproducibility. To showcase its applicability, we present two life science use cases. Moreover, the paper proposes an algorithm that combines query strategies to demonstrate LIFEDATA's ability to reduce data labeling effort.
引用
收藏
页码:465 / 474
页数:10
相关论文
共 50 条
  • [31] An active learning framework for set inversion
    Nguyen, Binh T.
    Nguyen, Duy M.
    Ho, Lam Si Tung
    Vu Dinh
    KNOWLEDGE-BASED SYSTEMS, 2019, 185
  • [32] JCLAL: A Java framework for active learning
    1600, Microtome Publishing (17):
  • [33] Design and Supervision Model of Group Projects for Active Learning
    Lau, Yi Meng
    Shim, Kyong Jin
    Gottipati, Swapna
    2021 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2021), 2021,
  • [34] An Active Learning Framework for Alpha Matting
    Shen, Yang
    Wang, Pengjie
    Pan, Zhifang
    Bao, Yanxia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (09)
  • [35] BioMart: a data federation framework for large collaborative projects
    Zhang, Junjun
    Haider, Syed
    Baran, Joachim
    Cros, Anthony
    Guberman, Jonathan M.
    Hsu, Jack
    Liang, Yong
    Yao, Long
    Kasprzyk, Arek
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
  • [36] SEMESTER PROJECTS IN MEDICAL BIOPHYSICS PROMOTE ACTIVE LEARNING
    Kralova, Eva
    Ferencova, Elena
    Trnka, Michal
    9TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES (EDULEARN17), 2017, : 9302 - 9306
  • [37] Online Active Learning Framework for Data Stream Classification With Density-Peaks Recognition
    Zhang, Kuangyan
    Liu, Sanmin
    Chen, Yanfei
    IEEE ACCESS, 2023, 11 : 27853 - 27864
  • [38] MKGB: A Medical Knowledge Graph Construction Framework Based on Data Lake and Active Learning
    Ren, Peng
    Hou, Wei
    Sheng, Ming
    Li, Xin
    Li, Chao
    Zhang, Yong
    HEALTH INFORMATION SCIENCE, HIS 2021, 2021, 13079 : 245 - 253
  • [39] Active Learning Framework Combining Semi-supervised Approach for Data Stream Mining
    Kholghi, Mahnoosh
    Keyvanpour, MohammadReza
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 238 - +
  • [40] ATTDC: An Active and Traceable Trust Data Collection Scheme for Industrial Security in Smart Cities
    Shen, Mengqiu
    Liu, Anfeng
    Huang, Guosheng
    Xiong, Neal N.
    Lu, Huimin
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (08) : 6437 - 6453