Automated Metadata Annotation:What Is and Is Not Possible with Machine Learning

被引:0
|
作者
Mingfang Wu [1 ]
Hans Brandhorst [2 ]
MariaCristina Marinescu [3 ]
Joaquim Mor Lpez [3 ]
Margorie Hlava [4 ]
Joseph Busch [5 ]
机构
[1] Australian Research Data Commons
[2] Iconclass
[3] Barcelona Supercomputing Center
[4] Access Innovations
[5] Taxonomy
关键词
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
摘要
Automated metadata annotation is only as good as training dataset, or rules that are available for the domain. It's important to learn what type of data content a pre-trained machine learning algorithm has been trained on to understand its limitations and potential biases. Consider what type of content is readily available to train an algorithm—what's popular and what's available. However, scholarly and historical content is often not available in consumable, homogenized, and interoperable formats at the large volume that is required for machine learning. There are exceptions such as science and medicine, where large, well documented collections are available. This paper presents the current state of automated metadata annotation in cultural heritage and research data, discusses challenges identified from use cases, and proposes solutions.
引用
收藏
页码:122 / 138
页数:17
相关论文
共 50 条
  • [1] Automated metadata annotation: What is and is not possible with machine learning
    Wu, Mingfang
    Brandhorst, Hans
    Marinescu, Maria-Cristina
    Lopez, Joaquim More
    Hlava, Margorie
    Busch, Joseph
    DATA INTELLIGENCE, 2023, 5 (01) : 122 - 138
  • [2] Semi-Automated Machine Learning Video Annotation for Gastroenterologists
    Krenzer, Adrian
    Makowski, Kevin
    Hekalo, Amar
    Puppe, Frank
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 484 - 485
  • [3] Enhancing Semantic Metadata of Reliable Hadith with Automated Annotation
    Jaafar, S. N.
    Masrom, S.
    Mahtar, S. N. Mohamed
    Khairudin, N.
    Rahim, S. K. N. Abdul
    Azizan, A.
    ADVANCED SCIENCE LETTERS, 2016, 22 (10) : 2947 - 2950
  • [4] Automated Semantic Annotation Deploying Machine Learning Approaches: A Systematic Review
    Chang W.C.
    Sangodiah A.
    Mendel, 2023, 29 (02) : 111 - 130
  • [5] Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists
    Adrian Krenzer
    Kevin Makowski
    Amar Hekalo
    Daniel Fitting
    Joel Troya
    Wolfram G. Zoller
    Alexander Hann
    Frank Puppe
    BioMedical Engineering OnLine, 21
  • [6] Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists
    Krenzer, Adrian
    Makowski, Kevin
    Hekalo, Amar
    Fitting, Daniel
    Troya, Joel
    Zoller, Wolfram G.
    Hann, Alexander
    Puppe, Frank
    BIOMEDICAL ENGINEERING ONLINE, 2022, 21 (01)
  • [7] Automated annotation of keywords for proteins related to mycoplasmataceae using machine learning techniques
    Bazzan, ALC
    Engel, PM
    Schroeder, LF
    da Silva, SC
    BIOINFORMATICS, 2002, 18 : S35 - S43
  • [8] Enhancing Annotation Efficiency with Machine Learning: Automated Partitioning of a Lung Ultrasound Dataset by View
    VanBerlo, Bennett
    Smith, Delaney
    Tschirhart, Jared
    VanBerlo, Blake
    Wu, Derek
    Ford, Alex
    McCauley, Joseph
    Wu, Benjamin
    Chaudhary, Rushil
    Dave, Chintan
    Ho, Jordan
    Deglint, Jason
    Li, Brian
    Arntfield, Robert
    DIAGNOSTICS, 2022, 12 (10)
  • [9] Metadata for collaborative learning: What to reuse?
    Tamaura, Y
    3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2003, : 522 - 523
  • [10] Metadata and Metrics for Automated Repurposing of Learning Resources
    Sanand, S.
    Raghavan, S. V.
    ICALT: 2009 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, 2009, : 460 - 464