Automated Metadata Annotation:What Is and Is Not Possible with Machine Learning

被引:0
|
作者
Mingfang Wu [1 ]
Hans Brandhorst [2 ]
MariaCristina Marinescu [3 ]
Joaquim Mor Lpez [3 ]
Margorie Hlava [4 ]
Joseph Busch [5 ]
机构
[1] Australian Research Data Commons
[2] Iconclass
[3] Barcelona Supercomputing Center
[4] Access Innovations
[5] Taxonomy
关键词
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
摘要
Automated metadata annotation is only as good as training dataset, or rules that are available for the domain. It's important to learn what type of data content a pre-trained machine learning algorithm has been trained on to understand its limitations and potential biases. Consider what type of content is readily available to train an algorithm—what's popular and what's available. However, scholarly and historical content is often not available in consumable, homogenized, and interoperable formats at the large volume that is required for machine learning. There are exceptions such as science and medicine, where large, well documented collections are available. This paper presents the current state of automated metadata annotation in cultural heritage and research data, discusses challenges identified from use cases, and proposes solutions.
引用
收藏
页码:122 / 138
页数:17
相关论文
共 50 条
  • [31] Validation of Machine Learning-Based Automated Surgical Instrument Annotation Using Publicly Available Intraoperative Video
    Markarian, Nicholas
    Kugener, Guillaume
    Pangal, Dhiraj J.
    Unadkat, Vyom
    Sinha, Aditya
    Zhu, Yichao
    Roshannai, Arman
    Chan, Justin
    Hung, Andrew J.
    Wrobel, Bozena B.
    Anandkumar, Animashree
    Zada, Gabriel
    Donoho, Daniel A.
    OPERATIVE NEUROSURGERY, 2022, 23 (03) : 235 - 240
  • [32] Instagram Hashtags as Image Annotation Metadata
    Giannoulakis, Stamatios
    Tsapatsoulis, Nicolas
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2015, 458 : 206 - 220
  • [33] Comparison of a machine and deep learning model for automated tumor annotation on digitized whole slide prostate cancer histology
    Duenweg, Savannah R.
    Brehler, Michael
    Bobholz, Samuel A.
    Lowman, Allison K.
    Winiarz, Aleksandra
    Kyereme, Fitzgerald
    Nencka, Andrew
    Iczkowski, Kenneth A.
    LaViolette, Peter S.
    PLOS ONE, 2023, 18 (03):
  • [34] Annotation Time Stamps - Temporal Metadata from the Linguistic Annotation Process
    Tomanek, Katrin
    Hahn, Udo
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [35] A Machine Learning Approach for Accurate Annotation of Noncoding RNAs
    Song, Yinglei
    Liu, Chunmei
    Wang, Zhi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (03) : 551 - 559
  • [36] Social Annotation in Query Expansion: a Machine Learning Approach
    Lin, Yuan
    Lin, Hongfei
    Jin, Song
    Ye, Zheng
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 405 - 414
  • [37] Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
    Mathieu, Alban
    Leclercq, Mickael
    Sanabria, Melissa
    Perin, Olivier
    Droit, Arnaud
    FRONTIERS IN MICROBIOLOGY, 2022, 13
  • [38] Implementing a Metadata Manager for Machine Learning with the Asset Administration Shell
    da Silva, Alexandre Sawczuk
    Van, Hoai My
    Weiss, Gereon
    2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
  • [39] Chemical machine vision: Automated extraction of chemical metadata from raster images
    Gkoutos, GV
    Rzepa, H
    Clark, RM
    Adjei, O
    Johal, H
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (05): : 1342 - 1355
  • [40] Coverage for Identifying Critical Metadata in Machine Learning Operating Envelopes
    Lanus, Erin
    Lee, Brian
    Pol, Luis
    Sobien, Daniel
    Kauffman, Justin
    Freeman, Laura J.
    2024 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS, ICSTW 2024, 2024, : 217 - 226