The "Collections as ML Data" checklist for machine learning and cultural heritage

被引:8
|
作者
Lee, Benjamin Charles Germain [1 ,2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA USA
[2] Univ Washington, Paul GAllen Sch Comp Sci & Engn, 185 East Stevens Way Northeast, Seattle, WA 98195 USA
关键词
D O I
10.1002/asi.24765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Within cultural heritage, there has been a growing and concerted effort to consider a critical sociotechnical lens when applying machine learning techniques to digital collections. Though the cultural heritage community has collectively developed an emerging body of work detailing responsible operations for machine learning in galleries, museums, archives, and libraries at the organizational level, there remains a paucity of guidelines created for researchers embarking on machine learning projects with digital collections. The manifold stakes and sensitivities involved in applying machine learning to cultural heritage underscore the importance of developing such guidelines. This article contributes to this need by formulating a detailed checklist with guiding questions and practices that can be employed while developing a machine learning project that utilizes cultural heritage data. I call the resulting checklist the "Collections as ML Data" checklist, which, when completed, can be published with the deliverables of the project. By surveying existing projects, including my own project, Newspaper Navigator, I justify the "Collections as ML Data" checklist and demonstrate how the formulated guiding questions can be employed by researchers.
引用
收藏
页码:375 / 396
页数:22
相关论文
共 50 条
  • [1] Cultural Heritage Collections as Data
    Disli, Meltem
    Tonta, Yasar
    TURKISH LIBRARIANSHIP, 2023, 37 (03) : 191 - 214
  • [2] Copyright and Licencing for Cultural Heritage Collections As Data
    Disli, Meltem
    Candela, Gustavo
    JOURNAL OF OPEN HUMANITIES DATA, 2025, 11
  • [3] Cultural Heritage Checklist
    Braun, Jascha Philipp
    DENKMALPFLEGE, 2021, 79 (02): : 183 - 184
  • [4] Machine Learning for Cultural Heritage: A Survey
    Fiorucci, Marco
    Khoroshiltseva, Marina
    Pontil, Massimiliano
    Traviglia, Arianna
    Del Bue, Alessio
    James, Stuart
    PATTERN RECOGNITION LETTERS, 2020, 133 : 102 - 108
  • [5] Datafication and Cultural Heritage Collections Data Infrastructures: Critical Perspectives on Documentation, Cataloguing and Data-sharing in Cultural Heritage Institutions
    Belteki, Daniel
    Rees, Arran j.
    Sichani, Anna-maria
    JOURNAL OF OPEN HUMANITIES DATA, 2025, 11
  • [6] Publishing Cultural Heritage Collections of Ghent with Linked Data Event Streams
    Van de Vyvere, Brecht
    Van D'Huynslager, Olivier
    Atauil, Achraf
    Segers, Maarten
    Van Campe, Leen
    Vandekeybus, Niels
    Teugels, Sofie
    Saenko, Alina
    Pauwels, Pieter-Jan
    Colpaert, Pieter
    METADATA AND SEMANTIC RESEARCH, MTSR 2021, 2022, 1537 : 357 - 369
  • [7] CULTURAL AND HISTORICAL HERITAGE IN VIRTUAL ENVIRONMENTS. CREATION OF VIRTUAL LEARNING COLLECTIONS
    Vasileva, S.
    Nikolova, M.
    EDULEARN19: 11TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2019, : 4319 - 4322
  • [8] University Collections and Cultural Heritage - Inventory of the Heritage of the University of Strasbourg
    Issenmann, Delphine
    IN SITU-REVUE DE PATRIMOINES, 2011, (17):
  • [9] Machine Learning Models for Artist Classification of Cultural Heritage Sketches
    Chirosca, Gianina
    Radvan, Roxana
    Musat, Silviu
    Pop, Matei
    Chirosca, Alecsandru
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [10] Machine learning in analytical chemistry for cultural heritage: A comprehensive review
    Towarek, Aleksandra
    Halicz, Ludwik
    Matwin, Stan
    Wagner, Barbara
    JOURNAL OF CULTURAL HERITAGE, 2024, 70 : 64 - 70