Controlled vocabularies in digital libraries: challenges and solutions for increased discoverability of digital objects

被引:1
|
作者
Chipangila, Bertha [1 ]
Liswaniso, Eric [1 ]
Mawila, Andrew [1 ]
Mwanza, Philomena [1 ]
Nawila, Daisy [1 ]
M'sendo, Robert [2 ]
Nyirenda, Mayumbo [2 ]
Phiri, Lighton [1 ]
机构
[1] Univ Zambia, Dept Lib & Informat Sci, POB 32379, Lusaka 10101, Zambia
[2] Univ Zambia, Dept Comp Sci, POB 32379, Lusaka 10101, Zambia
关键词
Controlled vocabularies; Digital libraries; Document classification; Institutional repositories; METADATA QUALITY;
D O I
10.1007/s00799-023-00374-1
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Digital Library Systems are widely used in the Higher Education sector, through the use of Institutional Repositories (IRs), to collect, store, manage and make available scholarly research output produced by Higher Education Institutions (HEIs). This wide application of IRs is a direct response to the increase in scholarly research output produced. In order to facilitate discoverability of digital content in IRs, accurate, consistent and comprehensive association of descriptive metadata to digital objects during ingestion into IRs is crucial. However, due to human errors resulting from complex IR ingestion workflows, most digital content in IRs have incorrect and inconsistent descriptive metadata. While there exists a broad spectrum of descriptive metadata elements, subject headings present a classic example of a crucial metadata element that adversely affects discoverability of digital content when incorrectly and inconsistently specified. This paper outlines a case study conducted at an HEI-The University of Zambia-in order to demonstrate the effectiveness of integrating controlled subject vocabularies during the ingestion of digital objects in to IRs. A situational analysis was conducted to understand how subject headings are associated with digital objects and to analyse subject headings associated with already ingested digital objects. In addition, an exploratory study was conducted to determine domain-specific subject headings to be integrated with the IR. Furthermore, a usability study was conducted in order to comparatively determine the usefulness of using controlled vocabularies during the ingestion of digital objects into IRs. Finally, multi-label classification experiments were carried out where digital objects were assigned with more than one class. The results of the study revealed that a noticeable number of digital content is associated with incorrect subject categories and, additionally, associated with few subjects headings: two or less subject headings (71.2%), with a significant number of subject headings (92.1%) being associated with a single publication. A comparative study conducted suggests that IRs integrated with controlled vocabularies are perceived to be more usable (SUS Score = 68.9) when compared with IRs without controlled vocabularies (SUS Score = 66.2). Furthermore, the effectiveness of the multi-label arXiv subjects classifier demonstrates the viability of integrating automated techniques for subject classification.
引用
收藏
页码:139 / 155
页数:17
相关论文
共 50 条
  • [21] Organizational Challenges of the Semantic Web in Digital Libraries
    Bygstad, Bendik
    Ghinea, Gheorghita
    Klaebo, Geir-Tore
    IIT: 2008 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2008, : 504 - +
  • [22] The Digital Atlas Dilemma: Outlining the Challenges for Libraries
    Nolan, Lee Ann
    Andrew, Paige G.
    Bidney, Marcy
    JOURNAL OF MAP & GEOGRAPHY LIBRARIES, 2014, 10 (02) : 132 - 156
  • [23] Rural accessibility to digital libraries: requirements and challenges
    Mamabolo, Mapheto J.
    Durodolu, Oluwole Olumide
    DIGITAL LIBRARY PERSPECTIVES, 2023, 39 (04) : 551 - 570
  • [24] Preservation of accessible objects in digital libraries and aspects of vulnerabilities
    dos Santos, Christiane Gomes
    de Araujo, Wagner Junqueira
    REVISTA IBERO-AMERICANA DE CIENCIA DA INFORMACAO, 2018, 11 (02): : 367 - 387
  • [25] Search and Analytics Challenges in Digital Libraries and Archives
    Maltese, Vincenzo
    Giunchiglia, Fausto
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 7 (03):
  • [26] Accessibility Challenges with CAPTCHA Services in Digital Libraries
    Todorov, Todor
    Bogdanova, Galina
    Noev, Nikolay
    DIGITAL PRESENTATION AND PRESERVATION OF CULTURAL AND SCIENTIFIC HERITAGE, 2022, 12 : 255 - 261
  • [27] Digital Initiatives in Academic Libraries: Challenges and Opportunities
    Williams, David J.
    PORTAL-LIBRARIES AND THE ACADEMY, 2023, 23 (02) : 387 - 398
  • [28] Security in distributed digital libraries: Issues and challenges
    Vemulapalli, S
    Halappanavar, M
    Mukkamala, R
    2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS OF THE WORKSHOPS, 2002, : 480 - 486
  • [29] Challenges confronting the construction of digital libraries in China
    Ma, ZW
    Zhou, J
    INFORMATION ETHICS IN THE ELECTRONIC AGE: CURRENT ISSUES IN AFRICA AND THE WORLD, 2004, : 88 - 98
  • [30] Document images analysis solutions for digital libraries
    Le Bourgeois, F
    Trinh, E
    Allier, B
    Eglin, V
    Emptoz, H
    FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 2 - 24