Time Period Categorization in Fiction: A Comparative Analysis of Machine Learning Techniques

被引:0
|
作者
Westin, Fereshta [1 ,2 ]
机构
[1] Univ Boras, Boras, Sweden
[2] Univ Boras, Allegatan 1, Boras, Sweden
关键词
Cataloging for digital resources; time period categorization; machine learning; text analysis; fiction; LDA; SBERT; TF-IDF; CLASSIFICATION;
D O I
10.1080/01639374.2024.2315548
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
This study investigates the automatic categorization of time period metadata in fiction, a critical but often overlooked aspect of cataloging. Using a comparative analysis approach, the performance of three machine learning techniques, namely Latent Dirichlet Allocation (LDA), Sentence-BERT (SBERT), and Term Frequency-Inverse Document Frequency (TF-IDF) were assessed, by examining their precision, recall, F1 scores, and confusion matrix results. LDA identifies underlying topics within the text, TF-IDF measures word importance, and SBERT measures sentence semantic similarity. Based on F1-score analysis and confusion matrix outcomes, TF-IDF and LDA effectively categorize text data by time period, while SBERT performed poorly across all time period categories.
引用
收藏
页码:124 / 153
页数:30
相关论文
共 50 条
  • [31] Machine Learning and Deep Learning Techniques for Residential Load Forecasting: A Comparative Analysis
    Shabbir, Noman
    Kutt, Lauri
    Raja, Hadi A.
    Ahmadiahangar, Roya
    Rosin, Argo
    Husev, Oleksandr
    2021 IEEE 62ND INTERNATIONAL SCIENTIFIC CONFERENCE ON POWER AND ELECTRICAL ENGINEERING OF RIGA TECHNICAL UNIVERSITY (RTUCON), 2021,
  • [32] A Comparative Study of Machine Learning Techniques for Real-time Multi-tier Sentiment Analysis
    Chan, Wint Nyein
    Thein, Thandar
    PROCEEDINGS OF THE 2018 1ST IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE INNOVATION AND INVENTION (ICKII 2018), 2018, : 90 - 93
  • [33] A Comparative Study of Gaussian Process Machine Learning and Time Series Analysis Techniques for Predicting Unemployment Rate
    Aris, Muhammad Naeim Mohd
    Nagaratnam, Shalini
    Zakaria, Nurul Nnadiah
    Azami, Muhammad Fadhirul Anuar Mohd
    Samsudin, Muhammad Afiq Ikram
    Othman, Ernee Sazlinayati
    2024 16TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, ICCAE 2024, 2024, : 242 - 246
  • [34] Comparative Analysis of Different Machine Learning Techniques for Travel Mode Prediction
    Bhosle, Nilesh
    Jagtap, Jayant
    Shivakrishna, D.
    2024 SMART CITIES SYMPOSIUM PRAGUE, SCSP, 2024,
  • [35] Time period estimation of masonry infilled RC frames using machine learning techniques
    Somala, Surendra Nadh
    Karthikeyan, Karthika
    Mangalathu, Sujith
    STRUCTURES, 2021, 34 : 1560 - 1566
  • [36] Comparative Analysis of Machine Learning Techniques for Temperature Compensation in Microwave Sensors
    Kazemi, Nazli
    Abdolrazzaghi, Mohammad
    Musilek, Petr
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2021, 69 (09) : 4223 - 4236
  • [37] Machine Learning Techniques for Heart Disease Prediction: A Comparative Study and Analysis
    Rahul Katarya
    Sunit Kumar Meena
    Health and Technology, 2021, 11 : 87 - 97
  • [38] Comparative analysis of machine learning techniques in prognosis of type II diabetes
    Sarwar A.
    Sharma V.
    AI and Society, 2014, 29 (01): : 123 - 129
  • [39] A comparative analysis of machine learning techniques for building cooling load prediction
    Havaeji S.
    Ghanizadeh Anganeh P.
    Torbat Esfahani M.
    Rezaeihezaveh R.
    Rezaei Moghadam A.
    Journal of Building Pathology and Rehabilitation, 2024, 9 (2)
  • [40] A Comparative Sentiment Analysis Of Sentence Embedding Using Machine Learning Techniques
    Poornima, A.
    Priya, K. Sathiya
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 493 - 496