Time Period Categorization in Fiction: A Comparative Analysis of Machine Learning Techniques

被引:0
|
作者
Westin, Fereshta [1 ,2 ]
机构
[1] Univ Boras, Boras, Sweden
[2] Univ Boras, Allegatan 1, Boras, Sweden
关键词
Cataloging for digital resources; time period categorization; machine learning; text analysis; fiction; LDA; SBERT; TF-IDF; CLASSIFICATION;
D O I
10.1080/01639374.2024.2315548
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
This study investigates the automatic categorization of time period metadata in fiction, a critical but often overlooked aspect of cataloging. Using a comparative analysis approach, the performance of three machine learning techniques, namely Latent Dirichlet Allocation (LDA), Sentence-BERT (SBERT), and Term Frequency-Inverse Document Frequency (TF-IDF) were assessed, by examining their precision, recall, F1 scores, and confusion matrix results. LDA identifies underlying topics within the text, TF-IDF measures word importance, and SBERT measures sentence semantic similarity. Based on F1-score analysis and confusion matrix outcomes, TF-IDF and LDA effectively categorize text data by time period, while SBERT performed poorly across all time period categories.
引用
收藏
页码:124 / 153
页数:30
相关论文
共 50 条
  • [21] A Comparative Analysis of Data sets using Machine Learning Techniques
    Abhilash, C. B.
    Rohitaksha, K.
    Biradar, Shankar
    SOUVENIR OF THE 2014 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2014, : 24 - 29
  • [22] Mortality Prediction using Machine Learning Techniques: Comparative Analysis
    Verma, Akash
    Goyal, Shreya
    Thakur, Shridhar Kumar
    Gupta, Archit
    Gupta, Indrajeet
    PROCEEDINGS OF THE 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC 2019), 2019, : 230 - 234
  • [23] Comparative Analysis of Machine Learning Techniques for Island Heightmap Generation
    Demergis, Dimitri
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [24] A Comparative Analysis of Machine Learning Techniques for Cyberbullying Detection on Twitter
    Muneer, Amgad
    Fati, Suliman Mohamed
    FUTURE INTERNET, 2020, 12 (11) : 1 - 21
  • [25] Comparative Analysis of Machine Learning Techniques Using Predictive Modeling
    Khandelwal, Ritu
    Goyal, Hemlata
    Shekhawat, Rajveer S.
    Recent Advances in Computer Science and Communications, 2022, 15 (03) : 466 - 477
  • [26] Comparative Analysis of Machine Learning Techniques in Assessing Cognitive Workload
    Elkin, Colin
    Devabhaktuni, Vijay
    ADVANCES IN NEUROERGONOMICS AND COGNITIVE ENGINEERING, 2020, 953 : 185 - 195
  • [27] Comparative Analysis of Machine Learning Techniques for Cryptocurrency Price Prediction
    Salehi, Sara
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2024, 48 (02) : 341 - 352
  • [28] Categorization of Mouse Ultrasonic Vocalizations Using Machine Learning Techniques
    Kouzoupis, Spyros
    Neocleous, Andreas
    Athanassakis, Irene
    ACOUSTICS, 2019, 1 (04): : 837 - 846
  • [29] Evaluation of Machine Learning Techniques for Motivational Quotes Classification and Categorization
    Kapuria, Adhiveer
    Bhavsar, Parth
    Kejriwal, Nishant
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (03): : 2746 - 2763
  • [30] A comparative analysis of the automatic modeling of Learning Styles through Machine Learning techniques
    Ferreira, Lucas D.
    Spadon, Gabriel
    Carvalho, Andre C. P. L. F.
    Rodrigues-, Jose F., Jr.
    2018 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2018,