Automatic Detection of Four-Panel Cartoon in Large-Scale Korean Digitized Newspapers using Deep Learning

被引:0
|
作者
Lee, Seojoon [1 ]
Kim, Byungjun [2 ]
Jun, Bong Gwan [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Ctr Digital Humanities & Computat Social Sci, Daejeon, South Korea
[3] Korea Adv Inst Sci & Technol, Sch Digital Humanities & Computat Social Sci, Daejeon, South Korea
关键词
big data; object detection; data strategy; four-panel cartoon; digital newspaper; data science;
D O I
10.5334/johd.205
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
In the realm of cultural and historical studies, the collection of image -based content from big data is a fundamental aspect of data analysis. However, this process is as intricate as extracting resources from vast terrains. Echoing this sentiment, there is a growing appreciation in scholarly circles for "Four -panel Cartoon" (FPC) as a valuable image content source in big data digital newspapers in the Republic of Korea. Yet, identifying these FPCs amidst the vastness of big data archives is an arduous journey, especially given their unstructured image data format - a task both time -intensive and costly. To address this issue, this research paper presents a novel computational FPC detection mechanism: the development of the YOLOv5 _ FPC model, via finetuning the You Only Look Once Version 5 (YOLOv5) deep learning model, tailored precisely for FPC image detection. We applied our YOLOv5 _ FPC model to the Chosun Ilbo News Library archive (1920-1940) for automatic FPC data mining, spanning 47,777 JPG image files. We identified 1040 FPC objects within 1035 files, which include previously undiscovered FPCs by previous researchers. We provide a detailed description of our methodology, which includes the collection, labeling, training, detection, and distribution of the data we discovered from big data newspaper archives. Our findings, now available as an open -access dataset in the Journal of Open Humanities Data (JOHD) Dataverse, invite discussions among humanities researchers focusing on the culture and history of Korea between 1920 and 1940.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Large-scale Exploration of Neuronal Morphologies Using Deep Learning and Augmented Reality
    Zhongyu Li
    Erik Butler
    Kang Li
    Aidong Lu
    Shuiwang Ji
    Shaoting Zhang
    Neuroinformatics, 2018, 16 : 339 - 349
  • [22] Implementation of a Large-Scale Image Curation Workflow Using Deep Learning Framework
    Domalpally, Amitha
    Slater, Robert
    Barrett, Nancy
    Voland, Rick
    Balaji, Rohit
    Heathcote, Jennifer
    Channa, Roomasa
    Blodi, Barbara
    OPHTHALMOLOGY SCIENCE, 2022, 2 (04):
  • [23] Large-Scale Modeling of Mobile User Click Behaviors Using Deep Learning
    Zhou, Xin
    Li, Yang
    15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 473 - 483
  • [24] Large-scale Exploration of Neuronal Morphologies Using Deep Learning and Augmented Reality
    Li, Zhongyu
    Butler, Erik
    Li, Kang
    Lu, Aidong
    Ji, Shuiwang
    Zhang, Shaoting
    NEUROINFORMATICS, 2018, 16 (3-4) : 339 - 349
  • [25] Optimizing coagulant dosage using deep learning models with large-scale data
    Kim J.
    Hua C.
    Kim K.
    Lin S.
    Oh G.
    Park M.-H.
    Kang S.
    Chemosphere, 2024, 350
  • [26] Large-scale singer recognition using deep metric learning: an experimental study
    Hu, Shichao
    Liang, Beici
    Chen, Zhouxuan
    Lu, Xiao
    Zhao, Ethan
    Lui, Simon
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] Designing Reconfigurable Large-Scale Deep Learning Systems Using Stochastic Computing
    Ren, Ao
    Li, Zhe
    Wang, Yanzhi
    Qiu, Qinru
    Yuan, Bo
    2016 IEEE INTERNATIONAL CONFERENCE ON REBOOTING COMPUTING (ICRC), 2016,
  • [28] SODA: A large-scale open site object detection dataset for deep learning in construction
    Duan, Rui
    Deng, Hui
    Tian, Mao
    Deng, Yichuan
    Lin, Jiarui
    AUTOMATION IN CONSTRUCTION, 2022, 142
  • [29] Object Detection in Large-Scale Remote Sensing Images With a Distributed Deep Learning Framework
    Liu, Linkai
    Liu, Yuanxing
    Yan, Jining
    Liu, Hong
    Li, Mingming
    Wang, Jinlin
    Zhou, Kefa
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8142 - 8154
  • [30] Large-Scale Detection and Categorization of Oil Spills from SAR Images with Deep Learning
    Bianchi, Filippo Maria
    Espeseth, Martine M.
    Borch, Njal
    REMOTE SENSING, 2020, 12 (14)