Automatic Detection of Four-Panel Cartoon in Large-Scale Korean Digitized Newspapers using Deep Learning

被引:0
|
作者
Lee, Seojoon [1 ]
Kim, Byungjun [2 ]
Jun, Bong Gwan [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Ctr Digital Humanities & Computat Social Sci, Daejeon, South Korea
[3] Korea Adv Inst Sci & Technol, Sch Digital Humanities & Computat Social Sci, Daejeon, South Korea
关键词
big data; object detection; data strategy; four-panel cartoon; digital newspaper; data science;
D O I
10.5334/johd.205
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
In the realm of cultural and historical studies, the collection of image -based content from big data is a fundamental aspect of data analysis. However, this process is as intricate as extracting resources from vast terrains. Echoing this sentiment, there is a growing appreciation in scholarly circles for "Four -panel Cartoon" (FPC) as a valuable image content source in big data digital newspapers in the Republic of Korea. Yet, identifying these FPCs amidst the vastness of big data archives is an arduous journey, especially given their unstructured image data format - a task both time -intensive and costly. To address this issue, this research paper presents a novel computational FPC detection mechanism: the development of the YOLOv5 _ FPC model, via finetuning the You Only Look Once Version 5 (YOLOv5) deep learning model, tailored precisely for FPC image detection. We applied our YOLOv5 _ FPC model to the Chosun Ilbo News Library archive (1920-1940) for automatic FPC data mining, spanning 47,777 JPG image files. We identified 1040 FPC objects within 1035 files, which include previously undiscovered FPCs by previous researchers. We provide a detailed description of our methodology, which includes the collection, labeling, training, detection, and distribution of the data we discovered from big data newspaper archives. Our findings, now available as an open -access dataset in the Journal of Open Humanities Data (JOHD) Dataverse, invite discussions among humanities researchers focusing on the culture and history of Korea between 1920 and 1940.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Automatic Graph Partitioning for Very Large-scale Deep Learning
    Tanaka, Masahiro
    Taura, Kenjiro
    Hanawa, Toshihiro
    Torisawa, Kentaro
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 1004 - 1013
  • [2] Heading Direction Estimation Using Deep Learning with Automatic Large-scale Data Acquisition
    Berriel, Rodrigo E.
    Tones, Lucas Tabelini
    Cardoso, Vinicius B.
    Guidolini, Ranik
    Badue, Claudine
    De Souza, Alberto F.
    Oliveira-Santos, Thiago
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [3] Automatic text generation using deep learning: providing large-scale support for online learning communities
    Du, Hanxiang
    Xing, Wanli
    Pei, Bo
    INTERACTIVE LEARNING ENVIRONMENTS, 2023, 31 (08) : 5021 - 5036
  • [4] Automatic detection of fish scale circuli using deep learning
    Hanson, Nora N.
    Ounsley, James P.
    Henry, Jason
    Terzic, Kasim
    Caneco, Bruno
    BIOLOGY METHODS & PROTOCOLS, 2024, 9 (01):
  • [5] Deep Learning-Based Large-Scale Automatic Satellite Crosswalk Classification
    Berriel, Rodrigo F.
    Lopes, Andre Teixeira
    de Souza, Alberto F.
    Oliveira-Santos, Thiago
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (09) : 1513 - 1517
  • [6] Rich Punctuations Prediction Using Large-scale Deep Learning
    Wu, Xueyang
    Zhu, Su
    Wu, Yue
    Yu, Kai
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [7] Large-Scale Mobile App Identification Using Deep Learning
    Rezaei, Shahbaz
    Kroencke, Bryce
    Liu, Xin
    IEEE ACCESS, 2020, 8 : 348 - 362
  • [8] Deep Learning for Large-Scale Traffic-Sign Detection and Recognition
    Tabernik, Domen
    Skocaj, Danijel
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1427 - 1440
  • [9] A deep learning approach for anomaly detection in large-scale Hajj crowds
    Aldayri, Amnah
    Albattah, Waleed
    VISUAL COMPUTER, 2024, 40 (08): : 5589 - 5603
  • [10] Large-scale Malware Automatic Detection Based On Multiclass Features and Machine Learning
    Wang, Zhiqiang
    Tang, Yao
    Yao, Jing
    Qian, Rong
    Zhang, Zheng
    Ma, Pingchuan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,