Automatic Detection of Four-Panel Cartoon in Large-Scale Korean Digitized Newspapers using Deep Learning

被引:0
|
作者
Lee, Seojoon [1 ]
Kim, Byungjun [2 ]
Jun, Bong Gwan [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Grad Sch Culture Technol, Daejeon, South Korea
[2] Korea Adv Inst Sci & Technol, Ctr Digital Humanities & Computat Social Sci, Daejeon, South Korea
[3] Korea Adv Inst Sci & Technol, Sch Digital Humanities & Computat Social Sci, Daejeon, South Korea
关键词
big data; object detection; data strategy; four-panel cartoon; digital newspaper; data science;
D O I
10.5334/johd.205
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
In the realm of cultural and historical studies, the collection of image -based content from big data is a fundamental aspect of data analysis. However, this process is as intricate as extracting resources from vast terrains. Echoing this sentiment, there is a growing appreciation in scholarly circles for "Four -panel Cartoon" (FPC) as a valuable image content source in big data digital newspapers in the Republic of Korea. Yet, identifying these FPCs amidst the vastness of big data archives is an arduous journey, especially given their unstructured image data format - a task both time -intensive and costly. To address this issue, this research paper presents a novel computational FPC detection mechanism: the development of the YOLOv5 _ FPC model, via finetuning the You Only Look Once Version 5 (YOLOv5) deep learning model, tailored precisely for FPC image detection. We applied our YOLOv5 _ FPC model to the Chosun Ilbo News Library archive (1920-1940) for automatic FPC data mining, spanning 47,777 JPG image files. We identified 1040 FPC objects within 1035 files, which include previously undiscovered FPCs by previous researchers. We provide a detailed description of our methodology, which includes the collection, labeling, training, detection, and distribution of the data we discovered from big data newspaper archives. Our findings, now available as an open -access dataset in the Journal of Open Humanities Data (JOHD) Dataverse, invite discussions among humanities researchers focusing on the culture and history of Korea between 1920 and 1940.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [41] Implementation of a Large-Scale Retinal Image Curation Workflow Using Deep Learning Framework
    Balaji, Rohit
    Heathcote, Jen
    Slater, Robert
    Barrett, Nancy
    Voland, Rick
    Tomic, Vesna
    McDonald, Jared
    Blodi, Barbara A.
    Domalpally, Amitha
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [42] Classification of large-scale image database of various skin diseases using deep learning
    Masaya Tanaka
    Atsushi Saito
    Kosuke Shido
    Yasuhiro Fujisawa
    Kenshi Yamasaki
    Manabu Fujimoto
    Kohei Murao
    Youichirou Ninomiya
    Shin’ichi Satoh
    Akinobu Shimizu
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 1875 - 1887
  • [43] Large-scale semantic web image retrieval using bimodal deep learning techniques
    Huang, Changqin
    Xu, Haijiao
    Xie, Liang
    Zhu, Jia
    Xu, Chunyan
    Tang, Yong
    INFORMATION SCIENCES, 2018, 430 : 331 - 348
  • [44] Large-scale pancreatic cancer detection via non-contrast CT and deep learning
    Kai Cao
    Yingda Xia
    Jiawen Yao
    Xu Han
    Lukas Lambert
    Tingting Zhang
    Wei Tang
    Gang Jin
    Hui Jiang
    Xu Fang
    Isabella Nogues
    Xuezhou Li
    Wenchao Guo
    Yu Wang
    Wei Fang
    Mingyan Qiu
    Yang Hou
    Tomas Kovarnik
    Michal Vocka
    Yimei Lu
    Yingli Chen
    Xin Chen
    Zaiyi Liu
    Jian Zhou
    Chuanmiao Xie
    Rong Zhang
    Hong Lu
    Gregory D. Hager
    Alan L. Yuille
    Le Lu
    Chengwei Shao
    Yu Shi
    Qi Zhang
    Tingbo Liang
    Ling Zhang
    Jianping Lu
    Nature Medicine, 2023, 29 : 3033 - 3043
  • [45] DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning
    Yan, Ke
    Wang, Xiaosong
    Lu, Le
    Summers, Ronald M.
    JOURNAL OF MEDICAL IMAGING, 2018, 5 (03)
  • [46] Automated curation of large-scale cancer histopathology image datasets using deep learning
    Hilgers, Lars
    Laleh, Narmin Ghaffari
    West, Nicholas P.
    Westwood, Alice
    Hewitt, Katherine J.
    Quirke, Philip
    Grabsch, Heike, I
    Carrero, Zunamys, I
    Matthaei, Emylou
    Loeffler, Chiara M. L.
    Brinker, Titus J.
    Yuan, Tanwei
    Brenner, Hermann
    Brobeil, Alexander
    Hoffmeister, Michael
    Kather, Jakob Nikolas
    HISTOPATHOLOGY, 2024, 84 (07) : 1139 - 1153
  • [47] Large-Scale Transportation Network Congestion Evolution Prediction Using Deep Learning Theory
    Ma, Xiaolei
    Yu, Haiyang
    Wang, Yunpeng
    Wang, Yinhai
    PLOS ONE, 2015, 10 (03):
  • [48] Large-scale traffic control using autonomous vehicles and decentralized deep reinforcement learning
    Maske, Harshal
    Chu, Tianshu
    Kalabic, Uros
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3816 - 3821
  • [49] Large-scale prediction of stream water quality using an interpretable deep learning approach
    Zheng, Hang
    Liu, Yueyi
    Wan, Wenhua
    Zhao, Jianshi
    Xie, Guanti
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 331
  • [50] AI for Discovery and Diagnosis of Brain Diseases using Deep Learning and Large-Scale Neuroimaging
    Thompson, Paul
    ANNALS OF NEUROLOGY, 2024, 96 : S302 - S303