STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

被引:47
|
作者
Yoshikawa, Yuya [1 ]
Shigeto, Yutaro [1 ]
Takeuchi, Akikazu [1 ]
机构
[1] Chiba Inst Technol, STAIR Lab, 2-17-1 Tsudanuma, Narashino, Chiba, Japan
关键词
D O I
10.18653/v1/P17-2066
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. In this paper, we particularly consider generating Japanese captions for images. Since most available caption datasets have been constructed for English language, there are few datasets for Japanese. To tackle this problem, we construct a large-scale Japanese image caption dataset based on images from MS-COCO, which is called STAIR Captions. STAIR Captions consists of 820,310 Japanese captions for 164,062 images. In the experiment, we show that a neural network trained using STAIR Captions can generate more natural and better Japanese captions, compared to those generated using English-Japanese machine translation after generating English captions.
引用
收藏
页码:417 / 421
页数:5
相关论文
共 50 条
  • [1] Quality Estimation for Image Captions Based on Large-scale Human Evaluations
    Levinboim, Tomer
    Thapliyal, Ashish V.
    Sharma, Piyush
    Soricut, Radu
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3157 - 3166
  • [2] Constructing a Large-Scale Database of Japanese Word Associations
    Joyce, Terry
    GLOTTOMETRICS, 2005, 10 : 82 - 90
  • [3] A lightweight convolutional neural network for large-scale Chinese image caption
    赵德新
    杨瑞雪
    郭淑涛
    OptoelectronicsLetters, 2021, 17 (06) : 361 - 366
  • [4] A lightweight convolutional neural network for large-scale Chinese image caption
    Dexin Zhao
    Ruixue Yang
    Shutao Guo
    Optoelectronics Letters, 2021, 17 : 361 - 366
  • [5] A lightweight convolutional neural network for large-scale Chinese image caption
    Zhao, Dexin
    Yang, Ruixue
    Guo, Shutao
    OPTOELECTRONICS LETTERS, 2021, 17 (06) : 361 - 366
  • [6] SDFC dataset: a large-scale benchmark dataset for hyperspectral image classification
    Sun, Liwei
    Zhang, Junjie
    Li, Jia
    Wang, Yueming
    Zeng, Dan
    OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (02)
  • [7] SDFC dataset: a large-scale benchmark dataset for hyperspectral image classification
    Liwei Sun
    Junjie Zhang
    Jia Li
    Yueming Wang
    Dan Zeng
    Optical and Quantum Electronics, 2023, 55
  • [8] MARVEL: A Large-Scale Image Dataset for Maritime Vessels
    Gundogdu, Erhan
    Solmaz, Berkan
    Yucesoy, Veysel
    Koc, Aykut
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 165 - 180
  • [9] SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding
    Bastani, Favyen
    Wolters, Piper
    Gupta, Ritwik
    Ferdinando, Joe
    Kembhavi, Aniruddha
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16726 - +
  • [10] A LARGE-SCALE SOLAR IMAGE DATASET WITH LABELED EVENT REGIONS
    Schuh, Michael A.
    Angryk, Rafal A.
    Pillai, Karthik Ganesan
    Banda, Juan M.
    Martens, Petrus C.
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4349 - 4353