Products-6K: A Large-Scale Groceries Product Recognition Dataset

被引:11
|
作者
Georgiadis, Kostas [1 ]
Kordopatis-Zilos, Giorgos [1 ]
Kalaganis, Fotis P. [1 ]
Migkotzidis, Panagiotis [1 ]
Chatzilari, Elisavet [1 ]
Panakidou, Valasia [2 ]
Pantouvakis, Kyriakos [2 ]
Tortopidis, Savvas [2 ]
Papadopoulos, Symeon [1 ]
Nikolopoulos, Spiros [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi 57001, Greece
[2] D Masoutis SA, Thermi, Greece
关键词
Product Recognition; Groceries Dataset; Image Retrieval; OCR; FEATURES;
D O I
10.1145/3453892.3453894
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Product recognition is a task that receives continuous attention by the computer vision/deep learning community mainly with the scope of providing robust solutions for automatic checkout supermarkets. One of the main challenges is the lack of images that illustrate in realistic conditions a high number of products. Here the product recognition task is perceived slightly differently compared to the automatic checkout paradigm but the challenges encountered are the same. The setting under which this dataset is captured is with the aim to help individuals with visual impairment in doing their daily grocery in order to increase their autonomy. In particular, we propose a large-scale dataset utilized to tackle the product recognition problem in a supermarket environment. The dataset is characterized by (a) large scale in terms of unique products associated with one or more photos from different viewpoints, (b) rich textual descriptions linked to different levels of annotation and, (c) images acquired both in laboratory conditions and in a realistic supermarket scenario portrayed in various clutter and lighting conditions. A direct comparison with existing datasets of this category demonstrates the significantly higher number of the available unique products, as well as the richness of its annotation enabling different recognition scenarios. Finally, the dataset is also benchmarked using various approaches based both on visual and textual descriptors
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [11] A large-scale dataset for end-to-end table recognition in the wild
    Fan Yang
    Lei Hu
    Xinwu Liu
    Shuangping Huang
    Zhenghui Gu
    Scientific Data, 10
  • [12] Vietnam-Celeb: a large-scale dataset for Vietnamese speaker recognition
    Pham Viet Thanh
    Nguyen Xuan Thai Hoa
    Hoang Long Vu
    Nguyen Thi Thu Trang
    INTERSPEECH 2023, 2023, : 1918 - 1922
  • [13] A large-scale dataset for end-to-end table recognition in the wild
    Yang, Fan
    Hu, Lei
    Liu, Xinwu
    Huang, Shuangping
    Gu, Zhenghui
    SCIENTIFIC DATA, 2023, 10 (01)
  • [14] A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition
    Liu, Jianbang
    Fang, Yuqi
    Zhu, Delong
    Ma, Nachuan
    Pan, Jin
    Meng, Max Q-H
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14018 - 14024
  • [15] DNRTI: A Large-scale Dataset for Named Entity Recognition in Threat Intelligence
    Wang, Xuren
    Liu, Xinpei
    Ao, Shengqin
    Li, Ning
    Jiang, Zhengwei
    Xu, Zongyi
    Xiong, Zihan
    Xiong, Mengbo
    Zhang, Xiaoqing
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1842 - 1848
  • [16] Training Convolutional Neural Network for Sketch Recognition on Large-Scale Dataset
    Zhou, Wen
    Jia, Jinyuan
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (01) : 82 - 89
  • [17] Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition
    Dong, Yiting
    Li, Yang
    Zhao, Dongcheng
    Shen, Guobin
    Zeng, Yi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [18] FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
    Wang, Yan
    Sun, Yixuan
    Huang, Yiwen
    Liu, Zhongying
    Gao, Shuyong
    Zhang, Wei
    Ge, Weifeng
    Zhang, Wenqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20890 - 20899
  • [19] Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
    Amazon, United States
    arXiv,
  • [20] RPC: a large-scale and fine-grained retail product checkout dataset
    Wei, Xiu-Shen
    Cui, Quan
    Yang, Lei
    Wang, Peng
    Liu, Lingqiao
    Yang, Jian
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (09)