Products-6K: A Large-Scale Groceries Product Recognition Dataset

被引:11
|
作者
Georgiadis, Kostas [1 ]
Kordopatis-Zilos, Giorgos [1 ]
Kalaganis, Fotis P. [1 ]
Migkotzidis, Panagiotis [1 ]
Chatzilari, Elisavet [1 ]
Panakidou, Valasia [2 ]
Pantouvakis, Kyriakos [2 ]
Tortopidis, Savvas [2 ]
Papadopoulos, Symeon [1 ]
Nikolopoulos, Spiros [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi 57001, Greece
[2] D Masoutis SA, Thermi, Greece
关键词
Product Recognition; Groceries Dataset; Image Retrieval; OCR; FEATURES;
D O I
10.1145/3453892.3453894
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Product recognition is a task that receives continuous attention by the computer vision/deep learning community mainly with the scope of providing robust solutions for automatic checkout supermarkets. One of the main challenges is the lack of images that illustrate in realistic conditions a high number of products. Here the product recognition task is perceived slightly differently compared to the automatic checkout paradigm but the challenges encountered are the same. The setting under which this dataset is captured is with the aim to help individuals with visual impairment in doing their daily grocery in order to increase their autonomy. In particular, we propose a large-scale dataset utilized to tackle the product recognition problem in a supermarket environment. The dataset is characterized by (a) large scale in terms of unique products associated with one or more photos from different viewpoints, (b) rich textual descriptions linked to different levels of annotation and, (c) images acquired both in laboratory conditions and in a realistic supermarket scenario portrayed in various clutter and lighting conditions. A direct comparison with existing datasets of this category demonstrates the significantly higher number of the available unique products, as well as the richness of its annotation enabling different recognition scenarios. Finally, the dataset is also benchmarked using various approaches based both on visual and textual descriptors
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [1] SER30K: A Large-Scale Dataset for Sticker Emotion Recognition
    Liu, Shengzhe
    Zhang, Xin
    Yang, Jufeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [2] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    SCIENTIFIC DATA, 2023, 10 (01)
  • [3] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    Scientific Data, 10
  • [4] Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition
    Li, Zeyu
    Xiang, Suncheng
    Yu, Tong
    Gao, Jingsheng
    Ruan, Jiacheng
    Hu, Yanping
    Liu, Ting
    Fu, Yuzhuo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 475 - 486
  • [5] A Large-Scale 3D Object Recognition dataset
    Solund, Thomas
    Buch, Anders Glent
    Kruger, Norbert
    Aanaes, Henrik
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 73 - 82
  • [6] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
    Oh, Sangmin
    Hoogs, Anthony
    Perera, Amitha
    Cuntoor, Naresh
    Chen, Chia-Chih
    Lee, Jong Taek
    Mukherjee, Saurajit
    Aggarwal, J. K.
    Lee, Hyungtae
    Davis, Larry
    Swears, Eran
    Wang, Xioyang
    Ji, Qiang
    Reddy, Kishore
    Shah, Mubarak
    Vondrick, Carl
    Pirsiavash, Hamed
    Ramanan, Deva
    Yuen, Jenny
    Torralba, Antonio
    Song, Bi
    Fong, Anesco
    Roy-Chowdhury, Amit
    Desai, Mita
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [7] LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION
    Fan, Weiquan
    Xu, Xiangmin
    Xing, Xiaofen
    Chen, Weidong
    Huang, Dongyan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 641 - 645
  • [8] A large-scale dataset for Chinese historical document recognition and analysis
    Shi, Yongxin
    Peng, Dezhi
    Zhang, Yuyi
    Cao, Jiahuan
    Jin, Lianwen
    SCIENTIFIC DATA, 2025, 12 (01)
  • [9] The WDC Training Dataset and Gold Standard for Large-Scale Product Matching
    Primpeli, Anna
    Peeters, Ralph
    Bizer, Christian
    COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 381 - 386
  • [10] UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images
    He, Boyong
    Li, Xianjiang
    Huang, Bo
    Gu, Enhui
    Guo, Weijie
    Wu, Liaoni
    REMOTE SENSING, 2021, 13 (24)