Products-6K: A Large-Scale Groceries Product Recognition Dataset

被引:11
|
作者
Georgiadis, Kostas [1 ]
Kordopatis-Zilos, Giorgos [1 ]
Kalaganis, Fotis P. [1 ]
Migkotzidis, Panagiotis [1 ]
Chatzilari, Elisavet [1 ]
Panakidou, Valasia [2 ]
Pantouvakis, Kyriakos [2 ]
Tortopidis, Savvas [2 ]
Papadopoulos, Symeon [1 ]
Nikolopoulos, Spiros [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi 57001, Greece
[2] D Masoutis SA, Thermi, Greece
关键词
Product Recognition; Groceries Dataset; Image Retrieval; OCR; FEATURES;
D O I
10.1145/3453892.3453894
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Product recognition is a task that receives continuous attention by the computer vision/deep learning community mainly with the scope of providing robust solutions for automatic checkout supermarkets. One of the main challenges is the lack of images that illustrate in realistic conditions a high number of products. Here the product recognition task is perceived slightly differently compared to the automatic checkout paradigm but the challenges encountered are the same. The setting under which this dataset is captured is with the aim to help individuals with visual impairment in doing their daily grocery in order to increase their autonomy. In particular, we propose a large-scale dataset utilized to tackle the product recognition problem in a supermarket environment. The dataset is characterized by (a) large scale in terms of unique products associated with one or more photos from different viewpoints, (b) rich textual descriptions linked to different levels of annotation and, (c) images acquired both in laboratory conditions and in a realistic supermarket scenario portrayed in various clutter and lighting conditions. A direct comparison with existing datasets of this category demonstrates the significantly higher number of the available unique products, as well as the richness of its annotation enabling different recognition scenarios. Finally, the dataset is also benchmarked using various approaches based both on visual and textual descriptors
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [31] VStego800K: Large-Scale Steganalysis Dataset for Streaming Voice
    Xu, Xuan
    Guo, Shengnan
    Fang, Zhengyang
    Zhou, Pengcheng
    Yang, Zhongliang
    Zhou, Linna
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 292 - 303
  • [32] LogoDet-3K. A Large-scale Image Dataset for Logo Detection
    Wang, Jing
    Min, Weiqing
    Hou, Sujuan
    Ma, Shengnan
    Zheng, Yuanjie
    Jiang, Shuqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [33] The Jester Dataset: A Large-Scale Video Dataset of Human Gestures
    Materzynska, Joanna
    Berger, Guillaume
    Bax, Ingo
    Memisevic, Roland
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2874 - 2882
  • [34] FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
    Khan, Faizan Farooq
    Li, Xiang
    Temple, Andrew J.
    Elhoseiny, Mohamed
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20439 - 20449
  • [35] EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition
    Song, Yingjie
    Liu, Zhi
    Li, Gongyang
    Xie, Jiawei
    Wu, Qiang
    Zeng, Dan
    Xu, Lihua
    Zhang, Tianhong
    Wang, Jijun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [36] Palmprint and Palmvein Recognition Based on DCNN and A New Large-Scale Contactless Palmvein Dataset
    Zhang, Lin
    Cheng, Zaixi
    Shen, Ying
    Wang, Dongqing
    SYMMETRY-BASEL, 2018, 10 (04):
  • [37] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
    Guo, Yandong
    Zhang, Lei
    Hu, Yuxiao
    He, Xiaodong
    Gao, Jianfeng
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102
  • [38] Advancing music emotion recognition: large-scale dataset construction and evaluator impact analysis
    Hu, Qiong
    Murad, Masrah Azrifah Azmi
    Li, Qi
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [39] POLIMI-ITW-S: A large-scale dataset for human activity recognition in the wild
    Quan, Hao
    Hu, Yu
    Bonarini, Andrea
    DATA IN BRIEF, 2022, 43
  • [40] Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach
    Shen, Xi
    Pastrolin, Ilaria
    Bounou, Oumayma
    Gidaris, Spyros
    Smith, Marc
    Poncet, Olivier
    Aubry, Mathieu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6810 - 6817