Shelf Management: A deep learning-based system for shelf visual monitoring

被引:1
|
作者
Pietrini, Rocco [1 ]
Paolanti, Marina [2 ]
Mancini, Adriano [1 ]
Frontoni, Emanuele [2 ]
Zingaretti, Primo [1 ]
机构
[1] Univ Politecn Marche, Dipartimento Ingn Informaz, VRAI Vis Robot & Artificial Intelligence Lab, via Brecce Bianche 12, I-60131 Ancona, Italy
[2] Univ Macerata, Dept Polit Sci Commun & Int Relat, Via Don Minzoni 22A, I-62100 Macerata, Italy
关键词
Shelf management; Retail; Shelf monitoring; SKU recognition; Planogram compliance; Planogram;
D O I
10.1016/j.eswa.2024.124635
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shelf monitoring plays a key role in optimizing retail shelf layout, enhancing the customer shopping experience and maximizing profit margins. The process of automating shelf audit involves the detection, localization and recognition of objects on store shelves, including diverse products with varying attributes in unconstrained environments. This facilitates the assessment of planogram compliance. Accurate product localization within shelves requires the identification of specific shelf rows. To address the current technological challenges, we introduce "Shelf Management", a deep learning-based system that is carefully tailored to redesign shelf monitoring practices. Our system can navigate the complexities of shelf monitoring by using advanced deep learning techniques and object detection and recognition models. In addition, a complex semantic module enhances the accuracy of detecting and assigning products to their designated shelf rows and locations. In particular, we recognize the lack of finely annotated datasets at the SKU level. As a contribution to the field, we provide annotations for two novel datasets: SHARD (SHelf mAnagement Row Dataset) and SHAPE (SHelf mAnagement Product dataset). These datasets not only provide valuable resources, but also serve as benchmarks for further research in the field of retail. A complete pipeline is designed using a RetinaNet architecture for object detection with 0.752 mAP, followed by a Deep Hough transform to detect shelf rows as semantic lines with an F1 score of 97%, and a product recognition step using a MobileNetV3 architecture trained with triplet loss and used as a feature extractor together with FAISS for fast image retrieval with an accuracy of 93% on top-1 recognition. Localization is achieved using a deterministic approach based on product detection and shelf row detection. Source code and datasets are available at https://github.com/rokopibyte/shelf_management.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Machine Learning-Based Radon Monitoring System
    Valcarce, Diego
    Alvarellos, Alberto
    Rabunal, Juan Ramon
    Dorado, Julian
    Gestal, Marcos
    CHEMOSENSORS, 2022, 10 (07)
  • [32] Deep Learning-Based Driver Assistance System
    Kurtkaya, Bariscan
    Tezcan, Arda
    Taskiran, Murat
    ELECTRICA, 2023, 23 (03): : 607 - 618
  • [33] A Deep Learning-Based Framework for Visual Inspection of Plastic Bottles
    Kazmi, Majida
    Hafeez, Basra
    Aftab, Fakhra
    Shahid, Jamal
    Qazi, Saad Ahmed
    IEEE ACCESS, 2023, 11 : 125529 - 125542
  • [34] A Deep Learning-based Visual Perception Approach for Mobile Robots
    Shan, Guangcun
    Li, Xin
    Zhang, Yinan
    Wang, Tian
    Fang, Yinghong
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 825 - 829
  • [35] A Comparison of Deep Learning-Based Monocular Visual Odometry Algorithms
    Jeong, Eunju
    Lee, Jaun
    Kim, Pyojin
    PROCEEDINGS OF THE 2021 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY (APISAT 2021), VOL 2, 2023, 913 : 923 - 934
  • [36] Testing Deep Learning-based Visual Perception for Automated Driving
    Abrecht, Stephanie
    Gauerhof, Lydia
    Gladisch, Christoph
    Groh, Konrad
    Heinzemann, Christian
    Woehrle, Matthias
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2021, 5 (04)
  • [37] Deep Space In Situ Imaging Results of Commercial Off-the- Shelf Visual Monitoring System Aboard the Hayabusa2 Spacecraft
    Kimura, Shinichi
    Sawada, Hirotaka
    Saiki, Takanao
    Mimasu, Yuya
    Ogawa, Kazunori
    Tsuda, Yuichi
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2021, 36 (03) : 16 - 23
  • [38] Deep Learning-Based Approach for Arabic Visual Speech Recognition
    Alsulami, Nadia H.
    Jamal, Amani T.
    Elrefaei, Lamiaa A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 85 - 108
  • [39] Deep learning-based visual slam for indoor dynamic scenes
    Xu, Zhendong
    Song, Yong
    Pang, Bao
    Xu, Qingyang
    Yuan, Xianfeng
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [40] Deep learning-based visual detection of marine organisms: A survey
    Wang, Ning
    Chen, Tingkai
    Liu, Shaoman
    Wang, Rongfeng
    Karimi, Hamid Reza
    Lin, Yejin
    NEUROCOMPUTING, 2023, 532 : 1 - 32