Shelf Management: A deep learning-based system for shelf visual monitoring

被引：1

作者：

Pietrini, Rocco ^{[1
]}

Paolanti, Marina ^{[2
]}

Mancini, Adriano ^{[1
]}

Frontoni, Emanuele ^{[2
]}

Zingaretti, Primo ^{[1
]}

机构：

[1] Univ Politecn Marche, Dipartimento Ingn Informaz, VRAI Vis Robot & Artificial Intelligence Lab, via Brecce Bianche 12, I-60131 Ancona, Italy

[2] Univ Macerata, Dept Polit Sci Commun & Int Relat, Via Don Minzoni 22A, I-62100 Macerata, Italy

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

关键词：

Shelf management; Retail; Shelf monitoring; SKU recognition; Planogram compliance; Planogram;

D O I：

10.1016/j.eswa.2024.124635

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Shelf monitoring plays a key role in optimizing retail shelf layout, enhancing the customer shopping experience and maximizing profit margins. The process of automating shelf audit involves the detection, localization and recognition of objects on store shelves, including diverse products with varying attributes in unconstrained environments. This facilitates the assessment of planogram compliance. Accurate product localization within shelves requires the identification of specific shelf rows. To address the current technological challenges, we introduce "Shelf Management", a deep learning-based system that is carefully tailored to redesign shelf monitoring practices. Our system can navigate the complexities of shelf monitoring by using advanced deep learning techniques and object detection and recognition models. In addition, a complex semantic module enhances the accuracy of detecting and assigning products to their designated shelf rows and locations. In particular, we recognize the lack of finely annotated datasets at the SKU level. As a contribution to the field, we provide annotations for two novel datasets: SHARD (SHelf mAnagement Row Dataset) and SHAPE (SHelf mAnagement Product dataset). These datasets not only provide valuable resources, but also serve as benchmarks for further research in the field of retail. A complete pipeline is designed using a RetinaNet architecture for object detection with 0.752 mAP, followed by a Deep Hough transform to detect shelf rows as semantic lines with an F1 score of 97%, and a product recognition step using a MobileNetV3 architecture trained with triplet loss and used as a feature extractor together with FAISS for fast image retrieval with an accuracy of 93% on top-1 recognition. Localization is achieved using a deterministic approach based on product detection and shelf row detection. Source code and datasets are available at https://github.com/rokopibyte/shelf_management.

引用

页数：14

共 50 条

[31] Machine Learning-Based Radon Monitoring System
Valcarce, Diego
Alvarellos, Alberto
Rabunal, Juan Ramon
Dorado, Julian
Gestal, Marcos
CHEMOSENSORS, 2022, 10 (07)
[32] Deep Learning-Based Driver Assistance System
Kurtkaya, Bariscan
Tezcan, Arda
Taskiran, Murat
ELECTRICA, 2023, 23 (03): : 607 - 618
[33] A Deep Learning-Based Framework for Visual Inspection of Plastic Bottles
Kazmi, Majida
Hafeez, Basra
Aftab, Fakhra
Shahid, Jamal
Qazi, Saad Ahmed
IEEE ACCESS, 2023, 11 : 125529 - 125542
[34] A Deep Learning-based Visual Perception Approach for Mobile Robots
Shan, Guangcun
Li, Xin
Zhang, Yinan
Wang, Tian
Fang, Yinghong
2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 825 - 829
[35] A Comparison of Deep Learning-Based Monocular Visual Odometry Algorithms
Jeong, Eunju
Lee, Jaun
Kim, Pyojin
PROCEEDINGS OF THE 2021 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY (APISAT 2021), VOL 2, 2023, 913 : 923 - 934
[36] Testing Deep Learning-based Visual Perception for Automated Driving
Abrecht, Stephanie
Gauerhof, Lydia
Gladisch, Christoph
Groh, Konrad
Heinzemann, Christian
Woehrle, Matthias
ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2021, 5 (04)
[37] Deep Space In Situ Imaging Results of Commercial Off-the- Shelf Visual Monitoring System Aboard the Hayabusa2 Spacecraft
Kimura, Shinichi
Sawada, Hirotaka
Saiki, Takanao
Mimasu, Yuya
Ogawa, Kazunori
Tsuda, Yuichi
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2021, 36 (03) : 16 - 23
[38] Deep Learning-Based Approach for Arabic Visual Speech Recognition
Alsulami, Nadia H.
Jamal, Amani T.
Elrefaei, Lamiaa A.
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 85 - 108
[39] Deep learning-based visual slam for indoor dynamic scenes
Xu, Zhendong
Song, Yong
Pang, Bao
Xu, Qingyang
Yuan, Xianfeng
APPLIED INTELLIGENCE, 2025, 55 (06)
[40] Deep learning-based visual detection of marine organisms: A survey
Wang, Ning
Chen, Tingkai
Liu, Shaoman
Wang, Rongfeng
Karimi, Hamid Reza
Lin, Yejin
NEUROCOMPUTING, 2023, 532 : 1 - 32

← 1 2 3 4 5 →