Towards Collaborative Robotics in Top View Surveillance: A Framework for Multiple Object Tracking by Detection Using Deep Learning

被引：8

作者：

Imran Ahmed ^{[1
,2
]}

Sadia Din ^{[3
]}

Gwanggil Jeon ^{[1
,4
,5
]}

Francesco Piccialli ^{[1
,6
]}

Giancarlo Fortino ^{[1
,7
]}

机构：

[1] IEEE

[2] the Center of excellence in Information Technology,Institute of Management Sciences

[3] the Department of Information and Communication Engineering, Yeungnam University

[4] the School of Electronic Engineering, Xidian University

[5] the Department of Embedded Systems Engineering, Incheon National University

[6] the Department of Mathematics and Applications “RCaccioppoli”, University of Naples Federico Ⅱ

[7] the Department of Informatics, Modeling, Electronics and Systems, University of Calabria

来源：

IEEE/CAAJournalofAutomaticaSinica | 2021年 / 8卷 / 07期

关键词：

D O I：

暂无

中图分类号：

TN948.6 [电视中心管理系统]; TP242 [机器人]; TP18 [人工智能理论];

学科分类号：

0810 ; 081001 ; 1111 ; 081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Collaborative Robotics is one of the high-interest research topics in the area of academia and industry. It has been progressively utilized in numerous applications, particularly in intelligent surveillance systems. It allows the deployment of smart cameras or optical sensors with computer vision techniques,which may serve in several object detection and tracking tasks.These tasks have been considered challenging and high-level perceptual problems, frequently dominated by relative information about the environment, where main concerns such as occlusion, illumination, background, object deformation, and object class variations are commonplace. In order to show the importance of top view surveillance, a collaborative robotics framework has been presented. It can assist in the detection and tracking of multiple objects in top view surveillance. The framework consists of a smart robotic camera embedded with the visual processing unit. The existing pre-trained deep learning models named SSD and YOLO has been adopted for object detection and localization. The detection models are further combined with different tracking algorithms, including GOTURN, MEDIANFLOW, TLD, KCF, MIL, and BOOSTING.These algorithms, along with detection models, help to track and predict the trajectories of detected objects. The pre-trained models are employed; therefore, the generalization performance is also investigated through testing the models on various sequences of top view data set. The detection models achievedmaximum True Detection Rate 93% to 90% with a maximum0.6% False Detection Rate. The tracking results of different algorithms are nearly identical, with tracking accuracy ranging from 90% to 94%. Furthermore, a discussion has been carried out on output results along with future guidelines.

引用

页码：1253 / 1270

页数：18

共 51 条

[1] Single Stage Vehicle Logo Detector Based on Multi-Scale Prediction [J].

Zhang, Junxing ;

Yang, Shuo ;

Bo, Chunjuan ;

Lu, Huimin .

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10) :2188-2198

[2]

Video Object Segmentation and Tracking[J] . Rui Yao,Guosheng Lin,Shixiong Xia,Jiaqi Zhao,Yong Zhou.ACM Transactions on Intelligent Systems and Technology . 2020 (4)

[3]

Deep learning in video multi-object tracking: A survey[J] . Gioele Ciaparrone,Francisco Luque Sánchez,Siham Tabik,Luigi Troiano,Roberto Tagliaferri,Francisco Herrera.Neurocomputing . 2019 (C)

[4]

Efficient topview person detector using point based transformation and lookup table[J] . Imran Ahmed,Misbah Ahmad,Muhammad Nawaz,Khalid Haseeb,Sajidullah Khan,Gwanggil Jeon.Computer Communications . 2019 (C)

[5] Person detector for different overhead views using machine learning [J].

Ahmed, Imran ;

Ahmad, Misbah ;

Adnan, Awais ;

Ahmad, Awais ;

Khan, Murad .

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (10) :2657-2668

[6]

Rotation invariant person tracker using top view[J] . Kaleem Ullah,Imran Ahmed,Misbah Ahmad,Arif Ur Rahman,Muhammad Nawaz,Awais Adnan.Journal of Ambient Intelligence and Humanized Computing . 2019 (prep)

[7]

Learning Attribute-Specific Representations for Visual Tracking[J] . Yuankai Qi,Shengping Zhang,Weigang Zhang,Li Su,Qingming Huang,Ming Hsuan Yang.Proceedings of the AAAI Conference on Artificial Intelligence . 2019

[8] Energy Efficient Camera Solution for Video Surveillance [J].

Ahmad, Misbah ;

Ahmed, Imran ;

Ullah, Kaleem ;

Khan, Iqbal ;

Khattak, Ayesha ;

Adnan, Awais .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) :522-529

[9]

Object Detection in 20 Years: A Survey[J] . Zhengxia Zou,Zhenwei Shi,Yuhong Guo,Jieping Ye.CoRR . 2019

[10]

Label-less Learning for Emotion Cognition[J] . Chen Min,Hao Yixue.IEEE Transactions on Neural Networks and Learning Systems . 2019 (7)

← 1 2 3 4 5 6 →