Visual Co-occurrence Network: Using Context for Large-Scale Object Recognition in Retail

被引：0

作者：

Advani, Siddharth ^{[1
]}

Smith, Brigid ^{[1
]}

Tanabe, Yasuki ^{[2
]}

Irick, Kevin ^{[3
]}

Cotter, Matthew ^{[1
]}

Sampson, Jack ^{[1
]}

Narayanan, Vijaykrishnan ^{[1
]}

机构：

[1] Penn State Univ, University Pk, PA 16802 USA

[2] Toshiba, Tokyo, Japan

[3] SiliconScapes LLC, State Coll, PA USA

来源：

2015 13TH IEEE SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA | 2015年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In any visual object recognition system, the classification accuracy will likely determine the usefulness of the system as a whole. In many real-world applications, it is also important to be able to recognize a large number of diverse objects for the system to be robust enough to handle the sort of tasks that the human visual system handles on an average day. These objectives are often at odds with performance, as running too large of a number of detectors on any one scene will be prohibitively slow for use in any real-time scenario. However, visual information has temporal and spatial context that can be exploited to reduce the number of detectors that need to be triggered at any given instance. In this paper, we propose a dynamic approach to encode such context, called Visual Co-occurrence Network (ViCoNet) that establishes relationships between objects observed in a visual scene. We investigate the utility of ViCoNet when integrated into a vision pipeline targeted for retail shopping. When evaluated on a large and deep dataset, we achieve a 50% improvement in performance and a 7% improvement in accuracy in the best case, and a 45% improvement in performance and a 3% improvement in accuracy in the average case over an established baseline. The memory overhead of ViCoNet is around 10KB, highlighting its effectiveness on temporal big data.

引用

页码：103 / 112

页数：10

共 50 条

[21] Context-Dependent Robust Text Recognition using Large-scale Restricted Bayesian Network
Nakada, Hidemoto
Ichisugi, Yuuji
8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 314 - 320
[22] Implementation of Large-scale Object Recognition System
Kim, Min-Uk
Yoon, Kyoungro
2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
[23] HIGH ORDER CO-OCCURRENCE OF VISUAL WORDS FOR ACTION RECOGNITION
Zhang, Lei
Zhen, Xiantong
Shao, Ling
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 757 - 760
[24] Object Classification Using Heterogeneous Co-occurrence Features
Ito, Satoshi
Kubota, Susumu
COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 209 - 222
[25] Object categorization using co-occurrence, location and appearance
Galleguillos, Carolina
Rabinovich, Andrew
Belongie, Serge
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3552 - 3559
[26] Object Classification Using Heterogeneous Co-occurrence Features
Ito, Satoshi
Kubota, Susumu
COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 701 - 714
[27] CoANE: Modeling Context Co-occurrence for Attributed Network Embedding
Hsieh, I-Chung
Li, Cheng-Te
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1567 - 1568
[28] CoANE: Modeling Context Co-Occurrence for Attributed Network Embedding
Hsieh, I-Chung
Li, Cheng-Te
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 167 - 180
[29] VisualTextualRank: An Extension of VisualRank to Large-Scale Video Shot Extraction Exploiting Tag Co-occurrence
Do, Nga H.
Yanai, Keiji
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (01): : 166 - 172
[30] Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study
Wu, Jiageng
Wang, Lumin
Hua, Yining
Li, Minghui
Zhou, Li
Bates, David W.
Yang, Jie
JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25

← 1 2 3 4 5 →