Visual Co-occurrence Network: Using Context for Large-Scale Object Recognition in Retail

被引:0
|
作者
Advani, Siddharth [1 ]
Smith, Brigid [1 ]
Tanabe, Yasuki [2 ]
Irick, Kevin [3 ]
Cotter, Matthew [1 ]
Sampson, Jack [1 ]
Narayanan, Vijaykrishnan [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Toshiba, Tokyo, Japan
[3] SiliconScapes LLC, State Coll, PA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In any visual object recognition system, the classification accuracy will likely determine the usefulness of the system as a whole. In many real-world applications, it is also important to be able to recognize a large number of diverse objects for the system to be robust enough to handle the sort of tasks that the human visual system handles on an average day. These objectives are often at odds with performance, as running too large of a number of detectors on any one scene will be prohibitively slow for use in any real-time scenario. However, visual information has temporal and spatial context that can be exploited to reduce the number of detectors that need to be triggered at any given instance. In this paper, we propose a dynamic approach to encode such context, called Visual Co-occurrence Network (ViCoNet) that establishes relationships between objects observed in a visual scene. We investigate the utility of ViCoNet when integrated into a vision pipeline targeted for retail shopping. When evaluated on a large and deep dataset, we achieve a 50% improvement in performance and a 7% improvement in accuracy in the best case, and a 45% improvement in performance and a 3% improvement in accuracy in the average case over an established baseline. The memory overhead of ViCoNet is around 10KB, highlighting its effectiveness on temporal big data.
引用
收藏
页码:103 / 112
页数:10
相关论文
共 50 条
  • [21] Context-Dependent Robust Text Recognition using Large-scale Restricted Bayesian Network
    Nakada, Hidemoto
    Ichisugi, Yuuji
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 314 - 320
  • [22] Implementation of Large-scale Object Recognition System
    Kim, Min-Uk
    Yoon, Kyoungro
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [23] HIGH ORDER CO-OCCURRENCE OF VISUAL WORDS FOR ACTION RECOGNITION
    Zhang, Lei
    Zhen, Xiantong
    Shao, Ling
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 757 - 760
  • [24] Object Classification Using Heterogeneous Co-occurrence Features
    Ito, Satoshi
    Kubota, Susumu
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 209 - 222
  • [25] Object categorization using co-occurrence, location and appearance
    Galleguillos, Carolina
    Rabinovich, Andrew
    Belongie, Serge
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3552 - 3559
  • [26] Object Classification Using Heterogeneous Co-occurrence Features
    Ito, Satoshi
    Kubota, Susumu
    COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 701 - 714
  • [27] CoANE: Modeling Context Co-occurrence for Attributed Network Embedding
    Hsieh, I-Chung
    Li, Cheng-Te
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1567 - 1568
  • [28] CoANE: Modeling Context Co-Occurrence for Attributed Network Embedding
    Hsieh, I-Chung
    Li, Cheng-Te
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 167 - 180
  • [29] VisualTextualRank: An Extension of VisualRank to Large-Scale Video Shot Extraction Exploiting Tag Co-occurrence
    Do, Nga H.
    Yanai, Keiji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (01): : 166 - 172
  • [30] Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study
    Wu, Jiageng
    Wang, Lumin
    Hua, Yining
    Li, Minghui
    Zhou, Li
    Bates, David W.
    Yang, Jie
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25