Global relative position space based pooling for fine-grained vehicle recognition

被引:4
|
作者
Xiang, Ye [1 ]
Fu, Ying [1 ]
Huang, Hua [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained vehicle recognition; Global relative position space based pooling (GRPSP); Convolution neural network (CNN); MODEL; CLASSIFICATION; FEATURES; REPRESENTATION; NETWORKS; SURF; BAG;
D O I
10.1016/j.neucom.2019.07.098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of fine-grained vehicle recognition, large intra-class variation often affects the final prediction seriously. One of the principal reasons is unstable change of absolute positions for discriminative features, which can not be completely solved by the region of interest (ROI) localization for inevitable deviations. In this paper, we propose a global relative position space based pooling (GRPSP) method to replace the absolute position information with global relative position information, by which the discriminative features can be well aligned and thereby the intra-class variation is reduced effectively. Specifically, the discriminative features containing both absolute position information and appearance information are extracted by a convolution neural network (CNN). We then transform the absolute positions into global relative positions, and estimate position probabilities of locating at discrete representatives of global relative position space. Accordingly, the joint probabilities of different kinds of features locating at different representative positions can be calculated, and constitute the pooling vector. Lastly, the Support Vector Machine (SVM) is trained for classification. Experimental results on public and self-building challenging datasets both show that our proposed method can improve various CNNs and is the state-of-the-art. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:287 / 298
页数:12
相关论文
共 50 条
  • [41] A model for fine-grained vehicle classification based on deep learning
    Yu, Shaoyong
    Wu, Yun
    Li, Wei
    Song, Zhijun
    Zeng, Wenhua
    NEUROCOMPUTING, 2017, 257 : 97 - 103
  • [42] Deep LSAC for Fine-Grained Recognition
    Lin, Di
    Wang, Yi
    Liang, Lingyu
    Li, Ping
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
  • [43] FgER: Fine-Grained Entity Recognition
    Abhishek
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8008 - 8009
  • [44] A dataset for fine-grained seed recognition
    Yuan, Min
    Lv, Ningning
    Dong, Yongkang
    Hu, Xiaowen
    Lu, Fuxiang
    Zhan, Kun
    Shen, Jiacheng
    Wu, Xiaolin
    Zhu, Liye
    Xie, Yufei
    SCIENTIFIC DATA, 2024, 11 (01)
  • [45] Structure-guided attention network for fine-grained vehicle model recognition
    Li, Xiying
    Quan, Fengwei
    Jiang, Qianyin
    Lu, Qiang
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (02) : 23033
  • [46] AMLNet: Attention Multibranch Loss CNN Models for Fine-Grained Vehicle Recognition
    Lu, Hongchun
    Han, Min
    Wang, Chaoqing
    Cheng, Junlong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 375 - 384
  • [47] A NOVEL PART FEATURE INTEGRATION AND FUSION METHOD FOR FINE-GRAINED VEHICLE RECOGNITION
    Wang, Ping
    Cao, Yijie
    Lu, Lei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1990 - 1994
  • [48] Dual-Rank Attention Module for Fine-Grained Vehicle Model Recognition
    Cai, Wen
    Zhu, Wenjia
    Cheng, Bo
    Xu, Longdao
    Yu, Ye
    Lu, Qiang
    Jia, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 17 - 28
  • [49] A Novel Smart Lightweight Visual Attention Model for Fine-Grained Vehicle Recognition
    Boukerche, Azzedine
    Ma, Xiren
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 13846 - 13862
  • [50] Local Temporal Bilinear Pooling for Fine-Grained Action Parsing
    Zhang, Yan
    Tang, Siyu
    Muandet, Krikamol
    Jarvers, Christian
    Neumann, Heiko
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11997 - 12007