I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification

被引:23
|
作者
Li, Jingjing [1 ]
Zhu, Lei [2 ]
Huang, Zi [3 ]
Lu, Ke [1 ]
Zhao, Jidong [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Shandong Normal Univ, Jinan, Peoples R China
[3] Univ Queensland, Sch ITEE, Brisbane, Qld, Australia
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Fine-grained visual classification; multi-modal analysis; deep learning; transfer learning;
D O I
10.1145/3240508.3240579
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In visual classification tasks, it is hard to tell the subtle differences from one species to another similar breeds. Such a challenging problem is generally known as Fine-Grained Visual Classification (FGVC). In this paper, we propose a novel FGVC approach called Texts Assisted Fine-Grained Visual Classification (TA-FGVC). TA-FGVC reads from texts to gain attention, sees the images with the gained attention and then tells the subtle differences. Technically, we propose a deep neural network which learns a visual-semantic embedding model. The proposed deep architecture mainly consists of two parts: visual localization and visual-to-semantic projection. The model is fed with both visual features which are extracted from raw images and semantic information which are learned from two sources: gleaned from unannotated texts and gathered from image attributes. At the very last layer of the model, each image is embedded into the semantic space which is related to class labels. Finally, the categorization results from both visual stream and visual-semantic stream are combined to achieve the ultimate decision. Extensive experiments on open standard benchmarks verify the superiority of our model against several state of the art work.
引用
收藏
页码:663 / 671
页数:9
相关论文
共 50 条
  • [1] FINE-GRAINED FERRITES I NICKEL FERRITE
    MALINOFS, WW
    BABBITT, RW
    JOURNAL OF APPLIED PHYSICS, 1961, 32 (03) : S237 - &
  • [2] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [3] DriverGuard: A Fine-Grained Protection on I/O Flows
    Cheng, Yueqiang
    Ding, Xuhua
    Deng, Robert H.
    COMPUTER SECURITY - ESORICS 2011, 2011, 6879 : 227 - 244
  • [4] Grain boundary sliding in fine-grained ice I
    Univ of Minnesota, Minneapolis, United States
    Scripta Mater, 9 (1399-1406):
  • [5] Grain boundary sliding in fine-grained Ice I
    Goldsby, DL
    Kohlstedt, DL
    SCRIPTA MATERIALIA, 1997, 37 (09) : 1399 - 1406
  • [6] I SAW THE MOVIE BUT I COULDNT READ THE BOOK
    BARR, HR
    JOURNAL OF READING, 1986, 29 (06): : 511 - 515
  • [7] Reflector: A Fine-Grained I/O Tracker for HPC Systems
    Al-Mamun, Abdullah
    Liu, Jialin
    Li, Tonglin
    Koziol, Quincey
    Zhai, Zhongyi
    Qian, Junyan
    Shen, Haoting
    Zhao, Dongfang
    PROCEEDINGS OF THE 25TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '20), 2020, : 427 - 428
  • [8] Fine-Grained I/O Traffic Control Middleware for I/O Fairness in Virtualized System
    Lee, Jaehak
    Lee, Hwamin
    Yu, Heonchang
    IEEE ACCESS, 2022, 10 : 73122 - 73144
  • [9] An Erudite Fine-Grained Visual Classification Model
    Chang, Dongliang
    Tong, Yujun
    Du, Ruoyi
    Hospedales, Timothy
    Song, Yi-Zhe
    Ma, Zhanyu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7268 - 7277
  • [10] Pairwise Confusion for Fine-Grained Visual Classification
    Dubey, Abhimanyu
    Gupta, Otkrist
    Guo, Pei
    Raskar, Ramesh
    Farrell, Ryan
    Naik, Nikhil
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 71 - 88