Manifold regularized cross-modal embedding for zero-shot learning

被引:32
|
作者
Ji, Zhong [1 ]
Yu, Yunlong [1 ]
Pang, Yanwei [1 ]
Guo, Jichang [1 ]
Zhang, Zhongfei [2 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Image classification; Cross-modal embedding; Manifold; Domain adaptation; RECOGNITION;
D O I
10.1016/j.ins.2016.10.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-Shot Learning (ZSL) aims at classifying previously unseen class samples and has gained its popularity in applications where samples of some categories are scarce for training. The basic idea to address this issue is transferring knowledge from the seen classes to the unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. The class semantic information can be obtained from human labeled attributes or text corpus in an unsupervised fashion. Therefore, the embedding function from visual space to the embedding space is extremely important. However, the existing embedding approaches to ZSL mainly focus on aligning pairwise semantic consistency from heterogeneous spaces but ignore the intrinsic structure of the locally homogeneous isomorph. In order to preserve the locally visual structure in the embedding process, this paper proposes a Manifold regularized Cross-Modal Embedding (MCME) approach for ZSL by formulating the manifold constraint for intrinsic structure of the visual features as well as aligning pairwise consistency. The linear, closed-form solution makes MCME efficient to compute. Furthermore, rather than applying the embedding function learned from the seen classes directly, we also propose a new domain adaptation strategy to overcome the domain-shift problem during the knowledge transfer process. The MCME with the domain adaptation method is called MCME-DA. Extensive experiments on the benchmark datasets of AwA and CUB validate the superiority and promise of MCME and MCME-DA. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:48 / 58
页数:11
相关论文
共 50 条
  • [41] Semantic-Adversarial Graph Convolutional Network for Zero-Shot Cross-Modal Retrieval
    Li, Chuang
    Fei, Lunke
    Kang, Peipei
    Liang, Jiahao
    Fang, Xiaozhao
    Teng, Shaohua
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 459 - 472
  • [42] INTER-MODALITY FUSION BASED ATTENTION FOR ZERO-SHOT CROSS-MODAL RETRIEVAL
    Chakraborty, Bela
    Wang, Peng
    Wang, Lei
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2648 - 2652
  • [43] Disentangled Ontology Embedding for Zero-shot Learning
    Geng, Yuxia
    Chen, Jiaoyan
    Zhang, Wen
    Xu, Yajing
    Chen, Zhuo
    Pan, Jeff Z.
    Huang, Yufeng
    Xiong, Feiyu
    Chen, Huajun
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 443 - 453
  • [44] Learning a Deep Embedding Model for Zero-Shot Learning
    Zhang, Li
    Xiang, Tao
    Gong, Shaogang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
  • [45] Variational autoencoder based on distributional semantic embedding and cross-modal reconstruction for generalized zero-shot fault diagnosis of industrial processes
    Mou, Miao
    Zhao, Xiaoqiang
    Liu, Kai
    Hui, Yongyong
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 177 : 1154 - 1167
  • [46] Cross-Modal Zero-Shot-Learning for Tactile Object Recognition
    Liu, Huaping
    Sun, Fuchun
    Fang, Bin
    Guo, Di
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (07): : 2466 - 2474
  • [47] ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval
    Tariq, Umair
    Hu, Zonghai
    Tasneem, Khawaja Tauseef
    Bin Heyat, Md Belal
    Iqbal, Muhammad Shahid
    Aziz, Kamran
    IEEE ACCESS, 2024, 12 : 162622 - 162637
  • [48] Generalized Zero-Shot Learning Based on Manifold Alignment
    Xu, Rui
    Shao, Shuai
    Liu, Baodi
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 202 - 207
  • [49] An Inverse Mapping with Manifold Alignment for Zero-Shot Learning
    Wu, Xixun
    Song, Binheng
    Wang, Zhixiang
    Yuan, Chun
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 400 - 411
  • [50] A Cross-Modal transfer approach for histological images: A case study in aquaculture for disease identification using Zero-Shot learning
    Mendieta, Milton
    Romero, Dennis
    2017 IEEE SECOND ECUADOR TECHNICAL CHAPTERS MEETING (ETCM), 2017,