Manifold regularized cross-modal embedding for zero-shot learning

被引：32

作者：

Ji, Zhong ^{[1
]}

Yu, Yunlong ^{[1
]}

Pang, Yanwei ^{[1
]}

Guo, Jichang ^{[1
]}

Zhang, Zhongfei ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China

[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA

来源：

INFORMATION SCIENCES | 2017年 / 378卷

基金：

中国国家自然科学基金;

关键词：

Zero-shot learning; Image classification; Cross-modal embedding; Manifold; Domain adaptation; RECOGNITION;

D O I：

10.1016/j.ins.2016.10.025

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Zero-Shot Learning (ZSL) aims at classifying previously unseen class samples and has gained its popularity in applications where samples of some categories are scarce for training. The basic idea to address this issue is transferring knowledge from the seen classes to the unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. The class semantic information can be obtained from human labeled attributes or text corpus in an unsupervised fashion. Therefore, the embedding function from visual space to the embedding space is extremely important. However, the existing embedding approaches to ZSL mainly focus on aligning pairwise semantic consistency from heterogeneous spaces but ignore the intrinsic structure of the locally homogeneous isomorph. In order to preserve the locally visual structure in the embedding process, this paper proposes a Manifold regularized Cross-Modal Embedding (MCME) approach for ZSL by formulating the manifold constraint for intrinsic structure of the visual features as well as aligning pairwise consistency. The linear, closed-form solution makes MCME efficient to compute. Furthermore, rather than applying the embedding function learned from the seen classes directly, we also propose a new domain adaptation strategy to overcome the domain-shift problem during the knowledge transfer process. The MCME with the domain adaptation method is called MCME-DA. Extensive experiments on the benchmark datasets of AwA and CUB validate the superiority and promise of MCME and MCME-DA. (C) 2016 Elsevier Inc. All rights reserved.

引用

页码：48 / 58

页数：11

共 50 条

[41] Semantic-Adversarial Graph Convolutional Network for Zero-Shot Cross-Modal Retrieval
Li, Chuang
Fei, Lunke
Kang, Peipei
Liang, Jiahao
Fang, Xiaozhao
Teng, Shaohua
PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 459 - 472
[42] INTER-MODALITY FUSION BASED ATTENTION FOR ZERO-SHOT CROSS-MODAL RETRIEVAL
Chakraborty, Bela
Wang, Peng
Wang, Lei
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2648 - 2652
[43] Disentangled Ontology Embedding for Zero-shot Learning
Geng, Yuxia
Chen, Jiaoyan
Zhang, Wen
Xu, Yajing
Chen, Zhuo
Pan, Jeff Z.
Huang, Yufeng
Xiong, Feiyu
Chen, Huajun
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 443 - 453
[44] Learning a Deep Embedding Model for Zero-Shot Learning
Zhang, Li
Xiang, Tao
Gong, Shaogang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
[45] Variational autoencoder based on distributional semantic embedding and cross-modal reconstruction for generalized zero-shot fault diagnosis of industrial processes
Mou, Miao
Zhao, Xiaoqiang
Liu, Kai
Hui, Yongyong
PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 177 : 1154 - 1167
[46] Cross-Modal Zero-Shot-Learning for Tactile Object Recognition
Liu, Huaping
Sun, Fuchun
Fang, Bin
Guo, Di
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (07): : 2466 - 2474
[47] ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval
Tariq, Umair
Hu, Zonghai
Tasneem, Khawaja Tauseef
Bin Heyat, Md Belal
Iqbal, Muhammad Shahid
Aziz, Kamran
IEEE ACCESS, 2024, 12 : 162622 - 162637
[48] Generalized Zero-Shot Learning Based on Manifold Alignment
Xu, Rui
Shao, Shuai
Liu, Baodi
Liu, Weifeng
2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 202 - 207
[49] An Inverse Mapping with Manifold Alignment for Zero-Shot Learning
Wu, Xixun
Song, Binheng
Wang, Zhixiang
Yuan, Chun
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 400 - 411
[50] A Cross-Modal transfer approach for histological images: A case study in aquaculture for disease identification using Zero-Shot learning
Mendieta, Milton
Romero, Dennis
2017 IEEE SECOND ECUADOR TECHNICAL CHAPTERS MEETING (ETCM), 2017,

← 1 2 3 4 5 →