Visual Recognition by Learning from Web Data: A Weakly Supervised Domain Generalization Approach

被引:0
|
作者
Niu, Li [1 ]
Li, Wen [1 ]
Xu, Dong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
EVENT RECOGNITION; ADAPTATION; KERNEL; IMAGES; VIDEOS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we formulate a new weakly supervised domain generalization approach for visual recognition by using loosely labeled web images/videos as training data. Specifically, we aim to address two challenging issues when learning robust classifiers: 1) coping with noise in the labels of training web images/videos in the source domain; and 2) enhancing generalization capability of learnt classifiers to any unseen target domain. To address the first issue, we partition the training samples in each class into multiple clusters. By treating each cluster as a "bag" and the samples in each cluster as "instances", we formulate a multi-instance learning (MIL) problem by selecting a subset of training samples from each training bag and simultaneously learning the optimal classifiers based on the selected samples. To address the second issue, we assume the training web images/videos may come from multiple hidden domains with different data distributions. We then extend our MIL formulation to learn one classifier for each class and each latent domain such that multiple classifiers from each class can be effectively integrated to achieve better generalization capability. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our new approach for visual recognition by learning from web data.
引用
收藏
页码:2774 / 2783
页数:10
相关论文
共 50 条
  • [21] Domain Specific Facts Extraction Using Weakly Supervised Active Learning Approach
    Pande, Vinay
    Mukherjee, Tanmoy
    Varma, Vasudeva
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2013, : 246 - 251
  • [22] A weakly supervised approach for recycling code recognition
    Pellegrini, Lorenzo
    Maltoni, Davide
    Graffieti, Gabriele
    Lomonaco, Vincenzo
    Mazzini, Lisa
    Mondardini, Marco
    Zappoli, Milena
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [23] A Weakly Supervised Transfer Learning Approach for Radar Sounder Data Segmentation
    Garcia, Miguel Hoyo
    Donini, Elena
    Bovolo, Francesca
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [24] Visual object recognition with supervised learning
    Heisele, B
    IEEE INTELLIGENT SYSTEMS, 2003, 18 (03) : 38 - 42
  • [25] Weakly supervised learning for an effective focused web crawler
    Dhanith, P. R. Joe
    Saeed, Khalid
    Rohith, G.
    Raja, S. P.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [26] Weakly Supervised Action Recognition and Localization Using Web Images
    Liu, Cuiwei
    Wu, Xinxiao
    Jia, Yunde
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 642 - 657
  • [27] Weakly Supervised Learning of Object Segmentations from Web-Scale Video
    Hartmann, Glenn
    Grundmann, Matthias
    Hoffman, Judy
    Tsai, David
    Kwatra, Vivek
    Madani, Omid
    Vijayanarasimhan, Sudheendra
    Essa, Irfan
    Rehg, James
    Sukthankar, Rahul
    COMPUTER VISION - ECCV 2012: WORKSHOPS AND DEMONSTRATIONS, PT I, 2012, 7583 : 198 - 208
  • [28] CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images
    Guo, Sheng
    Huang, Weilin
    Zhang, Haozhi
    Zhuang, Chenfan
    Dong, Dengke
    Scott, Matthew R.
    Huang, Dinglong
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 139 - 154
  • [29] Multimodal Visual Concept Learning with Weakly Supervised Techniques
    Bouritsas, Giorgos
    Koutras, Petros
    Zlatintsi, Athanasia
    Maragos, Petros
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4914 - 4923
  • [30] Weakly Supervised Facial Action Unit Recognition With Domain Knowledge
    Wang, Shangfei
    Peng, Guozhu
    Chen, Shiyu
    Ji, Qiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) : 3265 - 3276