Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

被引:0
|
作者
Evans, Talfan [1 ]
Pathak, Shreya [1 ]
Merzic, Hamza [1 ,2 ]
Schwarz, Jonathan [1 ,3 ]
Tannol, Ryutaro [1 ]
Henaff, Olivier J. [1 ]
机构
[1] Google DeepMind, London, England
[2] UCL, London, England
[3] Harvard Univ, Cambridge, MA 02138 USA
来源
关键词
D O I
10.1007/978-3-031-72643-9_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Power-law scaling indicates that large-scale training with uniform sampling is prohibitively slow. Active learning methods aim to increase data efficiency by prioritizing learning on the most relevant examples. Despite their appeal, these methods have yet to be widely adopted since no one algorithm has been shown to a) generalize across models and tasks b) scale to large datasets and c) yield overall FLOP savings when accounting for the overhead of data selection. In this work we propose a method which satisfies these three properties, leveraging small, cheap proxy models to estimate "learnability" scores for datapoints, which are used to prioritize data for training much larger models. As a result, models trained using our methods - ClassAct and Active-CLIP - require 46% and 51% fewer training updates and up to 25% less total computation to reach the same performance as uniformly-trained visual classifiers on JFT and multimodal models on ALIGN, respectively. Finally, we find our data-prioritization scheme to be complementary with recent data-curation and learning objectives, yielding a new state-of-the-art in several multimodal transfer tasks.
引用
收藏
页码:264 / 280
页数:17
相关论文
共 50 条
  • [21] Learning Compact Visual Attributes for Large-Scale Image Classification
    Su, Yu
    Jurie, Frederic
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 51 - 60
  • [22] Fast Learning Discriminative Dictionaries for Large-scale Visual Recognition
    Zhao, Tianyi
    Qu, Yanyun
    Fan, Jianping
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [23] Preparing Students With Learning Disabilities for Large-Scale Writing Assessments
    Olinghouse, Natalie G.
    Colwell, Ryan P.
    INTERVENTION IN SCHOOL AND CLINIC, 2013, 49 (02) : 67 - 76
  • [24] A Deep Multiview Active Learning for Large-Scale Image Classification
    Yao, Tuozhong
    Wang, Wenfeng
    Gu, Yuhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [25] Understanding Large-Scale Structure in Global Ionospheric Maps With Visual and Statistical Analyses
    Verkhoglyadova, Olga
    Meng, Xing
    Kosberg, Jacob
    FRONTIERS IN ASTRONOMY AND SPACE SCIENCES, 2022, 9
  • [26] Sentence understanding and learning of new words with large-scale neural networks
    Markert, Heiner
    Kayikci, Zoehre Kara
    Palm, Guenther
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2008, 5064 : 217 - 227
  • [27] Teachers on the Move: Evidence From a Large-Scale Learning Intervention During Lockdown
    Bhatia, Kartika
    Leighton, Margaret
    JOURNAL OF DEVELOPMENT STUDIES, 2024, 60 (07): : 1002 - 1020
  • [28] An Active Learning Based LDA Algorithm for Large-Scale Data Classification
    Yu X.
    Zhou Y.-P.
    Ren C.-N.
    Yu, Xu (yuxu0532@163.com), 1600, Science and Engineering Research Support Society (09): : 29 - 36
  • [29] In operando active learning of interatomic interaction during large-scale simulations
    Hodapp, M.
    Shapeev, A.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2020, 1 (04):
  • [30] Fast Pairwise Query Selection for Large-Scale Active Learning to Rank
    Qian, Buyue
    Wang, Xiang
    Wang, Jun
    Li, Hongfei
    Cao, Nan
    Zhi, Weifeng
    Davidson, Ian
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 607 - 616