Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding

被引:0
|
作者
Evans, Talfan [1 ]
Pathak, Shreya [1 ]
Merzic, Hamza [1 ,2 ]
Schwarz, Jonathan [1 ,3 ]
Tannol, Ryutaro [1 ]
Henaff, Olivier J. [1 ]
机构
[1] Google DeepMind, London, England
[2] UCL, London, England
[3] Harvard Univ, Cambridge, MA 02138 USA
来源
关键词
D O I
10.1007/978-3-031-72643-9_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Power-law scaling indicates that large-scale training with uniform sampling is prohibitively slow. Active learning methods aim to increase data efficiency by prioritizing learning on the most relevant examples. Despite their appeal, these methods have yet to be widely adopted since no one algorithm has been shown to a) generalize across models and tasks b) scale to large datasets and c) yield overall FLOP savings when accounting for the overhead of data selection. In this work we propose a method which satisfies these three properties, leveraging small, cheap proxy models to estimate "learnability" scores for datapoints, which are used to prioritize data for training much larger models. As a result, models trained using our methods - ClassAct and Active-CLIP - require 46% and 51% fewer training updates and up to 25% less total computation to reach the same performance as uniformly-trained visual classifiers on JFT and multimodal models on ALIGN, respectively. Finally, we find our data-prioritization scheme to be complementary with recent data-curation and learning objectives, yielding a new state-of-the-art in several multimodal transfer tasks.
引用
收藏
页码:264 / 280
页数:17
相关论文
共 50 条
  • [1] Large-Scale Visual Relationship Understanding
    Zhang, Ji
    Kalantidis, Yannis
    Rohrbach, Marcus
    Paluri, Manohar
    Elgammal, Ahmed
    Elhoseiny, Mohamed
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9185 - 9194
  • [2] Large-scale learning for media understanding
    Rocha, Anderson
    Scheirer, Walter J.
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2015,
  • [3] Large-scale learning for media understanding
    Anderson Rocha
    Walter J. Scheirer
    EURASIP Journal on Image and Video Processing, 2015
  • [4] Assistive Technology Approaches for Large-Scale Assessment: Perceptions of Teachers of Students with Visual Impairments
    Johnstone, Christopher
    Thurlow, Martha
    Altman, Jason
    Timmons, Joe
    Kato, Kentaro
    EXCEPTIONALITY, 2009, 17 (02) : 66 - 75
  • [5] Elementary school teachers' math anxiety and students' math learning: A large-scale replication
    Schaeffer, Marjorie W.
    Rozek, Christopher S.
    Maloney, Erin A.
    Berkowitz, Talia
    Levine, Susan C.
    Beilock, Sian L.
    DEVELOPMENTAL SCIENCE, 2021, 24 (04)
  • [6] ACTIVE LEARNING FOR LARGE-SCALE FACTOR ANALYSIS
    Silva, Jorge
    Carin, Lawrence
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 161 - 164
  • [7] Active Learning for Large-Scale Entity Resolution
    Qian, Kun
    Popa, Lucian
    Sen, Prithviraj
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1379 - 1388
  • [8] Teachers' and Students' Perceptions of the Nature and Impact of Large-Scale Reforms
    Ryan, Thomas
    Joong, Peter
    CANADIAN JOURNAL OF EDUCATIONAL ADMINISTRATION AND POLICY, 2005, (38): : 1 - 21
  • [9] An improved way to make large-scale SVR learning practical
    Yong Q.
    Jie Y.
    Lixiu Y.
    Chenzhou Y.
    Eurasip Journal on Applied Signal Processing, 2004, 2004 (08) : 1135 - 1141
  • [10] Visual Analytics to make sense of large-scale administrative and normative data
    Guarino, Alfonso
    Lettieri, Nicola
    Malandrino, Delfina
    Russo, Pietro
    Zaccagnino, Rocco
    2019 23RD INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV): BIOMEDICAL VISUALIZATION AND GEOMETRIC MODELLING & IMAGING, 2019, : 133 - 138