Active Metric Learning and Classification using Similarity Queries

被引:0
|
作者
Nadagouda, Namrata [1 ]
Xu, Austin [1 ]
Davenport, Mark A. [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning is commonly used to train label-efficient models by adaptively selecting the most informative queries. However, most active learning strategies are designed to either learn a representation of the data (e.g., embedding or metric learning) or perform well on a task (e.g., classification) on the data. However, many machine learning tasks involve a combination of both representation learning and a task-specific goal. Motivated by this, we propose a novel unified query framework that can be applied to any problem in which a key component is learning a representation of the data that reflects similarity. Our approach builds on similarity or nearest neighbor (NN) queries which seek to select samples that result in improved embeddings. The queries consist of a reference and a set of objects, with an oracle selecting the object most similar (i.e., nearest) to the reference. In order to reduce the number of solicited queries, they are chosen adaptively according to an information theoretic criterion. We demonstrate the effectiveness of the proposed strategy on two tasks - active metric learning and active classification - using a variety of synthetic and real world datasets. In particular, we demonstrate that actively selected NN queries outperform recently developed active triplet selection methods in a deep metric learning setting. Further, we show that in classification, actively selecting class labels can be reformulated as a process of selecting the most informative NN query, allowing direct application of our method.
引用
收藏
页码:1478 / 1488
页数:11
相关论文
共 50 条
  • [41] Learning with queries corrupted by classification noise
    Jackson, Jeffrey
    Shamir, Eli
    Shwartzman, Clara
    Discrete Applied Mathematics, 92 (02): : 157 - 175
  • [42] DENSE SEMANTIC REFINEMENT USING ACTIVE SIMILARITY LEARNING
    Clarkson, Connor
    Edwards, Michael
    Xie, Xianghua
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 19 (01): : 15 - 30
  • [43] Malware Family Classification using Active Learning by Learning
    Chen, Chin-Wei
    Su, Ching-Hung
    Lee, Kun-Wei
    Bair, Ping-Hao
    2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 590 - 595
  • [44] A queries-based structure for similarity searching in static and dynamic metric spaces
    Hanyf, Youssef
    Silkan, Hassan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) : 188 - 196
  • [45] Solving similarity joins and range queries in metric spaces with the list of twin clusters
    Paredes, Rodrigo
    Reyes, Nora
    JOURNAL OF DISCRETE ALGORITHMS, 2009, 7 (01) : 18 - 35
  • [46] A REGULARIZED MULTI-METRIC ACTIVE LEARNING FRAMEWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhang, Zhou
    Crawford, Melba M.
    2016 8TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2016,
  • [47] Speaker accent classification using distance metric learning approach
    Ullah, Sameeh
    Karray, Fakhri
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 856 - 861
  • [48] Improving Classification of Malware Families using Learning a Distance Metric
    Jurecek, Martin
    Jureckova, Olha
    Lorencz, Robert
    ICISSP: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2021, : 643 - 652
  • [49] SAR Image Classification Using CNN Embeddings and Metric Learning
    Li, Yibing
    Li, Xiang
    Sun, Qian
    Dong, Qianhui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [50] Similarity-based Active Learning for Image Classification under Class Imbalance
    Zhang, Chuanhai
    Tavanapong, Wallapak
    Kijkul, Gavin
    Wong, Johnny
    de Groen, Piet C.
    Oh, JungHwan
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1422 - 1427