On Multilabel Classification and Ranking with Bandit Feedback

被引:0
|
作者
Gentile, Claudio [1 ]
Orabona, Francesco [2 ]
机构
[1] Univ Insubria, DiSTA, I-21100 Varese, Italy
[2] Toyota Technol Inst Chicago, Chicago, IL 60637 USA
关键词
contextual bandits; structured prediction; ranking; online learning; regret bounds; generalized linear; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel multilabel/ranking algorithm working in partial information settings. The algorithm is based on 2nd-order descent methods, and relies on upper-confidence bounds to trade-off exploration and exploitation. We analyze this algorithm in a partial adversarial setting, where covariates can be adversarial, but multilabel probabilities are ruled by (generalized) linear models. We show O(T-1/2 log T) regret bounds, which improve in several ways on the existing results. We test the effectiveness of our upper-confidence scheme by contrasting against full-information baselines on diverse real-world multilabel data sets, often obtaining comparable performance.
引用
收藏
页码:2451 / 2487
页数:37
相关论文
共 50 条
  • [21] Online Boosting with Bandit Feedback
    Brukhim, Nataly
    Hazan, Elad
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
  • [22] Bandit Learning with Implicit Feedback
    Qi, Yi
    Wu, Qingyun
    Wang, Hongning
    Tang, Jie
    Sun, Maosong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [23] MaxGap bandit: Adaptive algorithms for approximate ranking
    Katariya, Sumeet
    Tripathy, Ardhendu
    Nowak, Robert
    arXiv, 2019,
  • [24] Optimal Clustering with Bandit Feedback
    Yang, Junwen
    Zhong, Zixin
    Tan, Vincent Y. F.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [25] Nearest Neighbour with Bandit Feedback
    Pasteris, Stephen
    Hicks, Chris
    Mavroudis, Vasilios
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] UniRank: Unimodal Bandit Algorithms for Online Ranking
    Gauthier, Camille-Sovanneary
    Gaudel, Romaric
    Fromont, Elisa
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [27] MaxGap Bandit: Adaptive Algorithms for Approximate Ranking
    Katariya, Sumeet
    Tripathy, Ardhendu
    Nowak, Robert
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Multilabel Classification with Group Testing and Codes
    Ubaru, Shashanka
    Mazumdar, Arya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [29] Gated Value Network for Multilabel Classification
    Hou, Yimin
    Wan, Sen
    Bao, Feng
    Ren, Zhiquan
    Dong, Yunfeng
    Dai, Qionghai
    Deng, Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4748 - 4754
  • [30] Dataset Sampler for a Multilabel Classification Task
    Kok, Yong En
    Woodward, Simon
    Ozcan, Ender
    Torres Torres, Mercedes
    MOLECULAR INFORMATICS, 2022, 41 (12)