Recurrent SubmodularWelfare and Matroid Blocking Semi-Bandits

被引:0
|
作者
Papadigenopoulos, Orestis [1 ]
Caramanis, Constantine [2 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, Elect & Comp Engn, Austin, TX 78712 USA
关键词
COMPLEXITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A recent line of research focuses on the study of stochastic multi-armed bandits (MAB), in the case where temporal correlations of specific structure are imposed between the player's actions and the reward distributions of the arms. These correlations lead to (sub-)optimal solutions that exhibit interesting dynamical patterns - a phenomenon that yields new challenges both from an algorithmic as well as a learning perspective. In this work, we extend the above direction to a combinatorial semi-bandit setting and study a variant of stochastic MAB, where arms are subject to matroid constraints and each arm becomes unavailable (blocked) for a fixed number of rounds after each play. A natural common generalization of the state-of-the-art for blocking bandits, and that for matroid bandits, only guarantees a 1/2-approximation for general matroids. In this paper we develop the novel technique of correlated (interleaved) scheduling, which allows us to obtain a polynomial-time (1 - (1)/(e))-approximation algorithm (asymptotically and in expectation) for any matroid. Along the way, we discover an interesting connection to a variant of Submodular Welfare Maximization, for which we provide (asymptotically) matching upper and lower approximability bounds. In the case where the mean arm rewards are unknown, our technique naturally decouples the scheduling from the learning problem, and thus allows to control the (1 - (1)/(e))-approximate regret of a UCB-based adaptation of our online algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Combinatorial Blocking Bandits with Stochastic Delays
    Atsidakou, Alexia
    Papadigenopoulos, Orestis
    Basu, Soumya
    Caramanis, Constantine
    Shakkottai, Sanjay
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [32] Semi-streaming Algorithms for Submodular Matroid Intersection
    Garg, Paritosh
    Jordan, Linus
    Svensson, Ola
    INTEGER PROGRAMMING AND COMBINATORIAL OPTIMIZATION, IPCO 2021, 2021, 12707 : 208 - 222
  • [33] A semi-small decomposition of the Chow ring of a matroid
    Braden, Tom
    Huh, June
    Matherne, Jacob P.
    Proudfoot, Nicholas
    Wang, Botong
    ADVANCES IN MATHEMATICS, 2022, 409
  • [34] Semi-streaming algorithms for submodular matroid intersection
    Garg, Paritosh
    Jordan, Linus
    Svensson, Ola
    MATHEMATICAL PROGRAMMING, 2023, 197 (02) : 967 - 990
  • [35] Semi-streaming algorithms for submodular matroid intersection
    Paritosh Garg
    Linus Jordan
    Ola Svensson
    Mathematical Programming, 2023, 197 : 967 - 990
  • [36] Variational Thompson Sampling for Relational Recurrent Bandits
    Lamprier, Sylvain
    Gisselbrecht, Thibault
    Gallinari, Patrick
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 405 - 421
  • [37] Semi-Parametric Sampling for Stochastic Bandits with Many Arms
    Ou, Mingdong
    Li, Nan
    Yang, Cheng
    Zhu, Shenghuo
    Jin, Rong
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7933 - 7940
  • [38] ON THE MAXIMUM POSITIVE SEMI-DEFINITE NULLITY AND THE CYCLE MATROID OF GRAPHS
    Van der Holst, Hein
    ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2009, 18 : 192 - 201
  • [39] A Semi-streaming Algorithm for Monotone Regularized Submodular Maximization with a Matroid Constraint
    Nong, Qing-Qin
    Wang, Yue
    Gong, Su-Ning
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024,
  • [40] Online Semi-supervised Learning in Contextual Bandits with Episodic Reward
    Lin, Baihan
    AI 2020: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 12576 : 407 - 419