Dynamic Pricing and Learning with Bayesian Persuasion

被引:0
|
作者
Agrawal, Shipra [1 ]
Feng, Yiding [2 ]
Tang, Wei [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Chicago, Chicago, IL USA
关键词
INFORMATION; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a novel dynamic pricing and learning setting where in addition to setting prices of products in sequential rounds, the seller also ex-ante commits to 'advertising schemes'. That is, in the beginning of each round the seller can decide what kind of signal they will provide to the buyer about the product's quality upon realization. Using the popular Bayesian persuasion framework to model the effect of these signals on the buyers' valuation and purchase responses, we formulate the problem of finding an optimal design of the advertising scheme along with a pricing scheme that maximizes the seller's expected revenue. Without any apriori knowledge of the buyers' demand function, our goal is to design an online algorithm that can use past purchase responses to adaptively learn the optimal pricing and advertising strategy. We study the regret of the algorithm when compared to the optimal clairvoyant price and advertising scheme. Our main result is a computationally efficient online algorithm that achieves an O(T (2/3)(mlog T)(1/3)) regret bound when the valuation function is linear in the product quality. Here m is the cardinality of the discrete product quality domain and T is the time horizon. This result requires some natural monotonicity and Lipschitz assumptions on the valuation function, but no Lipschitz or smoothness assumption on the buyers' demand function. For constant m, our result matches the regret lower bound for dynamic pricing within logarithmic factors, which is a special case of our problem. We also obtain several improved results for the widely considered special case of additive valuations, including an (O) over tilde( T (2/3) ) regret bound independent of m when m <= T-1/3.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] SIGNALLING BY BAYESIAN PERSUASION AND PRICING STRATEGY
    Chen, Yanlin
    Zhang, Jun
    ECONOMIC JOURNAL, 2020, 130 (628): : 976 - 1007
  • [2] Bayesian dynamic learning and pricing with strategic customers
    Chen, Xi
    Gao, Jianjun
    Ge, Dongdong
    Wang, Zizhuo
    PRODUCTION AND OPERATIONS MANAGEMENT, 2022, 31 (08) : 3125 - 3142
  • [3] Bayesian persuasion with optimal learning
    Liao, Xiaoye
    JOURNAL OF MATHEMATICAL ECONOMICS, 2021, 97
  • [4] Dynamic pricing and inventory management with demand learning: A bayesian approach
    Liu, Jue
    Pang, Zhan
    Qi, Linggang
    COMPUTERS & OPERATIONS RESEARCH, 2020, 124
  • [5] Dynamic pricing with Bayesian demand learning and reference price effect
    Cao, Ping
    Zhao, Nenggui
    Wu, Jie
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 279 (02) : 540 - 556
  • [6] Bayesian dithering for learning: Asymptotically optimal policies in dynamic pricing
    Huh, Woonghee Tim
    Kim, Michael Jong
    Lin, Meichun
    PRODUCTION AND OPERATIONS MANAGEMENT, 2022, 31 (09) : 3576 - 3593
  • [7] Bayesian Dynamic Pricing Policies: Learning and Earning Under a Binary Prior Distribution
    Harrison, J. Michael
    Keskin, N. Bora
    Zeevi, Assaf
    MANAGEMENT SCIENCE, 2012, 58 (03) : 570 - 586
  • [8] Keeping the Listener Engaged: A Dynamic Model of Bayesian Persuasion
    Che, Yeon-Koo
    Kim, Kyungmin
    Mierendorff, Konrad
    JOURNAL OF POLITICAL ECONOMY, 2023, : 1797 - 1844
  • [9] Sellers' Pricing By Bayesian Reinforcement Learning
    Han, Wei
    2009 INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY, VOLS 1 AND 2, 2009, : 1276 - 1280
  • [10] Bayesian Persuasion
    Kamenica, Emir
    Gentzkow, Matthew
    AMERICAN ECONOMIC REVIEW, 2011, 101 (06): : 2590 - 2615