Dynamic Pricing and Learning with Bayesian Persuasion

被引:0
|
作者
Agrawal, Shipra [1 ]
Feng, Yiding [2 ]
Tang, Wei [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Univ Chicago, Chicago, IL USA
关键词
INFORMATION; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a novel dynamic pricing and learning setting where in addition to setting prices of products in sequential rounds, the seller also ex-ante commits to 'advertising schemes'. That is, in the beginning of each round the seller can decide what kind of signal they will provide to the buyer about the product's quality upon realization. Using the popular Bayesian persuasion framework to model the effect of these signals on the buyers' valuation and purchase responses, we formulate the problem of finding an optimal design of the advertising scheme along with a pricing scheme that maximizes the seller's expected revenue. Without any apriori knowledge of the buyers' demand function, our goal is to design an online algorithm that can use past purchase responses to adaptively learn the optimal pricing and advertising strategy. We study the regret of the algorithm when compared to the optimal clairvoyant price and advertising scheme. Our main result is a computationally efficient online algorithm that achieves an O(T (2/3)(mlog T)(1/3)) regret bound when the valuation function is linear in the product quality. Here m is the cardinality of the discrete product quality domain and T is the time horizon. This result requires some natural monotonicity and Lipschitz assumptions on the valuation function, but no Lipschitz or smoothness assumption on the buyers' demand function. For constant m, our result matches the regret lower bound for dynamic pricing within logarithmic factors, which is a special case of our problem. We also obtain several improved results for the widely considered special case of additive valuations, including an (O) over tilde( T (2/3) ) regret bound independent of m when m <= T-1/3.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Learning dynamic Bayesian networks
    Ghahramani, Z
    ADAPTIVE PROCESSING OF SEQUENCES AND DATA STRUCTURES, 1998, 1387 : 168 - 197
  • [22] Dynamic Pricing by Multiagent Reinforcement Learning
    Han, Wei
    Liu, Lingbo
    Zheng, Huaili
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 226 - 229
  • [23] Reinforcement Learning for Fair Dynamic Pricing
    Maestre, Roberto
    Duque, Juan
    Rubio, Alberto
    Arevalo, Juan
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 120 - 135
  • [24] Dynamic pricing in the presence of individual learning
    Weng, Xi
    JOURNAL OF ECONOMIC THEORY, 2015, 155 : 262 - 299
  • [25] Learning and optimizing through dynamic pricing
    Kumar R.
    Li A.
    Wang W.
    Journal of Revenue and Pricing Management, 2018, 17 (2) : 63 - 77
  • [26] Dynamic pricing and inventory control with learning
    Petruzzi, NC
    Dada, M
    NAVAL RESEARCH LOGISTICS, 2002, 49 (03) : 303 - 325
  • [27] Dynamic pricing and learning in electricity markets
    Garcia, A
    Campos-Nañez, E
    Reitzes, J
    OPERATIONS RESEARCH, 2005, 53 (02) : 231 - 241
  • [28] Dynamic Pricing and Learning with Finite Inventories
    den Boer, Arnoud V.
    Zwart, Bert
    OPERATIONS RESEARCH, 2015, 63 (04) : 965 - 978
  • [29] Dynamic Learning and Pricing with Model Misspecification
    Nambiar, Mila
    Simchi-Levi, David
    Wang, He
    MANAGEMENT SCIENCE, 2019, 65 (11) : 4980 - 5000
  • [30] Bayesian estimation of dynamic asset pricing models with informative observations
    Fulop, Andras
    Li, Junye
    JOURNAL OF ECONOMETRICS, 2019, 209 (01) : 114 - 138