How Big Should Your Data Really Be? Data-Driven Newsvendor: Learning One Sample at a Time

被引:18
|
作者
Besbas, Omar [1 ]
Mouchtaki, Omar [1 ]
机构
[1] Columbia Univ, Grad Sch Business, New York, NY 10027 USA
关键词
limited data; data-driven decisions; minimax regret; sample average approximation; empirical optimization; finite samples; distributionally robust optimization; INVENTORY CONTROL; APPROXIMATION ALGORITHMS; NONPARAMETRIC ESTIMATOR; CENSORED NEWSVENDOR; OPTIMAL ACQUISITION; QUANTILE; OPTIMIZATION; MANAGEMENT; AMBIGUITY;
D O I
10.1287/mnsc.2023.4725
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We study the classical newsvendor problem in which the decision maker must trade off underage and overage costs. In contrast to the typical setting, we assume that the decision maker does not know the underlying distribution driving uncertainty but has only access to historical data. In turn, the key questions are how to map existing data to a decision and what type of performance to expect as a function of the data size. We analyze the classical setting with access to past samples drawn from the distribution (e.g., past demand), focusing not only on asymptotic performance but also on what we call the transient regime of learning, that is, performance for arbitrary data sizes. We evaluate the performance of any algorithm through its worst-case relative expected regret, compared with an oracle with knowledge of the distribution. We provide the first finite sample exact analysis of the classical sample average approximation (SAA) algorithm for this class of problems across all data sizes. This allows one to uncover novel fundamental insights on the value of data: It reveals that tens of samples are sufficient to perform very efficiently but also that more data can lead to worse out-of-sample performance for SAA. We then focus on the general class of mappings from data to decisions without any restriction on the set of policies and derive an optimal algorithm (in the minimax sense) and characterize its associated performance. This leads to significant improvements for limited data sizes and allows to exactly quantify the value of historical information.
引用
收藏
页码:5848 / 5865
页数:18
相关论文
共 50 条
  • [11] An integrated data-driven method using deep learning for a newsvendor problem with unobservable features
    Neghab, Davood Pirayesh
    Khayyati, Siamak
    Karaesmen, Fikri
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 302 (02) : 482 - 496
  • [12] HOW GOOD ARE YOUR DATA REALLY
    TAYLOR, JK
    CHEMTECH, 1988, 18 (03) : 174 - 177
  • [13] How Big Data Has Changed Technology Roadmapping: A Review on Data-Driven Roadmapping
    Kim, Jinhong
    Park, Gamunnarbi
    Woo, Myoungkyun
    Geum, Youngjung
    IEEE ACCESS, 2025, 13 : 8297 - 8309
  • [14] Data-Driven Solutions for the Newsvendor Problem: A Systematic Literature Review
    Moraes, Thais de Castro
    Yuan, Xue-Ming
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: ARTIFICIAL INTELLIGENCE FOR SUSTAINABLE AND RESILIENT PRODUCTION SYSTEMS, APMS 2021, PT IV, 2021, 633 : 149 - 158
  • [15] Learning to Sample: Data-Driven Sampling and Reconstruction of FRI Signals
    Mulleti, Satish
    Zhang, Haiyang
    Eldar, Yonina C. C.
    IEEE ACCESS, 2023, 11 : 71048 - 71062
  • [16] Bilevel optimization for feature selection in the data-driven newsvendor problem
    Serrano, Breno
    Minner, Stefan
    Schiffer, Maximilian
    Vidal, Thibaut
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 315 (02) : 703 - 714
  • [17] Mobile Big Data: The Fuel for Data-Driven Wireless
    Cheng, Xiang
    Fang, Luoyang
    Yang, Liuqing
    Cui, Shuguang
    IEEE INTERNET OF THINGS JOURNAL, 2017, 4 (05): : 1489 - 1516
  • [18] Data-Driven Newsvendor Problems Regularized by a Profit Risk Constraint
    Lin, Shaochong
    Chen, Youhua
    Li, Yanzhi
    Shen, Zuo-Jun Max
    PRODUCTION AND OPERATIONS MANAGEMENT, 2022, 31 (04) : 1630 - 1644
  • [19] Data-driven medicinal chemistry in the era of big data
    Lusher, Scott J.
    McGuire, Ross
    van Schaik, Rene C.
    Nicholson, C. David
    de Vlieg, Jacob
    DRUG DISCOVERY TODAY, 2014, 19 (07) : 859 - 868
  • [20] Data-driven innovation: switching the perspective on Big Data
    Trabucchi, Daniel
    Buganza, Tommaso
    EUROPEAN JOURNAL OF INNOVATION MANAGEMENT, 2019, 22 (01) : 23 - 40