How Big Should Your Data Really Be? Data-Driven Newsvendor: Learning One Sample at a Time

被引:18
|
作者
Besbas, Omar [1 ]
Mouchtaki, Omar [1 ]
机构
[1] Columbia Univ, Grad Sch Business, New York, NY 10027 USA
关键词
limited data; data-driven decisions; minimax regret; sample average approximation; empirical optimization; finite samples; distributionally robust optimization; INVENTORY CONTROL; APPROXIMATION ALGORITHMS; NONPARAMETRIC ESTIMATOR; CENSORED NEWSVENDOR; OPTIMAL ACQUISITION; QUANTILE; OPTIMIZATION; MANAGEMENT; AMBIGUITY;
D O I
10.1287/mnsc.2023.4725
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We study the classical newsvendor problem in which the decision maker must trade off underage and overage costs. In contrast to the typical setting, we assume that the decision maker does not know the underlying distribution driving uncertainty but has only access to historical data. In turn, the key questions are how to map existing data to a decision and what type of performance to expect as a function of the data size. We analyze the classical setting with access to past samples drawn from the distribution (e.g., past demand), focusing not only on asymptotic performance but also on what we call the transient regime of learning, that is, performance for arbitrary data sizes. We evaluate the performance of any algorithm through its worst-case relative expected regret, compared with an oracle with knowledge of the distribution. We provide the first finite sample exact analysis of the classical sample average approximation (SAA) algorithm for this class of problems across all data sizes. This allows one to uncover novel fundamental insights on the value of data: It reveals that tens of samples are sufficient to perform very efficiently but also that more data can lead to worse out-of-sample performance for SAA. We then focus on the general class of mappings from data to decisions without any restriction on the set of policies and derive an optimal algorithm (in the minimax sense) and characterize its associated performance. This leads to significant improvements for limited data sizes and allows to exactly quantify the value of historical information.
引用
收藏
页码:5848 / 5865
页数:18
相关论文
共 50 条
  • [41] AN APPROACH TO DATA-DRIVEN LEARNING
    MARKOV, Z
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 535 : 127 - 140
  • [42] Metacognition and Data-Driven Learning
    Sato, Masatoshi
    TESOL QUARTERLY, 2024, 58 (03) : 1246 - 1255
  • [43] Data-Driven Personalized Learning
    Guo, Xue
    He, Xiangchun
    Pei, Zhuoyun
    PROCEEDINGS OF 2023 6TH INTERNATIONAL CONFERENCE ON EDUCATIONAL TECHNOLOGY MANAGEMENT, ICETM 2023, 2023, : 49 - 54
  • [44] Big Data Analysis with Momentum Strategy on Data-driven Trading
    Gao, Xiangyu
    Qiu, Meikang
    He, Zhen
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1328 - 1335
  • [45] Data-Driven Artificial Intelligence for Calibration of Hyperspectral Big Data
    Sagan, Vasit
    Maimaitijiang, Maitiniyazi
    Paheding, Sidike
    Bhadra, Sourav
    Gosselin, Nichole
    Burnette, Max
    Demieville, Jeffrey
    Hartling, Sean
    LeBauer, David
    Newcomb, Maria
    Pauli, Duke
    Peterson, Kyle T.
    Shakoor, Nadia
    Stylianou, Abby
    Zender, Charles S.
    Mockler, Todd C.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [46] Holistic data-driven requirements elicitation in the big data era
    Henriksson, Aron
    Zdravkovic, Jelena
    SOFTWARE AND SYSTEMS MODELING, 2022, 21 (04): : 1389 - 1410
  • [47] Holistic data-driven requirements elicitation in the big data era
    Aron Henriksson
    Jelena Zdravkovic
    Software and Systems Modeling, 2022, 21 : 1389 - 1410
  • [48] Big Data Analytics in Education: A Data-Driven Literature Review
    Shabihi, Negar
    Kim, Mi Song
    IEEE 21ST INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2021), 2021, : 154 - 156
  • [49] A Data-Driven Framework for Business Analytics in the Context of Big Data
    Lu, Jing
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2018, 2018, 909 : 339 - 351
  • [50] A Data-Driven Sequential Localization Framework for Big Telco Data
    Zhu, Fangzhou
    Yuan, Mingxuan
    Xie, Xike
    Wang, Ting
    Zhao, Shenglin
    Rao, Weixiong
    Zeng, Jia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (08) : 3007 - 3019