How Big Should Your Data Really Be? Data-Driven Newsvendor: Learning One Sample at a Time

被引:18
|
作者
Besbas, Omar [1 ]
Mouchtaki, Omar [1 ]
机构
[1] Columbia Univ, Grad Sch Business, New York, NY 10027 USA
关键词
limited data; data-driven decisions; minimax regret; sample average approximation; empirical optimization; finite samples; distributionally robust optimization; INVENTORY CONTROL; APPROXIMATION ALGORITHMS; NONPARAMETRIC ESTIMATOR; CENSORED NEWSVENDOR; OPTIMAL ACQUISITION; QUANTILE; OPTIMIZATION; MANAGEMENT; AMBIGUITY;
D O I
10.1287/mnsc.2023.4725
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We study the classical newsvendor problem in which the decision maker must trade off underage and overage costs. In contrast to the typical setting, we assume that the decision maker does not know the underlying distribution driving uncertainty but has only access to historical data. In turn, the key questions are how to map existing data to a decision and what type of performance to expect as a function of the data size. We analyze the classical setting with access to past samples drawn from the distribution (e.g., past demand), focusing not only on asymptotic performance but also on what we call the transient regime of learning, that is, performance for arbitrary data sizes. We evaluate the performance of any algorithm through its worst-case relative expected regret, compared with an oracle with knowledge of the distribution. We provide the first finite sample exact analysis of the classical sample average approximation (SAA) algorithm for this class of problems across all data sizes. This allows one to uncover novel fundamental insights on the value of data: It reveals that tens of samples are sufficient to perform very efficiently but also that more data can lead to worse out-of-sample performance for SAA. We then focus on the general class of mappings from data to decisions without any restriction on the set of policies and derive an optimal algorithm (in the minimax sense) and characterize its associated performance. This leads to significant improvements for limited data sizes and allows to exactly quantify the value of historical information.
引用
收藏
页码:5848 / 5865
页数:18
相关论文
共 50 条
  • [31] Big data: How do your data grow?
    Lynch, Clifford
    NATURE, 2008, 455 (7209) : 28 - 29
  • [32] Data-driven research: How it can and should influence care policies
    Putrino, David
    INTERNATIONAL JOURNAL OF CARDIOLOGY, 2017, 240 : 90 - 91
  • [33] An end-to-end deep learning model for solving data-driven newsvendor problem with accessibility to textual review data
    Tian, Yu-Xin
    Zhang, Chuan
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2023, 265
  • [34] How to Use the Big Data to the Technology Planning: A Data-Driven Technology Roadmapping Using ARM
    Geum, Y.
    Lee, H.
    Park, Y.
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2012, : 661 - 665
  • [35] Data mining and visualization of data-driven news in the era of big data
    Qi, Erna
    Yang, Xingrui
    Wang, Zongjun
    Cluster Computing, 2019, 22 : 10333 - 10346
  • [36] The Big Data Newsvendor: Practical Insights from Machine Learning
    Ban, Gah-Yi
    Rudin, Cynthia
    OPERATIONS RESEARCH, 2019, 67 (01) : 90 - 108
  • [37] Big Data-Driven Deep Learning Ensembler for DDoS Attack Detection
    Alshdadi, Abdulrahman A.
    Almazroi, Abdulwahab Ali
    Ayub, Nasir
    Lytras, Miltiadis D.
    Alsolami, Eesa
    Alsubaei, Faisal S.
    FUTURE INTERNET, 2024, 16 (12)
  • [38] The Prediction of Flight Delay: Big Data-driven Machine Learning Approach
    Huo, Jiage
    Keung, K. L.
    Lee, C. K. M.
    Ng, Kam K. H.
    Li, K. C.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 190 - 194
  • [39] Data-driven image restoration with option-driven learning for big and small astronomical image data sets
    Jia, Peng
    Ning, Runyu
    Sun, Ruiqi
    Yang, Xiaoshan
    Cai, Dongmei
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 501 (01) : 291 - 301
  • [40] Approach to data-driven learning
    Markov, Z.
    International Workshop on Fundamentals of Artificial Intelligence Research, 1991,