A deep Q-learning approach to optimize ordering and dynamic pricing decisions in the presence of strategic customers

被引:6
|
作者
Alamdar, Parisa Famil [1 ]
Seifi, Abbas [1 ]
机构
[1] Amirkabir Univ Technol, Tehran Polytech, Dept Ind Engn & Management Syst, Tehran, Iran
关键词
Deep reinforcement learning; Dynamic pricing; Strategic customer; Neural network demand model; Multiple substitute products; INVENTORY; MODELS; CHOICE;
D O I
10.1016/j.ijpe.2024.109154
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this paper, we present an optimization method to analyze the simultaneous decisions on dynamic pricing and ordering quantities for seasonal products, by a retailer in monopolistic condition. Customers are assumed to be strategic and may postpone their purchase to get a lower price in future. The problem has been investigated in the context of multiple substitute products. We have developed a model based on deep neural networks to estimate customers' demand. The problem is complex and cannot be solved using classical optimization methods. Therefore, we have developed a reinforcement learning algorithm called deep Q -learning algorithm (DQL) to solve the problem. The proposed algorithm is a combination of a Q -learning algorithm and two deep neural networks for the primary and discount sales periods, which uses the neural network to estimate the Q -values in a large space of states and actions. The performances of the demand model and the proposed optimization algorithm have been tested using a real -world dataset taken from the clothing industry. The results of our experiments demonstrate that the proposed demand model performs better than a fully connected neural networkbased model and a latent class model tested in this paper. Furthermore, the performance of the DQL algorithm is significantly superior to those of two simulated annealing and genetic algorithms. In addition, the results of a comparison between the DQL algorithm and another reinforcement learning algorithm called State -ActionReward -State -Action (SARSA) indicate that the proposed algorithm results in higher revenues and takes less time to converge. Consequently, the proposed algorithm has a high potential for solving such a large scale integrated pricing and ordering optimization problem.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Multivariate Geostatistical Simulation and Deep Q-Learning to Optimize Mining Decisions
    Sebastian Avalos
    Julian M. Ortiz
    Mathematical Geosciences, 2023, 55 : 673 - 692
  • [2] Multivariate Geostatistical Simulation and Deep Q-Learning to Optimize Mining Decisions
    Avalos, Sebastian
    Ortiz, Julian M.
    MATHEMATICAL GEOSCIENCES, 2023, 55 (05) : 673 - 692
  • [3] Joint Strategy of Dynamic Ordering and Pricing for Competing Perishables with Q-Learning Algorithm
    Zheng, Jiangbo
    Gan, Yanhong
    Liang, Ying
    Jiang, Qingqing
    Chang, Jiatai
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [4] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
    Cheng, Yan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
  • [5] Bayesian dynamic learning and pricing with strategic customers
    Chen, Xi
    Gao, Jianjun
    Ge, Dongdong
    Wang, Zizhuo
    PRODUCTION AND OPERATIONS MANAGEMENT, 2022, 31 (08) : 3125 - 3142
  • [6] Pricing Decisions for a Sustainable Supply Chain in the Presence of Potential Strategic Customers
    Liu, Xinmin
    Lin, Kangkang
    Wang, Lei
    Ding, Lili
    SUSTAINABILITY, 2020, 12 (04)
  • [7] A Deep Q-Learning Approach for Dynamic Management of Heterogeneous Processors
    Gupta, Ujjwal
    Mandal, Sumit K.
    Mao, Manqing
    Chakrabarti, Chaitali
    Ogras, Umit Y.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (01) : 14 - 17
  • [8] Robust Dynamic Pricing with Demand Learning in the Presence of Outlier Customers
    Chen, Xi
    Wang, Yining
    OPERATIONS RESEARCH, 2023, 71 (04) : 1362 - 1386
  • [9] Deep Q-learning: A robust control approach
    Varga, Balazs
    Kulcsar, Balazs
    Chehreghani, Morteza Haghir
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (01) : 526 - 544
  • [10] Dynamic Pricing in the Presence of Social Learning and Strategic Consumers
    Papanastasiou, Yiangos
    Savva, Nicos
    MANAGEMENT SCIENCE, 2017, 63 (04) : 919 - 939