A deep Q-learning approach to optimize ordering and dynamic pricing decisions in the presence of strategic customers

被引:6
|
作者
Alamdar, Parisa Famil [1 ]
Seifi, Abbas [1 ]
机构
[1] Amirkabir Univ Technol, Tehran Polytech, Dept Ind Engn & Management Syst, Tehran, Iran
关键词
Deep reinforcement learning; Dynamic pricing; Strategic customer; Neural network demand model; Multiple substitute products; INVENTORY; MODELS; CHOICE;
D O I
10.1016/j.ijpe.2024.109154
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this paper, we present an optimization method to analyze the simultaneous decisions on dynamic pricing and ordering quantities for seasonal products, by a retailer in monopolistic condition. Customers are assumed to be strategic and may postpone their purchase to get a lower price in future. The problem has been investigated in the context of multiple substitute products. We have developed a model based on deep neural networks to estimate customers' demand. The problem is complex and cannot be solved using classical optimization methods. Therefore, we have developed a reinforcement learning algorithm called deep Q -learning algorithm (DQL) to solve the problem. The proposed algorithm is a combination of a Q -learning algorithm and two deep neural networks for the primary and discount sales periods, which uses the neural network to estimate the Q -values in a large space of states and actions. The performances of the demand model and the proposed optimization algorithm have been tested using a real -world dataset taken from the clothing industry. The results of our experiments demonstrate that the proposed demand model performs better than a fully connected neural networkbased model and a latent class model tested in this paper. Furthermore, the performance of the DQL algorithm is significantly superior to those of two simulated annealing and genetic algorithms. In addition, the results of a comparison between the DQL algorithm and another reinforcement learning algorithm called State -ActionReward -State -Action (SARSA) indicate that the proposed algorithm results in higher revenues and takes less time to converge. Consequently, the proposed algorithm has a high potential for solving such a large scale integrated pricing and ordering optimization problem.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A scalable Deep Q-Learning approach for hot stamping process under dynamic control environment
    Nievas, Nuria
    Pages-Bernaus, Adela
    Abio, Albert
    Lange, Danillo
    Garcia-Llamas, Eduard
    Grane, Marc
    Pujante, Jaume
    Echeverria, Lluis
    Bonada, Francesc
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
  • [22] A Deep Q-Learning based approach applied to the Snake game
    Sebastianelli, Alessandro
    Tipaldi, Massimo
    Ullo, Silvia Liberata
    Glielmo, Luigi
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 348 - 353
  • [23] Deep Q-learning Approach for Congestion Problem In Smart Cities
    Faqir, Nada
    En-Nahnahi, Noureddine
    Boumhidi, Jaouad
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [24] New Dynamic Switch Migration Technique Based on Deep Q-learning
    Yao, Lin
    Li, Jia
    Wu, Guowei
    Wu, Bin
    2021 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2021), 2021, : 125 - 130
  • [25] Deep Coalitional Q-Learning for Dynamic Coalition Formation in Edge Computing
    Ding, Shiyao
    Lin, Donghui
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (05): : 864 - 872
  • [26] Deep Q-Learning Based Reinforcement Learning Approach for Network Intrusion Detection
    Alavizadeh, Hooman
    Alavizadeh, Hootan
    Jang-Jaccard, Julian
    COMPUTERS, 2022, 11 (03)
  • [27] Dynamic pricing for the successive-generation products in the presence of strategic customers and limited trade-in duration
    Yuan, Xigang
    Ma, Zujun
    Zhang, Xiaoqing
    KYBERNETES, 2023, 52 (11) : 5329 - 5352
  • [28] Deep Q-learning to globally optimize a k-D parameter search for medical imaging
    Zhang, Hongmei
    Liang, Songshi
    Matkovic, Luke A.
    Momin, Shadab
    Wang, Kai
    Yang, Xiaofeng
    Insana, Michael F.
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2023, 13 (08) : 4879 - 4896
  • [29] Automate Page Layout Optimization: An Offline Deep Q-Learning Approach
    Qin, Zhou
    Liu, Wenyang
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 522 - 524
  • [30] Multiple Correlated Jammers Suppression: A Deep Dueling Q-Learning Approach
    Linh Manh Hoang
    Diep Nguyen
    Zhang, J. Andrew
    Dinh Thai Hoang
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 998 - 1003