Joint Strategy of Dynamic Ordering and Pricing for Competing Perishables with Q-Learning Algorithm

被引:2
|
作者
Zheng, Jiangbo [1 ]
Gan, Yanhong [2 ]
Liang, Ying [3 ]
Jiang, Qingqing [1 ]
Chang, Jiatai [1 ]
机构
[1] Jinan Univ, Sch Management, Guangzhou 510632, Guangdong, Peoples R China
[2] South China Univ Technol, Sch Business Adm, Guangzhou 510640, Peoples R China
[3] South China Normal Univ, Sch Econ & Management, Guangzhou 510006, Guangdong, Peoples R China
关键词
INVENTORY CONTROL; POLICIES; REPLENISHMENT; MANAGEMENT;
D O I
10.1155/2021/6643195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We use Machine Learning (ML) to study firms' joint pricing and ordering decisions for perishables in a dynamic loop. The research assumption is as follows: at the beginning of each period, the retailer prices both the new and old products and determines how many new products to order, while at the end of each period, the retailer decides how much remaining inventory should be carried over to the next period. The objective is to determine a joint pricing, ordering, and disposal strategy to maximize the total expected discounted profit. We establish a decision model based on Markov processes and use the Q-learning algorithm to obtain a near-optimal policy. From numerical analysis, we find that (i) the optimal number of old products carried over to the next period depends on the upper quantitative bound for old inventory; (ii) the optimal prices for new products are positively related to potential demand but negatively related to the decay rate, while the optimal prices for old products have a positive relationship with both; and (iii) ordering decisions are unrelated to the quantity of old products. When the decay rate is low or the variable ordering cost is high, the optimal orders exhibit a trapezoidal decline as the quantity of new products increases.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A deep Q-learning approach to optimize ordering and dynamic pricing decisions in the presence of strategic customers
    Alamdar, Parisa Famil
    Seifi, Abbas
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2024, 269
  • [2] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
    Cheng, Yan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
  • [3] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
    Wang, Yin-Hao
    Li, Tzuu-Hseng S.
    Lin, Chih-Jui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
  • [4] Dynamic feature selection algorithm based on Q-learning mechanism
    Ruohao Xu
    Mengmeng Li
    Zhongliang Yang
    Lifang Yang
    Kangjia Qiao
    Zhigang Shang
    Applied Intelligence, 2021, 51 : 7233 - 7244
  • [5] A Dynamic Planning Algorithm based on Q-Learning Routing in SDON
    Shang, Jingkun
    Li, Hui
    Man, Xiangkun
    Wu, Fang
    Zhao, Jia Wei
    Ma, Xiaomei
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [6] A New Algorithm to Track Dynamic Goal Position in Q-learning
    Mitra, Soumishila
    Banerjee, Dhrubojyoti
    Konar, Amit
    Janarthanan, R.
    2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 69 - 74
  • [7] Dynamic feature selection algorithm based on Q-learning mechanism
    Xu, Ruohao
    Li, Mengmeng
    Yang, Zhongliang
    Yang, Lifang
    Qiao, Kangjia
    Shang, Zhigang
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
  • [8] Advanced Option Pricing and Hedging with Q-Learning: Performance Evaluation of the ALBS Algorithm
    Stoiljkovic, Zoran
    JOURNAL OF DERIVATIVES, 2025, 32 (03): : 48 - 79
  • [9] Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm
    Wang, Shuda
    Si, Feng
    Yang, Jing
    Wang, Shuoning
    Yang, Jun
    INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 633 - 642
  • [10] An improved Q-learning algorithm based on exploration region expansion strategy
    Gao, Qingji
    Hong, Bingong
    He, Zhendong
    Liu, Jie
    Niu, Guochen
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4167 - +