Joint Strategy of Dynamic Ordering and Pricing for Competing Perishables with Q-Learning Algorithm

被引:2
|
作者
Zheng, Jiangbo [1 ]
Gan, Yanhong [2 ]
Liang, Ying [3 ]
Jiang, Qingqing [1 ]
Chang, Jiatai [1 ]
机构
[1] Jinan Univ, Sch Management, Guangzhou 510632, Guangdong, Peoples R China
[2] South China Univ Technol, Sch Business Adm, Guangzhou 510640, Peoples R China
[3] South China Normal Univ, Sch Econ & Management, Guangzhou 510006, Guangdong, Peoples R China
关键词
INVENTORY CONTROL; POLICIES; REPLENISHMENT; MANAGEMENT;
D O I
10.1155/2021/6643195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We use Machine Learning (ML) to study firms' joint pricing and ordering decisions for perishables in a dynamic loop. The research assumption is as follows: at the beginning of each period, the retailer prices both the new and old products and determines how many new products to order, while at the end of each period, the retailer decides how much remaining inventory should be carried over to the next period. The objective is to determine a joint pricing, ordering, and disposal strategy to maximize the total expected discounted profit. We establish a decision model based on Markov processes and use the Q-learning algorithm to obtain a near-optimal policy. From numerical analysis, we find that (i) the optimal number of old products carried over to the next period depends on the upper quantitative bound for old inventory; (ii) the optimal prices for new products are positively related to potential demand but negatively related to the decay rate, while the optimal prices for old products have a positive relationship with both; and (iii) ordering decisions are unrelated to the quantity of old products. When the decay rate is low or the variable ordering cost is high, the optimal orders exhibit a trapezoidal decline as the quantity of new products increases.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] PENALIZED Q-LEARNING FOR DYNAMIC TREATMENT REGIMENS
    Song, Rui
    Wang, Weiwei
    Zeng, Donglin
    Kosorok, Michael R.
    STATISTICA SINICA, 2015, 25 (03) : 901 - 920
  • [42] Adaptive Learning Recommendation Strategy Based on Deep Q-learning
    Tan, Chunxi
    Han, Ruijian
    Ye, Rougang
    Chen, Kani
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 251 - 266
  • [43] Optimization algorithm for dynamic spectrum access based on Q-learning in cognitive radio networks
    Huang, Ying
    Yan, Dingyu
    Li, Nan
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2015, 42 (06): : 179 - 183
  • [44] Dynamic obstacle avoidance based on multi-sensor fusion and Q-learning algorithm
    Zhang, Yi
    Wei, Xin
    Zhou, Xiangyu
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1569 - 1573
  • [45] Dynamic Switch Migration Algorithm with Q-learning towards Scalable SDN Control Plane
    Min, Zhu
    Hua, Qu
    Zhao Jihong
    2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
  • [46] A novel dynamic integration approach for multiple load forecasts based on Q-learning algorithm
    Ma, Minhua
    Jin, Bingjie
    Luo, Shuxin
    Guo, Shaoqing
    Huang, Hongwei
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (07):
  • [47] A Multiagent Dynamic Assessment Approach for Water Quality Based on Improved Q-Learning Algorithm
    Ni, Jianjun
    Ren, Li
    Liu, Minghua
    Zhu, Daqi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [48] Real Time Demand Learning-Based Q-learning Approach for Dynamic Pricing in E-retailing Setting
    Cheng, Yan
    IEEC 2009: FIRST INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE, PROCEEDINGS, 2009, : 594 - 598
  • [49] An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm
    Spano, Sergio
    Cardarilli, Gian Carlo
    Di Nunzio, Luca
    Fazzolari, Rocco
    Giardino, Daniele
    Matta, Marco
    Nannarelli, Alberto
    Re, Marco
    IEEE ACCESS, 2019, 7 : 186340 - 186351
  • [50] A Multi-Step Joint Q-learning Cooperative Algorithm for Regional Interconnected Power Systems
    Xiong, Li
    Li, Ling
    Liu, Wei
    Liang, Zhencheng
    Ling, Wuneng
    Zhang, Ye
    2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 2250 - 2255