Joint Strategy of Dynamic Ordering and Pricing for Competing Perishables with Q-Learning Algorithm

被引：2

作者：

Zheng, Jiangbo ^{[1
]}

Gan, Yanhong ^{[2
]}

Liang, Ying ^{[3
]}

Jiang, Qingqing ^{[1
]}

Chang, Jiatai ^{[1
]}

机构：

[1] Jinan Univ, Sch Management, Guangzhou 510632, Guangdong, Peoples R China

[2] South China Univ Technol, Sch Business Adm, Guangzhou 510640, Peoples R China

[3] South China Normal Univ, Sch Econ & Management, Guangzhou 510006, Guangdong, Peoples R China

来源：

WIRELESS COMMUNICATIONS & MOBILE COMPUTING | 2021年 / 2021卷

关键词：

INVENTORY CONTROL; POLICIES; REPLENISHMENT; MANAGEMENT;

D O I：

10.1155/2021/6643195

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We use Machine Learning (ML) to study firms' joint pricing and ordering decisions for perishables in a dynamic loop. The research assumption is as follows: at the beginning of each period, the retailer prices both the new and old products and determines how many new products to order, while at the end of each period, the retailer decides how much remaining inventory should be carried over to the next period. The objective is to determine a joint pricing, ordering, and disposal strategy to maximize the total expected discounted profit. We establish a decision model based on Markov processes and use the Q-learning algorithm to obtain a near-optimal policy. From numerical analysis, we find that (i) the optimal number of old products carried over to the next period depends on the upper quantitative bound for old inventory; (ii) the optimal prices for new products are positively related to potential demand but negatively related to the decay rate, while the optimal prices for old products have a positive relationship with both; and (iii) ordering decisions are unrelated to the quantity of old products. When the decay rate is low or the variable ordering cost is high, the optimal orders exhibit a trapezoidal decline as the quantity of new products increases.

引用

页数：19

共 50 条

[41] PENALIZED Q-LEARNING FOR DYNAMIC TREATMENT REGIMENS
Song, Rui
Wang, Weiwei
Zeng, Donglin
Kosorok, Michael R.
STATISTICA SINICA, 2015, 25 (03) : 901 - 920
[42] Adaptive Learning Recommendation Strategy Based on Deep Q-learning
Tan, Chunxi
Han, Ruijian
Ye, Rougang
Chen, Kani
APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 251 - 266
[43] Optimization algorithm for dynamic spectrum access based on Q-learning in cognitive radio networks
Huang, Ying
Yan, Dingyu
Li, Nan
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2015, 42 (06): : 179 - 183
[44] Dynamic obstacle avoidance based on multi-sensor fusion and Q-learning algorithm
Zhang, Yi
Wei, Xin
Zhou, Xiangyu
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1569 - 1573
[45] Dynamic Switch Migration Algorithm with Q-learning towards Scalable SDN Control Plane
Min, Zhu
Hua, Qu
Zhao Jihong
2017 9TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2017,
[46] A novel dynamic integration approach for multiple load forecasts based on Q-learning algorithm
Ma, Minhua
Jin, Bingjie
Luo, Shuxin
Guo, Shaoqing
Huang, Hongwei
INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (07):
[47] A Multiagent Dynamic Assessment Approach for Water Quality Based on Improved Q-Learning Algorithm
Ni, Jianjun
Ren, Li
Liu, Minghua
Zhu, Daqi
MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
[48] Real Time Demand Learning-Based Q-learning Approach for Dynamic Pricing in E-retailing Setting
Cheng, Yan
IEEC 2009: FIRST INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE, PROCEEDINGS, 2009, : 594 - 598
[49] An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm
Spano, Sergio
Cardarilli, Gian Carlo
Di Nunzio, Luca
Fazzolari, Rocco
Giardino, Daniele
Matta, Marco
Nannarelli, Alberto
Re, Marco
IEEE ACCESS, 2019, 7 : 186340 - 186351
[50] A Multi-Step Joint Q-learning Cooperative Algorithm for Regional Interconnected Power Systems
Xiong, Li
Li, Ling
Liu, Wei
Liang, Zhencheng
Ling, Wuneng
Zhang, Ye
2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 2250 - 2255

← 1 2 3 4 5 →