Real-time power optimization based on Q-learning algorithm for direct methanol fuel cell system

被引：1

作者：

Chi, Xuncheng ^{[1
]}

Chen, Fengxiang ^{[1
]}

Zhai, Shuang ^{[2
]}

Hu, Zhe ^{[2
]}

Zhou, Su ^{[3
]}

Wei, Wei ^{[4
]}

机构：

[1] Tongji Univ, Sch Automot Studies, Shanghai, Peoples R China

[2] Shanghai Refire Technol Co Ltd, Shanghai, Peoples R China

[3] Shanghai Zhongqiao Vocat & Tech Univ, Shanghai, Peoples R China

[4] CAS &M Zhangjiagang New Energy Technol Co Ltd, Zhangjiagang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF HYDROGEN ENERGY | 2024年 / 89卷

基金：

中国国家自然科学基金;

关键词：

Direct methanol fuel cell (DMFC) system; Real-time power optimization; Methanol supply control; Reinforcement learning; Q -learning algorithm; MASS-TRANSPORT MODEL; NUMERICAL-MODEL; PERFORMANCE; DMFC;

D O I：

10.1016/j.ijhydene.2024.09.084

中图分类号：

O64 [物理化学（理论化学）、化学物理学];

学科分类号：

070304 ; 081704 ;

摘要：

Efficient real-time power optimization of direct methanol fuel cell (DMFC) system is crucial for enhancing its performance and reliability. The power of DMFC system is mainly affected by stack temperature and circulating methanol concentration. However, the methanol concentration cannot be directly measured using reliable sensors, which poses a challenge for the real-time power optimization. To address this issue, this paper investigates the operating mechanism of DMFC system and establishes a system power model. Based on the established model, reinforcement learning using Q-learning algorithm is proposed to control methanol supply to optimize DMFC system power under varying operating conditions. This algorithm is simple, easy to implement, and does not rely on methanol concentration measurements. To validate the effectiveness of the proposed algorithm, simulation comparisons between the proposed method and the traditional perturbation and observation (P&O) algorithm are implemented under different operating conditions. The results show that proposed power optimization based on Q-learning algorithm improves net power by 1% and eliminates the fluctuation of methanol supply caused by P&O. For practical implementation considerations and real-time requirements of the algorithm, hardware-in-the-loop (HIL) experiments are conducted. The experiment results demonstrate that the proposed methods optimize net power under different operating conditions. Additionally, in terms of model accuracy, the experimental results are well matched with the simulation. Moreover, under varying load condition, compared with P&O, proposed power optimization based on Q-learning algorithm reduces root mean square error (RMSE) from 7.271% to 2.996% and mean absolute error (MAE) from 5.036% to 0.331%.

引用

页码：1241 / 1253

页数：13

共 50 条

[41] An Adaptive Routing Scheme Based on Q-learning and Real-time Traffic Monitoring for Network-on-Chip
Fan, Renshi
Du, Gaoming
Xu, Pengfei
Li, Zhenmin
Song, Yukun
Zhang, Duoli
PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 244 - 248
[42] Learning-Based Modeling and Optimization for Real-Time System Availability
Li, Liying
Zhou, Junlong
Wei, Tongquan
Chen, Mingsong
Hu, Xiaobo Sharon
IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (04) : 581 - 594
[43] Optimization and simulation of distribution system in a supply chain based on Q-learning
Li Suicheng
Lin Jun
Yin Hongying
2006 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1445 - 1449
[44] Combination Optimization Model of Urban Key Intersections Based on Q-Learning Algorithm
Dong, Dan-Ping
Wei, Fu-Lu
Chen, Ming-Tao
Guo, Yong-Qing
Yang, Chang-Hai
Han, Yu-Xin
CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 849 - 859
[45] Real-Time Path Planning Through Q-learning's Exploration Strategy Adjustment
Kim, Howon
Lee, WonChang
2021 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2021,
[46] Hybrid genetic algorithm based fuel restricted real power optimization for utility system
Kumarappan, N
Mohan, MR
CEC: 2003 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-4, PROCEEDINGS, 2003, : 1294 - 1301
[47] Ddos attack real-time defense mechanism using deep q-learning network
Feng W.
Wu Y.
International Journal of Performability Engineering, 2020, 16 (09) : 1362 - 1373
[48] Real-time optimization of an experimental solid-oxide fuel-cell system
Ferreira, T. de Avila
Wuillemin, Z.
Marchetti, A. G.
Salzmann, C.
Van Herle, J.
Bonvin, D.
JOURNAL OF POWER SOURCES, 2019, 429 : 168 - 179
[49] Real-time temperature control for direct methanol fuel cell in off-grid renewable energy system with liquid level constraints
Chi, Xuncheng
Chen, Fengxiang
Zhang, Bo
Tong, Guangyao
Pei, Fenglai
Wei, Wei
RENEWABLE ENERGY, 2025, 242
[50] Deep Q-learning recommender algorithm with update policy for a real steam turbine system
Modirrousta, Mohammad Hossein
Shoorehdeli, Mahdi Aliyari
Yari, Mostafa
Ghahremani, Arash
IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2023, 5 (03)

← 1 2 3 4 5 →