Model-Based Reinforcement Learning for Cavity Filter Tuning

被引：0

作者：

Nimara, Doumitrou Daniil ^{[1
]}

Malek-Mohammadi, Mohammadreza ^{[2
]}

Wei, Jieqiang ^{[1
]}

Huang, Vincent ^{[1
]}

Ogren, Petter ^{[3
]}

机构：

[1] Ericsson GAIA, Stockholm, Sweden

[2] Qualcomm, San Diego, CA USA

[3] KTH, Div Robot Percept & Learning, Stockholm, Sweden

来源：

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211 | 2023年 / 211卷

关键词：

Reinforcement Learning; Model Based Reinforcement Learning; Telecommunication;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The ongoing development of telecommunication systems like 5G has led to an increase in demand of well calibrated base transceiver station (BTS) components. A pivotal component of every BTS is cavity filters, which provide a sharp frequency characteristic to select a particular band of interest and reject the rest. Unfortunately, their characteristics in combination with manufacturing tolerances make them difficult for mass production and often lead to costly manual post-production fine tuning. To address this, numerous approaches have been proposed to automate the tuning process. One particularly promising one, that has emerged in the past few years, is to use model free reinforcement learning (MFRL); however, the agents are not sample efficient. This poses a serious bottleneck, as utilising complex simulators or training with real filters is prohibitively time demanding. This work advocates for the usage of model based reinforcement learning (MBRL) and showcases how its utilisation can significantly decrease sample complexity, while maintaining similar levels of success rate. More specifically, we propose an improvement over a state-of-the-art (SoTA) MBRL algorithm, namely the Dreamer algorithm. This improvement can serve as a template for applications in other similar, high-dimensional non-image data problems. We carry experiments on two complex filter types, and show that our novel modification on the Dreamer architecture reduces sample complexity by a factor of 4 and 10, respectively. Our findings pioneer the usage of MBRL which paves the way for utilising more precise and accurate simulators which was previously prohibitively time demanding.

引用

页数：11

共 50 条

[21] Adaptive Discretization for Model-Based Reinforcement Learning
Sinclair, Sean R.
Wang, Tianyu
Jain, Gauri
Banerjee, Siddhartha
Yu, Christina Lee
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
[22] A comparison of direct and model-based reinforcement learning
Atkeson, CG
Santamaria, JC
1997 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION - PROCEEDINGS, VOLS 1-4, 1997, : 3557 - 3564
[23] Model-based Reinforcement Learning and the Eluder Dimension
Osband, Ian
Van Roy, Benjamin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[24] Model-based reinforcement learning in a complex domain
Kalyanakrishnan, Shivaram
Stone, Peter
Liu, Yaxin
ROBOCUP 2007: ROBOT SOCCER WORLD CUP XI, 2008, 5001 : 171 - 183
[25] Lipschitz Continuity in Model-based Reinforcement Learning
Asadi, Kavosh
Misra, Dipendra
Littman, Michael L.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[26] A Contraction Approach to Model-based Reinforcement Learning
Fan, Ting-Han
Ramadge, Peter J.
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 325 - +
[27] Model-Based Reinforcement Learning For Robot Control
Li, Xiang
Shang, Weiwei
Cong, Shuang
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
[28] Consistency of Fuzzy Model-Based Reinforcement Learning
Busoniu, Lucian
Ernst, Damien
De Schutter, Bart
Babuska, Robert
2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 518 - +
[29] Abstraction Selection in Model-Based Reinforcement Learning
Jiang, Nan
Kulesza, Alex
Singh, Satinder
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 179 - 188
[30] Asynchronous Methods for Model-Based Reinforcement Learning
Zhang, Yunzhi
Clavera, Ignasi
Tsai, Boren
Abbeel, Pieter
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100

← 1 2 3 4 5 →