Multi-asset closed-loop reservoir management using deep reinforcement learning

被引：0

作者：

Yusuf Nasir

Louis J. Durlofsky

机构：

[1] Stanford University,Department of Energy Science and Engineering

来源：

Computational Geosciences | 2024年 / 28卷

关键词：

Deep reinforcement learning; Multitask learning; Vector embeddings; Closed-loop reservoir management; Optimal control; Proximal policy optimization; Transformers; 86A22; 68T05;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Closed-loop reservoir management (CLRM), in which history matching and production optimization are performed multiple times over the life of an asset, can provide significant improvement in the specified objective. These procedures are computationally expensive due to the large number of flow simulations required for history matching and optimization. Existing CLRM procedures are applied asset by asset, without taking advantage of similarities in geology or the temporal structure of well data across assets. Here, we develop a CLRM framework to treat multiple assets from related geological systems, which enables reductions in computational demands. Deep reinforcement learning is used to train a single global control policy that is applicable for all assets considered. The new framework is an extension of a recently introduced control policy methodology for individual assets. Embedding layers are incorporated into the representation to handle the different numbers of decision variables that arise for the different assets. Because the global control policy learns a unified representation of useful features from multiple related assets, it is less expensive to construct than asset-by-asset training (we observe about 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3\times $$\end{document} speedup in our examples). The production optimization problem includes a relative-change constraint on the well settings, which renders the results suitable for practical use. We apply the multi-asset CLRM framework to 2D and 3D water-flooding examples. In both cases, four assets with different well counts, well configurations, and geostatistical descriptions are considered. Numerical experiments demonstrate that the global control policy provides objective function values, for both the 2D and 3D cases, that are nearly identical to those from control policies trained individually for each asset. This promising finding suggests that multi-asset CLRM may indeed represent a viable practical strategy.

引用

页码：23 / 42

页数：19

共 50 条

[31] Closed-loop reservoir management on the Brugge test case
Chaohui Chen
Yudou Wang
Gaoming Li
Albert C. Reynolds
Computational Geosciences, 2010, 14 : 691 - 703
[32] Closed-loop reservoir management on the Brugge test case
Chen, Chaohui
Wang, Yudou
Li, Gaoming
Reynolds, Albert C.
COMPUTATIONAL GEOSCIENCES, 2010, 14 (04) : 691 - 703
[33] Delayed reinforcement learning for closed-loop object recognition
Peng, J
Bhanu, B
IMAGE UNDERSTANDING WORKSHOP, 1996 PROCEEDINGS, VOLS I AND II, 1996, : 1429 - 1435
[34] Theoretical research on reservoir closed-loop production management
ZHAO Hui~1*
2 Petroleum Engineering College
3 Exploration and Production Research Institute
Science China(Technological Sciences), 2011, (10) : 2815 - 2824
[35] REINFORCEMENT LEARNING FOR TUNING PARAMETERS OF CLOSED-LOOP CONTROLLERS
Serafini, M. C.
Rosales, N.
Garelli, F.
DIABETES TECHNOLOGY & THERAPEUTICS, 2021, 23 : A84 - A85
[36] A reinforcement learning method with closed-loop stability guarantee
Osinenko, Pavel
Beckenbach, Lukas
Goehrt, Thomas
Streif, Stefan
IFAC PAPERSONLINE, 2020, 53 (02): : 8043 - 8048
[37] Closed-loop stability analysis of deep reinforcement learning controlled systems with experimental validation
Mohiuddin, Mohammed Basheer
Boiko, Igor
Azzam, Rana
Zweiri, Yahya
IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (13): : 1649 - 1668
[38] Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning
Padmanabhan, Regina
Meskin, Nader
Haddad, Wassim M.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 22 : 54 - 64
[39] Closed-Loop Control of Anesthesia and Mean Arterial Pressure Using Reinforcement Learning
Padmanabhan, Regina
Meskin, Nader
Haddad, Wassim M.
2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 265 - 272
[40] A closed-loop algorithm to detect human face using color and reinforcement learning
吴东晖
叶秀清
顾伟康
"JournalofZhejiangUniversityScienceJ", 2002, (01) : 73 - 77

← 1 2 3 4 5 →