Multi-asset closed-loop reservoir management using deep reinforcement learning

被引:0
|
作者
Yusuf Nasir
Louis J. Durlofsky
机构
[1] Stanford University,Department of Energy Science and Engineering
来源
Computational Geosciences | 2024年 / 28卷
关键词
Deep reinforcement learning; Multitask learning; Vector embeddings; Closed-loop reservoir management; Optimal control; Proximal policy optimization; Transformers; 86A22; 68T05;
D O I
暂无
中图分类号
学科分类号
摘要
Closed-loop reservoir management (CLRM), in which history matching and production optimization are performed multiple times over the life of an asset, can provide significant improvement in the specified objective. These procedures are computationally expensive due to the large number of flow simulations required for history matching and optimization. Existing CLRM procedures are applied asset by asset, without taking advantage of similarities in geology or the temporal structure of well data across assets. Here, we develop a CLRM framework to treat multiple assets from related geological systems, which enables reductions in computational demands. Deep reinforcement learning is used to train a single global control policy that is applicable for all assets considered. The new framework is an extension of a recently introduced control policy methodology for individual assets. Embedding layers are incorporated into the representation to handle the different numbers of decision variables that arise for the different assets. Because the global control policy learns a unified representation of useful features from multiple related assets, it is less expensive to construct than asset-by-asset training (we observe about 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3\times $$\end{document} speedup in our examples). The production optimization problem includes a relative-change constraint on the well settings, which renders the results suitable for practical use. We apply the multi-asset CLRM framework to 2D and 3D water-flooding examples. In both cases, four assets with different well counts, well configurations, and geostatistical descriptions are considered. Numerical experiments demonstrate that the global control policy provides objective function values, for both the 2D and 3D cases, that are nearly identical to those from control policies trained individually for each asset. This promising finding suggests that multi-asset CLRM may indeed represent a viable practical strategy.
引用
收藏
页码:23 / 42
页数:19
相关论文
共 50 条
  • [31] Closed-loop reservoir management on the Brugge test case
    Chaohui Chen
    Yudou Wang
    Gaoming Li
    Albert C. Reynolds
    Computational Geosciences, 2010, 14 : 691 - 703
  • [32] Closed-loop reservoir management on the Brugge test case
    Chen, Chaohui
    Wang, Yudou
    Li, Gaoming
    Reynolds, Albert C.
    COMPUTATIONAL GEOSCIENCES, 2010, 14 (04) : 691 - 703
  • [33] Delayed reinforcement learning for closed-loop object recognition
    Peng, J
    Bhanu, B
    IMAGE UNDERSTANDING WORKSHOP, 1996 PROCEEDINGS, VOLS I AND II, 1996, : 1429 - 1435
  • [34] Theoretical research on reservoir closed-loop production management
    ZHAO Hui~1*
    2 Petroleum Engineering College
    3 Exploration and Production Research Institute
    Science China(Technological Sciences), 2011, (10) : 2815 - 2824
  • [35] REINFORCEMENT LEARNING FOR TUNING PARAMETERS OF CLOSED-LOOP CONTROLLERS
    Serafini, M. C.
    Rosales, N.
    Garelli, F.
    DIABETES TECHNOLOGY & THERAPEUTICS, 2021, 23 : A84 - A85
  • [36] A reinforcement learning method with closed-loop stability guarantee
    Osinenko, Pavel
    Beckenbach, Lukas
    Goehrt, Thomas
    Streif, Stefan
    IFAC PAPERSONLINE, 2020, 53 (02): : 8043 - 8048
  • [37] Closed-loop stability analysis of deep reinforcement learning controlled systems with experimental validation
    Mohiuddin, Mohammed Basheer
    Boiko, Igor
    Azzam, Rana
    Zweiri, Yahya
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (13): : 1649 - 1668
  • [38] Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning
    Padmanabhan, Regina
    Meskin, Nader
    Haddad, Wassim M.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 22 : 54 - 64
  • [39] Closed-Loop Control of Anesthesia and Mean Arterial Pressure Using Reinforcement Learning
    Padmanabhan, Regina
    Meskin, Nader
    Haddad, Wassim M.
    2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 265 - 272
  • [40] A closed-loop algorithm to detect human face using color and reinforcement learning
    吴东晖
    叶秀清
    顾伟康
    "JournalofZhejiangUniversityScienceJ", 2002, (01) : 73 - 77