Policy-based reinforcement learning for time series anomaly detection

被引:42
|
作者
Yu, Mengran [1 ]
Sun, Shiliang [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
Time series anomaly detection; Reinforcement learning; Policy-based methods; OUTLIER DETECTION;
D O I
10.1016/j.engappai.2020.103919
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Time series anomaly detection has become a crucial and challenging task driven by the rapid increase of streaming data with the arrival of the Internet of Things. Existing methods are either domain-specific or require strong assumptions that cannot be met in realistic datasets. Reinforcement learning (RL), as an incremental self-learning approach, could avoid the two issues well. However, the current investigation is far from comprehensive. In this paper, we propose a generic policy-based RL framework to address the time series anomaly detection problem. The policy-based time series anomaly detector (PTAD) is progressively learned from the interactions with time-series data in the absence of constraints. Experimental results show that it outperforms the value-based temporal anomaly detector and other state-of-the-art detection methods whether training and test datasets come from the same source or not. Furthermore, the tradeoff between precision and recall is well respected by the PTAD, which is beneficial to fulfill various industrial requirements.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Intelligent Traffic Light via Policy-based Deep Reinforcement Learning
    Yue Zhu
    Mingyu Cai
    Chris W. Schwarz
    Junchao Li
    Shaoping Xiao
    International Journal of Intelligent Transportation Systems Research, 2022, 20 : 734 - 744
  • [22] Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms
    Murthy, Yashaswini
    Moharrami, Mehrdad
    Srikant, R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling
    Deng, Ou
    Jin, Qun
    DIGITAL HUMAN MODELING AND APPLICATIONS IN HEALTH, SAFETY, ERGONOMICS AND RISK MANAGEMENT, DHM 2023, PT II, 2023, 14029 : 378 - 391
  • [24] Intelligent Traffic Light via Policy-based Deep Reinforcement Learning
    Zhu, Yue
    Cai, Mingyu
    Schwarz, Chris W.
    Li, Junchao
    Xiao, Shaoping
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2022, 20 (03) : 734 - 744
  • [25] Deep Learning for Time Series Anomaly Detection: A Survey
    Darban, Zahra zamanzadeh
    Webb, Geoffrey i.
    Pan, Shirui
    Aggarwal, Charu
    Salehi, Mahsa
    ACM COMPUTING SURVEYS, 2025, 57 (01)
  • [26] Early Action Recognition With Category Exclusion Using Policy-Based Reinforcement Learning
    Weng, Junwu
    Jiang, Xudong
    Zheng, Wei-Long
    Yuan, Junsong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4626 - 4638
  • [27] Research of Anomaly Detection Based on Time Series
    Wang, Guilan
    Wang, Zhenqi
    Luo, Xianjin
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, : 444 - 448
  • [28] ADT: Time series anomaly detection for cyber-physical systems via deep reinforcement learning
    Yang, Xue
    Howley, Enda
    Schukat, Michael
    COMPUTERS & SECURITY, 2024, 141
  • [29] Time Series Anomaly Detection Based on GAN
    Sun, Yong
    Yu, Wenbo
    Chen, Yuting
    Kadam, Aishwarya
    2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 375 - 382
  • [30] Gmad: multivariate time series anomaly detection based on graph matching learning
    Kong, Jun
    Wang, Kang
    Jiang, Min
    Tao, Xuefeng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,