Towards Reliable Drift Detection and Explanation in Text Data

被引:0
|
作者
Feldhans, Robert [1 ]
Hammer, Barbara [1 ]
机构
[1] Bielefeld Univ, Bielefeld, Germany
关键词
Drift Explanation; Text Data; Transformer; Visualization;
D O I
10.1007/978-3-031-77731-8_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When delivered to the market, machine learning models face new data which are possibly subject to novel characteristics - a phenomenon known as concept drift. As this might lead to performance degradation, it is necessary to detect such drift and, if required, adapt the model accordingly. While a variety of drift detection and adaptation methods exists for standard vectorial data, a suitable treatment of text data is less researched. In this work we present a novel approach which detects and explains drift in text data based on their representation via transformer embeddings. In a nutshell, the method generates suitable statistical features from the original distribution and the possibly shifted variation. Based on these representations, drift scores can be assigned to individual data points, allowing a visualization and human-readable characterization of the type of drift. We demonstrate the approach's effectiveness in reliably detecting drift in several experiments.
引用
收藏
页码:301 / 312
页数:12
相关论文
共 50 条
  • [21] Handling Concept Drift in Data Streams by Using Drift Detection Methods
    Patil, Malini M.
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 155 - 166
  • [22] Towards Non-Parametric Drift Detection via Dynamic Adapting Window Independence Drift Detection (DAWIDD)
    Hinder, Fabian
    Artelt, Andre
    Hammer, Barbara
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [23] Towards Reliable Driver Drowsiness Detection Leveraging Wearables
    Cao, Yetong
    Li, Fan
    Liu, Xiaochen
    Yang, Song
    Wang, Yu
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (02)
  • [24] Towards Reliable Real-time Person Detection
    Serban, Silviu-Tudor
    Simha, Srinidhi Mukanahallipatna
    Bathrinarayanan, Vasanth
    Corvee, Etienne
    Bremond, Francois
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 232 - 239
  • [25] Feature Drift Detection in Evolving Data Streams
    Zhao, Di
    Koh, Yun Sing
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 335 - 349
  • [26] New Drift Detection Method for Data Streams
    Sobhani, Parinaz
    Beigy, Hamid
    ADAPTIVE AND INTELLIGENT SYSTEMS, 2011, 6943 : 88 - 97
  • [27] Concept Drift Detection for Evolving Stream Data
    Lee, Jeonghoon
    Lee, Yoon-Joon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (11) : 2288 - 2292
  • [28] Towards Reliable Data Feature Retrieval and Decision Engine in Host-Based Anomaly Detection Systems
    Haider, Waqas
    Hu, Jiankun
    Xie, Miao
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 513 - 517
  • [29] Towards a robust and reliable deep learning approach for detection of compact binary mergers in gravitational wave data
    Jadhav, Shreejit
    Shrivastava, Mihir
    Mitra, Sanjit
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):
  • [30] Contextual Anomaly Detection in Text Data
    Mahapatra, Amogh
    Srivastava, Nisheeth
    Srivastava, Jaideep
    ALGORITHMS, 2012, 5 (04) : 469 - 489