Validation of wastewater data using artificial intelligence tools and the evaluation of their performance regarding annotator agreement

被引:3
|
作者
Zidaoui, Imane [1 ,2 ]
Wemmert, Cedric [3 ]
Dufresne, Matthieu [2 ]
Joannis, Claude [4 ]
Isel, Sandra [2 ]
Wertel, Jonathan [2 ]
Vazquez, Jose [1 ]
机构
[1] ICube Lab, Dept Fluid Mech, 2 Rue Boussingault, F-67000 Strasbourg, France
[2] 3D EAU, 3 Quai Kleber, F-67000 Strasbourg, France
[3] ICube Lab, Data Sci & Knowledge Dept, 300 Bd Sebastien Brant, F-67400 Illkirch Graffenstaden, France
[4] CJ Conseil, 37 Rue Coteau, F-44100 Nantes, France
关键词
annotator agreement; artificial intelligence; data validation; matrix profile; one-class SVM; wastewater; REAL-TIME CONTROL; FAULT-DETECTION; URBAN;
D O I
10.2166/wst.2023.174
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
To prevent the pollution of water resources, the measurement and the limitation of wastewater discharges are required. Despite the progress in the field of data acquisition systems, sensors are subject to malfunctions that can bias the evaluation of the pollution flow. It is therefore essential to identify potential anomalies in the data before any use. The objective of this work is to deploy artificial intelligence tools to automate the data validation and to assess the added value of this approach in assisting the validation performed by an operator. To do so, we compare two state-of-the-art anomaly detection algorithms on turbidity data in a sewer network. On the one hand, we conclude that the One-class SVM model is not adapted to the nature of the studied data which is heterogeneous and noisy. The Matrix Profile model, on the other hand, provides promising results with a majority of anomalies detected and a relatively limited number of false positives. By comparing these results to the expert validation, it turns out that the use of the Matrix Profile model objectifies and accelerates the validation task while maintaining the same level of performance compared to the annotator agreement rate between two experts.
引用
收藏
页码:2957 / 2970
页数:14
相关论文
共 50 条
  • [41] Automatic recognition of industrial tools using artificial intelligence approach
    Les, Tomasz
    Kruk, Michal
    Osowski, Stanislaw
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (12) : 4777 - 4784
  • [42] Application of Predictive Maintenance Concepts Using Artificial Intelligence Tools
    Cardoso, Diogo
    Ferreira, Luis
    APPLIED SCIENCES-BASEL, 2021, 11 (01): : 1 - 18
  • [43] Decision-making in tunneling using artificial intelligence tools
    Mahmoodzadeh, Arsalan
    Mohammadi, Mokhtar
    Daraei, Ako
    Faraj, Rabar H.
    Omer, Rebaz Mohammed Dler
    Sherwani, Aryan Far H.
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2020, 103
  • [44] Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques
    Hosny, Mariam
    Abu Waraga, Omnia
    Abu Talib, Manar
    Abdallah, Mohamed
    DATA IN BRIEF, 2023, 51
  • [45] Using Generative Artificial Intelligence Tools in Software Engineering Courses
    Datta, Soma
    2024 36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING EDUCATION AND TRAINING, CSEE & T 2024, 2024,
  • [46] Big Data Analytics Using Artificial Intelligence
    Gandomi, Amir H.
    Chen, Fang
    Abualigah, Laith
    ELECTRONICS, 2023, 12 (04)
  • [47] Validity Evaluation for the Data Used for Artificial Intelligence System
    Son, Han Seong
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2020, 1037 : 362 - 369
  • [48] Prediction of Marathon Performance using Artificial Intelligence
    Lerebourg, Lucie
    Saboul, Damien
    Clemencon, Michel
    Coquart, Jeremy Bernard
    INTERNATIONAL JOURNAL OF SPORTS MEDICINE, 2023, 44 (05) : 352 - 360
  • [49] Towards Evaluation of Explainable Artificial Intelligence in Streaming Data
    Mozolewski, Maciej
    Bobek, Szymon
    Ribeiro, Rita P.
    Nalepa, Grzegorz J.
    Gama, Joao
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2024, PT IV, 2024, 2156 : 145 - 168
  • [50] Performance evaluation of artificial intelligence classifiers for the medical domain
    Smith, AE
    Nugent, CD
    McClean, SI
    HEALTH DATA IN THE INFORMATION SOCIETY, 2002, 90 : 553 - 556