Using Autonomous Outlier Detection Methods for Thermophysical Property Data

被引:0
|
作者
Schnorr, Andrea [1 ]
Kaldi, Daniel Johannes [1 ]
Staubach, Jens [2 ]
Garth, Christoph [1 ]
Stephan, Simon [2 ]
机构
[1] RPTU Kaiserslautern, Sci Visualizat Lab, D-67663 Kaiserslautern, Germany
[2] RPTU Kaiserslautern, Lab Engn Thermodynam LTD, D-67663 Kaiserslautern, Germany
来源
关键词
EQUATION-OF-STATE; VAPOR-LIQUID-EQUILIBRIA; LENNARD-JONES MIXTURES; MONTE-CARLO SIMULATIONS; XML-BASED APPROACH; THERMODYNAMIC PROPERTIES; PHASE-EQUILIBRIA; MOLECULAR SIMULATION; TRANSPORT-PROPERTIES; QUALITY ASSESSMENT;
D O I
10.1021/acs.jced.3c00588
中图分类号
O414.1 [热力学];
学科分类号
摘要
The reliability and accuracy of thermophysical property data are of central importance for the development of models that describe these properties. In this work, we compare different autonomous algorithms for identifying the outliers in an existing database. Therefore, the comprehensive database on thermophysical property data for the Lennard-Jones fluid [J. Chem. Inf. Model. 2019, 59, 4248-4265] is used. We focus on homogeneous state property data at given temperature and density for the pressure p, thermal expansion coefficient alpha, isothermal compressibility beta, thermal pressure coefficient gamma, internal energy u, isochoric heat capacity c(v), isobaric heat capacity c(p), Gr & uuml;neisen coefficient Gamma, Joule-Thomson coefficient mu(JT), speed of sound w, chemical potential mu, (reduced) Helmholtz energy a = a/T, and its derivatives a(nm). A comprehensive comparison of 19 outlier detection methods is carried out, which provides insights into the applicability of generic outlier detection algorithms for thermophysical property data. Different classes of outlier detection algorithms are included in the study, namely, machine learning, distance-based, density-based, statistical, ensemble, and model-informed. Two approaches are used for the method evaluation: in approach (a), the original database (comprising real outliers) is used. In approach (b), synthetic outliers are introduced. The results and findings from both approaches are consistent. Machine learning methods yield in some cases better performance compared to that of the distance-based, density-based, ensemble, and statistical methods. The best performance is obtained from the model-informed method (called MoDOD). The results also provide insights into the nature of the outliers in the Lennard-Jones database.
引用
收藏
页码:864 / 880
页数:17
相关论文
共 50 条
  • [21] Uncertainty analysis of thermophysical property measurements of solids using dynamic methods
    Malinaric, Svetozar
    INTERNATIONAL JOURNAL OF THERMOPHYSICS, 2007, 28 (01) : 20 - 32
  • [22] Uncertainty Analysis of Thermophysical Property Measurements of Solids Using Dynamic Methods
    Svetozár Malinarič
    International Journal of Thermophysics, 2007, 28 : 20 - 32
  • [23] Outlier detection and missing data filling methods for coastal water temperature data
    Cho, Hong Yeon
    Oh, Ji Hee
    Kim, Kyeong Ok
    Shim, Jae Seol
    JOURNAL OF COASTAL RESEARCH, 2013, : 1898 - 1903
  • [24] An Experimental Analysis of Fraud Detection Methods in Enterprise Telecommunication Data using Unsupervised Outlier Ensembles
    Kaiafas, Georgios
    Hammerschmidt, Christian
    State, Radu
    Nguyen, Cu D.
    Ries, Thorsten
    Ourdane, Mohamed
    2019 IFIP/IEEE SYMPOSIUM ON INTEGRATED NETWORK AND SERVICE MANAGEMENT (IM), 2019, : 37 - 42
  • [25] Patching rainfall data using regression methods .3. Grouping, patching and outlier detection
    Pegram, G
    JOURNAL OF HYDROLOGY, 1997, 198 (1-4) : 319 - 334
  • [26] Outlier detection in satellite data using spatial coherence
    Alvera-Azcarate, A.
    Sirjacobs, D.
    Barth, A.
    Beckers, J. -M.
    REMOTE SENSING OF ENVIRONMENT, 2012, 119 : 84 - 91
  • [27] Outlier Detection for Categorial Data Using Clustering Algorithms
    Nowak-Brzezinska, Agnieszka
    Lazarz, Weronika
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 714 - 727
  • [28] Outlier Detection in Data Streams Using OLAP Cubes
    Heine, Felix
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 767 : 29 - 36
  • [29] Outlier Detection for Monitoring Data Using Stacked Autoencoder
    Wan, Fangyi
    Guo, Gaodeng
    Zhang, Chunlin
    Guo, Qing
    Liu, Jie
    IEEE ACCESS, 2019, 7 : 173827 - 173837
  • [30] Methods for outlier detection in prediction
    Pierna, JAF
    Wahl, F
    de Noord, OE
    Massart, DL
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 63 (01) : 27 - 39