Reproducibility and imputation of air toxics data

被引:12
|
作者
Le, Hien Q.
Batterman, Stuart A. [1 ]
Wahl, Robert L.
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Michigan Dept Community Hlth, Ann Arbor, MI 48109 USA
来源
JOURNAL OF ENVIRONMENTAL MONITORING | 2007年 / 9卷 / 12期
关键词
D O I
10.1039/b709816b
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Ambient air quality datasets include missing data, values below method detection limits and outliers, and the precision and accuracy of the measurements themselves are often unknown. At the same time, many analyses require continuous data sequences and assume that measurements are error-free. While a variety of data imputation and cleaning techniques are available, the evaluation of such techniques remains limited. This study evaluates the performance of these techniques for ambient air toxics measurements, a particularly challenging application, and includes the analysis of intra- and inter-laboratory precision. The analysis uses an unusually complete-dataset, consisting of daily measurements of over 70 species of carbonyls and volatile organic compounds ( VOCs) collected over a one year period in Dearborn, Michigan, including 122 pairs of replicates. Analysis was restricted to compounds found above detection limits in >= 20% of the samples. Outliers were detected using the Gumbell extreme value distribution. Error models for inter- and intra-laboratory reproducibility were derived from replicate samples. Imputation variables were selected using a generalized additive model, and the performance of two techniques, multiple imputation and optimal linear estimation, was evaluated for three missingness patterns. Many species were rarely detected or had very poor reproducibility. Error models developed for seven carbonyls showed median intra- and inter- laboratory errors of 22% and 25%, respectively. Better reproducibility was seen for the 16 VOCs meeting detection and reproducibility criteria. Imputation performance depended on the compound and missingness pattern. Data missing at random could be adequately imputed, but imputations for row-wise deletions, the most common type of missingness pattern encountered, were not informative. The analysis shows that air toxics data require significant efforts to identify and mitigate errors, outliers and missing observations, and that these steps are essential and should be performed prior to using these data in receptor, exposure, health and other applications.
引用
收藏
页码:1358 / 1372
页数:15
相关论文
共 50 条
  • [21] Missing data imputation for paired stream and air temperature sensor data
    Li, Han
    Deng, Xinwei
    Smith, Eric
    ENVIRONMETRICS, 2017, 28 (01)
  • [22] EPAS AIR TOXICS STUDY
    DOWD, RM
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1984, 18 (12) : A373 - A373
  • [23] RECOMMENDATIONS FOR THE REGULATION OF AIR TOXICS
    SCHER, JA
    ENVIRONMENTAL PROGRESS, 1987, 6 (03): : A12 - A14
  • [24] UAV-Based Wildland Fire Air Toxics Data Collection and Analysis
    Ragbir, Prabhash
    Kaduwela, Ajith
    Passovoy, David
    Amin, Preet
    Ye, Shuchen
    Wallis, Christopher
    Alaimo, Christopher
    Young, Thomas
    Kong, Zhaodan
    SENSORS, 2023, 23 (07)
  • [25] Methods for imputation of missing values in air quality data sets
    Junninen, H
    Niska, H
    Tuppurainen, K
    Ruuskanen, J
    Kolehmainen, M
    ATMOSPHERIC ENVIRONMENT, 2004, 38 (18) : 2895 - 2907
  • [26] EPA tackles urban air toxics
    不详
    CHEMICAL ENGINEERING PROGRESS, 1999, 95 (10) : 23 - 23
  • [27] AIR TOXICS - SOURCES AND MONITORING IN TEXAS
    PENDLETON, DR
    ENVIRONMENTAL HEALTH PERSPECTIVES, 1995, 103 : 223 - 228
  • [28] RESEARCH PRIORITIES FOR MOBILE AIR TOXICS
    不详
    ENVIRONMENTAL HEALTH PERSPECTIVES, 1993, 101 (01) : 20 - 20
  • [29] AIR TOXICS - EMERGING LEGISLATION AND REGULATIONS
    REITANO, AJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1990, 200 : 7 - CHAL
  • [30] PHOTOCATALYTIC OXIDATION OF OXYGENATED AIR TOXICS
    RAUPP, GB
    JUNIO, CT
    APPLIED SURFACE SCIENCE, 1993, 72 (04) : 321 - 327