共 50 条
Nonparametric maximum likelihood estimation from samples with irrelevant data and verification bias
被引:5
|作者:
Lambert, D
[1
]
Tierney, L
[1
]
机构:
[1] UNIV MINNESOTA,SCH STAT,MINNEAPOLIS,MN 55455
关键词:
environmental data;
identifiability;
nondetect;
D O I:
10.2307/2965557
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
Suppose that some measurements come from a distribution F that is of interest and others come from another, irrelevant distribution G. Some measurements from F are verified and known to be from F. The other, unverified measurements may be from F or from G. Not all measurements from F are equally likely to be verified, and,lo measurement from G is ever verified. This model applies to measurements of low concentrations obtained using gas chromatography/mass spectroscopy, for example, as is shown in this article. How well a feature T(F) of F can be estimated when there are unverified data depends on what can be assumed about F, G, and the conditional probability v(z) of verifying a measurement of x from F. If F, G, and v are unrestricted, then more than one choice of (F, G, v) gives the same distribution p of the observable z, and thus T(F) cannot be uniquely estimated from data. But if the set of values of T(F) that correspond to a distribution p of a: is small enough, then it is reasonable to try to estimate that set of T(F) from data. This article shows that the set of possible values of T(F) is a finite interval for some choices of T, such as the mean, and proposes estimators of the interval of possible values. An example using data from a Love Canal study shows that the partially identified set of T(F) can be sufficiently small for estimators of T(F) to be useful.
引用
收藏
页码:937 / 944
页数:8
相关论文