Assessing Conformance with Benford's Law: Goodness-Of-Fit Tests and Simultaneous Confidence Intervals

被引:19
|
作者
Lesperance, M. [1 ]
Reed, W. J. [1 ]
Stephens, M. A. [2 ]
Tsao, C. [1 ]
Wilton, B. [3 ]
机构
[1] Univ Victoria, Dept Math & Stat, Victoria, BC, Canada
[2] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
[3] Camosun Coll, Victoria, BC, Canada
来源
PLOS ONE | 2016年 / 11卷 / 03期
基金
加拿大自然科学与工程研究理事会;
关键词
STATISTICS;
D O I
10.1371/journal.pone.0151235
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Benford's Law is a probability distribution for the first significant digits of numbers, for example, the first significant digits of the numbers 871 and 0.22 are 8 and 2 respectively. The law is particularly remarkable because many types of data are considered to be consistent with Benford's Law and scientists and investigators have applied it in diverse areas, for example, diagnostic tests for mathematical models in Biology, Genomics, Neuroscience, image analysis and fraud detection. In this article we present and compare statistically sound methods for assessing conformance of data with Benford's Law, including discrete versions of Cramr- von Mises (CvM) statistical tests and simultaneous confidence intervals. We demonstrate that the common use of many binomial confidence intervals leads to rejection of Benford too often for truly Benford data. Based on our investigation, we recommend that the CvM statistic U2 d, Pearson's chi-square statistic and 100(1-a)% Goodman's simultaneous confidence intervals be computed when assessing conformance with Benford's Law. Visual inspection of the data with simultaneous confidence intervals is useful for understanding departures from Benford and the influence of sample size.
引用
收藏
页数:20
相关论文
共 50 条