Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data

被引:13
|
作者
Crocetti, Emanuele [1 ]
Randi, Giorgia [1 ]
机构
[1] European Commiss, JRC, Directorate Hlth Consumers & Reference Mat F, Hlth Soc Unit, Ispra, Italy
关键词
cancer registry; incidence; data quality; Benford; methodology; FRAUD;
D O I
10.3389/fpubh.2016.00225
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Benfords law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benfords law, and if this can be used in their data quality check process. Methods: We sampled 43 population-based cancer registry populations (CRPs) from the Cancer Incidence in 5 Continents-volume X (CI5-X). The distribution of cancer incidence rate FSD was evaluated overall, by sex, and by CRP. Several statistics, including Pearsons coefficient of correlation and distance measures, were applied to check the adherence to the Benfords law. Results: In the whole dataset (146,590 incidence rates) and for each sex (70,722 male and 75,868 female incidence rates), the FSD distributions were Benford-like. The coefficient of correlation between observed and expected FSD distributions was extremely high (0.999), and the distance measures low. Considering single CRP (from 933 to 7,222 incidence rates), the results were in agreement with the Benfords law, and only a few CRPs showed possible discrepancies from it. Conclusion: This study demonstrated for the first time that cancer incidence rates follow Benfords law. This characteristic can be used as a new, simple, and objective tool in data quality evaluation. The analyzed data had been already checked for publication in CI5-X. Therefore, their quality was expected to be good. In fact, only for a few CRPs several statistics were consistent with possible violations.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Elaboration on Benford's law and the distribution of first digits
    Falk, W. R.
    CANADIAN JOURNAL OF PHYSICS, 2014, 92 (01) : 59 - 64
  • [22] Benford's law applied to aerobiological data and its potential as a quality control tool
    Docampo, Silvia
    del Mar Trigo, Maria
    Jesus Aira, Maria
    Cabezudo, Baltasar
    Flores-Moya, Antonio
    AEROBIOLOGIA, 2009, 25 (04) : 275 - 283
  • [23] Testing firm-level data quality in China against Benford's Law
    Huang, Yasheng
    Niu, Zhiyong
    Yang, Clair
    ECONOMICS LETTERS, 2020, 192
  • [24] Data Diagnostics Using Second-Order Tests of Benford's Law
    Nigrini, Mark J.
    Miller, Steven J.
    AUDITING-A JOURNAL OF PRACTICE & THEORY, 2009, 28 (02): : 305 - 324
  • [25] Benford’s law applied to aerobiological data and its potential as a quality control tool
    Silvia Docampo
    María del Mar Trigo
    María Jesús Aira
    Baltasar Cabezudo
    Antonio Flores-Moya
    Aerobiologia, 2009, 25 : 275 - 283
  • [26] Benford's law and Theil transform of financial data
    Clippe, Paulette
    Ausloos, Marcel
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (24) : 6556 - 6567
  • [27] Application of Benford's Law in Analyzing Geotechnical Data
    Alipour, A.
    Alipour, S.
    CIVIL ENGINEERING INFRASTRUCTURES JOURNAL-CEIJ, 2019, 52 (02): : 323 - 334
  • [28] Use of Benford's law in drug discovery data
    Orita, Masaya
    Moritomo, Ayako
    Niimi, Tatsuya
    Ohno, Kazuki
    DRUG DISCOVERY TODAY, 2010, 15 (9-10) : 328 - 331
  • [29] Benford's Law: Analyzing a Decade of Financial Data
    Alali, Fatima A.
    Romero, Silvia
    JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2013, 10 (01) : 1 - 39
  • [30] Agreement of drug discovery data with Benford's law
    Orita, Masaya
    Hagiwara, Yosuke
    Moritomo, Ayako
    Tsunoyama, Kazuhisa
    Watanabe, Toshihiro
    Ohno, Kazuki
    EXPERT OPINION ON DRUG DISCOVERY, 2013, 8 (01) : 1 - 5