Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data

被引:13
|
作者
Crocetti, Emanuele [1 ]
Randi, Giorgia [1 ]
机构
[1] European Commiss, JRC, Directorate Hlth Consumers & Reference Mat F, Hlth Soc Unit, Ispra, Italy
关键词
cancer registry; incidence; data quality; Benford; methodology; FRAUD;
D O I
10.3389/fpubh.2016.00225
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Benfords law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benfords law, and if this can be used in their data quality check process. Methods: We sampled 43 population-based cancer registry populations (CRPs) from the Cancer Incidence in 5 Continents-volume X (CI5-X). The distribution of cancer incidence rate FSD was evaluated overall, by sex, and by CRP. Several statistics, including Pearsons coefficient of correlation and distance measures, were applied to check the adherence to the Benfords law. Results: In the whole dataset (146,590 incidence rates) and for each sex (70,722 male and 75,868 female incidence rates), the FSD distributions were Benford-like. The coefficient of correlation between observed and expected FSD distributions was extremely high (0.999), and the distance measures low. Considering single CRP (from 933 to 7,222 incidence rates), the results were in agreement with the Benfords law, and only a few CRPs showed possible discrepancies from it. Conclusion: This study demonstrated for the first time that cancer incidence rates follow Benfords law. This characteristic can be used as a new, simple, and objective tool in data quality evaluation. The analyzed data had been already checked for publication in CI5-X. Therefore, their quality was expected to be good. In fact, only for a few CRPs several statistics were consistent with possible violations.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Benford's Law and Its Application to Lightning Data
    Manoochehrnia, Pooyan
    Rachidi, Farhad
    Rubinstein, Marcos
    Schulz, Wolfgang
    Diendorfer, Gerhard
    IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY, 2010, 52 (04) : 956 - 961
  • [32] Data validity and statistical conformity with Benford?s Law
    Cerqueti, Roy
    Maggi, Mario
    CHAOS SOLITONS & FRACTALS, 2021, 144
  • [33] Blind image quality assessment based on Benford's law
    Al-Bandawi, Hussein
    Deng, Guang
    IET IMAGE PROCESSING, 2018, 12 (11) : 1983 - 1993
  • [34] Using Benford's law on the seismic reflectivity analysis
    de Macedo, Isadora A. S.
    de Figueiredo, Jose Jadsom S.
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2018, 6 (03): : T689 - T697
  • [35] Conformance of Public Water Use Data to Benford's Law
    Sowby, Robert B.
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 2018, 110 (12): : E52 - E59
  • [36] Utilization of Benford's Law by Testing Government Macroeconomics Data
    Placek, Michal
    EUROPEAN FINANCIAL SYSTEMS 2013: PROCEEDINGS OF THE 10TH INTERNATIONAL SCIENTIFIC CONFERENCE, 2013, : 258 - 263
  • [37] Characterizing Memory Failures Using Benford's Law
    Ferreira, Kurt B.
    Levy, Scott
    EURO-PAR 2021: PARALLEL PROCESSING WORKSHOPS, 2022, 13098 : 310 - 321
  • [38] Selecting Audit Samples Using Benford's Law
    da Silva, Carlos Gomes
    Carreira, Pedro M. R.
    AUDITING-A JOURNAL OF PRACTICE & THEORY, 2013, 32 (02): : 53 - 65
  • [39] Adaptive Fraud Detection using Benford's Law
    Lu, Fletcher
    Boritz, J. Efrim
    Covvey, Dominic
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 347 - 358
  • [40] Countries with potential data misreport based on Benford's law
    Kilani, A.
    Georgiou, G. P.
    JOURNAL OF PUBLIC HEALTH, 2021, 43 (02) : E295 - E296