Probabilistic record linkage and a method to calculate the positive predictive value

被引:157
作者
Blakely, T [1 ]
Salmond, C [1 ]
机构
[1] Univ Otago, Wellington Sch Med, Dept Publ Hlth, Wellington, New Zealand
关键词
medical record linkage; predictive value of tests; sensitivity and specificity; epidemiological methods; censuses; mortality;
D O I
10.1093/ije/31.6.1246
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Computerized record linkage is commonly used in cohort studies to ascertain the study outcome, and as such its accuracy classifying the outcome can be described using the standard epidemiological terms of sensitivity and positive predictive value (PPV). Method We describe a 'duplicate method' to calculate the PPV of record linkage when each record can only be involved in one match (e.g. linking population files to death files). The method does not require a validation subset of records from both files with detailed personal information (e.g. name and address), and is therefore ideal for linkage projects using anonymous data. The duplicate method assumes that the number of records from one file with zero, one, two, etc., links from the other file is distributed in a manner predicted by combinatorial probabilities. Having made this assumption, the number of false positive links, and hence the PPV, are estimable. We demonstrate this duplicate method using output from anonymous and probabilistic record linkage of census and mortality records in New Zealand. Results The PPV estimates conform to the pattern expected based on the underlying theory of probabilistic record linkage, and were robust to sensitivity analyses. We encourage other researchers to further assess the accuracy of this method.
引用
收藏
页码:1246 / 1252
页数:7
相关论文
共 24 条
[1]   Anonymous linkage of New Zealand mortality and Census data [J].
Blakely, T ;
Woodward, A ;
Salmond, C .
AUSTRALIAN AND NEW ZEALAND JOURNAL OF PUBLIC HEALTH, 2000, 24 (01) :92-95
[2]  
Blakely T., 2001, SOCIOECONOMIC FACTOR
[3]  
Blakely T, 1999, ANONYMOUS RECORD LIN
[4]  
Brenner H, 1998, METHOD INFORM MED, V37, P69
[5]   USE OF THE POSITIVE PREDICTIVE VALUE TO CORRECT FOR DISEASE MISCLASSIFICATION IN EPIDEMIOLOGIC STUDIES [J].
BRENNER, H ;
GEFELLER, O .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 138 (11) :1007-1015
[6]   UTILITY OF THE NATIONAL DEATH INDEX FOR ASCERTAINMENT OF MORTALITY AMONG CANCER PREVENTION STUDY-II PARTICIPANTS [J].
CALLE, EE ;
TERRELL, DD .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 137 (02) :235-241
[7]   BIAS DUE TO MISCLASSIFICATION IN ESTIMATION OF RELATIVE RISK [J].
COPELAND, KT ;
CHECKOWAY, H ;
MCMICHAEL, AJ ;
HOLBROOK, RH .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1977, 105 (05) :488-495
[8]   COMPUTERIZED LINKING OF MEDICAL RECORDS - METHODOLOGICAL GUIDELINES [J].
GILL, L ;
GOLDACRE, M ;
SIMMONS, H ;
BETTLEY, G ;
GRIFFITH, M .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 1993, 47 (04) :316-319
[9]  
Gill LE, 1987, TXB MEDICAL RECORD L
[10]  
GOLDBERG MS, 1993, CAN J PUBLIC HEALTH, V84, P201