The Use of Prior Information in Very Robust Regression for Fraud Detection

被引:9
|
作者
Riani, Marco [1 ]
Corbellini, Aldo [1 ]
Atkinson, Anthony C. [2 ]
机构
[1] Univ Parma, Dept Econ & Management, Parma, Italy
[2] London Sch Econ, Dept Stat, London WC2A 2AE, England
关键词
big data; data cleaning; forward search; MM estimation; misinvoicing; money laundering; seafood; timeliness; OUTLIER DETECTION;
D O I
10.1111/insr.12247
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Misinvoicing is a major tool in fraud including money laundering. We develop a method of detecting the patterns of outliers that indicate systematic mis-pricing. As the data only become available year by year, we develop a combination of very robust regression and the use of cleaned' prior information from earlier years, which leads to early and sharp indication of potentially fraudulent activity that can be passed to legal agencies to institute prosecution. As an example, we use yearly imports of a specific seafood into the European Union. This is only one of over one million annual data sets, each of which can currently potentially contain 336 observations. We provide a solution to the resulting big data problem, which requires analysis with the minimum of human intervention.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [21] Credit Card Fraud Detection using Logistic Regression
    Wang, Tianyou
    Zhao, Yucheng
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 301 - 305
  • [22] On the use of robust regression in econometrics
    Baldauf, Markus
    Santos Silva, J. M. C.
    ECONOMICS LETTERS, 2012, 114 (01) : 124 - 127
  • [23] Confidence intervals in regression utilizing prior information
    Kabaila, Paul
    Giri, Khageswor
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (10) : 3419 - 3429
  • [24] Constrained inverse regression for incorporating prior information
    Naik, PA
    Tsai, CL
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 204 - 211
  • [25] Multiclass Regularized Regression Integrating Prior Information
    He, Jingxuan
    Zeng, Chubing
    Lewinger, Juan Pablo
    Conti, David V.
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 758 - 758
  • [26] Robustness Against Conflicting Prior Information in Regression*
    Gagnon, Philippe
    BAYESIAN ANALYSIS, 2023, 18 (03): : 841 - 864
  • [27] A two-stage Bayesian semiparametric model for novelty detection with robust prior information
    Francesco Denti
    Andrea Cappozzo
    Francesca Greselin
    Statistics and Computing, 2021, 31
  • [28] A two-stage Bayesian semiparametric model for novelty detection with robust prior information
    Denti, Francesco
    Cappozzo, Andrea
    Greselin, Francesca
    STATISTICS AND COMPUTING, 2021, 31 (04)
  • [29] Correction to: A two-stage Bayesian semiparametricmodel for novelty detection with robust prior information
    Francesco Denti
    Andrea Cappozzo
    Francesca Greselin
    Statistics and Computing, 2022, 32
  • [30] Robust Monitoring of Time Series with Application to Fraud Detection
    Rousseeuw, Peter
    Perrotta, Domenico
    Riani, Marco
    Hubert, Mia
    ECONOMETRICS AND STATISTICS, 2019, 9 : 108 - 121